--- title: "A gentle introduction to group sequential design" output: rmarkdown::html_vignette bibliography: gsDesign.bib vignette: > %\VignetteIndexEntry{A gentle introduction to group sequential design} %\VignetteEngine{knitr::rmarkdown} %\VignetteEncoding{UTF-8} --- ```{r, include=FALSE} knitr::opts_chunk$set( collapse = FALSE, comment = "#>", dev = "ragg_png", dpi = 96, fig.retina = 1, fig.width = 7.2916667, fig.asp = 0.618, fig.align = "center", out.width = "80%" ) options(width = 58) ``` ## Introduction This article is intended to give a gentle mathematical and statistical introduction to group sequential design. We also provide relatively simple examples from the literature to explain clinical applications. There is no programming shown, but by accessing the source for the article all required programming can be accessed; substantial commenting is provided in the source in the hope that users can understand how to implement the concepts developed here. Hopefully, the few mathematical and statistical concepts introduced will not discourage those wishing to understand some underlying concepts for group sequential design. A group sequential design enables repeated analysis of an endpoint for a clinical trial to enable possible early stopping of a trial for either a positive result, for futility, or for a safety issue. This approach can - limit exposure risk to patients and clinical trial investment past the time where known unacceptable safety risks have been established for the endpoint of interest, - limit investment in a trial where interim results suggest further evaluation for a positive efficacy finding is futile, or - accelerate the availability of a highly effective treatment by enabling early approval following an early positive finding. Examples of outcomes that might be considered include: - a continuous outcome such as change from baseline at some fixed follow-up time in the HAM-D depression score, - absolute or difference or risk ratio for a response rate (e.g., in oncology) or failure rate for a binary (yes/no) outcome, and - a hazard ratio for a time-to-event out such such as time-to-death or disease progression in an oncology trial or for time until a cardiovascular event (death, myocardial infarction or unstable angina). Examples of the above include: - a new treatment for major depression where an interim analysis of a continuous outcome stopped the trial for futility (@binneman20086), - a new treatment for patients with unstable angina undergoing balloon angioplasty with a positive interim finding for a binary outcome of death, myocardial infarction or urgent repeat intervention within 30 days (@CAPTURE), and - a new treatment for patients with lung cancer based on a positive interim finding for time-to-death (@KEYNOTE189). ## Group sequential design framework We assume - A two-arm clinical trial with a control and experimental group. - There are $k$ analyses planned for some integer $k> 1.$ - There is a natural parameter $\delta$ describing the underlying treatment difference with an estimate that has an asymptotically normal and efficient estimate $\hat\delta_j$ with variance $\sigma_j^2$ and corresponding statistical information $\mathcal{I}_j=1/\sigma_j^2$, at analysis $j=1,2,\ldots,k$. A positive value favoring experimental treatment and negative value favoring control. We assume a consistent estimate $\hat\sigma_j^2$ of $\sigma_j^2, j=1,2,\ldots,k$. - The information fraction is defined as $t_j=\mathcal{I}_i/\mathcal{I}_j$ at analysis $j=1,\ldots,k$. - Correlations between estimates at different analyses are $\text{Corr}(\hat\delta_i,\hat\delta_j)=\sqrt{\mathcal{I}_i/\mathcal{I}_j}=\sqrt{t_j}$ for $1\le i\le j\le k.$ - There is a test test $Z_j\approx\hat\delta_j/\hat{\sigma}^2_j.$ For a time-to-event outcome, $\delta$ would typically represent the logarithm of the hazard ratio for the control group versus the experimental group. For a difference in response rates, $\delta$ would represent the underlying response rates. For a continuous outcome such as the HAM-D, we would examine the difference in change from baseline at a milestone time point (e.g., at 6 weeks as in @binneman20086). For $j=1,\ldots,k$, the tests $Z_j$ are asymptotically multivariate normal with correlations as above, and for $i=1,\ldots,k$ have $\text{Cov}(Z_i,Z_j)=\text{Corr}(\hat\delta_i,\hat\delta_j)$ and $E(Z_j)=\delta\sqrt{I_j}.$ This multivariate asymptotic normal distribution for $Z_1,\ldots,Z_k$ is referred to as the *canonical form* by @JTBook who have also summarized much of the surrounding literature. ## Bounds for testing ### One-sided testing We assume that the primary test the null hypothesis $H_{0}$: $\delta=0$ against the alternative $H_{1}$: $\delta = \delta_1$ for a fixed effect size $\delta_1 > 0$ which represents a benefit of experimental treatment compared to control. We assume further that there is interest in stopping early if there is good evidence to reject one hypothesis in favor of the other. For $i=1,2,\ldots,k-1$, interim cutoffs $l_{i}< u_{i}$ are set; final cutoffs $l_{k}\leq u_{k}$ are also set. For $i=1,2,\ldots,k$, the trial is stopped at analysis $i$ to reject $H_{0}$ if $l_{j}