Cookies on this website

We use cookies to ensure that we give you the best experience on our website. If you click 'Accept all cookies' we'll assume that you are happy to receive all cookies and you won't see this message again. If you click 'Reject all non-essential cookies' only necessary cookies providing core functionality such as security, network management, and accessibility will be enabled. Click 'Find out more' for information on how to change your cookie settings.

OBJECTIVE: A schema was recently proposed for assessing the levels of evidence for surrogate validity that included 4 domains: Target, Study Design, Statistical Strength, and Penalties. This report examines one component of the schema. It surveys the literature on methods of statistical validation of surrogate markers and compares these methods head-to-head using simulated datasets. METHODS: Simulated datasets (continuous, multivariate normal) were generated to capture 3 possible relationships of surrogate (S) and true (T) outcome (none, weakly positive, strongly positive) each applied to 4 treatment effects (effect on both surrogate and true outcome, effect on neither, effect on surrogate only, and effect on true outcome only). These datasets were analyzed using single and multitrial statistical approaches, and the results were provided to participants for discussion. RESULTS: The multitrial surrogate threshold effect seemed to capture best the requirement that surrogate validation is demonstrated by a treatment-associated change in the surrogate predicting a treatment-associated change in the outcome. CONCLUSION: There was general agreement that neither a single trial nor any of the single trial statistical methods was adequate to establish surrogate validity. These exercises also showed that summary statistics developed specifically to establish surrogate validity, such as the proportion of the effect explained, were problematic. A sizable statistical research agenda remains, which includes investigating the additional advantage obtained with modeling subject-level data compared to modeling with only trial-level data; and developing and testing multitrial statistical approaches robust to settings with only a few trials.


Conference paper

Publication Date





616 - 619


Biomarkers, Clinical Trials as Topic, Data Interpretation, Statistical, Humans, Models, Theoretical, Outcome Assessment (Health Care), Reproducibility of Results, Rheumatic Diseases, Rheumatology