Synapse - One statistical test is sufficient for assessing new predictive markers

One statistical test is sufficient for assessing new predictive markers Journal Article

Authors:	Vickers, A. J.; Cronin, A. M.; Begg, C. B.
Article Title:	One statistical test is sufficient for assessing new predictive markers
Abstract:	Background. We have observed that the area under the receiver operating characteristic curve (AUC) is increasingly being used to evaluate whether a novel predictor should be incorporated in a multivariable model to predict risk of disease. Frequently, investigators will approach the issue in two distinct stages: first, by testing whether the new predictor variable is significant in a multivariable regression model; second, by testing differences between the AUC of models with and without the predictor using the same data from which the predictive models were derived. These two steps often lead to discordant conclusions. Discussion. We conducted a simulation study in which two predictors, X and X, were generated as standard normal variables with varying levels of predictive strength, represented by means that differed depending on the binary outcome Y. The data sets were analyzed using logistic regression, and likelihood ratio and Wald tests for the incremental contribution of X were performed. The patient-specific predictors for each of the models were then used as data for a test comparing the two AUCs. Under the null, the size of the likelihood ratio and Wald tests were close to nominal, but the area test was extremely conservative, with test sizes less than 0.006 for all configurations studied. Where X* was associated with outcome, the area test had much lower power than the likelihood ratio and Wald tests. Summary. Evaluation of the statistical significance of a new predictor when there are existing clinical predictors is most appropriately accomplished in the context of a regression model. Although comparison of AUCs is a conceptually equivalent approach to the likelihood ratio and Wald test, it has vastly inferior statistical properties. Use of both approaches will frequently lead to inconsistent conclusions. Nonetheless, comparison of receiver operating characteristic curves remains a useful descriptive tool for initial evaluation of whether a new predictor might be of clinical relevance. © 2011 Vickers et al; licensee BioMed Central Ltd.
Journal Title:	BMC Medical Research Methodology
Volume:	11
ISSN:	1471-2288
Publisher:	Biomed Central Ltd
Date Published:	2011-01-28
Start Page:	13
Language:	English
DOI:	10.1186/1471-2288-11-13
PROVIDER:	scopus
PMCID:	PMC3042425
PUBMED:	21276237
DOI/URL:	http://www.scopus.com/inward/record.url?eid=2-s2.0-79251564078&partnerID=40&md5=f3a2d9c9ebb3c047b70d32c514d9b56d
Notes:	--- - "Export Date: 23 June 2011" - "Art. No.: 13" - "Source: Scopus"

Altmetric

What is Altmetric?

Citation Impact

What is Dimensions Citation Badge?

BMJ Impact Analytics

MSK Authors

306 Begg
888 Vickers

Related MSK Work

Comparing Roc Curves Derived From Regression Models

Statistics in Medicine 2013
Optimal Cutpoint Estimation With Censored Data

Journal of Statistical Theory and Practice 2013
Prostate Cancer: Detection Of Extracapsular Extension By Genitourinary And General Body Radiologists At Mr Imaging

Radiology 2004
Predicting Radiation Induced Valvular Heart Damage

Acta Oncologica 2015
Locally Advanced Breast Cancer: Mr Imaging For Prediction Of Response To Neoadjuvant Chemotherapy Results From Acrin 6657/I Spy Trial

Radiology 2012