Methods for categorizing a prognostic variable in a multivariable setting Journal Article


Authors: Mazumdar, M.; Smith, A.; Bacik, J.
Article Title: Methods for categorizing a prognostic variable in a multivariable setting
Abstract: The literature is filled with examples of categorization of a continuous prognostic variable in a univariable setting followed by the addition of this categorical variable to an existing multivariable model. Typically, an 'optimal' cutpoint for a new prognostic variable is obtained through a systematic search relating the variable to the outcome in an univariable manner. The corresponding categorical variable is then fitted in a multivariable model along with other already established prognostic covariates to assess the additional value of the new variable. This prompts the question whether the cutpoint search should have been performed in the same multivariable setting where it will ultimately be used. In this paper, we extend the univariable cutpoint search methods (split-sample approach and two-fold cross-validation approach) to the multivariable setting using -2 × log-likelihood statistic as the correlative measure. A Monte Carlo simulation study demonstrates that both methods are more efficient in detecting the true cutpoint and in estimating the effect size under the multivariable setting as opposed to the univariable setting. The cross-validation method performs better than the split-sample method in univariable as well as multivariable scenarios. For the cross-validation method in the multivariable setting, there is still a substantial loss of power when a cutpoint model is used in cases where there is a continuous relationship between the covariate and the outcome. An example is provided to illustrate the value of the multivariable cross-validation approach. Copyright © 2003 John Wiley & Sons, Ltd.
Keywords: cancer survival; controlled study; survival analysis; major clinical study; cancer risk; validation process; united states; reproducibility of results; breast cancer; classification; proportional hazards models; analytic method; high risk patient; prostate cancer; correlation analysis; statistical analysis; outcomes research; intermethod comparison; multivariate analysis; monte carlo method; osteopontin; nonbiological model; humans; prognosis; human; male; article; categorization; log-likelihood statistics; multivariable setting; split-sample approach; two-fold cross-validation approach
Journal Title: Statistics in Medicine
Volume: 22
Issue: 4
ISSN: 0277-6715
Publisher: John Wiley & Sons  
Date Published: 2003-02-28
Start Page: 559
End Page: 571
Language: English
DOI: 10.1002/sim.1333
PUBMED: 12590414
PROVIDER: scopus
DOI/URL:
Notes: Export Date: 12 September 2014 -- Source: Scopus
Altmetric
Citation Impact
BMJ Impact Analytics
MSK Authors
  1. Madhu Mazumdar
    127 Mazumdar
  2. Alexander D Smith
    28 Smith
  3. Jennifer M Bacik
    46 Bacik