Evaluation of algorithms using automated health plan data to identify breast cancer recurrences Journal Article


Authors: Aiello Bowles, E. J.; Kroenke, C. H.; Chubak, J.; Bhimani, J.; O'Connell, K.; Brandzel, S.; Valice, E.; Doud, R.; Theis, M. K.; Roh, J. M.; Heon, N.; Persaud, S.; Griggs, J. J.; Bandera, E. V.; Kushi, L. H.; Kantor, E. D.
Article Title: Evaluation of algorithms using automated health plan data to identify breast cancer recurrences
Abstract: Background: We updated algorithms to identify breast cancer recurrences from administrative data, extending previously developed methods. Methods: In this validation study, we evaluated pairs of breast cancer recurrence algorithms (vs. individual algorithms) to identify recurrences. We generated algorithm combinations that categorized discordant algorithm results as no recurrence [High Specificity and PPV (positive predictive value) Combination] or recurrence (High Sensitivity Combination). We compared individual and combined algorithm results to manually abstracted recurrence outcomes from a sample of 600 people with incident stage I-IIIA breast cancer diagnosed between 2004 and 2015. We used Cox regression to evaluate risk factors associated with age- and stage-adjusted recurrence rates using different recurrence definitions, weighted by inverse sampling probabilities. Results: Among 600 people, we identified 117 recurrences using the High Specificity and PPV Combination, 505 using the High Sensitivity Combination, and 118 using manual abstraction. The High Specificity and PPV Combination had good specificity [98%, 95% confidence interval (CI): 97-99] and PPV (72%, 95% CI: 63-80) but modest sensitivity (64%, 95% CI: 44-80). The High Sensitivity Combination had good sensitivity (80%, 95% CI: 49-94) and specificity (83%, 95% CI: 80-86) but low PPV (29%, 95% CI: 25-34). Recurrence rates using combined algorithms were similar in magnitude for most risk factors. Conclusions: By combining algorithms, we identified breast cancer recurrences with greater PPV than individual algorithms, without additional review of discordant records. Impact: Researchers should consider tradeoffs between accuracy and manual chart abstraction resources when using previously developed algorithms. We provided guidance for future studies that use breast cancer recurrence algorithms with or without supplemental manual chart abstraction. © 2023 American Association for Cancer Research.
Keywords: adult; cancer chemotherapy; controlled study; aged; middle aged; major clinical study; cancer radiotherapy; cancer staging; recurrence risk; follow up; antineoplastic agent; sensitivity and specificity; breast cancer; mastectomy; epidermal growth factor receptor 2; risk factors; validation study; breast neoplasms; algorithms; risk factor; cancer hormone therapy; false negative result; body mass; proportional hazards model; algorithm; breast tumor; comorbidity; predictive value of tests; estrogen receptor; progesterone receptor; false positive result; predictive value; lumpectomy; diagnostic test accuracy study; demographics; charlson comorbidity index; very elderly; humans; human; male; female; article; icd-9; icd-10; breast cancer recurrence
Journal Title: Cancer Epidemiology Biomarkers and Prevention
Volume: 33
Issue: 3
ISSN: 1055-9965
Publisher: American Association for Cancer Research  
Date Published: 2024-03-01
Start Page: 355
End Page: 364
Language: English
DOI: 10.1158/1055-9965.Epi-23-0782
PUBMED: 38088912
PROVIDER: scopus
PMCID: PMC10922110
DOI/URL:
Notes: The MSK Cancer Center Support Grant (P30 CA008748) is acknowledged in the PDF -- Source: Scopus
Altmetric
Citation Impact
BMJ Impact Analytics
MSK Authors
  1. Narre Heon
    16 Heon
  2. Elizabeth David Kantor
    40 Kantor
  3. Jenna Bhimani
    14 Bhimani
  4. Sonia Persaud
    21 Persaud