An algorithm to predict data completeness in oncology electronic medical records for comparative effectiveness research Journal Article


Authors: Merola, D.; Schneeweiss, S.; Schrag, D.; Lii, J.; Lin, K. J.
Article Title: An algorithm to predict data completeness in oncology electronic medical records for comparative effectiveness research
Abstract: Introduction: Electronic health record (EHR) discontinuity (missing out-of-network encounters) can lead to information bias. We sought to construct an algorithm that identifies high EHR-continuity among oncology patients. Methods: Using a linked Medicare-EHR database and regression, we sought to 1) measure how often Medicare claims for outpatient encounters were substantiated by visits recorded in the EHR, and 2) predict continuity ratio, defined as the yearly proportion of outpatient encounters reported to Medicare that were captured by EHR data. The prediction model...s performance was evaluated with the coefficient of determination and Spearman...s correlation. We quantified variable misclassification by decile of continuity ratio using standardized difference and sensitivity. Results: A total of 79,678 subjects met all eligibility criteria. Predicted and observed continuity was highly correlated (σSpearman=0.86). On average across all variables measured, MSD was reduced by a factor of 1/7th and sensitivity was improved 35-fold comparing subjects in the highest vs. lowest decile of CR. Conclusion: In the oncology population, restricting EHR-based study cohorts to subjects with high continuity may reduce misclassification without greatly impacting representativeness. Further work is needed to elucidate the best manner of implementing continuity prediction rules in cohort studies. © 2022 Elsevier Ltd
Keywords: adult; controlled study; major clinical study; cohort analysis; prediction; medicare; electronic medical record; algorithm; quantitative analysis; outpatient; comparative effectiveness; comparative effectiveness research; continuity; human; male; female; article; electronic medical records; eligibility criteria; electronic health record; information bias; data completeness
Journal Title: Annals of Epidemiology
Volume: 76
ISSN: 1047-2797
Publisher: Elsevier Science, Inc.  
Date Published: 2022-12-01
Start Page: 143
End Page: 149
Language: English
DOI: 10.1016/j.annepidem.2022.07.007
PUBMED: 35878784
PROVIDER: scopus
PMCID: PMC9741728
DOI/URL:
Notes: Article -- Export Date: 1 December 2022 -- Source: Scopus
Altmetric
Citation Impact
BMJ Impact Analytics
MSK Authors
  1. Deborah Schrag
    229 Schrag