A scalable quality assurance process for curating oncology electronic health records: The Project GENIE Biopharma Collaborative approach Journal Article


Authors: Lavery, J. A.; Lepisto, E. M.; Brown, S.; Rizvi, H.; McCarthy, C.; LeNoue-Newton, M.; Yu, C.; Lee, J.; Guo, X.; Yu, T.; Rudolph, J.; Sweeney, S.; AACR Project GENIE Consortium; Park, B. H.; Warner, J. L.; Bedard, P. L.; Riely, G.; Schrag, D.; Panageas, K. S.
Article Title: A scalable quality assurance process for curating oncology electronic health records: The Project GENIE Biopharma Collaborative approach
Abstract: PURPOSE: The American Association for Cancer Research Project Genomics Evidence Neoplasia Information Exchange Biopharma Collaborative is a multi-institution effort to build a pan-cancer repository of genomic and clinical data curated from the electronic health record. For the research community to be confident that data extracted from electronic health record text are reliable, transparency of the approach used to ensure data quality is essential. MATERIALS AND METHODS: Four institutions participating in AACR's Project GENIE created an observational cohort of patients with cancer for whom tumor molecular profiling data, therapeutic exposures, and treatment outcomes are available and will be shared publicly with the research community. A comprehensive approach to quality assurance included assessments of (1) feasibility of the curation model through pressure test cases; (2) accuracy through programmatic queries and comparison with source data; and (3) reproducibility via double curation and code review. RESULTS: Assessments of feasibility resulted in critical modifications to the curation directives. Queries and comparison with source data identified errors that were rectified via data correction and curator retraining. Assessment of intercurator reliability indicated a reliable curation model. CONCLUSION: The transparent quality assurance processes for the GENIE BPC data ensure that the data can be used for analyses that support clinical decision making and advances in precision oncology.
Journal Title: JCO Clinical Cancer Informatics
Volume: 6
ISSN: 2473-4276
Publisher: American Society of Clinical Oncology  
Date Published: 2022-01-01
Start Page: e2100105
Language: English
DOI: 10.1200/cci.21.00105
PUBMED: 35192403
PROVIDER: scopus
PMCID: PMC8863125
DOI/URL:
Notes: Article -- Export Date: 1 April 2022 -- Source: Scopus
Altmetric
Citation Impact
BMJ Impact Analytics
MSK Authors
  1. Gregory J Riely
    599 Riely
  2. Katherine S Panageas
    512 Panageas
  3. Hira Abbas Rizvi
    122 Rizvi
  4. Julia E Rudolph
    16 Rudolph
  5. Jessica Ann Lavery
    79 Lavery
  6. Samantha Brown
    56 Brown
  7. Jasme Lee
    31 Lee