Creating a synthetic clinical trial: Comparative effectiveness analyses using an electronic medical record Journal Article


Authors: Zauderer, M. G.; Grigorenko, A.; May, P.; Kastango, N.; Wagner, I.; Caroline, A.; Kris, M. G.
Article Title: Creating a synthetic clinical trial: Comparative effectiveness analyses using an electronic medical record
Abstract: PURPOSE Electronic medical records (EMRs) are a vast resource of potentially mineable data that can be used to complement and extend clinical trials. Extracting and analyzing EMR data are impeded by technical complexities associated with large, multiformat databases. We sought to develop and validate a framework that would overcome the difficulties associated with EMR data and create a simple, portable, and expandable system to better use this resource. MATERIALS AND METHODS An Internet-accessible program was developed in Python that applied user-defined criteria to identify and extract patient data from Memorial Sloan Kettering databases. A Worker Application composed of individual modules was developed to identify each patient’s functional status, smoking status, and treatment classification. The validity of this approach was tested by identifying, extracting, and analyzing data from a patient cohort that paralleled a practice-changing, prospective, randomized phase III clinical trial performed at a different institution. We called this a synthetic clinical trial. RESULTS Our synthetic clinical trial identified and extracted data on a cohort of 281 patients with lung cancer who matched inclusion criteria and received their first treatment between October 2003 and July 2010. The data extraction modules were precise and accurate, with F-measures greater than 0.98. Results were similar in directionality and magnitude to the chosen comparator clinical trial. CONCLUSION Our framework offers an accurate and user-friendly interface for identifying and extracting EMR data that can be used to create synthetic clinical trials. Additional studies are needed to validate this approach in other patient cohorts, replicate our findings, and leverage this methodology to improve patient care and accelerate drug development. © 2019 by American Society of Clinical Oncology.
Keywords: adult; controlled study; major clinical study; cisplatin; cancer combination chemotherapy; gemcitabine; prospective study; quality control; carboplatin; randomized controlled trial; cohort analysis; smoking; retrospective study; patient care; electronic medical record; lung adenocarcinoma; karnofsky performance status; clinical evaluation; patient coding; phase 3 clinical trial; conceptual framework; pemetrexed; functional status; comparative effectiveness; proof of concept; data extraction; human; male; female; priority journal; article; internet access
Journal Title: JCO Clinical Cancer Informatics
Volume: 3
ISSN: 2473-4276
Publisher: American Society of Clinical Oncology  
Date Published: 2019-01-01
Language: English
DOI: 10.1200/cci.19.00037
PUBMED: 31225984
PROVIDER: scopus
PMCID: PMC6874028
DOI/URL:
Notes: Article -- Source: Scopus
Altmetric
Citation Impact
BMJ Impact Analytics
MSK Authors
  1. Marjorie G Zauderer
    188 Zauderer
  2. Mark Kris
    869 Kris
  3. Isaac   Wagner
    17 Wagner
  4. Paul   May
    2 May