Representing and utilizing clinical textual data for real world studies: An OHDSI approach Journal Article


Authors: Keloth, V. K.; Banda, J. M.; Gurley, M.; Heider, P. M.; Kennedy, G.; Liu, H.; Liu, F.; Miller, T.; Natarajan, K.; Patterson, O. V.; Peng, Y.; Raja, K.; Reeves, R. M.; Rouhizadeh, M.; Shi, J.; Wang, X.; Wang, Y.; Wei, W. Q.; Williams, A. E.; Zhang, R.; Belenkaya, R.; Reich, C.; Blacketer, C.; Ryan, P.; Hripcsak, G.; Elhadad, N.; Xu, H.
Article Title: Representing and utilizing clinical textual data for real world studies: An OHDSI approach
Abstract: Clinical documentation in electronic health records contains crucial narratives and details about patients and their care. Natural language processing (NLP) can unlock the information conveyed in clinical notes and reports, and thus plays a critical role in real-world studies. The NLP Working Group at the Observational Health Data Sciences and Informatics (OHDSI) consortium was established to develop methods and tools to promote the use of textual data and NLP in real-world observational studies. In this paper, we describe a framework for representing and utilizing textual data in real-world evidence generation, including representations of information from clinical text in the Observational Medical Outcomes Partnership (OMOP) Common Data Model (CDM), the workflow and tools that were developed to extract, transform and load (ETL) data from clinical notes into tables in OMOP CDM, as well as current applications and specific use cases of the proposed OHDSI NLP solution at large consortia and individual institutions with English textual data. Challenges faced and lessons learned during the process are also discussed to provide valuable insights for researchers who are planning to implement NLP solutions in real-world studies. © 2023 Elsevier Inc.
Keywords: outcome assessment; observational study; information science; language processing; natural language processing systems; informatics; electronic health records; medical informatics; natural language processing; workflow; real-world; human; article; electronic health record; records management; electronic health; health records; real-world study; natural languages; textual data
Journal Title: Journal of Biomedical Informatics
Volume: 142
ISSN: 1532-0464
Publisher: Elsevier Inc.  
Date Published: 2023-06-01
Start Page: 104343
Language: English
DOI: 10.1016/j.jbi.2023.104343
PUBMED: 36935011
PROVIDER: scopus
PMCID: PMC10428170
DOI/URL:
Notes: Article -- Source: Scopus
Altmetric
Citation Impact
BMJ Impact Analytics
MSK Authors