Synapse - Adapting large language models for automatic annotation of radiology reports for metastases detection

Adapting large language models for automatic annotation of radiology reports for metastases detection Conference Paper

Authors:	Barabadi, M. A.; Yip Chan, W.; Zhu, X.; Simpson, A. L.; Do, R. K. G.
Title:	Adapting large language models for automatic annotation of radiology reports for metastases detection
Conference Title:	2024 IEEE Canadian Conference on Electrical and Computer Engineering (CCECE)
Abstract:	Automatic identification of metastatic sites in cancer patients from electronic health records is a challenging yet crucial task with significant implications for diagnosis and treatment. In this study, we propose a method to detect metastases from non-structured radiology report texts by accessing only their impression section. We build models based on pre-trained large language models and parameter-efficient fine-tuning. We compare model performances between utilizing non-structured reports and reports following institutional-level templates. By incorporating patient historical data and their timeline into the model, we bridge the gap between structured and non-structured reports. Our experiments are conducted on data gathered at Memorial Sloan Kettering Cancer Center (MSKCC) which have been annotated for metastases presence in three organs: liver, lung, and adrenal glands. Our results suggest that access to previous reports significantly improves model performance, with an average improvement of 7.7 points in terms of F1-score over all datasets. Additionally, incorporating temporal information enhances the accuracy of metastasis detection by 0.4 and 1.1 points on liver and adrenal glands data, respectively. Our method shows potential for automating radiology report labeling on a large scale in an efficient manner, with the potential to deploy on low-cost hardware. © 2024 IEEE.
Keywords:	lung cancer; oncology; radiology; diagnosis; diseases; language processing; natural language processing systems; natural language processing; radiology reports; metastasis detection; natural languages; language model; large language model; large language models; metastases detection; parameter-efficient tuning; modeling performance; structured reports
Journal Title	Conference proceedings of the Canadian Conference on Electrical and Computer Engineering
Conference Dates:	2024 Aug 6-9
Conference Location:	Kingston, Canada
ISBN:	2576-7046
Publisher:	Institute of Electrical and Electronics Engineers Inc.
Date Published:	2024-01-01
Start Page:	340
End Page:	345
Language:	English
DOI:	10.1109/ccece59415.2024.10667245
PROVIDER:	scopus
DOI/URL:	https://www.scopus.com/inward/record.uri?eid=2-s2.0-85205013831&doi=10.1109%2fCCECE59415.2024.10667245&partnerID=40&md5=214f7dc49c208ebaa6a17b7056555f97
Notes:	Conference paper -- ISBN: 979-8-3503-7162-8 -- Source: Scopus

Altmetric

What is Altmetric?

Citation Impact

What is Dimensions Citation Badge?

BMJ Impact Analytics

MSK Authors

260 Do

Related MSK Work

Targeted Generative Data Augmentation For Automatic Metastases Detection From Free Text Radiology Reports

Frontiers in Artificial Intelligence 2025
Developing A Cancer Digital Twin: Supervised Metastases Detection From Consecutive Structured Radiology Reports

Frontiers in Artificial Intelligence 2022
Parameter Efficient Fine Tuning And Few Shot Learning Of Multiscale Vision Transformers For Liver Tumour Segmentation In Ct

Progress in Biomedical Optics and Imaging - Proceedings of SPIE 2025
Generative Inpainting Based Anomaly Detection For Ct Liver Tumor Detection

IEEE Transactions on Radiation and Plasma Medical Sciences 2025
Use Of Natural Language Processing To Infer Sites Of Metastatic Disease From Radiology Reports At Scale

JCO Clinical Cancer Informatics 2024