Cancer type, stage and prognosis assessment from pathology reports using LLMs Journal Article


Authors: Saluja, R.; Rosenthal, J.; Windon, A.; Artzi, Y.; Pisapia, D. J.; Liechty, B. L.; Rory Sabuncu, M. R.
Article Title: Cancer type, stage and prognosis assessment from pathology reports using LLMs
Abstract: Large Language Models (LLMs) have shown significant promise across various natural language processing tasks. However, their application in the field of pathology, particularly for extracting meaningful insights from unstructured medical texts such as pathology reports, remains underexplored and not well quantified. In this project, we leverage state-of-the-art language models, including the GPT family, Mistral models, and the open-source Llama models, to evaluate their performance in comprehensively analyzing pathology reports. Specifically, we assess their performance in cancer type identification, AJCC stage determination, and prognosis assessment, encompassing both information extraction and higher-order reasoning tasks. Based on a detailed analysis of their performance metrics in a zero-shot setting, we developed two instruction-tuned models: Path-llama3.1-8B and Path-GPT-4o-mini-FT. These models demonstrated superior performance in zero-shot cancer type identification, staging, and prognosis assessment compared to the other models evaluated. © 2025 Elsevier B.V., All rights reserved.
Keywords: cancer staging; neoplasm staging; neoplasm; neoplasms; classification; pathology; diagnosis; natural language processing; humans; prognosis; human
Journal Title: Scientific Reports
Volume: 15
Issue: 1
ISSN: 20452322
Publisher: Elsevier B.V.  
Date Published: 2025-01-01
Start Page: 27300
Language: English
DOI: 10.1038/s41598-025-10709-4
PUBMED: 40715326
PROVIDER: scopus
PMCID: PMC12297491
DOI/URL:
Notes: Article -- Source: Scopus
Altmetric
Citation Impact
BMJ Impact Analytics