Synapse - Fast, light, and scalable: Harnessing data-mined line annotations for automated tumor segmentation on brain MRI

Fast, light, and scalable: Harnessing data-mined line annotations for automated tumor segmentation on brain MRI Journal Article

Authors:	Swinburne, N. C.; Yadav, V.; Murthy, K. N. K.; Elnajjar, P.; Shih, H. H.; Panyam, P. K.; Santilli, A.; Gutman, D. C.; Pike, L.; Moss, N. S.; Stone, J.; Hatzoglou, V.; Shah, A.; Juluru, K.; Shah, S. P.; Holodny, A. I.; Young, R. J.; For The M.S.K. MIND Consortium
Article Title:	Fast, light, and scalable: Harnessing data-mined line annotations for automated tumor segmentation on brain MRI
Abstract:	Objectives: While fully supervised learning can yield high-performing segmentation models, the effort required to manually segment large training sets limits practical utility. We investigate whether data mined line annotations can facilitate brain MRI tumor segmentation model development without requiring manually segmented training data. Methods: In this retrospective study, a tumor detection model trained using clinical line annotations mined from PACS was leveraged with unsupervised segmentation to generate pseudo-masks of enhancing tumors on T1-weighted post-contrast images (9911 image slices; 3449 adult patients). Baseline segmentation models were trained and employed within a semi-supervised learning (SSL) framework to refine the pseudo-masks. Following each self-refinement cycle, a new model was trained and tested on a held-out set of 319 manually segmented image slices (93 adult patients), with the SSL cycles continuing until Dice score coefficient (DSC) peaked. DSCs were compared using bootstrap resampling. Utilizing the best-performing models, two inference methods were compared: (1) conventional full-image segmentation, and (2) a hybrid method augmenting full-image segmentation with detection plus image patch segmentation. Results: Baseline segmentation models achieved DSC of 0.768 (U-Net), 0.831 (Mask R-CNN), and 0.838 (HRNet), improving with self-refinement to 0.798, 0.871, and 0.873 (each p < 0.001), respectively. Hybrid inference outperformed full image segmentation alone: DSC 0.884 (Mask R-CNN) vs. 0.873 (HRNet), p < 0.001. Conclusions: Line annotations mined from PACS can be harnessed within an automated pipeline to produce accurate brain MRI tumor segmentation models without manually segmented training data, providing a mechanism to rapidly establish tumor segmentation capabilities across radiology modalities. Key Points: • A brain MRI tumor detection model trained using clinical line measurement annotations mined from PACS was leveraged to automatically generate tumor segmentation pseudo-masks. • An iterative self-refinement process automatically improved pseudo-mask quality, with the best-performing segmentation pipeline achieving a Dice score of 0.884 on a held-out test set. • Tumor line measurement annotations generated in routine clinical radiology practice can be harnessed to develop high-performing segmentation models without manually segmented training data, providing a mechanism to rapidly establish tumor segmentation capabilities across radiology modalities. © 2023, The Author(s), under exclusive licence to European Society of Radiology.
Keywords:	adult; controlled study; middle aged; primary tumor; retrospective studies; major clinical study; nuclear magnetic resonance imaging; brain tumor; brain neoplasms; magnetic resonance imaging; neoplasms; tumor volume; diagnostic imaging; retrospective study; information storage; lung adenocarcinoma; radiology; brain; glioblastoma; self concept; image processing, computer-assisted; image processing; image segmentation; procedures; data mining; humans; human; male; female; article; deep learning; breast ductal carcinoma; t1 weighted imaging; semi supervised machine learning
Journal Title:	European Radiology
Volume:	33
Issue:	9
ISSN:	0938-7994
Publisher:	Springer
Date Published:	2023-09-01
Start Page:	6582
End Page:	6591
Language:	English
DOI:	10.1007/s00330-023-09583-3
PUBMED:	37042979
PROVIDER:	scopus
PMCID:	PMC10523913
DOI/URL:	https://www.scopus.com/inward/record.uri?eid=2-s2.0-85152569728&doi=10.1007%2fs00330-023-09583-3&partnerID=40&md5=46e0989f65f693b06ca11977c8c77d53
Notes:	The MSK Cancer Center Support Grant (P30 CA008748) is acknowledged in the PDF -- Corresponding author is MSK author: Nathaniel C. Swinburne -- Source: Scopus

Altmetric

What is Altmetric?

Citation Impact

What is Dimensions Citation Badge?

BMJ Impact Analytics

MSK Authors

237 Young
99 Hatzoglou
20 Shah
208 Holodny
35 Juluru
12 Elnajjar
6 Shih
15 Keshava Murthy
5 Gutman
2 Yadav
3 Panyam
2 Santilli

Related MSK Work

Semisupervised Training Of A Brain Mri Tumor Detection Model Using Mined Annotations

Radiology 2022
Foundational Segmentation Models And Clinical Data Mining Enable Accurate Computer Vision For Lung Cancer

Journal of Imaging Informatics in Medicine 2025
Self Supervised 3 D Anatomy Segmentation Using Self Distilled Masked Image Transformer (Smit)

Lecture Notes in Computer Science 2022
Cross Modality (Ct Mri) Prior Augmented Deep Learning For Robust Lung Tumor Segmentation From Small Mr Datasets

Medical Physics 2019
Self Supervised Learning Improves Robustness Of Deep Learning Lung Tumor Segmentation Models To Ct Imaging Differences

Medical Physics 2025