Fast, light, and scalable: Harnessing data-mined line annotations for automated tumor segmentation on brain MRI Journal Article


Authors: Swinburne, N. C.; Yadav, V.; Murthy, K. N. K.; Elnajjar, P.; Shih, H. H.; Panyam, P. K.; Santilli, A.; Gutman, D. C.; Pike, L.; Moss, N. S.; Stone, J.; Hatzoglou, V.; Shah, A.; Juluru, K.; Shah, S. P.; Holodny, A. I.; Young, R. J.; For The M.S.K. MIND Consortium
Article Title: Fast, light, and scalable: Harnessing data-mined line annotations for automated tumor segmentation on brain MRI
Abstract: Objectives: While fully supervised learning can yield high-performing segmentation models, the effort required to manually segment large training sets limits practical utility. We investigate whether data mined line annotations can facilitate brain MRI tumor segmentation model development without requiring manually segmented training data. Methods: In this retrospective study, a tumor detection model trained using clinical line annotations mined from PACS was leveraged with unsupervised segmentation to generate pseudo-masks of enhancing tumors on T1-weighted post-contrast images (9911 image slices; 3449 adult patients). Baseline segmentation models were trained and employed within a semi-supervised learning (SSL) framework to refine the pseudo-masks. Following each self-refinement cycle, a new model was trained and tested on a held-out set of 319 manually segmented image slices (93 adult patients), with the SSL cycles continuing until Dice score coefficient (DSC) peaked. DSCs were compared using bootstrap resampling. Utilizing the best-performing models, two inference methods were compared: (1) conventional full-image segmentation, and (2) a hybrid method augmenting full-image segmentation with detection plus image patch segmentation. Results: Baseline segmentation models achieved DSC of 0.768 (U-Net), 0.831 (Mask R-CNN), and 0.838 (HRNet), improving with self-refinement to 0.798, 0.871, and 0.873 (each p < 0.001), respectively. Hybrid inference outperformed full image segmentation alone: DSC 0.884 (Mask R-CNN) vs. 0.873 (HRNet), p < 0.001. Conclusions: Line annotations mined from PACS can be harnessed within an automated pipeline to produce accurate brain MRI tumor segmentation models without manually segmented training data, providing a mechanism to rapidly establish tumor segmentation capabilities across radiology modalities. Key Points: • A brain MRI tumor detection model trained using clinical line measurement annotations mined from PACS was leveraged to automatically generate tumor segmentation pseudo-masks. • An iterative self-refinement process automatically improved pseudo-mask quality, with the best-performing segmentation pipeline achieving a Dice score of 0.884 on a held-out test set. • Tumor line measurement annotations generated in routine clinical radiology practice can be harnessed to develop high-performing segmentation models without manually segmented training data, providing a mechanism to rapidly establish tumor segmentation capabilities across radiology modalities. © 2023, The Author(s), under exclusive licence to European Society of Radiology.
Keywords: adult; controlled study; middle aged; primary tumor; retrospective studies; major clinical study; nuclear magnetic resonance imaging; brain tumor; brain neoplasms; magnetic resonance imaging; neoplasms; tumor volume; diagnostic imaging; retrospective study; information storage; lung adenocarcinoma; radiology; brain; glioblastoma; self concept; image processing, computer-assisted; image processing; image segmentation; procedures; data mining; humans; human; male; female; article; deep learning; breast ductal carcinoma; t1 weighted imaging; semi supervised machine learning
Journal Title: European Radiology
Volume: 33
Issue: 9
ISSN: 0938-7994
Publisher: Springer  
Date Published: 2023-09-01
Start Page: 6582
End Page: 6591
Language: English
DOI: 10.1007/s00330-023-09583-3
PUBMED: 37042979
PROVIDER: scopus
PMCID: PMC10523913
DOI/URL:
Notes: The MSK Cancer Center Support Grant (P30 CA008748) is acknowledged in the PDF -- Corresponding author is MSK author: Nathaniel C. Swinburne -- Source: Scopus
Altmetric
Citation Impact
BMJ Impact Analytics
MSK Authors
  1. Robert J Young
    228 Young
  2. Akash Deelip Shah
    20 Shah
  3. Andrei Holodny
    207 Holodny
  4. Krishna   Juluru
    35 Juluru
  5. Hao-Hsin Shih
    6 Shih
  6. David Gutman
    5 Gutman
  7. Vivek Yadav
    2 Yadav
  8. Prashanth Kumar Panyam
    3 Panyam