RSNA 2023 abdominal trauma AI challenge: Review and outcomes Review


Authors: Hermans, S.; Hu, Z.; Ball, R. L.; Lin, H. M.; Prevedello, L. M.; Berger, F. H.; Yusuf, I.; Rudie, J. D.; Vazirabad, M.; Flanders, A. E.; Shih, G.; Mongan, J.; Nicolaou, S.; Marinelli, B. S.; Davis, M. A.; Magudia, K.; Sejdić, E.; Colak, E.
Review Title: RSNA 2023 abdominal trauma AI challenge: Review and outcomes
Abstract: Purpose: To evaluate the performance of the winning machine learning models from the 2023 RSNA Abdominal Trauma Detection AI Challenge. Materials and Methods: The competition was hosted on Kaggle and took place between July 26 and October 15, 2023. The multicenter competition dataset consisted of 4274 abdominal trauma CT scans, in which solid organs (liver, spleen, and kidneys) were annotated as healthy, low-grade, or high-grade inju-ry. Studies were labeled as positive or negative for the presence of bowel and mesenteric injury and active extravasation. In this study, performances of the eight award-winning models were retrospectively assessed and compared using various metrics, including the area under the receiver operating characteristic curve (AUC), for each injury category. The reported mean values of these metrics were calculated by averaging the performance across all models for each specified injury type. Results: The models exhibited strong performance in detecting solid organ injuries, particularly high-grade injuries. For binary detection of injuries, the models demonstrated mean AUC values of 0.92 (range, 0.90–0.94) for liver, 0.91 (range, 0.87–0.93) for splenic, and 0.94 (range, 0.93–0.95) for kidney injuries. The models achieved mean AUC values of 0.98 (range, 0.96–0.98) for high-grade liver, 0.98 (range, 0.97–0.99) for high-grade splenic, and 0.98 (range, 0.97–0.98) for high-grade kidney injuries. For the detection of bowel and mesenteric injuries and active extravasation, the models demonstrated mean AUC values of 0.85 (range, 0.74–0.93) and 0.85 (range, 0.79–0.89), respectively. Conclusion: The award-winning models from the artificial intelligence challenge demonstrated strong performance in the detection of traumatic abdominal injuries on CT scans, particularly high-grade injuries. These models may serve as a performance baseline for future investigations and algorithms. © RSNA, 2024.
Keywords: adult; area under the curve; diagnostic accuracy; sensitivity and specificity; computer assisted tomography; retrospective study; disease severity; algorithm; artificial intelligence; liver injury; receiver operating characteristic; spleen injury; machine learning; human; male; female; article; abdominal injury; injury scale
Journal Title: Radiology: Artificial Intelligence
Volume: 7
Issue: 1
ISSN: 2638-6100
Publisher: Radiological Society of North America, Inc.  
Date Published: 2025-01-01
Start Page: e240334
Language: Croatian
DOI: 10.1148/ryai.240334
PROVIDER: scopus
PUBMED: 39503604
DOI/URL:
Notes: Source: Scopus
Altmetric
Citation Impact
BMJ Impact Analytics
MSK Authors