Evolutionary strategies enable systematic and reliable uncertainty quantification: A proof-of-concept pilot study on resting-state functional MRI language lateralization Journal Article


Authors: Stember, J. N.; Dishner, K.; Jenabi, M.; Pasquini, L.; Peck, K. K.; Saha, A.; Shah, A.; O'Malley, B.; Ilica, A. T.; Kelly, L.; Arevalo-Perez, J.; Hatzoglou, V.; Holodny, A.; Shalu, H.
Article Title: Evolutionary strategies enable systematic and reliable uncertainty quantification: A proof-of-concept pilot study on resting-state functional MRI language lateralization
Abstract: Reliable and trustworthy artificial intelligence (AI), particularly in high-stake medical diagnoses, necessitates effective uncertainty quantification (UQ). Existing UQ methods using model ensembles often introduce invalid variability or computational complexity, rendering them impractical and ineffective in clinical workflow. We propose a UQ approach based on deep neuroevolution (DNE), a data-efficient optimization strategy. Our goal is to replicate trends observed in expert-based UQ. We focused on language lateralization maps from resting-state functional MRI (rs-fMRI). Fifty rs-fMRI maps were divided into training/testing (30:20) sets, representing two labels: "left-dominant" and "co-dominant." DNE facilitated acquiring an ensemble of 100 models with high training and testing set accuracy. Model uncertainty was derived from distribution entropies over the 100 model predictions. Expert reviewers provided user-based uncertainties for comparison. Model (epistemic) and user-based (aleatoric) uncertainties were consistent in the independently and identically distributed (IID) testing set, mainly indicating low uncertainty. In a mostly out-of-distribution (OOD) holdout set, both model and user-based entropies correlated but displayed a bimodal distribution, with one peak representing low and another high uncertainty. We also found a statistically significant positive correlation between epistemic and aleatoric uncertainties. DNE-based UQ effectively mirrored user-based uncertainties, particularly highlighting increased uncertainty in OOD images. We conclude that DNE-based UQ correlates with expert assessments, making it reliable for our use case and potentially for other radiology applications.
Keywords: artificial intelligence; fmri; children; deep; image classification; uncertainty quantification; neuroevolution
Journal Title: Journal of Imaging Informatics in Medicine
Volume: 38
Issue: 1
ISSN: 2948-2925
Publisher: Springer  
Date Published: 2025-02-01
Start Page: 576
End Page: 586
Language: English
ACCESSION: WOS:001268929600002
DOI: 10.1007/s10278-024-01188-6
PROVIDER: wos
PMCID: PMC11810852
PUBMED: 38980624
Notes: The MSK Cancer Center Support Grant (P30 CA008748) is acknowledge in the PDF -- Corresponding authors is MSK author: Joseph N. Stember -- Source: Wos
Altmetric
Citation Impact
BMJ Impact Analytics
MSK Authors
  1. Lori H Kelly
    2 Kelly
  2. Akash Deelip Shah
    20 Shah
  3. Kyung Peck
    117 Peck
  4. Andrei Holodny
    207 Holodny
  5. Mehrnaz Jenabi
    26 Jenabi
  6. Atin   Saha
    11 Saha
  7. Joseph Nathaniel Stember
    19 Stember
  8. Ahmet T Ilica
    9 Ilica