Active learning in computational pathology with noise detection empowered by loss-based prior and feature analysis Journal Article


Authors: Huang, Y.; Li, J.; An, H.; Luo, T.; Ji, Z.; Song, Y.; Liu, H.; U, K.; Zhang, S. M.; Zhao, J.; Ge, Y.
Article Title: Active learning in computational pathology with noise detection empowered by loss-based prior and feature analysis
Abstract: AI-based histopathological image analysis has significantly advanced the field of computer-aided diagnosis. While labeled data can enhance model performance, manual annotation by pathologists is labor-intensive and time-consuming, with variability and reliance on coarse slide-level annotations often introducing noise. To address these challenges, we propose introduces BPAL (Beta Mixture Model and Penalized Regression for Active Learning), a novel active learning framework for histopathological whole-slide image analysis. BPAL aims to reduce expert annotation costs and mitigate the impact of noisy samples during training by autonomously managing highly informative samples in each active learning iteration. Our approach integrates two noise detection modules into active learning frameworks. By incorporating Penalized Regression (PR) with parallel computation capabilities into our framework, we enhance the efficiency of noisy sample detection. Leveraging a Beta Mixture Model (BMM) with prior loss knowledge further augments this process by enabling a comprehensive analysis from various angles within the merged feature and label spaces. This approach maximizes the utilization of information extracted from pathological image samples, ensuring a robust and thorough assessment of data quality. We propose a heuristic sampling strategy based on these enhancements. High-information samples identified by the module are categorized into three types: typical samples with high confidence levels that can receive pseudo labels for training, difficult samples requiring expert re-annotation due to complex features, and mislabeled noisy samples. The iterative addition of training sets retains high-information samples while mitigating the impact of noisy samples. Comparative evaluations demonstrate the superior performance of our approach on breast cancer and prostate cancer classification tasks. © 2025 Elsevier Ltd
Keywords: controlled study; human tissue; histopathology; comparative study; nuclear magnetic resonance imaging; cancer diagnosis; diagnostic accuracy; breast cancer; image analysis; prostate cancer; artificial intelligence; cancer classification; digital pathology; computer aided design; feature extraction; learning algorithm; learning frameworks; image retrieval; penalized regression; human; article; active learning; image-analysis; expert systems; deep learning; taxonomies; residual neural network; labeled data; beta mixture model; histopathology image analysis; noisy label detection; beta mixture models; histopathology image analyze; image analyze; noise detection; noisy labels; image noise
Journal Title: Biomedical Signal Processing and Control
Volume: 108
ISSN: 17468094
Publisher: Elsevier Inc.  
Date Published: 2025-10-01
Start Page: 107953
Language: English
DOI: 10.1016/j.bspc.2025.107953
PROVIDER: scopus
DOI/URL:
Notes: Article -- Source: Scopus
Altmetric
Citation Impact
BMJ Impact Analytics
MSK Authors
  1. Kai Cheng U
    2 U