Synapse - Active learning in computational pathology with noise detection empowered by loss-based prior and feature analysis

Active learning in computational pathology with noise detection empowered by loss-based prior and feature analysis Journal Article

Authors:	Huang, Y.; Li, J.; An, H.; Luo, T.; Ji, Z.; Song, Y.; Liu, H.; U, K.; Zhang, S. M.; Zhao, J.; Ge, Y.
Article Title:	Active learning in computational pathology with noise detection empowered by loss-based prior and feature analysis
Abstract:	AI-based histopathological image analysis has significantly advanced the field of computer-aided diagnosis. While labeled data can enhance model performance, manual annotation by pathologists is labor-intensive and time-consuming, with variability and reliance on coarse slide-level annotations often introducing noise. To address these challenges, we propose introduces BPAL (Beta Mixture Model and Penalized Regression for Active Learning), a novel active learning framework for histopathological whole-slide image analysis. BPAL aims to reduce expert annotation costs and mitigate the impact of noisy samples during training by autonomously managing highly informative samples in each active learning iteration. Our approach integrates two noise detection modules into active learning frameworks. By incorporating Penalized Regression (PR) with parallel computation capabilities into our framework, we enhance the efficiency of noisy sample detection. Leveraging a Beta Mixture Model (BMM) with prior loss knowledge further augments this process by enabling a comprehensive analysis from various angles within the merged feature and label spaces. This approach maximizes the utilization of information extracted from pathological image samples, ensuring a robust and thorough assessment of data quality. We propose a heuristic sampling strategy based on these enhancements. High-information samples identified by the module are categorized into three types: typical samples with high confidence levels that can receive pseudo labels for training, difficult samples requiring expert re-annotation due to complex features, and mislabeled noisy samples. The iterative addition of training sets retains high-information samples while mitigating the impact of noisy samples. Comparative evaluations demonstrate the superior performance of our approach on breast cancer and prostate cancer classification tasks. © 2025 Elsevier Ltd
Keywords:	controlled study; human tissue; histopathology; comparative study; nuclear magnetic resonance imaging; cancer diagnosis; diagnostic accuracy; breast cancer; image analysis; prostate cancer; artificial intelligence; cancer classification; digital pathology; computer aided design; feature extraction; learning algorithm; learning frameworks; image retrieval; penalized regression; human; article; active learning; image-analysis; expert systems; deep learning; taxonomies; residual neural network; labeled data; beta mixture model; histopathology image analysis; noisy label detection; beta mixture models; histopathology image analyze; image analyze; noise detection; noisy labels; image noise
Journal Title:	Biomedical Signal Processing and Control
Volume:	108
ISSN:	17468094
Publisher:	Elsevier Inc.
Date Published:	2025-10-01
Start Page:	107953
Language:	English
DOI:	10.1016/j.bspc.2025.107953
PROVIDER:	scopus
DOI/URL:	https://www.scopus.com/inward/record.uri?eid=2-s2.0-105003976609&doi=10.1016%2fj.bspc.2025.107953&partnerID=40&md5=b3bfea1608f109257866dfd9e001eb31
Notes:	Article -- Source: Scopus

Altmetric

What is Altmetric?

Citation Impact

What is Dimensions Citation Badge?

BMJ Impact Analytics

MSK Authors

4 U

Related MSK Work

Image Domain Material Decomposition For Dual Energy Ct Using Unsupervised Learning With Data Fidelity Loss

Medical Physics 2024
Deep Learning Inferred Multiplex Immunofluorescence For Immunohistochemical Image Quantification

Nature Machine Intelligence 2022
Deep Interactive Learning Based Ovarian Cancer Segmentation Of H&E Stained Whole Slide Images To Study Morphological Patterns Of Brca Mutation

Journal of Pathology Informatics 2023
Joint Breast Neoplasm Detection And Subtyping Using Multi Resolution Network Trained On Large Scale H&E Whole Slide Images With Weak Labels

Proceedings of Machine Learning Research 2023
Cellular Structure Image Classification With Small Targeted Training Samples

IEEE Access 2019