Multi-organ CT Segmentation Using Shifted-window Multilayer Perceptron Mixer Conference Paper


Authors: Pan, S.; Wynne, J.; Hu, M.; Wang, T.; Roper, J.; Patel, P.; Liu, T.; Yang, X.
Title: Multi-organ CT Segmentation Using Shifted-window Multilayer Perceptron Mixer
Conference Title: Medical Imaging 2023: Image-Guided Procedures, Robotic Interventions, and Modeling
Abstract: This work proposes a novel U-shaped neural network, Shifted-window MLP (Swin-MLP), that incorporates a convolutional neural network (CNN) and Multilayer Linear Perceptron-Mixer (MLP-Mixer) for automatic CT multi-organ segmentation. The network has a structure like V-net: 1) a Shifted-window MLP-Mixer encoder learns semantic features from the input CT scans, and 2) a decoder, which mirrors the architecture of the encoder, then reconstructs segmentation maps from the encoder's features. Novel to the proposed network, we apply a Shifted-window MLP-Mixer rather than convolutional layers to better model both global and local representations of the input scans. We evaluate the proposed network using an institutional pelvic dataset comprising 120 CT scans, and a public abdomen dataset containing 30 scans. The network's segmentation accuracy is evaluated in two domains: 1) volume-based accuracy is measured by Dice similarity coefficient (DSC), segmentation sensitivity, and precision; 2) surface-based accuracy is measured by Hausdorff distance (HD), mean surface distance (MSD), and residual mean square distance (RMS). The average DSC achieved by MLP-Vnet on the pelvic dataset is 0.866; sensitivity is 0.883, precision is 0.856, HD is 11.523 millimeter (mm), MSD is 3.926 mm, and RMS is 6.262 mm. The average DSC on the public abdomen dataset is 0.903, and HD is 5.275 mm. The proposed MLP-Mixer-Vnet demonstrates significant improvement over CNN-based networks. The automatic multi-organ segmentation tool may potentially facilitate the current radiotherapy treatment planning workflow. © 2023 SPIE.
Keywords: computerized tomography; medical imaging; semantics; ct-scan; convolution; convolutional neural network; hausdorff distance; signal encoding; convolutional neural networks; ct segmentation; similarity coefficients; multi-organ segmentations; multilayer neural networks; mixers (machinery); linear perceptrons; mean-square; multilayers perceptrons; u-shaped; multilayers
Journal Title Progress in Biomedical Optics and Imaging - Proceedings of SPIE
Volume: 12466
Conference Dates: 2023 Feb 19-22
Conference Location: San Diego, CA
ISBN: 1605-7422
Publisher: SPIE  
Date Published: 2023-01-01
Start Page: 12466-62
Language: English
DOI: 10.1117/12.2653956
PROVIDER: scopus
DOI/URL:
Notes: Conference paper: 12466 1U -- Source: Scopus
Altmetric
Citation Impact
BMJ Impact Analytics
MSK Authors
  1. Tonghe Wang
    55 Wang