Associations between radiation oncologist demographic factors and segmentation similarity benchmarks: Insights from a crowd-sourced challenge using Bayesian estimation Journal Article


Authors: Wahid, K. A.; Sahin, O.; Kundu, S.; Lin, D.; Alanis, A.; Tehami, S.; Kamel, S.; Duke, S.; Sherer, M. V.; Rasmussen, M.; Korreman, S.; Fuentes, D.; Cislo, M.; Nelms, B. E.; Christodouleas, J. P.; Murphy, J. D.; Mohamed, A. S. R.; He, R.; Naser, M. A.; Gillespie, E. F.; Fuller, C. D.
Article Title: Associations between radiation oncologist demographic factors and segmentation similarity benchmarks: Insights from a crowd-sourced challenge using Bayesian estimation
Abstract: PURPOSE: The quality of radiotherapy auto-segmentation training data, primarily derived from clinician observers, is of utmost importance. However, the factors influencing the quality of clinician-derived segmentations are poorly understood; our study aims to quantify these factors. METHODS: Organ at risk (OAR) and tumor-related segmentations provided by radiation oncologists from the Contouring Collaborative for Consensus in Radiation Oncology data set were used. Segmentations were derived from five disease sites: breast, sarcoma, head and neck (H&N), gynecologic (GYN), and GI. Segmentation quality was determined on a structure-by-structure basis by comparing the observer segmentations with an expert-derived consensus, which served as a reference standard benchmark. The Dice similarity coefficient (DSC) was primarily used as a metric for the comparisons. DSC was stratified into binary groups on the basis of structure-specific expert-derived interobserver variability (IOV) cutoffs. Generalized linear mixed-effects models using Bayesian estimation were used to investigate the association between demographic variables and the binarized DSC for each disease site. Variables with a highest density interval excluding zero were considered to substantially affect the outcome measure. RESULTS: Five hundred seventy-four, 110, 452, 112, and 48 segmentations were used for the breast, sarcoma, H&N, GYN, and GI cases, respectively. The median percentage of segmentations that crossed the expert DSC IOV cutoff when stratified by structure type was 55% and 31% for OARs and tumors, respectively. Regression analysis revealed that the structure being tumor-related had a substantial negative impact on binarized DSC for the breast, sarcoma, H&N, and GI cases. There were no recurring relationships between segmentation quality and demographic variables across the cases, with most variables demonstrating large standard deviations. CONCLUSION: Our study highlights substantial uncertainty surrounding conventionally presumed factors influencing segmentation quality relative to benchmarks.
Keywords: neoplasm; neoplasms; demography; bayes theorem; observer variation; radiotherapy; radiation oncology; benchmarking; radiotherapy planning, computer-assisted; epidemiology; radiation oncologists; procedures; organs at risk; humans; human; male; female; radiotherapy planning system; radiation oncologist
Journal Title: JCO Clinical Cancer Informatics
Volume: 8
ISSN: 2473-4276
Publisher: American Society of Clinical Oncology  
Date Published: 2024-06-01
Start Page: e2300174
Language: English
DOI: 10.1200/cci.23.00174
PUBMED: 38870441
PROVIDER: scopus
PMCID: PMC11214868
DOI/URL:
Notes: Article -- Source: Scopus
Altmetric
Citation Impact
BMJ Impact Analytics
MSK Authors
  1. Diana Lin
    16 Lin