已发表论文

利用超声和机器学习结合 SHAP 预测乳腺癌腋窝淋巴结转移

 

Authors Bai G, Zhong X, Wu Y, Lin W, Zhou S, Zhou P

Received 25 May 2025

Accepted for publication 13 September 2025

Published 26 September 2025 Volume 2025:17 Pages 2183—2197

DOI https://doi.org/10.2147/CMAR.S542680

Checked for plagiarism Yes

Review by Single anonymous peer review

Peer reviewer comments 2

Editor who approved publication: Dr Sanjeev K. Srivastava

Gengyan Bai, Xiaohong Zhong, Youping Wu, Weijie Lin, Shoulan Zhou, Ping Zhou

Department of Ultrasound, Women and Children’s Hospital, School of Medicine, Xiamen University, Xiamen, Fujian, People’s Republic of China

Correspondence: Xiaohong Zhong, Email feitianlu.fpm@163.com

Background: Accurate preoperative prediction of axillary lymph node (ALN) metastasis in breast cancer is crucial for surgical planning and reducing morbidity. Conventional ultrasound and Doppler methods are limited by subjectivity, while existing machine learning (ML) models often lack interpretability and multi-center validation.
Aim: To evaluate 11 ML algorithms and develop a validated model integrating ultrasound and Doppler features for ALN metastasis prediction, using SHapley Additive exPlanations (SHAP) for interpretability.
Methods: This retrospective dual-center study included 303 patients from Xiamen (internal cohorts: 212 training, 91 validation) and 102 from Longyan (external validation). Features were extracted from preoperative ultrasound and Doppler images. Recursive feature elimination (RFE) and SHAP selected key predictors. Gradient Boosting was identified as optimal and compared to B-mode/Doppler submodels and clinicopathological scores (Logical, Tumor, Tenon). Performance was assessed via AUC, calibration, decision curve analysis (DCA), and a web calculator was developed.
Results: Five features—tumor diameter, cortex-to-hilum ratio, lymph node systolic/diastolic ratio, peak systolic velocity, and end-diastolic velocity—were selected. The combined model achieved AUCs of 0.981 (training), 0.975 (internal validation), and 0.987 (external validation), outperforming scores (AUCs 0.517– 0.700). It showed superior calibration (Brier scores 0.045– 0.061) and net benefit in DCA.
Conclusion: The Gradient Boosting model with SHAP provides accurate, interpretable ALN metastasis prediction, supporting noninvasive risk stratification and personalized breast cancer management.

Keywords: breast cancer, axillary lymph node metastasis, ultrasound, doppler ultrasound, machine learning, SHapley Additive exPlanations