已发表论文

用于识别冠心病患者慢性肾病风险的机器学习模型:一项回顾性研究

 

Authors He T, Zhao J, Hou L, Su K, Li Y

Received 5 August 2025

Accepted for publication 18 November 2025

Published 5 December 2025 Volume 2025:18 Pages 7327—7340

DOI https://doi.org/10.2147/IJGM.S558568

Checked for plagiarism Yes

Review by Single anonymous peer review

Peer reviewer comments 3

Editor who approved publication: Dr Redoy Ranjan

Ting He,1,* Jinbo Zhao,2,* Ling Hou,2 Ke Su,2 Yuanhong Li2 

1Department of Central Hospital of Tujia and Miao Autonomous Prefecture, Hubei University of Medicine, Shiyan, Hubei, 442000, People’s Republic of China; 2Cardiovascular Disease Center, Central Hospital of Tujia and Miao Autonomous Prefecture, Hubei University of Medicine, Enshi, Hubei, 445000, People’s Republic of China

*These authors contributed equally to this work

Correspondence: Yuanhong Li, Cardiovascular Disease Center, Central Hospital of Tujia and Miao Autonomous Prefecture, Hubei University of Medicine, 158 Wuyang Avenue, Enshi, Hubei Province, 445000, People’s Republic of China, Email lyh0101@vip.163.com

Purpose: Coronary heart disease (CHD) has a significant co-morbid association with chronic kidney disease (CKD), but identification tools for the risk of concomitant CKD in patients with CHD are still lacking. The purpose of this research was to construct machine learning (ML) models for identifying undetected CKD in CHD patients.
Methods: 1786 CHD patients undergoing coronary intervention were retrospectively included. Lasso regression and multifactor logistic regression were used to screen feature variables. Five ML models, ie, logistic regression (LR), support vector machine (SVM), random forest (RF), gradient boosting machine (GBM), and extreme gradient boosting (XGBoost), were constructed. Participants were divided into the training set and validation set in a 7:3 ratio. The evaluation metrics included the area under the curve, calibration curve, and decision curve.
Results: Totally, 1786 CHD patients were enrolled and split into training (70%) and validation (30%) sets. The prevalence of CKD was 21.8% (390/1786). Multivariate logistic regression analysis showed that men, advanced age, hypertension, diabetes mellitus, history of atrial fibrillation (AF), high Gensini, low hemoglobin, low plateletcrit (PCT), high triglycerides (TG), high lipoprotein(a) (Lp(a)), hyperkalemia, high uric acid to albumin ratio (UAR), high systemic inflammation response index (SIRI), low lymphocyte to monocyte ratio (LMR), and high apolipoprotein B to apolipoprotein A1 (ApoB/ApoA1) ratio were the key clinical and laboratory test indicators of CKD. The XGBoost model performed optimally in the validation set (AUC=0.909, 95% CI 0.881 − 0.937). SHapley Additive explanation analysis identified UAR, hypertension, Gensini score, age, and SIRI as the top 5 key features.
Conclusion: The XGBoost model constructed on routine clinical data was effective in identifying CKD risk in CHD patients, with UAR as a novel strong predictor. Decision curve analysis confirmed the clinical utility of the model, indicating that it may be used to guide decisions for enhanced monitoring and early intervention over a wide range of risk thresholds.

Keywords: coronary heart disease, chronic kidney disease, machine learning, XGBoost