已发表论文

使用六种不同的机器学习算法构建心力衰竭检测的临床预测模型:识别关键的临床预后特征

 

Authors Qu FZ , Ding J, An XF, Peng R, He N, Liu S, Jiang X

Received 1 October 2024

Accepted for publication 23 December 2024

Published 28 December 2024 Volume 2024:17 Pages 6523—6534

DOI https://doi.org/10.2147/IJGM.S493789

Checked for plagiarism Yes

Review by Single anonymous peer review

Peer reviewer comments 4

Editor who approved publication: Dr Vinay Kumar

Fang Zhou Qu,1 Jiang Ding,2 Xi Feng An,3 Rui Peng,4 Ni He,5 Sheng Liu,1 Xin Jiang5 

1Medical School, Xizang Minzu University, Xianyang, People’s Republic of China; 2Institute of Electrical Power Systems, Graz University of Technology, Graz, Austria; 3The First Affiliated Hospital of Jinan University, Guangzhou, People’s Republic of China; 4Affiliated Nanhua Hospital, University of South China, Hengyang, People’s Republic of China; 5Department of Cardiology, Shaanxi Provincial People’s Hospital, Xi’an, People’s Republic of China

Correspondence: Xin Jiang, Department of Cardiology, Shaanxi Provincial People’s Hospital, Xi’an, Shanxi Province, People’s Republic of China, Email 18560453060@163.com

Purpose: Heart failure (HF) is a clinical syndrome in which structural or functional abnormalities of the heart result in impaired ventricular filling or ejection capacity. In order to improve the adaptability of models to different patient populations and data situations. This study aims to develop predictive models for HF risk using six machine learning algorithms, providing valuable insights into the early assessment and recognition of HF by clinical features.
Patients and Methods: The present study focused on clinical characteristics that significantly differed between groups with left ventricular ejection fractions (LVEF) [≤ 40% and > 40%]. Following the elimination of features with significant missing values, the remaining features were utilized to construct predictive models employing six machine learning algorithms. The optimal model was selected based on various performance metrics, including the area under the curve (AUC), accuracy, precision, recall, and F1 score. Utilizing the optimal model, the significance of clinical features was assessed, and those with importance values exceeding 0.8 were identified as crucial to the study. Finally, a correlation analysis was conducted to examine the relationships between these features and other significant clinical features.
Results: The logistic regression (LR) model was determined to be the optimal machine learning algorithm in this study, achieving an accuracy of 0.64, a precision of 0.45, a recall of 0.72, an F1 score of 0.51, and an AUC of 0.81 in the training set and 0.91 in the testing set. In addition, the analysis of feature importance indicated that blood calcium, angiotensin-converting enzyme inhibitors (ACEI) dosage, mean hemoglobin concentration, and survival duration were critical to the study, each possessing importance values exceeding 0.8. Furthermore, correlation analysis revealed a strong relationship between blood calcium and ionized calcium (|cor|=0.99), as well as a significant association between ACEI dosage (|cor|=0.68) and left ventricular metrics (|cor|=0.58); on the other hand, no correlations were observed between mean hemoglobin levels and other clinical characteristics.
Conclusion: The present study identified LR as the most effective risk prediction model for patients with HF, highlighting blood calcium, ACEI dosage, and mean hemoglobin level as significant predictors. These findings provide significant insights for the clinical prevention and early intervention of HF.

Keywords: left ventricular ejection fractions, logistic regression, blood calcium, area under the curve, correlation analysis