已发表论文

基于机器学习的 COVID-19 疾病进展预测

 

Authors Xu F, Chen X, Yin X, Qiu Q, Xiao J, Qiao L, He M, Tang L, Li X, Zhang Q, Lv Y, Xiao S, Zhao R, Guo Y, Chen M, Chen D, Wen L, Wang B, Nian Y, Liu K

Received 30 November 2020

Accepted for publication 26 March 2021

Published 29 April 2021 Volume 2021:14 Pages 1589—1598

DOI https://doi.org/10.2147/IJGM.S294872

Checked for plagiarism Yes

Review by Single anonymous peer review

Peer reviewer comments 2

Editor who approved publication: Dr Scott Fraser

Background: Since December 2019, COVID-19 has spread throughout the world. Clinical outcomes of COVID-19 patients vary among infected individuals. Therefore, it is vital to identify patients at high risk of disease progression.
Methods: In this retrospective, multicenter cohort study, COVID-19 patients from Huoshenshan Hospital and Taikang Tongji Hospital (Wuhan, China) were included. Clinical features showing significant differences between the severe and nonsevere groups were screened out by univariate analysis. Then, these features were used to generate classifier models to predict whether a COVID-19 case would be severe or nonsevere based on machine learning. Two test sets of data from the two hospitals were gathered to evaluate the predictive performance of the models.
Results: A total of 455 patients were included, and 21 features showing significant differences between the severe and nonsevere groups were selected for the training and validation set. The optimal subset, with eleven features in the –nearest neighbor model, obtained the highest area under the curve (AUC) value among the four models in the validation set. D-dimer, CRP, and age were the three most important features in the optimal-feature subsets. The highest AUC value was obtained using a support vector–machine model for a test set from Huoshenshan Hospital. Software for predicting disease progression based on machine learning was developed.
Conclusion: The predictive models were successfully established based on machine learning, and achieved satisfactory predictive performance of disease progression with optimal-feature subsets.
Keywords: COVID-19, disease progression, machine-learning models