论文已发表
注册即可获取德孚的最新动态
IF 收录期刊
使用机器学习预测 6-12 岁儿童胰岛素抵抗的简单方法
Received 8 July 2022
Accepted for publication 13 September 2022
Published 27 September 2022 Volume 2022:15 Pages 2963—2975
DOI https://doi.org/10.2147/DMSO.S380772
Checked for plagiarism Yes
Review by Single anonymous peer review
Peer reviewer comments 2
Editor who approved publication: Professor Gian Paolo Fadini
Background: Due to the increasing insulin resistance (IR) in childhood, rates of diabetes and cardiovascular disease may rise in the future and seriously threaten the healthy development of children. Finding an easy way to predict IR in children can help pediatricians to identify these children in time and intervene appropriately, which is particularly important for practitioners in primary health care.
Patients and Methods: Seventeen features from 503 children 6– 12 years old were collected. We defined IR by HOMA-IR greater than 3.0, thus classifying children with IR and those without IR. Data were preprocessed by multivariate imputation and oversampling to resolve missing values and data imbalances; then, recursive feature elimination was applied to further select features of interest, and 5 machine learning methods—namely, logistic regression (LR), support vector machine (SVM), random forest (RF), extreme gradient boosting (XGBoost), and gradient boosting with categorical features support (CatBoost)—were used for model training. We tested the trained models on an external test set containing information from 133 children, from which performance metrics were extracted and the optimal model was selected.
Results: After feature selection, the numbers of chosen features for the LR, SVM, RF, XGBoost, and CatBoost models were 6, 9, 10, 14, and 6, respectively. Among them, glucose, waist circumference, and age were chosen as predictors by most of the models. Finally, all 5 models achieved good performance on the external test set. Both XGBoost and CatBoost had the same AUC (0.85), which was highest among those of all models. Their accuracy, sensitivity, precision, and F1 scores were also close, but the specificity of XGBoost reached 0.79, which was significantly higher than that of CatBoost, so XGBoost was chosen as the optimal model.
Conclusion: The model developed herein has a good predictive ability for IR in children 6– 12 years old and can be clinically applied to help pediatricians identify children with IR in a simple and inexpensive way.
Keywords: children, insulin resistance, machine learning, artificial intelligence