已发表论文

利用机器学习鉴定半胱氨酸蛋白酶抑制剂 3 为骨质疏松症的潜在诊断生物标志物

 

Authors Liu H , Feng Y, Lin B, Zhang L, Wu B , Wu J

Received 2 October 2025

Accepted for publication 23 November 2025

Published 1 December 2025 Volume 2025:18 Pages 16805—16823

DOI https://doi.org/10.2147/JIR.S569594

Checked for plagiarism Yes

Review by Single anonymous peer review

Peer reviewer comments 2

Editor who approved publication: Dr Ujjwol Risal

Hongyu Liu,1,2 Yiqi Feng,1,2 Binbin Lin,1,2 Lingling Zhang,3 Buling Wu,1,2 Jingyi Wu3 

1Shenzhen Clinical College of Stomatology, School of Stomatology, Southern Medical University, Shenzhen, Guangdong, People’s Republic of China; 2Shenzhen Stomatology Hospital (Pingshan) of Southern Medical University, Shenzhen, Guangdong, People’s Republic of China; 3Department of Oral Implantology, Stomatological Hospital, School of Stomatology, Southern Medical University, Guangzhou, Guangdong, People’s Republic of China

Correspondence: Buling Wu, Email bulingwu@smu.edu.cn Jingyi Wu, Email Jingyiwu1121@163.com

Background: Osteoporosis (OP) is one of the most common systemic bone metabolic diseases, but its specific pathogenesis remains unclear. Cystatin 3 (CST3) is a cysteine protease inhibitor involved in various physiological and pathological processes, yet its role in osteoporosis has not been clarified. This study aims to explore the diagnostic value and potential mechanism of CST3 in OP.
Methods: Transcriptome data of OP patients and healthy individuals were obtained from the Gene Expression Omnibus (GEO) database. After normalization and batch effect correction, differentially expressed genes (DEGs) were screened. Gene Ontology (GO) enrichment analysis, enrichment term interaction analysis, and protein-protein interaction (PPI) analysis were performed on these DEGs. Characteristic genes were screened using Least Absolute Shrinkage and Selection Operator (LASSO) regression and Random Forest (RF) algorithms, and their diagnostic efficacy was evaluated by combining with ROC curves. A rat model of OP was constructed, and the expression of characteristic genes in bone marrow mesenchymal stem cells (BMSCs) was verified by quantitative real-time polymerase chain reaction (qRT-PCR). The functions of genes related to the characteristic genes and their potential association with immune infiltration were analyzed through co-expression analysis, PPI network, and CIBERSORT algorithm.
Results: A total of 178 DEGs were screened, which were enriched in pathways such as extracellular matrix regulation and collagen metabolism. Machine learning algorithms identified CST3 and FLJ36848 as characteristic genes, with the area under the ROC curve (AUC) of both exceeding 0.9, showing excellent diagnostic efficacy. Moreover, the diagnostic efficacy of CST3 in the validation set was superior to that of FLJ36848. Animal experiments confirmed that the expression of CST3 was upregulated in BMSCs of OP rats, while the expressions of ALP and OCN were downregulated. The PPI network showed that CST3 interacted with 178 node genes. Immune infiltration analysis revealed that the infiltration proportions of M2-type macrophages, NK cells, etc. were significantly increased in the CST3 high-expression group, suggesting that CST3 may be involved in the progression of OP by regulating the immune microenvironment.
Conclusion: This study found that CST3 is related to the pathogenesis of osteoporosis and may represent a promising biomarker associated with osteoporosis progression, which could be explored as a potential therapeutic target in future studies. Its potential mechanisms involve the association of CST3 with the regulation of extracellular matrix decomposition, collagen metabolism, calcium ion transmembrane transport, as well as immune cell infiltration and its function in osteoporosis. It should be clearly stated that this study still lacks direct functional evidence and verification with large-scale clinical samples. Therefore, these findings still need to be verified by more animal experiments and clinical trials, and the specific molecular mechanisms require further in-depth research.

Keywords: osteoporosis, cystatin 3, CST3, machine learning algorithms, immune cells