Prediction of subsequent fragility fractures: application of machine learning

Zabihiyeganeh, Mozhdeh; Mirzaei, Alireza; Tabrizian, Pouria; Rezaee, Aryan; Sheikhtaheri, Abbas; Kadijani, Azade Amini; Kadijani, Bahare Amini; Sharifi Kia, Ali

doi:10.1186/s12891-024-07559-y

Research
Open access
Published: 04 June 2024

Prediction of subsequent fragility fractures: application of machine learning

Mozhdeh Zabihiyeganeh¹,
Alireza Mirzaei^1,2,
Pouria Tabrizian¹,
Aryan Rezaee^1,3,
Abbas Sheikhtaheri⁴,
Azade Amini Kadijani¹,
Bahare Amini Kadijani⁵ &
…
Ali Sharifi Kia^1,6

BMC Musculoskeletal Disorders volume 25, Article number: 438 (2024) Cite this article

371 Accesses
Metrics details

Abstract

Background

Machine learning (ML) has shown exceptional promise in various domains of medical research. However, its application in predicting subsequent fragility fractures is still largely unknown. In this study, we aim to evaluate the predictive power of different ML algorithms in this area and identify key features associated with the risk of subsequent fragility fractures in osteoporotic patients.

Methods

We retrospectively analyzed data from patients presented with fragility fractures at our Fracture Liaison Service, categorizing them into index fragility fracture (n = 905) and subsequent fragility fracture groups (n = 195). We independently trained ML models using 27 features for both male and female cohorts. The algorithms tested include Random Forest, XGBoost, CatBoost, Logistic Regression, LightGBM, AdaBoost, Multi-Layer Perceptron, and Support Vector Machine. Model performance was evaluated through 10-fold cross-validation.

Results

The CatBoost model outperformed other models, achieving 87% accuracy and an AUC of 0.951 for females, and 93.4% accuracy with an AUC of 0.990 for males. The most significant predictors for females included age, serum C-reactive protein (CRP), 25(OH)D, creatinine, blood urea nitrogen (BUN), parathyroid hormone (PTH), femoral neck Z-score, menopause age, number of pregnancies, phosphorus, calcium, and body mass index (BMI); for males, the predictors were serum CRP, femoral neck T-score, PTH, hip T-score, BMI, BUN, creatinine, alkaline phosphatase, and spinal Z-score.

Conclusion

ML models, especially CatBoost, offer a valuable approach for predicting subsequent fragility fractures in osteoporotic patients. These models hold the potential to enhance clinical decision-making by supporting the development of personalized preventative strategies.

Peer Review reports

Background

Osteoporosis represents a significant public health concern within the aging population [1, 2]. Epidemiological data suggest that approximately one-third of women and one-fifth of men over the age of 50 will experience at least one osteoporotic fracture in their lifetime [3]. The incidence of such fractures is estimated to increase almost two folds by 2045 [4]. Patients with a history of fragility fracture face an elevated risk of subsequent fractures, linked to increased morbidity, mortality, and diminished quality of life [5, 6], thereby necessitating prevention of a subsequent fracture.

The identification of risk factors for subsequent fragility fractures is a crucial element in preventing re-fracture [7]. Prior research has identified numerous predictors, including age, gender, the site of the initial fracture, and comorbid conditions like hypertension and diabetes [6, 8,9,10,11]. Despite the recognized importance of these factors in preventing further fractures, they are often overlooked in clinical decision-making due to a lack of personalized risk assessment tools [12].

The World Health Organization developed the Fracture Risk Assessment Tool (FRAX) to evaluate the 10-year probability of bone fractures due to osteoporosis using clinical risk factors [13]. Despite being a significant advancement in fracture risk assessment, FRAX has several limitations, including but not limited to not taking into account changes in risk factors over time and providing a static risk assessment [14].

In response to these limitations, there have been significant strides in applying machine learning (ML) in personalized medicine [15, 16], including the prediction of cancer recurrence [17, 18], to enhance osteoporosis management. Numerous studies have employed a variety of ML techniques such as logistic regression, XGBoost, random forest, K-nearest neighbor, support vector machine, decision trees, and neural networks. These methods address various facets of osteoporosis from risk prediction and early detection to diagnosis, treatment, and management [19,20,21,22,23].

The potential of ML to predict re-fracture risk in osteoporotic patients remains largely untapped. A predictive ML model could facilitate personalized preventative strategies encompassing structured exercise, fall prevention, nutritional supplementation, custom orthoses, and prophylactic pharmacotherapy [24]. This study aims to develop an ML-based model to predict the risk of subsequent fragility fractures in patients with a history of such fractures, incorporating clinically relevant features.

Methods

Data sources and study population

This retrospective analysis received approval from the institutional review board of our institute, designated by the code IR.IUMS.REC.1401.106, which granted a waiver for informed consent. This study involved patients presenting with fragility fractures at the FLS of Shafa Orthopedic Hospital, affiliated with the Iran University of Medical Sciences in Tehran, from 2020 to 2023. The cohort was categorized into two groups: those with an initial fragility fracture (n = 905) and those with a subsequent fragility fracture (n = 195). The index fragility fractures were located in the distal radius (38%), lumbar spine (18%), femoral neck (15%), proximal humerus (5%), and other locations (24%). The re-fractures were mainly located in the distal radius (47%), femoral neck (32%), proximal humerus (14%), and other locations (7%). The mean time interval between the primary and secondary fragility fracture was 41.2 ± 31.7 months (range 1-120).

Re-fractures were mainly self-reported. However, the clinical history of patients was checked by the involved rheumatologist to make sure it was a subsequent osteoporotic fracture and not a traumatic fracture.

Inclusion criteria were those that were regarded for FLS (age ≥ 50 years and osteoporosis-related fractures). Any fracture caused by low-trauma fracture, often following a fall from standing height or less, was considered an osteoporotic fracture, excluding fractures at the toes, metatarsal bones, fingers, metacarpal bones, skull, facial bones, and mandible [25].

In total, 1100 patients who were registered during the study period were included in the analysis. Input features were extracted as an Excel file from the data captured by the FLS system. We excluded features considered irrelevant to the osteoporotic fracture based on the earlier evidence [26,27,28,29,30] and physician opinion. Features with more than 30% missing values or more than 95% of the data distributed in one class were excluded. In total, 118 features were identified at initial inspection, of which 27 features met the study criteria and were used for training the models. Since the FLS database in our center is grounded upon the workup of the causes of secondary osteoporosis, factors such as ESR, CRP, PTH, 25(OH)D, ALP, etc. which could indicate a secondary root of osteoporosis, were included in the feature sets.

Model training was done for males and females separately, considering the exclusion of pregnancy frequency and menopause age in the male group. As a result, model training in the male group was performed with 25 features. Characteristics of these features are demonstrated in detail in Table 1.

Table 1 Patients’ characteristics

Full size table

Quantitative variables are demonstrated with mean ± standard deviation for normally distributed quantitative parameters, with median (range) for non-normally distributed quantitative parameters, and with numbers (%) for qualitative parameters.

Data preprocessing

Outliers in the dataset were identified as data points lying beyond ± 3 standard deviations from the mean of a given feature. These outliers were subsequently replaced with the nearest values within the interquartile range boundaries. Numerical data underwent normalization to scale the values, while categorical variables were transformed via one-hot encoding, assigning 1 for “Yes” and 0 for “No.”

The rate of missing data for the male dataset varied from 1.03 to 17.01%, and for the female dataset, it ranged from 1.54 to 21.77%. For normally distributed numerical variables, the mean of the feature was used to impute missing values. In contrast, the median was employed for skewed numerical data. The mode was used for imputing missing categorical data, chosen based on the most frequent value within each class (re-fracture or no re-fracture). Detailed missing data rates for each feature are tabulated in Table 1.

Features and feature selection

The primary outcome, subsequent fragility fracture, was recorded as a binary variable (yes/no). The dataset comprised 26 features, excluding the target variable. These features encompassed demographics (age, sex, menopause age, BMI), laboratory results (CRP, ALP, serum Vitamin D, PTH), medical history (comorbidities, medication use), and densitometry measurements (BMD, T-score, Z-score).

Seven distinct feature sets were engineered to predict fragility in both genders. Six of these were derived using recursive feature elimination with cross-validation (RFECV) applied to random forest, XGBoost, CatBoost, logistic regression, LightGBM, and AdaBoost algorithms. The seventh set was manually selected based on prior evidence and clinician expertise, deemed relevant for predicting future fragility risk.

Data balancing

Initial models, based on features selected by physician opinion and trained using the XGBoost algorithm, demonstrated suboptimal performance (AUC = 0.502 for females and AUC = 0.498 for males), likely due to an imbalance in re-fracture instances. To address this, the synthetic minority oversampling technique (SMOTE) was implemented to augment the underrepresented class (re-fracture) in the datasets [31].

Model Development, evaluation, and explainability

We employed an array of models for development, including random forest, XGBoost, CatBoost, logistic regression, LightGBM, AdaBoost, MLP, and SVM, utilizing 10-fold cross-validation as illustrated in Fig. 1. Hyperparameter optimization for these models was conducted using a variable grid for each algorithm in combination with GridSearchCV from the scikit-learn library.

Model performance was assessed using accuracy, the area under the receiver operating characteristic curve (AUC ROC), precision, recall, F1 score, logistic loss, and Brier score. Model comparison hinged on the F1 score and accuracy, leading to the selection of the optimal models for both male and female patient groups. The contribution of individual features to the model performance was determined using Shapley Additive Explanations (SHAP) [32].

Results

Feature selection

Tables S1 and S2 present the details of the feature sets created using the male and female patients’ dataset.

Model performance and evaluation

A summarized evaluation of the performance of various predictive models for female patients, using feature sets one through seven, is provided in Tables S3-S9. Generally, the CatBoost algorithm demonstrated superior performance across the majority of feature sets, with the exception of feature set 5, where the LightGBM algorithm was more effective. Logistic regression exhibited the least robust performance across all feature sets, with the exception of feature set 7, where the SVM model was the least effective.

The performance details of the predictive models for male patients across different feature sets are documented in Tables S10-S16. The CatBoost algorithm consistently outperformed the other models across all feature sets. Logistic regression generally displayed the least favorable performance, except in feature sets 4, 5, and 7, where the SVM model showed the weakest results.

The optimal model for predicting subsequent fragility fractures in female patients was the CatBoost model trained on feature set 2, achieving an accuracy of 0.870 and an F1 score of 0.882. For male patients, the most effective model was the CatBoost trained on feature set 6, with an accuracy of 0.934 and an F1 score of 0.938. The performance metrics for the top five predictive models for female and male patients are presented in Tables 2 and 3, respectively.

Table 2 Top 5 female patients’ prediction models

Full size table

Table 3 Top 5 male patients’ prediction models

Full size table

Feature importance

Female patient’s prediction model

As depicted in Fig. 2, age, serum CRP, serum level of 25(OH)D (vitamin D3), serum creatinine, serum BUN, serum PTH, femoral neck Z-score, menopause age, number of pregnancies, serum phosphorus, serum calcium, and BMI had the highest contribution to the model’s prediction.

Male patients’ prediction model

As presented in Fig. 3, serum CRP, femoral neck T-score, serum PTH, hip T-score, BMI, serum BUN, serum creatinine, serum ALP, and spinal Z-score had the highest amount of contribution to the model’s performance in order.

Error analysis

Female patient’s prediction model

In total, there were 155 errors, of which 9 were false positives and 146 were false negatives. According to Figure S1, which presents the confusion matrix and heatmap of the error cases, ALP, PTH, 25(OH)D, age, menopause age, CRP, and BMI were more related to the error cases. As the color in the grid gets darker, it resembles a higher relation with errors.

Male patient’s prediction model

Overall, there were 9 errors, which 6 were false negatives and 3 were false positives. As depicted in Figure S2, ALP, PTH, 25(OH)D, CRP, BUN, and BMI were most related to the error cases.

Discussion

In this research, we assessed the predictive capabilities of various machine learning (ML) models in predicting subsequent fragility fractures within distinct male and female cohorts. Additionally, we identified the most contributing features in these predication models. For both genders, the CatBoost model emerged as the most accurate, yielding the highest predictive accuracy at 93.4% for males and 87% for females. The SHAP analysis revealed that in the female-specific models, the features that contributed most significantly included age, CRP, 25(OH)D, creatinine, BUN, PTH, femoral neck Z-score, menopause age, number of pregnancies, phosphorus, calcium, and BMI. For the male-specific models, the features with the greatest impact on the model’s predictive power were CRP, femoral neck T-score, PTH, hip T-score, BMI, BUN, creatinine, ALP, and spinal Z-score. To date, various studies have investigated the risk factors of re-fracture in osteoporotic patients sustaining a fragility fracture [6, 8,9,10,11]. Although these studies have provided valuable information, there is still a gap in the clinical application of this data, mainly due to the inability of physicians to interpret and implement these data in the process of treatment decision-making. ML algorithms are able to interpret this data according to the feature importance and provide a personalized risk for re-fracture, thereby translating the patients’ data into clinical practice [15, 16].

Following the advent of ML in medical sciences, the potential of these algorithms in osteoporosis management has been evaluated in many studies [33]. Although the use of ML algorithms in the prevention of subsequent fragility fractures has been considered, it has not received as much attention as it deserves. Shimizu et al. [34] evaluated the capability of ML algorithms for prediction and feature selection of re-fracture after surgical treatment of non-vertebral index fragility fracture. More than 7000 patients with an index fragility fracture were included in their study, randomly divided into training (75%) and test (25%) datasets. A decision-tree-based model (Light-GBM), Artificial Neural Network, and SVM model were developed for the prediction purpose. LightGBM model showed moderate accuracy for the prediction in the training (AUC = 0.90) and test dataset (AUC = 0.75), whereas the other models revealed poor performance (AUC < 0.60). Rheumatoid arthritis (RA) and chronic kidney disease (CKD) were the most relevant features for predicting the subsequent fracture. In the present study, we evaluated various ML models, including LightGBM and SVM. CatBoost was the most predictive ML model in our study, with a maximum AUC of 0.990 for the male group and 0.956 for the female group. However, the male and female populations were not evaluated separately in the study of Shimizu et al. Considering the smaller number of patients compared to the study of Shimizu et al., we used a cross-validation approach to test the performance of machine learning models. Features that had the highest contribution to the model’s prediction were significantly different from those reported by Shimizu et al., which could be attributed to the registration protocol. Since our center was a subspecialized orthopedic hospital, patients with RA, CKD, hyperthyroidism, and other important underlying disorders were not generally referred to our FLS department.

Ma et al. [35] compared the effectiveness of different ML algorithms in predicting new fractures after the treatment of index osteoporotic vertebral compression fractures. In a retrospective analysis of 529 patients, ML models including decision trees, random forests, SVM, gradient boosting machines (GBM), neural networks, regularized discriminant analysis (RDA), and logistic regression were compared in terms of their effectiveness in predicting new fractures occurring after surgical treatment of index fracture. The dataset was subdivided into the training (75%) and test set (25%). ML models were developed in training sets after ten cross-validations. Subsequently, the performance of each model was assessed in the test dataset. Almost all models predicted better than logistic regression, with random forest showing the maximum AUC (0.940). In contrast to the study of Ma et al., which was limited to the prediction of subsequent vertebral fragility fracture, the present study was not restricted by the location of the fragility fracture. Even so, both studies reveal the promising role of ML in the prediction of subsequent fragility fracture. The CatBoost algorithm, which was the best-predicting model in the present study, was not used in the study of Ma et al. Again, the male and female populations were not evaluated separately in the study of Ma et al.

Vries et al. [12] compared three ML algorithms, including the Cox regression, random survival forests (RSF), and an artificial neural network (ANN)-DeepSurv model, to design a risk assessment tool for future fractures. In total, 7578 patients with osteopenia or osteoporosis were included, of which 805 (11%) patients sustained a subsequent major osteoporotic fracture (MOF). For the complete dataset, including the osteopenia and osteoporosis patients, no significant difference was found between the discriminative ability of the three models. In the osteopenia group, the Cox regression model significantly outperformed the other models, with an AUC of 0.701 one year after the index fracture. Age, prior falls, simultaneous vertebral fracture, history of epilepsy, and age of menopause were independently associated with the incidence of subsequent MOF in the complete dataset using the Cox regression model. The predictive capability of the ML models used in the present study was remarkably higher than the study of Vries et al. This difference can be attributed to several factors, including the patient population, the type of fractures, or the ML model itself. These differences should be further investigated in future studies.

Regarding the feature importance, some features that were already acknowledged as predictors of fragility fracture, including age, sex, menopause age, and densitometry parameters, were found to be important features in our model’s development, as well. In addition, some features that were less frequently reported as predictors of subsequent fragility fracture in the general osteoporotic population were also included in our model’s development, including the CRP, BUN, and creatinine. High CRP levels, as a marker of chronic inflammation, have been earlier attributed to the increased risk of fragility fractures, although previous studies have yielded conflicting results [36, 37]. BUN and creatinine are acknowledged predictors of fragility fracture in osteoporotic patients with chronic kidney diseases, explained by the association between renal function and BMD [38,39,40]. However, these markers are rarely notified as predictors of fragility fractures in the general osteoporotic population, which could infer the power of ML algorithms to explore their predictive power.

Altogether, the results of the present study show that ML models could play an important role in the perdiction of subsequent fragility fractures. Therefore, optimization of these methods in the future could be regarded to empower clinicians to provide personalized re-fracture strategies. Such tools have already been designed for index fragility fractures (Fracture Risk Assessment Tool). However, the prevention of second re-fracture has received less attention and deserves more investigations in the future.

The present study had some strengths and weak points. The number of ML models evaluated in the present study was more than in earlier studies, and CatBoost, which was shown to be the most accurate model, was not used in earlier studies. Evaluation of the models separately for males and females could be the other strong point of the study, as menopause could be regarded as a confounding factor in males when models are trained on both sexes. The absence of an external validation set and a smaller number of patients, particularly in the male group, could be regarded as the weak points of this study. In addition, the study population was recruited from a subspecialized orthopedic hospital, and patients with important underlying disorders such as RA, CKD, hyperthyroidism, and other underlying disorders were not generally referred to our hospital. For this reason, the elaborated model might not be generalizable to other healthcare settings and patients with certain disorders.

Conclusion

Machine learning (ML) models, and the CatBoost algorithm in particular, have demonstrated a strong ability to predict subsequent fragility fractures. As such, these models show promise as effective tools in predicting future fragility fractures in patients with osteoporosis. The further refinement and optimization of these ML models could aid clinicians in creating tailored prevention strategies to reduce the risk of future fragility fractures.

Data availability

No datasets were generated or analysed during the current study.

References

Akkawi I, Zmerly H, Osteoporosis. Curr Concepts Joints. 2018;6(2):122–7.
Google Scholar
Shariatzadeh H, Modaghegh BS, Mirzaei A. The Effect of Dynamic Hyperextension Brace on osteoporosis and hyperkyphosis reduction in postmenopausal osteoporotic women. Archives bone Joint Surg. 2017;5(3):181–5.
Google Scholar
Johnell O, Kanis JA. An estimate of the worldwide prevalence and disability associated with osteoporotic fractures. Osteoporosis international: a journal established as result of cooperation between the European Foundation for Osteoporosis and the National Osteoporosis Foundation of the USA. 2006;17(12):1726–33.
Odén A, McCloskey EV, Kanis JA, Harvey NC, Johansson H. Burden of high fracture probability worldwide: secular increases 2010–2040. Osteoporosis international: a journal established as result of cooperation between the European Foundation for Osteoporosis and the National Osteoporosis Foundation of the USA. 2015;26(9):2243–8.
Bliuc D, Nguyen ND, Nguyen TV, Eisman JA, Center JR. Compound risk of high mortality following osteoporotic fracture and refracture in elderly women and men. J bone Mineral Research: Official J Am Soc Bone Mineral Res. 2013;28(11):2317–24.
Article Google Scholar
Center JR, Bliuc D, Nguyen TV, Eisman JA. Risk of subsequent fracture after low-trauma fracture in men and women. JAMA. 2007;297(4):387–94.
Article CAS PubMed Google Scholar
Mirzaei A, Jahed SA, Nojomi M, Rajaei A, Zabihiyeganeh M. A study of the value of trabecular bone score in fracture risk assessment of postmenopausal women. Taiwan J Obstet Gynecol. 2018;57(3):389–93.
Article PubMed Google Scholar
Ruan WD, Wang P, Ma XL, Ge RP, Zhou XH. Analysis on the risk factors of second fracture in osteoporosis-related fractures. Chin J Traumatol = Zhonghua Chuang shang za zhi. 2011;14(2):74–8.
PubMed Google Scholar
Izquierdo-Avino R, Cebollada-Gadea L, Jordan-Jarque M, Bordonaba-Bosque D, López-Cabanas JA. Risk of osteoporotic fracture and refracture: the importance of index fracture site. Archives Osteoporos. 2023;18(1):27.
Article CAS Google Scholar
Hsiao PC, Chen TJ, Li CY, Chu CM, Su TP, Wang SH, et al. Risk factors and incidence of repeat osteoporotic fractures among the elderly in Taiwan: a population-based cohort study. Medicine. 2015;94(7):e532.
Article PubMed PubMed Central Google Scholar
Ma X, Xia H, Wang J, Zhu X, Huang F, Lu L, et al. Re-fracture and correlated risk factors in patients with osteoporotic vertebral fractures. J Bone Miner Metab. 2019;37(4):722–8.
Article CAS PubMed Google Scholar
de Vries BCS, Hegeman JH, Nijmeijer W, Geerdink J, Seifert C, Groothuis-Oudshoorn CGM. Comparing three machine learning approaches to design a risk assessment tool for future fractures: predicting a subsequent major osteoporotic fracture in fracture patients with osteopenia and osteoporosis. Osteoporos International: J Established as Result Cooperation between Eur Foundation Osteoporos Natl Osteoporos Foundation USA. 2021;32(3):437–49.
Kanis JA, McCloskey EV, Johansson H, Oden A, Ström O, Borgström F. Development and use of FRAX® in osteoporosis. Osteoporos Int. 2010;21(2):407–13.
Article Google Scholar
El Miedany Y. FRAX: re-adjust or re-think. Archives Osteoporos. 2020;15(1):150.
Article Google Scholar
Johnson KB, Wei WQ, Weeraratne D, Frisse ME, Misulis K, Rhee K, et al. Precision Medicine, AI, and the future of Personalized Health Care. Clin Transl Sci. 2021;14(1):86–93.
Article PubMed Google Scholar
Schork NJ. Artificial Intelligence and Personalized Medicine. Cancer Treat Res. 2019;178:265–83.
Article CAS PubMed PubMed Central Google Scholar
Lou SJ, Hou MF, Chang HT, Chiu CC, Lee HH, Yeh SJ et al. Machine learning algorithms to predict recurrence within 10 years after breast Cancer surgery: a prospective cohort study. Cancers. 2020;12(12).
Mosayebi A, Mojaradi B, Bonyadi Naeini A, Khodadad Hosseini SH. Modeling and comparing data mining algorithms for prediction of recurrence of breast cancer. PLoS ONE. 2020;15(10):e0237658.
Article CAS PubMed PubMed Central Google Scholar
Wu X, Park S. A prediction model for osteoporosis risk using a machine-learning Approach and its validation in a large cohort. jkms. 2023;38(21):e162–0.
PubMed PubMed Central Google Scholar
Dzierżak R, Omiotek Z. Application of deep convolutional neural networks in the diagnosis of osteoporosis. Sensors. 2022;22(21):8189.
Article PubMed PubMed Central Google Scholar
Grygorieva N, Dubetska H, Koshel N, Pisaruk A, Antoniuk-Shcheglova I. Mathematical model of the bone biological age based on the bone mineral density and quality indicex and Ukrainian FRAX model. PAIN JOINTS SPINE. 2022;12(1):16–22.
Article Google Scholar
Kim SK, Yoo TK, Oh E, Kim DW, editors. Osteoporosis risk prediction using machine learning and conventional methods. 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC); 2013 3–7 July 2013.
Lin Y-T, Chu C-Y, Hung K-S, Lu C-H, Bednarczyk EM, Chen H-Y. Can machine learning predict pharmacotherapy outcomes? An application study in osteoporosis. Comput Methods Programs Biomed. 2022;225:107028.
Article PubMed Google Scholar
Wilson N, Hurkmans E, Adams J, Bakkers M, Balážová P, Baxter M et al. Prevention and management of osteoporotic fractures by non-physician health professionals: a systematic literature review to inform EULAR points to consider. RMD open. 2020;6(1).
Cosman F, de Beur SJ, LeBoff MS, Lewiecki EM, Tanner B, Randall S, et al. Clinician’s guide to Prevention and treatment of osteoporosis. Osteoporos Int. 2014;25(10):2359–81.
Article CAS PubMed PubMed Central Google Scholar
Tung C-W, Hsu Y-C, Shih Y-H, Chang P-J, Lin C-L. Dipstick Proteinuria and reduced estimated glomerular filtration rate as independent risk factors for osteoporosis. Am J Med Sci. 2018;355(5):434–41.
Article PubMed Google Scholar
Tariq S, Tariq S, Lone KP, Khaliq S. Alkaline phosphatase is a predictor of bone Mineral Density in postmenopausal females. Pak J Med Sci. 2019;35(3):749–53.
Article PubMed PubMed Central Google Scholar
de Pablo P, Cooper MS, Buckley CD. Association between bone mineral density and C-reactive protein in a large population-based sample. Arthr Rhuem. 2012;64(8):2624–31.
Article Google Scholar
Van Schoor N, Visser M, Pluijm S, Kuchuk N, Smit J, Lips P. Vitamin D deficiency as a risk factor for osteoporotic fractures. Bone. 2008;42(2):260–6.
Article PubMed Google Scholar
Cooper L, Clifton-Bligh PB, Nery ML, Figtree G, Twigg S, Hibbert E, et al. Vitamin D supplementation and bone mineral density in early postmenopausal women12. Am J Clin Nutr. 2003;77(5):1324–9.
Article CAS PubMed Google Scholar
Chawla NV, Bowyer KW, Hall LO, Kegelmeyer WP. SMOTE: synthetic minority over-sampling technique. J Artif Intell Res. 2002;16:321–57.
Article Google Scholar
Lundberg SM, Lee S-I. A unified approach to interpreting model predictions. Adv Neural Inf Process Syst. 2017;30.
Smets J, Shevroja E, Hügle T, Leslie WD, Hans D. Machine Learning Solutions for Osteoporosis-A review. J bone Mineral Research: Official J Am Soc Bone Mineral Res. 2021;36(5):833–51.
Article Google Scholar
Shimizu H, Enda K, Shimizu T, Ishida Y, Ishizu H, Ise K et al. Machine Learning Algorithms: Prediction and Feature Selection for Clinical Refracture after Surgically Treated Fragility Fracture. Journal of clinical medicine. 2022;11(7).
Ma Y, Lu Q, Yuan F, Chen H. Comparison of the effectiveness of different machine learning algorithms in predicting new fractures after PKP for osteoporotic vertebral compression fractures. J Orthop Surg Res. 2023;18(1):62.
Article PubMed PubMed Central Google Scholar
Briot K, Geusens P, Em Bultink I, Lems WF, Roux C. Inflammatory diseases and bone fragility. Osteoporos Int. 2017;28(12):3301–14.
Article CAS PubMed Google Scholar
Ishii S, Cauley JA, Greendale GA, Crandall CJ, Danielson ME, Ouchi Y, et al. C-Reactive protein, bone strength, and nine-year fracture risk: data from the study of women’s Health across the Nation (SWAN). J Bone Miner Res. 2013;28(7):1688–98.
Article CAS PubMed Google Scholar
Park BK, Yun KY, Kim SC, Joo JK, Lee KS, Choi OH. The relationship between renal function and bone marrow density in healthy Korean women. jmm. 2017;23(2):96–101.
PubMed PubMed Central Google Scholar
Jassal SK, von Muhlen D, Barrett-Connor E. Measures of renal function, BMD, bone loss, and osteoporotic fracture in older adults: the Rancho Bernardo Study. J Bone Miner Res. 2007;22(2):203–10.
Article CAS PubMed Google Scholar
Li S, Zhan J, Wang Y, Wang Y, He J, Huang W, et al. Association between renal function and bone mineral density in healthy postmenopausal Chinese women. BMC Endocr Disorders. 2019;19(1):146.
Article Google Scholar

Download references

Funding

No funding was received to assist with the preparation of this manuscript.

Author information

Authors and Affiliations

Bone and Joint Reconstruction Research Center, Department of Orthopedics, School of Medicine, University of Medical Sciences, Baharestan Sq, Tehran, Iran
Mozhdeh Zabihiyeganeh, Alireza Mirzaei, Pouria Tabrizian, Aryan Rezaee, Azade Amini Kadijani & Ali Sharifi Kia
Department of Orthopaedic Surgery, University of Minnesota, Minneapolis, MN, USA
Alireza Mirzaei
Student Research Committee, School of Medicine, Iran University of Medical Sciences, Tehran, Iran
Aryan Rezaee
Department of Health Information Management, School of Health Management and Information Sciences, Iran University of Medical Sciences, Tehran, Iran
Abbas Sheikhtaheri
Department of Medical Physics, Faculty of Medical Sciences, Tarbiat Modares University, Tehran, Iran
Bahare Amini Kadijani
Department of Computer Science, Faculty of Science, Western University, London, ON, Canada
Ali Sharifi Kia

Authors

Mozhdeh Zabihiyeganeh
View author publications
You can also search for this author in PubMed Google Scholar
Alireza Mirzaei
View author publications
You can also search for this author in PubMed Google Scholar
Pouria Tabrizian
View author publications
You can also search for this author in PubMed Google Scholar
Aryan Rezaee
View author publications
You can also search for this author in PubMed Google Scholar
Abbas Sheikhtaheri
View author publications
You can also search for this author in PubMed Google Scholar
Azade Amini Kadijani
View author publications
You can also search for this author in PubMed Google Scholar
Bahare Amini Kadijani
View author publications
You can also search for this author in PubMed Google Scholar
Ali Sharifi Kia
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M. Z: Study conception and design, Writing the draft, and revising the manuscript. A. M: Study conception and design, Data preparation, Data analysis, Writing the draft, and revising the manuscript. P. T: Data Collection, Contributed to the data analysis, Interpretation of the results, Provided critical revisions to the manuscript. A. R: Data Collection, Contributed to the data analysis, Interpretation of the results, Provided critical revisions to the manuscript. A. Sh: Study conception and design, Contributed to the data analysis, Revising the manuscript and Supervision. A. A. K: Data Collection, Provided expertise and guidance throughout the research project, and revised the manuscript for intellectual content. B. A. K: Data Collection, Provided expertise and guidance throughout the research project, and revised the manuscript for intellectual content. A. S. K: Study conception and design, Data preparation, Data analysis, Writing the draft, and revising the manuscript. All authors have read and approved the final manuscript, and ensure that this is the case.

Corresponding author

Correspondence to Ali Sharifi Kia.

Ethics declarations

Ethics approval and consent to participate

The study was approved by the research ethics committee of the Iran University of Medical Sciences (IUMS) (IR.IUMS.REC.1401.106). All patients’ data was collected anonymously. Therefore, informed consent was waived for this study by the research ethics committee of the Iran University of Medical Sciences. In addition, all methods were performed in accordance with the Declaration of Helsinki and Iranian research ethics guidelines.

Consent to publish

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Supplementary Material 2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Zabihiyeganeh, M., Mirzaei, A., Tabrizian, P. et al. Prediction of subsequent fragility fractures: application of machine learning. BMC Musculoskelet Disord 25, 438 (2024). https://doi.org/10.1186/s12891-024-07559-y

Download citation

Received: 29 November 2023
Accepted: 29 May 2024
Published: 04 June 2024
DOI: https://doi.org/10.1186/s12891-024-07559-y

Prediction of subsequent fragility fractures: application of machine learning

Abstract

Background

Methods

Results

Conclusion

Background

Methods

Data sources and study population

Data preprocessing

Features and feature selection

Data balancing

Model Development, evaluation, and explainability

Results

Feature selection

Model performance and evaluation

Feature importance

Female patient’s prediction model

Male patients’ prediction model

Error analysis

Female patient’s prediction model

Male patient’s prediction model

Discussion

Conclusion

Data availability

References

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent to publish

Competing interests

Additional information

Publisher’s Note

Electronic supplementary material

Supplementary Material 1

Supplementary Material 2

Rights and permissions

About this article

Cite this article

Share this article

Keywords

BMC Musculoskeletal Disorders

Contact us