Skip to main content

Machine learning-based models to predict one-year mortality among Chinese older patients with coronary artery disease combined with impaired glucose tolerance or diabetes mellitus

Abstract

Purpose

An accurate prediction of survival prognosis is beneficial to guide clinical decision-making. This prospective study aimed to develop a model to predict one-year mortality among older patients with coronary artery disease (CAD) combined with impaired glucose tolerance (IGT) or diabetes mellitus (DM) using machine learning techniques.

Methods

A total of 451 patients with CAD combined with IGT and DM were finally enrolled, and those patients randomly split 70:30 into training cohort (n = 308) and validation cohort (n = 143).

Results

The one-year mortality was 26.83%. The least absolute shrinkage and selection operator (LASSO) method and ten-fold cross-validation identified that seven characteristics were significantly associated with one-year mortality with creatine, N-terminal pro-B-type natriuretic peptide (NT-proBNP), and chronic heart failure being risk factors and hemoglobin, high density lipoprotein cholesterol, albumin, and statins being protective factors. The gradient boosting machine model outperformed other models in terms of Brier score (0.114) and area under the curve (0.836). The gradient boosting machine model also showed favorable calibration and clinical usefulness based on calibration curve and clinical decision curve. The Shapley Additive exPlanations (SHAP) found that the top three features associated with one-year mortality were NT-proBNP, albumin, and statins. The web-based application could be available at https://starxueshu-online-application1-year-mortality-main-49cye8.streamlitapp.com/.

Conclusions

This study proposes an accurate model to stratify patients with a high risk of one-year mortality. The gradient boosting machine model demonstrates promising prediction performance. Some interventions to affect NT-proBNP and albumin levels, and statins, are beneficial to improve survival outcome among patients with CAD combined with IGT or DM.

Introduction

Coronary artery disease (CAD) is the main cause of mortality and morbidity worldwide, causing over 8 million deaths worldwide each year and a huge burden on social care systems [1]. Asians exhibited greater susceptibility to CAD and diabetes mellitus (DM) compared to Westerners, due to higher levels of central obesity and total adiposity or higher degree of insulin resistance and endothelial dysfunction [2,3,4]. Notably, China has the world’s largest number of people with DM, accounting for a quarter of the world’s diabetic patients [5]. As a poor prognostic factor, DM is associated with a two- to four-fold increased risk of developing CAD [6, 7]. Compared to non-diabetic subjects, more severe and diffuse coronary atherosclerosis have been found in diabetic subjects [8]. Similarly, patients with DM more likely experience acute myocardial infarction than patients without DM [9]. That is to say, DM induces subsequent cardiovascular diseases and is detrimental to outcomes among patients with CAD. Survival prediction of patients with CAD helps make appropriate therapeutic strategies, which is able to significantly improve the prognosis of those patients. Proper interventions can be performed in advance and prognosis can be improved more effectively if we promptly achieve risk identification of CAD progression [10]. However, there are few studies on the survival risk factors of older patients with CAD combined with impaired glucose tolerance (IGT) or DM.

Machine learning provides researchers with powerful statistical methods to intercept associations between patient’s features and outcomes, allowing for objective integration of data to predict clinical prognosis. It has been demonstrated to be highly effective methods for prognostic prediction and decision making in plenty of analysis of diverse and massive electronic health record data [11, 12]. Prognostic prediction models have been developed in terms of different machine learning algorithms, and prediction performance of these models was superior to traditional regression-based heart disease prediction [13,14,15]. Recent systematic review identified that the best predictive model for cardiovascular complications in diabetic patients was neural network, followed by gradient boosting machine model [16]. However, there is still a lack of studies on machine learning for prognostic prediction specifically among older patients with CAD and IGT or DM. Therefore, the purpose of the study was to develop a model to predict one-year mortality among older patients with CAD combined with IGT or DM using machine learning techniques. The logistic regression model, and three machine learning algorithms, including gradient boosting machine model, random forest, and decision tree, were introduced in the study. In addition, a web-based calculator was developed to present optimal model and promote clinical application.

Methods

Design and patients

This prospective study analyzed 974 older patients with CAD admitted in the Department of Geriatric Cardiology, Chinese people’s Liberation Army (PLA) General Hospital. Patients were included from October, 2007 to July, 2011 if they (1) aged above 60 years, (2) diagnosed with CAD, and (3) had IGT or DM. Figure 1 shows the flowchart outlining patient’s enrollment and study design. Chinese PLA General Hospital was their designated hospital and had comprehensive medical treatment and final death records, which makes it easier for us to track them for long-term effectively and accurately judge the end point. A total of 451 patients with CAD combined with IGT and DM were finally enrolled, and those patients randomly split 70:30 into training cohort (n = 308) and validation cohort (n = 143). The randomization of patients was achieved using computer. The training cohort was used to train and optimize models using the logistic regression model and three machine learning algorithms, and the validation cohort was used to test prediction performance of these models. This study protocol was approved by the Ethics Committee of Chinese People’s Liberation Army General Hospital (Beijing, China) and in accordance with the Helsinki Declaration of 1975 (as revised in 1983).

Figure 1
figure 1

Flowchart outlining patient’s enrollment and study design.

Definition and outcome

According to the guidelines of the American College of Cardiology (ACC)/American Heart Association (AHA)/European College of Cardiology (ESC), the diagnosis of CAD was made by chief physicians based on clinical histories, angina symptoms, cardiac markers, and specific examinations, including electrocardiogram (rest and exercise), echocardiography, radionuclide imaging, computed tomography, and coronary angiography [17, 18]. IGT was defined as 2-h glucose ≥ 7.8mmol/l after a 75-g oral glucose tolerance test (OGTT) [19]. DM was defined as patients with fasting glucose ≥ 7.0mmol/l, OGTT 2-h glucose ≥ 11.1mmol/l, or oral hypoglycemic drugs or insulin [20]. The median interval of follow-up was 5.06 years. Due to the failure of multiple organs in the older population and the priority of all-cause mortality in the outcome study, the primary outcome of the present study was all-cause mortality. One-year mortality was defined as patients died within one year after discharge. Mortality was determined by telephone interviews and medical records, including legal documents of dead time, place and others.

Data and variables

The present study collected patient’s (1) demographics, including age, gender, current smoker, body mass index (BMI, kg/m2), and heart rate, (2) laboratory examination, including glucose (mmol/L), hemoglobin (g/dL), high density lipoprotein cholesterol (HDL-C, mmol/L), low density lipoprotein cholesterol (LDL-C, mmol/L), albumin (g/dL), serum creatinine (µmol/L), uric acid (µmol/L), N-terminal pro-B-type natriuretic peptide (NT-proBNP, pg/ml), (3) comorbidities, including hypertension, chronic heart failure (CHF), and chronic kidney disease (CKD), (4) drugs used during hospitalization, including aspirin, clopidogrel, beta receptor blockers, calcium channel blockers (CCB), nitrates, angiotensin converting enzyme inhibitor (ACEI) or angiotensin receptor blocker (ARB), and statins. All information was obtained and preserved by trained researchers. To verify the accuracy of the results, other independent researchers performed logistical check and data re-evaluation. Blood tests were conducted at the central laboratory in the Department of Biochemistry, Chinese PLA General Hospital.

Modelling and validation

To develop prediction model for one-year mortality among those patients, the logistic regression model and three machine learning models including gradient boosting machine model, random forest, and decision tree were used to develop prediction models in the training group. The least absolute shrinkage and selection operator (LASSO) method combined with ten-fold cross-validation was used to screen characteristics associated with one-year mortality, and significant variables were used as inputs to train and optimize models. Random hyper-parameter search was performed to determine the model parameters after applying 5-fold cross-validation and 100 iterations of bootstrapping procedures to achieve the best area under the curve (AUC). Patients in the validation group were used to assess predictive effectiveness of models using the Brier score, AUC, calibration curve, and decision curve. The following formula was used to calculate the Brier score.

$$Brier Score= \frac{1}{N}\sum _{i=1}^{n}{({p}_{i}-{o}_{i})}^{2}$$

In the formula, \(N\) is the sample size, \({p}_{i}\) is the predicted probability of death within one year, and \({o}_{i}\) is the actual probability of death within one year (“No” vs. “Yes”).

Shaley Additive exPlanation (SHAP) was performed to determine feature contributions using the following formula.

$$g\left({z}^{{\prime }}\right)={\varphi }_{0}+\sum _{j=1}^{M}{\varphi }_{j}{{Z}^{{\prime }}}_{j}$$

In the formula, \(g\) is the interpretation model, \(M\) is the number of inputted parameters, \({\varphi }_{0}\) is a constant, and \({\varphi }_{j}\) is the attribution value (Shapley value) of each model parameter.

Developed web calculator

A web-based calculator for predicting one-year mortality among those patients was developed using the “Streamlit” (https://share.streamlit.io/) application in terms of the optimal model. For friendly use of the web calculator, the present study deployed three panels, including the panel for choosing and filling model parameters, the panel for getting predicted one-year mortality, and the panel for model introduction.

Statistics and implementation

Categorical variables were presented as proportion, and continuous variables that were not normally distributed were presented as median and interquartile range (IQR). A comparison of characteristics was performed between patients with and without one-year mortality using Wilcoxon rank tests for not normally distributed continuous variables and using Chi-square or continuous adjusted Chi-square tests for categorical variables. Reclassification of patients was performed according to the best cut-off value of the optimal model, and two risk (high-risk vs. low-risk) groups were categorized in the validation group. Traditional statistical analyses were performed using R programming language (version 4.1, http://www.R-project.org). A P value of less than 0.05 was considered as significant (two-tailed). Machine learning modelling and interpretation were performed in an open-source web application of Jupyter Notebook in which authors are able to use Python language (version 3.9).

Results

Patient’s basic characteristics

The median age of the entire cohort was 86.00 [82.00, 89.00] years (Table 1). The majority of patients were male (89.6%). The comorbidity burden was relatively heavy, because 87.1% patients had hypertension, 85.1% patients had DM, 71.0% had chronic CAD, 31.0% had CHF, and 36.6% had CKD. Drugs used during hospitalization was common among those patients. To elaborate, aspirin, clopidogrel, beta receptor blocker, CCB, nitrates, ACEI/ARB and statins were administrated among 46.3%, 62.7%, 73.2%, 71.4%, 85.4%, 57.6% and 67.2% patients, respectively. The one-year mortality was 26.83% among all patients. Subgroup analysis was performed to compare clinical characteristics between patients with and without one-year mortality. Patients who died within one year tended to have an older age, a lower BMI, and a higher heart rate (P < 0.05 for all). Meanwhile, patients who were dead within one year had higher glucose, serum creatinine, uric acid and NT-proBNP levels, and lower hemoglobin, HDL-C, LDL-C and albumin levels (P < 0.05 for all). In addition, patients with one-year mortality tended to have an elevated proportion of comorbidities such as more DM, CHF and CKD (P < 0.05 for all). Regarding administration of drugs during hospitalization, patients with one-year mortality had significant less aspirin, clopidogrel, ACEI/ARB, and statins (P < 0.05 for all).

Table 1 Patient’s demographics, tests, comorbidities, and drugs

Development of models

According to the least absolute shrinkage and selection operator (LASSO) method and ten-fold cross-validation (Fig. 2A B), seven clinical characteristics were identified to be significantly associated with one-year mortality, and these characteristics included hemoglobin, HDL-C, albumin, serum creatinine, NT-proBNP, CHF, and statins. In addition, serum creatinine, NT-proBNP, and CHF were risk factors based on the LASSO method, with hemoglobin, HDL-C, albumin, and statins being protective factors. Correlation coefficients between those clinical characteristics were calculated and presented as a correlation matrix (Fig. 2C). It demonstrated that no serious collinearity existed because all the correlation coefficients were below 0.80. Thus, the seven clinical characteristics were served as model predictors to establish models using the logistic regression and three machine learning models. The optimal full super-parameters for each model were obtained after training and optimizing models (Table 2).

Figure 2
figure 2

A: The least absolute shrinkage and selection operator (LASSO) coefficient profiles of all variables; B: Selection of appropriate parameters; C: Correlation coefficients between clinical characteristics; D: Overall performance of each model; E: Calibration curve of each model; F: Clinical decision curve analysis of each model; G: The heatmap of SHAP value; H: Analysis of feature importance based on SHAP summary plot; I: The screenshot of online calculator.

Table 2 Full super-parameters of techniques

Validation of models

As for the overall performance of each model, the Brier score was 0.116 for logistic regression model, 0.114 for gradient boosting machine model, 0.143 for decision tree model, and 0.126 for random forest model. Corresponding AUC was 0.827 [95% confidence interval (CI): 0.731–0.924], 0.836 (95% CI: 0.743–0.929), 0.760 (95% CI: 0.652–0.869), and 0.829 (95% CI: 0.728–0.930), respectively (Fig. 2D). The above findings elucidated that the gradient boosting machine model was the optimal model in terms of the Brier score and AUC. Calibration curve showed that all models, in particular the gradient boosting machine model, had favorable calibration (Fig. 2E). Clinical decision curve analysis showed that the gradient boosting machine model and the logistic regression model both showed favorable clinical usefulness (Fig. 2F). Based on the gradient boosting machine model, the study further analyzed the feature importance for significant clinical characteristics in the whole population. The individual SHAP value and the mean SHAP value both found that the top three features associated with one-year mortality were NT-proBNP, albumin, and statins (Fig. 2G H).

Optimal web-based calculator

As the gradient boosting machine model was the optimal model in the study, we further developed a web-based calculator for predicting one-year mortality using this model in the “Streamlit” application (https://share.streamlit.io/). In the calculator, the present study deployed a panel for choosing model parameters, a panel for obtaining predicted one-year mortality, and a panel for introduction of the model (Fig. 2I). This calculator could be available at https://starxueshu-online-application1-year-mortality-main-49cye8.streamlitapp.com/.

Sometimes, if the web-based calculator has gone to sleep (shut down) and can not be opened, users are able to access to it via clicking ‘Yes, get this app back up!’. After about 30 s, the web-based calculator would be accessible.

Reclassification of patients

Patients were recategorized according to the best cut-off value (26.4%) in the gradient boosting machine model. Patients with a predicted risk of less than 26.4% were categorized into the low-risk group, whereas patients who had a predicted risk of 26.4% or above were categorized into the high-risk group. To be specific, the predicted risk was 7.72% in the low-risk group, and 62.31% in the high-risk group; the actual risk was 7.22% (7/97) in the low-risk group, and 50.00% (23/46) in the high-risk group. It indicated that patients in the high-risk group were near seven times more likely to suffer from one-year mortality after discharge from hospitalization.

Discussion

The present study developed an accurate model to predict one-year mortality among older patients with CAD and IGT or DM. Three machine learning techniques were introduced for analysis in the study, and the gradient boosting machine model showed the optimal prediction performance as compared to other models. The AUC of the model was up to 0.836, indicating excellent prediction effectiveness. In addition, to encourage clinical use of the model, the study proposed a web-based application, and the application was user-friendly. The LASSO method and ten-fold cross-validation identified that seven clinical characteristics were significantly associated with one-year mortality with creatine, NT-proBNP, and CHF being risk factors and hemoglobin, HDL-C, albumin, and statins used during hospitalization being protective factors. The SHAP found that the top three features associated with one-year mortality were NT-proBNP, albumin, and statins.

In the past two decades, the incidence of DM in China has been on a significant rise. Half of the adults (50.1%) in China are thought to have prediabetes. In other words, they have impaired fasting glucose and/or IGT [21, 22]. In addition, the prognosis of CAD is worse in patients with DM than in nondiabetic patients. IGT is also a risk factor for cardiovascular diseases [23, 24]. Most of the current studies on the prognosis of CAD focus on the analysis of patients’ overall survival and risk factors. Studies on the risk of short-term mortality in older patients with CAD combined with IGT or DM are still rare. The present study developed a new model with the aim of helping clinicians to identify patients at risk of short-term mortality in time for early warning and treatment to improve the survival of older patients with CAD.

In the present study, the LASSO method and ten-fold cross-validation identified that seven clinical characteristics were significantly associated with one-year mortality with creatine, NT-proBNP, and CHF being risk factors and hemoglobin, HDL-C, albumin, and statins used during hospitalization being protective factors. The SHAP found that the top three features associated with one-year mortality were NT-proBNP, albumin, and statins. To elaborate, NT-proBNP is a well-known biomarker for CAD, and it may facilitate early diagnosis of heart failure and stratification of cardiac risk. NT-proBNP has been reported to be a significant predictor of mortality for cardiovascular diseases [25]. Elevated NT-proBNP levels are associated with left ventricular insufficiency and poorer clinical prognosis in patients with heart failure and other related diseases. These observations have since been extended to patients with CAD, where elevated NT-proBNP levels may also be the result of myocardial ischemia [26]. As demonstrated previously, increasing quartiles of NT-proBNP were strongly related to an increase in the odds of one-year mortality [27]. In a study of patients with CAD, NT-proBNP levels were strongly associated with one-year mortality for the fourth versus the first quartile [28].

Serum albumin is an acute phase protein whose serum levels are affected by the nutritional status, inflammatory responses, and fluid status, exerting a protective effect in atherosclerosis via its anti-inflammatory, anti-oxidant, and anti-thrombotic roles [29]. Several studies have revealed that low serum albumin levels predicted adverse outcomes in the general population, as well as in patients with CAD [30]. A previous meta-analysis demonstrated that low serum albumin levels are associated with an increased risk of death, not only in subjects free from cardiovascular diseases, but also in patients who already experienced cardiovascular diseases [31]. These findings suggested that serum albumin could be a potential prognostic biomarker for cardiovascular diseases. Older patients are often at risk for malnutrition, infection, overloaded fluid, and liver and kidney insufficiency, all of which lead to the development of hypoproteinemia and an increased risk of short-term mortality in older patients. The inclusion of serum albumin into the model in our report is benefit for clinicians to identify potential risks in older patients with CAD in a timely manner, thus assisting clinical decision-making and therapy.

It is well-known that stains can improve clinical prognosis in the primary and secondary prevention of CAD. Both abnormal inflammatory responses or abnormal lipid metabolism are risk factors for CAD and DM. Dyslipidemia often accompanies abnormal glucose metabolism. As an essential risk factor for CAD, DM can exacerbate the progression of atherosclerosis, resulting in poor clinical outcomes. Abnormal blood glucose metabolism, including IGT and DM, has become increasingly common. As a key risk factor for CAD, DM can aggravate the progression of atherosclerosis and lead to adverse clinical outcomes [32, 33]. CAD and DM have common abnormal inflammatory response or abnormal lipid metabolism, and glucose metabolism is closely related to lipid metabolism [34]. Therefore, lipid metabolism and its management play an important role in CAD patients with CAD combined with IGT or DM.

As one of the important prognostic markers for patients with CAD, epidemiologic studies have demonstrated inverse relationship between HDL-C levels and CAD occurrence. HDL-C can mediate reverse transport of cholesterol to regulate atherosclerotic process and exhibits anti-inflammatory, anti-oxidant, and anti-thrombotic effects on vascular system. Studies reported that elevated HDL-C levels after treatment with stains can reduced the incidence of death [35]. The seven parameters included in our model are all common clinical indicators, with advantages of accessibility and convenience, which contribute to clinical application and popularization of the model. For high-risk patients, we should intervene in advance to control adverse prognostic factors and reduce death risk of patients with CAD combined with IGT or DM.

Actually speaking, previous studies have developed models to predict survival outcome among patients with CAD using machine learning techniques. For instance, Sherazi et al. [36] developed a machine learning-based one-year mortality prediction model after applying 22 continuous variables, 43 categorical variables, and 4 discrete variables. Although the AUC of the best machine learning model could be up to 0.898, too many variables in the model may restrict clinical application of the model. In addition, web-based calculator was not available in the above study. Ke et al. [10] showed that the gradient boosting decision tree had the optimal performance with the AUC of 0.918, but the model was used to predict the in-hospital mortality among patients with CAD. In the present study, the AUC of our model was up to 0.836, indicating excellent prediction effectiveness. To encourage clinical use of the model, the study also proposed a web-based application, and the application was user-friendly. In clinical settings, users are able to fill parameters of each feature according to patient’s conditions in the panel of selecting parameters, and then the probability of one-year mortality could be presented in the panel of results via submitting all these parameters. Based on the reclassification of patients, patients in the high-risk group were near seven times more likely to suffer from one-year mortality. Therefore, for patients in the high-risk group, effective measures need to be taken early based on the findings of the study, including regulating the levels of hemoglobin, albumin, creatine and NT-proBNP, treating CHF, and administering statins for appropriate patients.

The present study had some drawbacks. For one thing, although this study analyzed up to 26 variables, respiratory infections may have a significant impact on survival outcome and were not included for analysis of this study. For another thing, because the model was not externally validated in the present study, the generalization of the model in other cohorts needs future investigation.

Conclusions

The present study proposes an accurate model that is able to stratify patients with a high risk of one-year mortality. The gradient boosting machine model demonstrates promising prediction performance. Some interventions to affect NT-proBNP and albumin levels, and statins used during hospitalization, are beneficial to improve survival outcome among patients with CAD combined with IGT or DM.

Data Availability

All data and material are available under the requirement to the corresponding authors.

References

  1. Kasprzak D, Rzezniczak J, Ganowicz T, Luczak T, Slomczynski M, Hiczkiewicz J, et al. A review of Acute Coronary Syndrome and its potential impact on cognitive function. Glob Heart. 2021;16(1):53. https://doi.org/10.5334/gh.934. PMID: 34381674.

    Article  PubMed  PubMed Central  Google Scholar 

  2. Ramachandran A, Chamukuttan S, Shetty SA, Arun N, Susairaj P. Obesity in Asia–is it different from rest of the world. Diabetes Metab Res Rev. 2012;28(Suppl 2):47–51. https://doi.org/10.1002/dmrr.2353. PMID: 23280866.

    Article  PubMed  Google Scholar 

  3. Gobardhan SN, Dimitriu-Leen AC, van Rosendael AR, van Zwet EW, Roos CJ, Oemrawsingh PV, et al. Prevalence by computed tomographic angiography of coronary plaques in south asian and white patients with type 2 diabetes Mellitus at Low and High Risk using Four Cardiovascular Risk Scores (UKPDS, FRS, ASCVD, and JBS3). Am J Cardiol. 2017;119(5):705–11. https://doi.org/10.1016/j.amjcard.2016.11.029. PMID: 28024655.

    Article  PubMed  Google Scholar 

  4. Lee JW, Brancati FL, Yeh HC. Trends in the prevalence of type 2 diabetes in Asians versus whites: results from the United States National Health Interview Survey, 1997–2008. Diabetes Care. 2011;34(2):353-7. PMID: 21216863. doi: https://doi.org/10.2337/dc10-0746.

  5. Xu S, Scott CAB, Coleman RL, Tuomilehto J, Holman RR. Predicting the risk of developing type 2 diabetes in chinese people who have coronary heart disease and impaired glucose tolerance. J Diabetes. 2021;13(10):817–26. https://doi.org/10.1111/1753-0407.13175. PMID: 33665904.

    Article  CAS  PubMed  Google Scholar 

  6. Manoharan P, Nadarajah R, Suganthan N. Assessment of Additional Risk Factors for Cardiovascular Disease and Awareness Among Adult Patients With Diabetes Mellitus: A Cross-Sectional Study From Northern Sri Lanka. Cureus. 2022;14(10):e30047. PMID: 36381940. doi: https://doi.org/10.7759/cureus.30047.

  7. Herlitz J, Malmberg K, Karlson BW, Ryden L, Hjalmarson A. Mortality and morbidity during a five-year follow-up of diabetics with myocardial infarction. Acta Med Scand. 1988;224(1):31–8. https://doi.org/10.1111/j.0954-6820.1988.tb16735.x. PMID: 3046232.

    Article  CAS  PubMed  Google Scholar 

  8. Natali A, Vichi S, Landi P, Severi S, L’Abbate A, Ferrannini E. Coronary atherosclerosis in type II diabetes: angiographic findings and clinical outcome. Diabetologia. 2000;43(5):632–41. https://doi.org/10.1007/s001250051352. PMID: 10855538.

    Article  CAS  PubMed  Google Scholar 

  9. Esteghamati A, Abbasi M, Nakhjavani M, Yousefizadeh A, Basa AP, Afshar H. Prevalence of diabetes and other cardiovascular risk factors in an Iranian population with acute coronary syndrome. Cardiovasc Diabetol. 2006;5:15. PMID: 16842631. doi: https://doi.org/10.1186/1475-2840-5-15.

  10. Ke J, Chen Y, Wang X, Wu Z, Zhang Q, Lian Y, et al. Machine learning-based in-hospital mortality prediction models for patients with acute coronary syndrome. Am J Emerg Med. 2022;53:127–34. https://doi.org/10.1016/j.ajem.2021.12.070. PMID: 35033770.

    Article  PubMed  Google Scholar 

  11. Deo RC. Machine learning in Medicine: will this time be different? Circulation. 2020;142(16):1521–3. https://doi.org/10.1161/CIRCULATIONAHA.120.050583. PMID: 33074761.

    Article  PubMed  Google Scholar 

  12. Liang H, Tsui BY, Ni H, Valentim CCS, Baxter SL, Liu G, et al. Evaluation and accurate diagnoses of pediatric diseases using artificial intelligence. Nat Med. 2019;25(3):433–8. https://doi.org/10.1038/s41591-018-0335-9. PMID: 30742121.

    Article  CAS  PubMed  Google Scholar 

  13. Bharti R, Khamparia A, Shabaz M, Dhiman G, Pande S, Singh P. Prediction of Heart Disease Using a Combination of Machine Learning and Deep Learning. Comput Intell Neurosci. 2021;2021:8387680. PMID: 34306056. doi: https://doi.org/10.1155/2021/8387680.

  14. Ali MM, Paul BK, Ahmed K, Bui FM, Quinn JMW, Moni MA. Heart disease prediction using supervised machine learning algorithms: Performance analysis and comparison. Comput Biol Med. 2021;136:104672. PMID: 34315030. doi: https://doi.org/10.1016/j.compbiomed.2021.104672.

  15. Alaa AM, Bolton T, Di Angelantonio E, Rudd JHF, van der Schaar M. Cardiovascular disease risk prediction using automated machine learning: a prospective study of 423,604 UK Biobank participants. PLoS ONE. 2019;14(5):e0213653. https://doi.org/10.1371/journal.pone.0213653. PMID: 31091238.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  16. Kee OT, Harun H, Mustafa N, Abdul Murad NA, Chin SF, Jaafar R et al. Cardiovascular complications in a diabetes prediction model using machine learning: a systematic review. Cardiovasc Diabetol. 2023;22(1):13. PMID: 36658644. doi: https://doi.org/10.1186/s12933-023-01741-7.

  17. Fisher MA, Borgnakke WS, Taylor GW. Periodontal disease as a risk marker in coronary heart disease and chronic kidney disease. Curr Opin Nephrol Hy. 2010;19(6):519–26. https://doi.org/10.1097/MNH.0b013e32833eda38. PMID: WOS:000282981200002.

    Article  Google Scholar 

  18. Vallee A, Cinaud A, Blachier V, Lelong H, Safar ME, Blacher J. Coronary heart disease diagnosis by artificial neural networks including aortic pulse wave velocity index and clinical parameters. J Hypertens. 2019;37(8):1682–8. https://doi.org/10.1097/HJH.0000000000002075. PMID: 30870247.

    Article  CAS  PubMed  Google Scholar 

  19. Beulens JWJ, Rutters F, Ryden L, Schnell O, Mellbin L, Hart HE, et al. Risk and management of pre-diabetes. Eur J Prev Cardiol. 2019;26(2Suppl):47–54. https://doi.org/10.1177/2047487319880041. PMID: WOS:000501438900006.

    Article  PubMed  Google Scholar 

  20. Schleicher E, Gerdes C, Petersmann A, Muller-Wieland D, Muller UA, Freckmann G et al. Definition, Classification and Diagnosis of Diabetes Mellitus. Exp Clin Endocr Diab. 2022;130:S1-S8. PMID: WOS:000784362000001. doi: https://doi.org/10.1055/a-1624-2897.

  21. Xu Y, Wang L, He J, Bi Y, Li M, Wang T, et al. Prevalence and control of diabetes in chinese adults. JAMA. 2013;310(9):948–59. https://doi.org/10.1001/jama.2013.168118. PMID: 24002281.

    Article  CAS  PubMed  Google Scholar 

  22. Task Force on diabetes p-d, cardiovascular diseases of the European Society of C, European Association for the Study of D, Ryden L, Grant PJ, Anker SD, et al. ESC guidelines on diabetes, pre-diabetes, and cardiovascular diseases developed in collaboration with the EASD - summary. Diab Vasc Dis Res. 2014;11(3):133–73. https://doi.org/10.1177/1479164114525548. PMID: 24800783.

    Article  Google Scholar 

  23. Huang Y, Cai X, Mai W, Li M, Hu Y. Association between prediabetes and risk of cardiovascular disease and all cause mortality: systematic review and meta-analysis. BMJ. 2016;355:i5953. https://doi.org/10.1136/bmj.i5953. PMID: 27881363.

    Article  PubMed  PubMed Central  Google Scholar 

  24. George A, Bhatia RT, Buchanan GL, Whiteside A, Moisey RS, Beer SF, et al. Impaired glucose tolerance or newly diagnosed diabetes Mellitus diagnosed during admission adversely affects prognosis after myocardial infarction: an observational study. PLoS ONE. 2015;10(11):e0142045. https://doi.org/10.1371/journal.pone.0142045. PMID: 26571120.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  25. Kragelund C, Gronning B, Kober L, Hildebrandt P, Steffensen R. N-terminal pro-B-type natriuretic peptide and long-term mortality in stable coronary heart disease. N Engl J Med. 2005;352(7):666–75. https://doi.org/10.1056/NEJMoa042330. PMID: 15716560.

    Article  CAS  PubMed  Google Scholar 

  26. de Lemos JA, Morrow DA, Bentley JH, Omland T, Sabatine MS, McCabe CH, et al. The prognostic value of B-type natriuretic peptide in patients with acute coronary syndromes. N Engl J Med. 2001;345(14):1014–21. https://doi.org/10.1056/NEJMoa011053. PMID: 11586953.

    Article  PubMed  Google Scholar 

  27. James S, Armstrong P, Califf R, Simoons ML, Venge P, Wallentin L, et al. Troponin T levels and risk of 30-day outcomes in patients with the acute coronary syndrome: prospective verification in the GUSTO-IV trial. Am J Med. 2003;115(3):178–84. https://doi.org/10.1016/s0002-9343(03)00348-6. PMID: 12935823.

    Article  CAS  PubMed  Google Scholar 

  28. Westerhout CM, Fu Y, Lauer MS, James S, Armstrong PW, Al-Hattab E, et al. Short- and long-term risk stratification in acute coronary syndromes: the added value of quantitative ST-segment depression and multiple biomarkers. J Am Coll Cardiol. 2006;48(5):939–47. https://doi.org/10.1016/j.jacc.2006.04.085. PMID: 16949483.

    Article  PubMed  Google Scholar 

  29. Zhang Z, Pereira SL, Luo M, Matheson EM. Evaluation of blood biomarkers Associated with risk of Malnutrition in older adults: a systematic review and Meta-analysis. Nutrients. 2017;9(8):829. https://doi.org/10.3390/nu9080829. PMID: 28771192.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  30. Danesh J, Collins R, Appleby P, Peto R. Association of fibrinogen, C-reactive protein, albumin, or leukocyte count with coronary heart disease: meta-analyses of prospective studies. JAMA. 1998;279(18):1477-82. PMID: 9600484. doi: https://doi.org/10.1001/jama.279.18.1477.

  31. Pignatelli P, Farcomeni A, Menichelli D, Pastori D, Violi F. Serum albumin and risk of cardiovascular events in primary and secondary prevention: a systematic review of observational studies and bayesian meta-regression analysis. Intern Emerg Med. 2020;15(1):135–43. https://doi.org/10.1007/s11739-019-02204-2. PMID: 31605272.

    Article  PubMed  Google Scholar 

  32. Babes EE, Bustea C, Behl T, Abdel-Daim MM, Nechifor AC, Stoicescu M et al. Acute coronary syndromes in diabetic patients, outcome, revascularization, and antithrombotic therapy. Biomed Pharmacother. 2022;148:112772. PMID: 35245735. doi: https://doi.org/10.1016/j.biopha.2022.112772.

  33. Dwivedi AK, Dubey P, Reddy SY, Clegg DJ. Associations of Glycemic Index and Glycemic load with Cardiovascular Disease: updated evidence from Meta-analysis and cohort studies. Curr Cardiol Rep. 2022;24(3):141–61. https://doi.org/10.1007/s11886-022-01635-2. PMID: 35119682.

    Article  PubMed  Google Scholar 

  34. Xu W, Tian M, Zhou Y. The relationship between insulin resistance, adiponectin and C-reactive protein and vascular endothelial injury in diabetic patients with coronary heart disease. Exp Ther Med. 2018;16(3):2022-6. PMID: 30186434. doi: https://doi.org/10.3892/etm.2018.6407.

  35. Brown BG, Zhao XQ, Chait A, Fisher LD, Cheung MC, Morse JS, et al. Simvastatin and niacin, antioxidant vitamins, or the combination for the prevention of coronary disease. N Engl J Med. 2001;345(22):1583–92. https://doi.org/10.1056/NEJMoa011090. PMID: 11757504.

    Article  CAS  PubMed  Google Scholar 

  36. Sherazi SWA, Jeong YJ, Jae MH, Bae JW, Lee JY. A machine learning-based 1-year mortality prediction model after hospital discharge for clinical patients with acute coronary syndrome. Health Inf J. 2020;26(2):1289–304. https://doi.org/10.1177/1460458219871780. PMID: 31566458.

    Article  Google Scholar 

Download references

Acknowledgements

We appreciate Mingxing Lei and all other staff for their continued cooperation and contribution in the development of machine learning-based models.

Funding

This work was supported by grants from the Excellent Youth Incubation Program of Chinese People’s Liberation Army General Hospital (2020-YQPY-007), the Natural Science Foundation of Hainan Province (821QN389, 821MS112, 822MS193, 822MS198, 820MS126, 820QN383), the Military Medical Science and Technology Youth Incubation Program (20QNPY110, 19QNP060), the National Natural Science Foundation of China (81900357, 81941021, 81903392, 81901252, 82001476, 81802804, 81801251), the National Key R&D Program of China (2018YFC2000400), the National S&T Resource Sharing Service Platform Project of China (YCZYPT[2018]07), the Specific Research Fund of Innovation Platform for Academicians of Hainan Province (YSPTZX202216), the Medical Big Data R&D Project of Chinese People’s Liberation Army General Hospital (MBD2018030), the Traditional Chinese Medicine Service Capacity Cultivating Program (2021ZY053), the China Postdoctoral Science Foundation Funded Project (2019M650359, 2020M682816, 2021T140298), the Heatstroke Treatment and Research Center of PLA (413EGZ1D10), the Simulation Training for Treatment of Heat Stroke, and the Major Science and Technology Programme of Hainan Province (ZDKJ2019012). The sponsors had no role in the design, conduct, interpretation, review, approval or control of this article.

Author information

Authors and Affiliations

Authors

Contributions

Y.L., L.G., C.N., P.Z., Y.Z., Q.L., P.P. and S.F. wrote the main manuscript text and prepared all figures. All authors reviewed the manuscript.

Corresponding authors

Correspondence to Yali Zhao, Qiong Liu, Ping Ping or Shihui Fu.

Ethics declarations

Ethics approval and consent to participate

The current study received the approval from Ethics Committee of Chinese People’s Liberation Army General Hospital (Beijing, China). Prior to the current study, written informed consents were required from all participants.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary Material 1

Supplementary Material 2

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Li, Y., Guan, L., Ning, C. et al. Machine learning-based models to predict one-year mortality among Chinese older patients with coronary artery disease combined with impaired glucose tolerance or diabetes mellitus. Cardiovasc Diabetol 22, 139 (2023). https://doi.org/10.1186/s12933-023-01854-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12933-023-01854-z

Keywords