Skip to main content

An innovative model for predicting coronary heart disease using triglyceride-glucose index: a machine learning-based cohort study

Abstract

Background

Various predictive models have been developed for predicting the incidence of coronary heart disease (CHD), but none of them has had optimal predictive value. Although these models consider diabetes as an important CHD risk factor, they do not consider insulin resistance or triglyceride (TG). The unsatisfactory performance of these prediction models may be attributed to the ignoring of these factors despite their proven effects on CHD. We decided to modify standard CHD predictive models through machine learning to determine whether the triglyceride-glucose index (TyG-index, a logarithmized combination of fasting blood sugar (FBS) and TG that demonstrates insulin resistance) functions better than diabetes as a CHD predictor.

Methods

Two-thousand participants of a community-based Iranian population, aged 20–74 years, were investigated with a mean follow-up of 9.9 years (range: 7.6–12.2). The association between the TyG-index and CHD was investigated using multivariate Cox proportional hazard models. By selecting common components of previously validated CHD risk scores, we developed machine learning models for predicting CHD. The TyG-index was substituted for diabetes in CHD prediction models. All components of machine learning models were explained in terms of how they affect CHD prediction. CHD-predicting TyG-index cut-off points were calculated.

Results

The incidence of CHD was 14.5%. Compared to the lowest quartile of the TyG-index, the fourth quartile had a fully adjusted hazard ratio of 2.32 (confidence interval [CI] 1.16–4.68, p-trend 0.04). A TyG-index > 8.42 had the highest negative predictive value for CHD. The TyG-index-based support vector machine (SVM) performed significantly better than diabetes-based SVM for predicting CHD. The TyG-index was not only more important than diabetes in predicting CHD; it was the most important factor after age in machine learning models.

Conclusion

We recommend using the TyG-index in clinical practice and predictive models to identify individuals at risk of developing CHD and to aid in its prevention.

Introduction

CHD is a major public health challenge and contributes to the global disease burden. Despite improved prevention methods and treatment techniques [1, 2], it is still the leading cause of morbidity and mortality worldwide, representing 32% of all deaths [3], and an enormous stress on the national health finances [4, 5]. Thus, CHD risk assessment is a global public health priority.

Various CHD predictive models such as Framingham [6], Systematic COronary Risk Evaluation (SCORE) [7], Reynolds [8], American College of Cardiology/American Heart Association (ACC/AHA) [9], Joint British Societies’ consensus recommendations for the prevention of cardiovascular disease (JBS3) [10], Multi-Ethnic Study of Atherosclerosis (MESA) [11], QRISK [12] and prediction for atherosclerotic cardiovascular risk in China (China-PAR) [13], have been developed for predicting CHD incidence, but none has optimal predictive value [14]. All such models consider diabetes as an important CHD risk factor, but not one considers either insulin resistance or TG [14,15,16,17].

A better prediction of CHD may be possible by considering insulin resistance, which occurs years or even decades before diabetes [18]. Previous Mendelian randomized analyses, systematic reviews, and meta-analyses have advocated the association between insulin resistance and CHD by altering vascular wall responses for insulin and promoting atherosclerosis [19,20,21]. The hyperinsulinemic-euglycemic clamp test is the gold standard of insulin resistance measurement, but it is not applicable in clinical studies because of its invasive, complicated, and expensive protocol [22, 23]. Another validated index is the homeostasis model assessment of insulin resistance (HOMA-IR) calculated by dividing serum glucose by insulin concentrations. Circulating insulin concentration is not routinely measured in primary care. Moreover, it has limited value in subjects receiving subcutaneous insulin. Therefore, HOMA-IR is not a suitable index for primary prevention strategies [23]. The TyG-index is a logarithmized product of FBS and TG. It has been shown to correlate highly with the hyperinsulinemic-euglycemic clamp and HOMA-IR [24]. Moreover, it is a simple, low-cost protocol that can be used in all subjects regardless of their insulin treatment status [23]. Additionally, it contains TG, another risk factor for CHD [25, 26] as indicated by several studies; nonetheless, it has not been considered in previous models [6,7,8,9,10,11,12,13]. Therefore, it seems sensible to modify these models with the TyG-index and then evaluate their effectiveness.

Machine learning algorithms have been demonstrated to be extremely useful in predicting cardiovascular disease [27]. Their ability to capture complex interactions and nonlinear relationships between variables and outcomes makes them superior to standard statistical models [28]. Several studies have shown that machine learning algorithms outperform traditional models [29,30,31]. Despite this, no study has explored the impact of TyG-index on the prediction of CHD through machine learning. For these reasons, machine learning models should be chosen to fully assess how TyG-index and diabetes impact and interact with other variables when predicting CHD.

In view of the above, the primary objective of the current study was to investigate the association between the TyG-index and CHD in a 10-year prospective cohort study. The ultimate objective was to modify standard CHD predictive models through machine learning to determine whether the TyG-index functions better than diabetes as a CHD predictor.

Methods

Study population

This cohort study was conducted using data from Yazd Healthy Heart Project (YHHP) a population-based epidemiological study evaluating cardiovascular diseases and metabolic disorders [32].

In YHHP, 100 clusters and 20 families from each cluster were defined, and one adult (aged 20–74 years) from each family was randomly selected for participation and evaluation in the first phase conducted in 2005–2006 (n = 2000, men = 1000, women = 1000) [32].

After 10 years of follow-up (2015–2016), the participants were re-invited to Yazd Cardiovascular Research Centre (YCRC) to be re-evaluated [32].

Included participants

From the 2000 participants, 17 were omitted from the study due to loss during the second phase; from the 1983 individuals participating in the baseline examination, 62 were excluded due to diagnosis of CHD at baseline, 78 due to death during the study, and 308 due to missing data. The remaining 1552 participants (804 men, mean age 48.6 ± 14.7 years) were included in the present study (Fig. 1).

Fig. 1
figure 1

Flow diagram of participants attending the 10-year follow-up study

Ethical approval

The present study was approved by the Shahid Sadoughi University of Medical Sciences ethics committee (ethics code: IR.SSU.REC.1401.069) and conducted based on the Declaration of Helsinki on medical research [33]. Informed consent was obtained from study participants during the initial and follow-up phases. The present research is reported based on strengthening the reporting of observational studies in the epidemiology (STROBE) statement [34].

Biochemical analysis

Blood samples for laboratory tests were collected after overnight fasting. After centrifugation, serum uric acid (SUA), glucose, and TG were assessed using Pars Azmoon kits (Pars Azmoon Inc., Tehran, Iran). Bionic kits (Bionic Company, Tehran, Iran) were used to analyse lipid profiles (total cholesterol, low-density lipoprotein [LDL] and high-density lipoprotein [HDL]). All analyses were conducted using a biochemical auto-analyser (BT 3000, Italy) [32].

Anthropometric and blood pressure measurements

Height was measured in both phases using a stadiometer fixed on a wall with no dents or bumps. While the participants were standing barefoot, their heels, hips, shoulders, and head touching the wall, and their head fixed horizontally to the nearest 0.5 cm. Participants were weighed to the nearest 0.1 kg in the first phase using a digital scale (Seca, Germany) with minimal clothing and in the second phase using another digital scale (Model BF511, Omron Co. Karada body scan, Osaka, Japan). The superior border of the iliac crest and widest part of the buttock were considered to measure waist and hip circumferences, respectively, to the nearest 0.1 cm using a non-stretchable tape.

An automatic digital blood pressure monitor (Omron, M6 comfort, Osaka, Japan) was used to measure blood pressure of the participants’ right arms, while they were in the sitting position. Blood pressure measurements were taken by a trained nurse twice, with an interval of 5 min [32].

Data collection

Data including demographic features, education, physical activity, smoking habits, family history of premature CHD, and dietary habits were collected by completing questionnaires.

Trained interviewers completed questionnaires to assess physical activity, educational attainment, dietary habits and smoking status, in the first phase of the study. For educational attainment, participants were categorized as having a primary, high school, or academic education. Physical activity was assessed using the International Physical Activity Questionnaire (IPAQ) [35]. Participants were categorized as having low, moderate, or vigorous level of activity if their activity was < 600, 600–1200, or  > 1200 kilocalories/week, respectively. Participants were divided into groups of smokers or non-smokers based on their current smoking status. CHD occurrence in either father or brother less than 45 years of age, or mother or sister less than 55 years of age was defined as a family history of premature CHD [32]. A questionnaire was used to determine the use of fried foods, salt, removing poultry skin, eating out, meat consumption, and removing fat from meat.

CHD events were defined as occurrences of fatal or non-fatal CHD, myocardial infarction (MI), percutaneous coronary intervention (PCI), coronary artery bypass grafting (CABG), and new angina. The diagnosis of new angina was based on positive findings from the Rose angina questionnaire [36] in addition to positive electrocardiogram changes, elevated cardiac enzymes, and positive exercise tolerance test or coronary artery angiogram.

The time of outcome for fatal or non-fatal CHD, MI, CABG, positive exercise test, positive cardiac enzymes, and PCI was determined based on medical records. All Rose angina questionnaires [36] and electrocardiograms were investigated by an expert medical practitioner.

Statistical analysis

Statistical analyses were performed with SPSS version 24.0 (IBM Corp., Armonk, NY, USA), Python 3, and R version 4.2.2 (www.R-project.org). Continuous variables were described as mean ± standard deviation (SD) and compared by independent T-test or ANOVA. Categorical variables were described as numbers (percentage) and compared using chi-square tests.

The TyG-index, the primary exposure variable of interest, was defined as:

$$TyG-index=ln\left(\frac{Tg\left(mg/dL\right)\times fasting\,glucose\left(mg/dL\right)}{2}\right)$$

and analysed as quartiles based on sex-specific distributions and as continuous measures. Multivariable Cox proportional hazard models were used to estimate the risk of CHD development. Four models were evaluated: model I was adjusted for age and sex; model II was further adjusted for physical activity, education, family history of premature CHD, and smoking; model III was further adjusted for total cholesterol, HDL, body mass index (BMI), waist-to-hip ratio, blood pressure, SUA, and LDL; and, model IV was further adjusted for consuming fried foods, adding salt, removing poultry skin, using high fat dairy products, dining out, meat consumption, and removing fat from meat. Finally, medication use was adjusted in our models for investigating whether it could modify the association.

The “OptimalCutpoints” [37] R package was used to assess TyG-index cut-off points that can predict CHD. We stratified these cut-points based on sex and diabetes status.

In accordance with previous studies [31, 38], we selected several machine-learning models to construct CHD-prediction models (logistic regression, decision tree, random forest, K nearest neighbor (KNN), and SVM). To simulate previous standard CHD predictor models, we investigated the literature and selected the common components between Framingham risk scores [6], SCORE CVD death risk score [7], QRISK risk calculator [12], Reynolds CVD risk score [8], ACC/AHA pooled cohort hard CVD risk calculator [9], JBS3 risk score [10], MESA risk score [11], and China-PAR risk predictor [13]. As a result of these investigations, age, sex, blood pressure, total cholesterol, HDL, waist-to-hip ratio, diabetes, smoking status, and family history of premature heart disease were considered in simulating a standard CHD prediction model. As part of the preprocessing of data, all missing values and evaluated outliers and highly correlated features were excluded. Because of imbalanced outcome data (14.5% incidence), we used SMOTE (over-sampling method) [39], which has been proven reliable for CHD [38]. After standardizing continuous variables and randomly splitting data into 70/30, we trained models on the larger part of the dataset and evaluated their performance on the smaller part. Afterward, we modified our dataset, by substituting the TyG-index for diabetes, and repeated the previous steps. For demonstrating the comparison of true positive, true negative, false negative, and false positive values of models, we used confusion matrices. We chose to use different color spectra to help illustrate the comparison, and make it easier to understand. To report model performance we calculated area under the curves (AUC), sensitivity, specificity, Cohen-kappa score Matthew's correlation coefficient, and F1-score. We used the generally accepted AUC index [31] and DeLong test [40] to compare the performance of these models. In order to make sense of machine learning models and counter the black box character of machine learning models, we used the “Dalex” library [41] to determine how much the performance of a model changes when a selected explanatory variable is removed.

Results

Additional file 1: Table S1 summarizes the baseline characteristics of the study participants according to the follow-up process. Participants lost to follow-up were significantly older and less frequently male than participants who completed the follow-up.

Additional file 1: Table S2 explains the baseline characteristics of the study participants based on their gender.

The baseline characteristics of participants according to TyG-index quartiles are presented in Table 1. Participants in the highest quartile of serum TyG-index levels (TyG-index > 9.32) were older and had higher total cholesterol, TG, SUA, and fasting blood glucose levels, higher diabetes rates, blood pressure and anthropometric indices, lower HDL levels, and less education.

Table 1 Baseline clinical characteristics and biological variables of the participants according to serum TyG-indexa quartiles

TyG-index and incidence of CHD

The overall incidence of new-onset CHD in the second visit was 14.5%. The incidence of CHD was 6.4%, 11.1%, 14%, and 26% in quartiles 1 to 4, respectively.

Compared with the Q1 group, the hazard ratio (HR) and 95% CI of CHD incidence in model I were 1.51 (0.91–2.51), 1.68 (1.03–2.74), and 2.63 (1.67–4.15) in Q2, Q3, and Q4 groups, respectively. After final adjustment (model IV), HR in Q4 was slightly decreased but still significant. Adjusted HR levels per 1-unit increase in TyG-index were 1.87 (1.59–2.21), 1.70 (1.35–2.14) and 2.16 (1.69–2.77) in the total sample, in men, and in women, respectively (Table 2).

Table 2 Risk of CHDa according to quartiles of TyG-indexb, overall and stratified by gender

When stratifying for gender, the association between TyG-index and risk of CHD in men was no longer significant after adjusting for laboratory markers and dietary patterns, yet it was still significantly associated with CHD in women: HR 4.65 (1.34–16.1) for Q4 compared to Q1. Diabetes medications confounded the association between TyG-index and CHD but dyslipidaemia treatment did not. A TyG-index higher than 9.07 in women and 8.92 in men had the highest sensitivity and specificity simultaneously for predicting CHD (Table 3).

Table 3 TyG-indexa cut-off points

Table 4 shows the statistical functions, as well as the confusion matrices for predicting models consisting of true positive, false positive, true negative, and false negative values. Random forest models had the highest sensitivity and specificity. A significant improvement was seen in the SVM model after modification with the TyG-index. Other models showed no significant changes. In Fig. 2, all the components of these models are compared in terms of their impact on prediction. Eliminating diabetes decreased AUC by around 2% in the decision tree, whereas in other models, it did not affect AUC. Depending on the model, TyG-index removal decreased AUC from 1 to 22%. The current study showed that the TyG-index was much better than diabetes in predicting CHD; overall, it was the second most important factor after age.

Table 4 Comparison of the primary and TyG-indexa-modified* versions of CHDb predictive models using machine learning
Fig. 2
figure 2

Impact of different components of machine learning models on predicting CHD

Discussion

The results of this prospective cohort study in a community-based Iranian population followed for 9.9 years indicate that higher a TyG-index is associated with a higher risk of CHD. This association was more evident in females. Additionally, TyG-index outperformed diabetes in CHD prediction models.

CHD and TyG-index association

An association between the TyG-index and CHD was previously confirmed in both observational [23, 42,43,44,45,46,47,48,49] and meta-analyses studies [19, 50, 51], but the inconsistency in predictive values, the incompleteness of confounding factors (especially diet and medications), and the need to investigate the association in non-diabetic patients in observational studies and heterogeneity in meta-analyses prompted the current study [19].

Previous studies have suggested TyG-index cutoff points of 9 and 9.323 for preventing CHD [52]. The results of the current study will aid healthcare providers in our region to screen their patients for a TyG-index of ≥ 8.42, which our results showed as having the highest negative predictive value, and to consider pharmacological treatment for values of ≥ 9.28, which had the highest positive predictive value in the current study, and to control those under 8.99, which had the highest sensitivity and specificity simultaneously.

Mechanisms

FBS and TG are reflections of insulin resistance in the liver and adipocytes, respectively [53]. A combination of these two factors, the TyG-index demonstrated 96.5% sensitivity and 85% specificity for detecting insulin resistance, a better performance than that of HOMA-IR [51]. Resistance to insulin can trigger inflammatory processes, lipid metabolism deregulations, sympathetic nervous system over-activation, endothelial dysfunction, and eventually, thrombosis and CHD [43, 45, 46, 51, 54,55,56,57]. Therefore, the TyG-index can serve as a simple, practical, cost-effective, reproducible, and reliable surrogate marker for insulin resistance measurement in CHD prevention plans [54].

TyG-index and gender

Studies have shown that the TyG-index plays a significant role in CHD incidence in women [42, 43, 45, 46, 54, 58, 59]. Nonetheless, one study reported a greater role in men [60], and another found no differences between genders [55]. The current study found an association in both genders which persisted only in women after multivariable adjustment. This finding may be explained by the fact that nearly half of the female participants were over 50 years of age and susceptible to menopause at the baseline. Insulin resistance and higher CHD risks can occur after menopause because of decreasing estrogen levels [45, 46, 54, 55, 59]. Furthermore, the TyG-index was an independent risk factor for CHD until model II in non-diabetic participants. The lack of association in diabetic participants may have been due to lifestyle changes and medication consumption during the 10 years of follow-up [61]. Our analysis showed that diabetes treatment made the association non-significant. The first line of diabetes treatment is metformin which can decrease insulin resistance [62], confirming the insignificant association between the TyG-index and CHD in diabetic participants.

Prediction of CHD based on TyG-index

Previous studies have suggested that the TyG-index predicts cardiovascular events more accurately than hemoglobin A1c [23]. In addition, several studies have implicated that adding the TyG-index to the Framingham risk score can increase its predictive power [48, 49] Previous studies concluded that SVM and random forest were the most effective model for predicting CHD [38, 63, 64], the current study found that random forest achieved the highest AUC. In both random forest and SVM models, diabetes played no role, while the TyG-index was the second most influential component. The current study found that the use of the TyG-index instead of diabetes in machine learning models can significantly improve the predictive power of CHD predicting models. Machine learning models demonstrated that the TyG-index was not only more important than diabetes in predicting CHD, but it also was the most important factor after age. To the best of our knowledge, the TyG-index is not used in any clinical guideline [19], but the American Diabetes Association (ADA) suggested in 2022 that patients with elevated TG levels (≥ 150 mg/dL [1.7 mmol/L]) should implement enhanced lifestyle interventions and optimal glycemic control [65]. Our findings advocate the inclusion of the TyG-index in future CHD prevention guidelines.

Strengths and limitations

The following strengths of the current study should be noted. This study is the first to evaluate the predictive power of TyG-index in CHD using machine-learning techniques. To the best of our knowledge, the optimal cut-off points had not previously been evaluated in the Iranian population. The community-based prospective nature of our study and definite outcome determination minimize the chance of reverse causation and recall bias. Including both old and young populations was another advantage the current study had over others, as most previous studies recruited middle-aged and older adults. Furthermore, the current study attempted to ameliorate the adjustment of confounders by adding family history of premature CHD, medication use, dietary habits, complete lipid profile components and all anthropometric features to our models. The long follow-up time in the present study acts as a double-edged sword; indeed, it can reflect the lifetime risk of CHD, but on the other hand, our inability to evaluate and control voluntarily health check-ups or lifestyle changes during the ten-year study period may have affected our findings. Compared to previous studies, we had an identical method for defining of CHD by investigating ECGs, cardiac enzymes, using the Rose angina questionnaire, exercise tolerance test, and coronary artery angiogram.

This study had several limitations. First, it was embedded in an observational setting, and despite a wide range of adjustments, we cannot rule out the possibility of unmeasured confounders. Single baseline TyG-index investigation may incline our results to intra-individual variation. Second, we may have observed gender-specific results due to the lack of data on menopausal status. Third, only Iranian subjects were included, so our findings might not be generalizable to other countries.

Conclusion

The TyG-index can be used in clinical practice and predictive models as a highly valuable index for predicting and preventing CHD, but further studies are needed to validate our findings.

Availability of data and materials

The datasets used and/or analyzed during the current study are available from the corresponding author on reasonable request.

Abbreviations

TyG-index:

Triglyceride-glucose index

CHD:

Coronary heart disease

CI:

Confidence interval

HOMA-IR:

Homeostatic Model Assessment for Insulin Resistance

YHHP:

YAZD healthy heart project

TG:

Triglyceride

LDL:

Low-density lipoprotein

HDL:

High-density lipoprotein

FBS:

Fasting blood sugar

SUA:

Serum uric acid

IPAQ:

International Physical Activity Questionnaire

SD:

Standard deviation

BMI:

Body mass index

KNN:

K nearest neighbour

SVM:

Support vector machine

ADA:

American Diabetes Association

ECG:

Electro-cardio-gram

AUC:

Area under the curves

SCORE:

Systematic COronary Risk Evaluation

ACC/AHA:

American College of Cardiology/American Heart Association

China-PAR:

Prediction for atherosclerotic cardiovascular risk in China

MESA:

Multi-Ethnic Study of Atherosclerosis

JBS3:

Joint British Societies’ consensus recommendations for the prevention of cardiovascular disease

References

  1. Virani SS, Alonso A, Aparicio HJ, Benjamin EJ, Bittencourt MS, Callaway CW, Carson AP, Chamberlain AM, Cheng S, Delling FN. Heart disease and stroke statistics—2021 update: a report from the American Heart Association. Circulation. 2021;143(8):e254–743.

    PubMed  Google Scholar 

  2. Lopez AD, Mathers CD, Ezzati M, Jamison DT, Murray CJ. Global and regional burden of disease and risk factors, 2001: systematic analysis of population health data. The lancet. 2006;367(9524):1747–57.

    Google Scholar 

  3. Cardiovascular diseases (CVDs) [https://www.who.int/news-room/fact-sheets/detail/cardiovascular-diseases-(cvds)]. Accessed 6 Jan 2023.

  4. Cui H, Liu Q, Wu Y, Cao L. Cumulative triglyceride-glucose index is a risk for CVD: a prospective cohort study. Cardiovasc Diabetol. 2022;21(1):22.

    PubMed  PubMed Central  CAS  Google Scholar 

  5. Zhao D, Liu J, Wang M, Zhang X, Zhou M. Epidemiology of cardiovascular disease in China: current features and implications. Nat Rev Cardiol. 2019;16(4):203–12.

    PubMed  Google Scholar 

  6. D’Agostino RB, Grundy S, Sullivan LM, Wilson P, Group CRP. Validation of the Framingham coronary heart disease prediction scores: results of a multiple ethnic groups investigation. Jama. 2001;286(2):180–7.

    PubMed  Google Scholar 

  7. Conroy RM, Pyörälä K, Ae Fitzgerald, Sans S, Menotti A, De Backer G, De Bacquer D, Ducimetiere P, Jousilahti P, Keil U. Estimation of ten-year risk of fatal cardiovascular disease in Europe: the SCORE project. Eur Heart J. 2003;24(11):987–1003.

    PubMed  CAS  Google Scholar 

  8. Ridker PM, Paynter NP, Rifai N, Gaziano JM, Cook NR. C-reactive protein and parental history improve global cardiovascular risk prediction: the Reynolds Risk Score for men. Circulation. 2008;118(22):2243–51.

    PubMed  PubMed Central  CAS  Google Scholar 

  9. Goff DC Jr, Lloyd-Jones DM, Bennett G, Coady S, D’agostino RB, Gibbons R, Greenland P, Lackland DT, Levy D, O’donnell CJ. 2013 ACC/AHA guideline on the assessment of cardiovascular risk: a report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines. Circulation. 2014;129:S49–73.

    PubMed  Google Scholar 

  10. Board JBS. Joint British Societies’ consensus recommendations for the prevention of cardiovascular disease (JBS3). Heart. 2014;100:ii1–67.

    Google Scholar 

  11. McClelland RL, Jorgensen NW, Budoff M, Blaha MJ, Post WS, Kronmal RA, Bild DE, Shea S, Liu K, Watson KE. 10-year coronary heart disease risk prediction using coronary artery calcium and traditional risk factors: derivation in the MESA (Multi-Ethnic Study of Atherosclerosis) with validation in the HNR (Heinz Nixdorf Recall) study and the DHS (Dallas Heart Study). J Am Coll Cardiol. 2015;66(15):1643–53.

    PubMed  PubMed Central  CAS  Google Scholar 

  12. Hippisley-Cox J, Coupland C, Vinogradova Y, Robson J, Minhas R, Sheikh A, Brindle P. Predicting cardiovascular risk in England and Wales: prospective derivation and validation of QRISK2. BMJ. 2008;336(7659):1475–82.

    PubMed  PubMed Central  Google Scholar 

  13. Yang X, Li J, Hu D, Chen J, Li Y, Huang J, Liu X, Liu F, Cao J, Shen C. Predicting the 10-year risks of atherosclerotic cardiovascular disease in Chinese population: the China-PAR project (prediction for ASCVD risk in China). Circulation. 2016;134(19):1430–40.

    PubMed  Google Scholar 

  14. Sofogianni A, Stalikas N, Antza C, Tziomalos K. Cardiovascular risk prediction models and scores in the era of personalized medicine. J Pers Med. 2022;12(7):1180.

    PubMed  PubMed Central  Google Scholar 

  15. Studziński K, Tomasik T, Krzysztoń J, Jóźwiak J, Windak A. Effect of using cardiovascular risk scoring in routine risk assessment in primary prevention of cardiovascular disease: an overview of systematic reviews. BMC Cardiovasc Disord. 2019;19(1):1–16.

    Google Scholar 

  16. Karmali KN, Persell SD, Perel P, Lloyd-Jones DM, Berendsen MA, Huffman MD. Risk scoring for the primary prevention of cardiovascular disease. Cochrane Database Syst Rev. 2017. https://doi.org/10.1002/14651858.CD006887.pub4.

    Article  PubMed  PubMed Central  Google Scholar 

  17. Adeva-Andany MM, Martínez-Rodríguez J, González-Lucán M, Fernández-Fernández C, Castro-Quintela E. Insulin resistance is a cardiovascular risk factor in humans. Diabetes Metab Syndr. 2019;13(2):1449–55.

    PubMed  Google Scholar 

  18. Wu Z, Cui H, Li W, Zhang Y, Liu L, Liu Z, Zhang W, Zheng T, Yang J. Comparison of three non-insulin-based insulin resistance indexes in predicting the presence and severity of coronary artery disease. Front Cardiovasc Med. 2022;9:918359.

    PubMed  PubMed Central  CAS  Google Scholar 

  19. Liu X, Tan Z, Huang Y, Zhao H, Liu M, Yu P, Ma J, Zhao Y, Zhu W, Wang J. Relationship between the triglyceride-glucose index and risk of cardiovascular diseases and mortality in the general population: a systematic review and meta-analysis. Cardiovasc Diabetol. 2022;21(1):124.

    PubMed  PubMed Central  CAS  Google Scholar 

  20. Gast KB, Tjeerdema N, Stijnen T, Smit JW, Dekkers OM. Insulin resistance and risk of incident cardiovascular events in adults without diabetes: meta-analysis. PLoS ONE. 2012;7(12):e52036.

    PubMed  PubMed Central  CAS  Google Scholar 

  21. Chen W, Wang S, Lv W, Pan Y. Causal associations of insulin resistance with coronary artery disease and ischemic stroke: a Mendelian randomization analysis. BMJ Open Diabetes Res Care. 2020;8(1):e001217.

    PubMed  PubMed Central  Google Scholar 

  22. Tao LC, Xu JN, Wang TT, Hua F, Li JJ. Triglyceride-glucose index as a marker in cardiovascular diseases: landscape and limitations. Cardiovasc Diabetol. 2022;21(1):68.

    PubMed  PubMed Central  CAS  Google Scholar 

  23. Wang A, Tian X, Zuo Y, Chen S, Meng X, Wu S, Wang Y. Change in triglyceride-glucose index predicts the risk of cardiovascular disease in the general population: a prospective cohort study. Cardiovasc Diabetol. 2021. https://doi.org/10.1186/s12933-021-01305-7.

    Article  PubMed  PubMed Central  Google Scholar 

  24. Won KB, Kim YS, Lee BK, Heo R, Han D, Lee JH, Lee SE, Sung JM, Cho I, Park HB, et al. The relationship of insulin resistance estimated by triglyceride glucose index and coronary plaque characteristics. Medicine (Baltimore). 2018;97(21):e10726.

    PubMed  CAS  Google Scholar 

  25. Hokanson JE, Austin MA. Plasma triglyceride level is a risk factor for cardiovascular disease independent of high-density lipoprotein cholesterol level: a meta-analysis of population-based prospective studies. J Cardiovasc Risk. 1996;3(2):213–9.

    PubMed  CAS  Google Scholar 

  26. Hokanson JE, Austin MA. Plasma triglyceride level is a risk factor for cardiovascular disease independent of high-density lipoprotein cholesterol level: a metaanalysis of population-based prospective studies. J Cardiovasc Risk. 1996;3(2):213–9.

    PubMed  CAS  Google Scholar 

  27. Weng SF, Reps J, Kai J, Garibaldi JM, Qureshi N. Can machine-learning improve cardiovascular risk prediction using routine clinical data? PLoS ONE. 2017;12(4):e0174944.

    PubMed  PubMed Central  Google Scholar 

  28. Subramani S, Varshney N, Anand MV, Soudagar MEM, Al-Keridis LA, Upadhyay TK, Alshammari N, Saeed M, Subramanian K, Anbarasu K. Cardiovascular diseases prediction by machine learning incorporation with deep learning. Front Med. 2023;10:1150933.

    Google Scholar 

  29. Li Y, Sperrin M, Ashcroft DM, Van Staa TP. Consistency of variety of machine learning and statistical models in predicting clinical risks of individual patients: longitudinal cohort study using cardiovascular disease as exemplar. BMJ. 2020;371:m3919.

    PubMed  PubMed Central  Google Scholar 

  30. Allan S, Olaiya R, Burhan R. Reviewing the use and quality of machine learning in developing clinical prediction models for cardiovascular disease. Postgrad Med J. 2022;98(1161):551–8.

    PubMed  Google Scholar 

  31. Beunza J-J, Puertas E, García-Ovejero E, Villalba G, Condes E, Koleva G, Hurtado C, Landecho MF. Comparison of machine learning algorithms for clinical event prediction (risk of coronary heart disease). J Biomed Inform. 2019;97:103257.

    PubMed  Google Scholar 

  32. Sarebanhassanabadi M, Mirhosseini SJ, Mirzaei M, Namayandeh SM, Soltani MH, Salehi-Abargouei A. The association between a dietary habits score and the risk of metabolic syndrome: a cohort study. Clin Nutr. 2020;39(1):282–90.

    PubMed  Google Scholar 

  33. Ashcroft RE. The declaration of Helsinki. The Oxford textbook of clinical research ethics. Oxford: Oxford University Press; 2008. p. 141–8.

    Google Scholar 

  34. Vandenbroucke JP, Ev Elm, Altman DG, Gøtzsche PC, Mulrow CD, Pocock SJ, Poole C, Schlesselman JJ, Egger M, Initiative S. Strengthening the reporting of observational studies in epidemiology (STROBE): explanation and elaboration. Ann Intern Med. 2007;147(8):W163–94.

    PubMed  Google Scholar 

  35. Hallal PC, Victora CG. Reliability and validity of the international physical activity questionnaire (IPAQ). Med Sci Sports Exerc. 2004;36(3):556.

    PubMed  Google Scholar 

  36. Dg COOK, Shaper A, MacFarlane P. Using the WHO (Rose) angina questionnaire in cardiovascular epidemiology. Int J Epidemiol. 1989. https://doi.org/10.1093/ije/18.3.607.

    Article  Google Scholar 

  37. Computing Optimal Cutpoints in Diagnostic Tests https://cran.r-project.org/web/packages/OptimalCutpoints/OptimalCutpoints.pdf. Accessed 16 Jan 2023.

  38. Trigka M, Dritsas E. Long-term coronary artery disease risk prediction with machine learning models. Sensors. 2023;23(3):1193.

    PubMed  PubMed Central  Google Scholar 

  39. Fernández A, Garcia S, Herrera F, Chawla NV. SMOTE for learning from imbalanced data: progress and challenges, marking the 15-year anniversary. J Artific Intell Res. 2018;61:863–905.

    Google Scholar 

  40. DeLong ER, DeLong DM, Clarke-Pearson DL. Comparing the areas under two or more correlated receiver operating characteristic curves: a nonparametric approach. Biometrics. 1988;44:837–45.

    PubMed  CAS  Google Scholar 

  41. Baniecki H, Kretowicz W, Piatyszek P, Wisniewski J, Biecek P. Dalex: responsible machine learning with interactive explainability and fairness in python. J Mach Learn Res. 2021;22(1):9759–65.

    Google Scholar 

  42. Cho YK, Han KD, Kim HS, Jung CH, Park JY, Lee WJ. Triglyceride-glucose index is a useful marker for predicting future cardiovascular disease and mortality in young korean adults: a nationwide population-based cohort study. J Lipid Atheroscler. 2022;11(2):178–86.

    PubMed  PubMed Central  Google Scholar 

  43. Kim J, Shin SJ, Kang HT. The association between triglyceride-glucose index, cardio-cerebrovascular diseases, and death in Korean adults: a retrospective study based on the NHIS-HEALS cohort. PLoS ONE. 2021;16(11):e0259212.

    PubMed  PubMed Central  CAS  Google Scholar 

  44. Li H, Zuo Y, Qian F, Chen S, Tian X, Wang P, Li X, Guo X, Wu S, Wang A. Triglyceride-glucose index variability and incident cardiovascular disease: a prospective cohort study. Cardiovasc Diabetol. 2022;21(1):105.

    PubMed  PubMed Central  Google Scholar 

  45. Li S, Guo B, Chen H, Shi Z, Li Y, Tian Q, Shi S. The role of the triglyceride (triacylglycerol) glucose index in the development of cardiovascular events: a retrospective cohort analysis. Sci Rep. 2019. https://doi.org/10.1038/s41598-019-43776-5.

    Article  PubMed  PubMed Central  Google Scholar 

  46. Tian X, Zuo Y, Chen S, Liu Q, Tao B, Wu S, Wang A. Triglyceride–glucose index is associated with the risk of myocardial infarction: an 11-year prospective study in the Kailuan cohort. Cardiovasc Diabetol. 2021. https://doi.org/10.1186/s12933-020-01210-5.

    Article  PubMed  PubMed Central  Google Scholar 

  47. Yu LZ, Ma RL, Zhang XH, He J, Guo H, Hu YH, Wang XP, Mu LL, Yan YZ, Guo SX. The relationship between triglyceride glucose index and risk of cardiovascular disease among Kazakh and Uygur population in Xinjiang: a retrospective cohort analysis. Zhonghua Yu Fang Yi Xue Za Zhi. 2022;56(6):800–5.

    PubMed  CAS  Google Scholar 

  48. Barzegar N, Tohidi M, Hasheminia M, Azizi F, Hadaegh F. The impact of triglyceride-glucose index on incident cardiovascular events during 16 years of follow-up: Tehran lipid and glucose study. Cardiovasc Diabetol. 2020. https://doi.org/10.1186/s12933-020-01121-5.

    Article  PubMed  PubMed Central  Google Scholar 

  49. Sánchez-Íñigo L, Navarro-González D, Fernández-Montero A, Pastrana-Delgado J, Martínez JA. The TyG index may predict the development of cardiovascular events. Eur J Clin Invest. 2016;46(2):189–97.

    PubMed  Google Scholar 

  50. Sajdeya O, Beran A, Mhanna M, Alharbi A, Burmeister C, Abuhelwa Z, Malhas SE, Khader Y, Sayeh W, Assaly R, et al. triglyceride glucose index for the prediction of subclinical atherosclerosis and arterial stiffness: a meta-analysis of 37,780 individuals. Curr Probl Cardiol. 2022;47(12):101390.

    PubMed  Google Scholar 

  51. Ding X, Wang X, Wu J, Zhang M, Cui M. Triglyceride–glucose index and the incidence of atherosclerotic cardiovascular diseases: a meta-analysis of cohort studies. Cardiovasc Diabetol. 2021. https://doi.org/10.1186/s12933-021-01268-9.

    Article  PubMed  PubMed Central  Google Scholar 

  52. Araújo SP, Juvanhol LL, Bressan J, Hermsdorff HHM. Triglyceride glucose index: a new biomarker in predicting cardiovascular risk. Prev Med Rep. 2022;29:101941.

    PubMed  PubMed Central  Google Scholar 

  53. Tian X, Chen S, Zhang Y, Zhang X, Xu Q, Wang P, Wu S, Wang A, Luo Y. Time course of the triglyceride glucose index accumulation with the risk of cardiovascular disease and all-cause mortality. Cardiovasc Diabetol. 2022;21(1):183.

    PubMed  PubMed Central  CAS  Google Scholar 

  54. Liu J, Bu H, Ding Z, Zhang Y, Chen Y, Gao Y. The association of triglyceride glucose index for coronary artery disease in postmenopausal women. Clin Appl Thromb Hemost. 2022;28:10760296221094030.

    PubMed  PubMed Central  CAS  Google Scholar 

  55. Yang K, Liu W. Triglyceride and glucose index and sex differences in relation to major adverse cardiovascular events in hypertensive patients without diabetes. Front Endocrinol (Lausanne). 2021;12:761397.

    PubMed  Google Scholar 

  56. Chen F, Pan Y, Liu Z, Huang R, Wang J, Shao J, Gong Y, Sun X, Jiang X, Wang W, et al. Impact of visit-to-visit triglyceride-glucose index variability on the risk of cardiovascular disease in the elderly. Int J Endocrinol. 2022;2022:5125884.

    PubMed  PubMed Central  Google Scholar 

  57. Liu Y, Wu M, Xu J, Sha D, Xu B, Kang L. Association between Triglyceride and glycose (TyG) index and subclinical myocardial injury. Nutr Metab Cardiovasc Dis. 2020;30(11):2072–6.

    PubMed  CAS  Google Scholar 

  58. Nakagomi A, Sunami Y, Kawasaki Y, Fujisawa T, Kobayashi Y. Sex difference in the association between surrogate markers of insulin resistance and arterial stiffness. J Diabetes Complications. 2020;34(6):107442.

    PubMed  Google Scholar 

  59. Lu YW, Chang CC, Chou RH, Tsai YL, Liu LK, Chen LK, Huang PH, Lin SJ. Gender difference in the association between TyG index and subclinical atherosclerosis: results from the I-Lan Longitudinal aging study. Cardiovasc Diabetol. 2021;20(1):206.

    PubMed  PubMed Central  Google Scholar 

  60. Ye Z, Xie E, Gao Y, Li P, Tu Y, Guo Z, Li Q, Wu Y, Yu X, Li Y, et al. The triglyceride glucose index is associated with future cardiovascular disease nonlinearly in middle-aged and elderly Chinese adults. BMC Endocr Disord. 2022;22(1):242.

    PubMed  PubMed Central  CAS  Google Scholar 

  61. Park B, Lee Y-J, Lee HS, Jung D-H. The triglyceride-glucose index predicts ischemic heart disease risk in Koreans: a prospective study using National Health Insurance Service data. Cardiovasc Diabetol. 2020. https://doi.org/10.1186/s12933-020-01186-2.

    Article  PubMed  PubMed Central  Google Scholar 

  62. Davies MJ, Aroda VR, Collins BS, Gabbay RA, Green J, Maruthur NM, Rosas SE, Del Prato S, Mathieu C, Mingrone G. Management of hyperglycemia in type 2 diabetes, 2022. A consensus report by the American Diabetes Association (ADA) and the European Association for the Study of Diabetes (EASD). Diabetes Care. 2022;45(11):2753–86.

    PubMed  PubMed Central  CAS  Google Scholar 

  63. Hassan CAU, Iqbal J, Irfan R, Hussain S, Algarni AD, Bukhari SSH, Alturki N, Ullah SS. Effectively predicting the presence of coronary heart disease using machine learning classifiers. Sensors (Basel). 2022. https://doi.org/10.3390/s22197227.

    Article  PubMed  PubMed Central  Google Scholar 

  64. Garavand A, Salehnasab C, Behmanesh A, Aslani N, Zadeh AH, Ghaderzadeh M. Efficient model for coronary artery disease diagnosis: a comparative study of several machine learning algorithms. J Healthc Eng. 2022;2022:5359540.

    PubMed  PubMed Central  Google Scholar 

  65. Draznin B, Aroda VR, Bakris G, Benson G, Brown FM, Freeman R, Green J, Huang E, Isaacs D, Kahan S. 10. Cardiovascular disease and risk management: standards of medical care in diabetes-2022. Diabetes Care. 2022;45:S144–74.

    Google Scholar 

Download references

Acknowledgements

We thank all study participants, their relatives, the members of the survey team and project developments and management team of YHHP and Yazd cardiovascular research center.

Funding

This study had no funding.

Author information

Authors and Affiliations

Authors

Contributions

MSH involved in the conception, design, and conduct of the study. SRM and ZHM involved in conception, design, analysis and interpretation of the results and wrote the first draft of the manuscript. PMV, AK and SS revised it critically for important intellectual content. All authors edited, reviewed, and approved the final version of the manuscript. MSH is the guarantor of this work and, as such, had full access to all the data in the study and takes responsibility for the integrity of the data and the accuracy of the data analysis.

Corresponding author

Correspondence to Mohammadtaghi Sarebanhassanabadi.

Ethics declarations

Ethics approval and consent to participate

The present study was approved by the Shahid Sadoughi University of Medical. Sciences ethics committee (ethics code: IR.SSU.REC.1401.069) and conducted based on the Declaration of Helsinki on medical research [33]. Informed consent was obtained from study participants during the initial and the follow-up phase. The present research is reported based on strengthening the reporting of observational studies in epidemiology (STROBE) statement.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Table S1.

Baseline clinical characteristics and biological variables of the participants according to inclusion process. Table S2. Comparison of baseline clinical characteristics and biological variables between genders.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Mirjalili, S.R., Soltani, S., Heidari Meybodi, Z. et al. An innovative model for predicting coronary heart disease using triglyceride-glucose index: a machine learning-based cohort study. Cardiovasc Diabetol 22, 200 (2023). https://doi.org/10.1186/s12933-023-01939-9

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12933-023-01939-9

Keywords