If you don't remember your password, you can reset it by entering your email address and clicking the Reset Password button. You will then receive an email that contains a secure link for resetting your password
If the address matches a valid account an email will be sent to __email__ with instructions for resetting your password
Corresponding author at: Centre for Health Policy, School of Population and Global Health The University of Melbourne, Level 4, 207 Bouverie Street, Carlton, Victoria 3053, Australia. Tel.: +61 3 90356387; Fax: +61 3 9348 1174.
Centre for Chronic Disease Prevention, James Cook University, Cairns, Qld, AustraliaSchool of Health Sciences, University of South Australia, Adelaide, SA, Australia
Centre for Health Policy, Melbourne School of Population and Global Health, University of Melbourne, Melbourne, Vic, AustraliaMenzies Institute for Medical Research, University of Tasmania, Hobart, Tas, Australia
Centre for Health Policy, Melbourne School of Population and Global Health, University of Melbourne, Melbourne, Vic, AustraliaHealth Economics Research Centre, Nuffield Department of Population Health, University of Oxford, Oxford, UK
Cardiovascular disease (CVD) is the leading cause of death for Indigenous Australians. There is widespread belief that current tools have deficiencies for assessing CVD risk in this high-risk population. We sought to develop a 5-year CVD risk score using a wide range of known risk factors to further improve CVD risk prediction in this population.
Methods
We used clinical and demographic information on Indigenous people aged between 30 and 74 years without a history of CVD events who participated in the Well Person’s Health Check (WPHC), a community-based survey. Baseline assessments were conducted between 1998 and 2000, and data were linked to administrative hospitalisation and death records for identification of CVD events. We used Cox proportional hazard models to estimate the 5-year CVD risk, and the Harrell’s c-statistic and the modified Hosmer-Lemeshow (mH-L) χ2 statistic to assess the model discrimination and calibration, respectively.
Results
The study sample consisted of 1,583 individuals (48.1% male; mean age 45.0 year). The risk score consisted of sex, age, systolic blood pressure, diabetes mellitus, waist circumference, triglycerides, and albumin creatinine ratio. The bias-corrected c-statistic was 0.72 and the bias-corrected mH-L χ2 statistic was 12.01 (p-value, 0.212), indicating good discrimination and calibration, respectively. Using our risk score, the CVD risk of the Indigenous Australians could be stratified to a greater degree compared to a recalibrated Framingham risk score.
Conclusions
A seven-factor risk score could satisfactorily stratify 5-year risk of CVD in an Indigenous Australian cohort. These findings inform future research targeting CVD risk in Indigenous Australians.
Early prevention of cardiovascular disease (CVD) is recognised to be crucial for indigenous populations given their substantially higher CVD risk compared to non-indigenous people [
]. The excess CVD risk among Indigenous Australians is largely due to the very high prevalence of traditional CVD risk factors such as hypertension and diabetes. In addition, other highly prevalent risk factors such as albuminuria have also been shown to be associated with CVD in Indigenous Australians [
ACC/AHA guideline on the assessment of cardiovascular risk: a report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines.
]. One explanation for this underestimation may be that certain CVD risk factors for Indigenous people such as albuminuria and waist circumference are not included in the Framingham CVD equations [
Cardiovascular risk assessment in low-resource settings: a consensus document of the European Society of Hypertension Working Group on Hypertension and Cardiovascular Risk in Low Resource Settings.
]. When information on traditional risk factors is not available, a standard approach for risk stratification is to produce a model or risk chart based on a reduced set of factors [
In the present study, we aimed to develop and validate prediction models for 5-year risk of CVD using data from an Indigenous Australian cohort. We developed two models: a primary model that included any relevant factors and a reduced information model that does not contain laboratory variables. After internally validating these models, we used them to classify the study cohort as groups with high, moderate and low CVD risks alongside classification based on the Australian guidelines for CVD prevention [
], and compared the average risks in each predicted category with the observed risks. Then, we examined the extent to which the Australian guidelines may misclassify Indigenous Australians with and without high 5-year CVD risk.
Methods
Study Cohort
The cohort used in the present study was obtained from the Well Person’s Health Check (WPHC) study [
]. Briefly, the WPHC was conducted between 1998 and 2000 and included 3,508 people in 26 rural and remote Indigenous communities in North Queensland. For our analysis, we included Indigenous men and women aged between 30 and 74 years who could be linked to hospitalisation and death records for identification of the CVD events occurring after the study entry. We excluded people with a history of CVD events. In contrast to the study by Hua et al. [
], we did not exclude people with missing risk factors at baseline. Instead, we used imputations to replace the missing values with the estimates. As a result, the study cohort consisted of 1,583 individuals (see supplementary document - Figure S1 for a flow chart of the cohort selection).
Outcome Events and Follow-Up Time
We used the definition of CVD from the Framingham Heart Study for our outcome events, which included a composite of coronary heart disease (coronary insufficiency, myocardial infarction, and angina pectoris), cerebrovascular events (ischaemic stroke, haemorrhagic stroke and transient ischaemic attack), congestive heart failure, peripheral vascular disease, and coronary deaths [
]. These events were identified using the International Classification of Diseases and procedure codes (versions ICD-9-CM and ICD-10-AM; for details, see Hua et al.) [
]. The follow-up time was calculated as the duration between baseline measurement and hospitalisation of a CVD event, coronary death, non-coronary death or last date of data collection (1 December 2014), whichever came first. To maintain alignment with the CVD risk calculators recommended by the Australian guidelines for prevention of CVD [
], a 5-year follow-up period was generated for all individuals. This was done by considering an individual as right censored if there was no recorded CVD event within the 5-year follow-up period.
Statistical Analysis
Missing Data and Multiple Imputation
The percentage of observations with missing data ranged from 0% to 15.8% depending on variables recorded in the data set. We used multiple imputation [
] to replace missing data with appropriate estimates (see detailed methods in supplementary document - section 2).
Model Selection
We used Cox proportional hazard models to estimate the 5-year CVD risk. Two models were developed: (i) the primary model with variables selected from risk factors included in the Framingham CVD models [
] which have been shown to be associated with CVD risk in Indigenous populations; and (ii) the reduced information model that excluded risk factors requiring laboratory measurements. Initial covariates considered in the primary model included sex, age, total cholesterol, high-density lipoprotein (HDL) cholesterol, low-density lipoprotein (LDL) cholesterol, total cholesterol to HDL ratio, triglycerides, systolic blood pressure (SBP), diastolic blood pressure (DBP), current smoking, diabetes status, fasting blood glucose, body mass index (BMI), waist circumference and ACR (see supplementary document - section 3 for methods of risk factor measurements). Of note, the diabetes condition was identified based on the patient self-reports and confirmed by the primary care medical records. In the reduced information model, only sex, age, SBP, DBP, current smoking, diabetes status, BMI and waist circumference were considered. To select the optimal set of predictors, we performed both stepwise and manual procedures, and included variables with a p-value smaller than 0.15 in the final models (see supplementary document - section 4 for detailed methods). We used the scaled Schoenfeld residuals [
where is the 5-year CVD risk, is the survival rate at 5 years of a person with mean values of risk factors, is the estimated coefficient for the i-th risk factor, and and are the actual and mean values, respectively, of the i-th risk factor. We estimated and and their 95% CIs by fitting the final model to each of the imputed data sets and combined the estimates using Rubin’s rule (see supplementary document - section 5 for detailed methods).
] to evaluate the model discrimination, i.e. the model’s ability to separate persons with events from those without events. The c-statistic is defined as the proportion of all possible pairs of persons, at least one of whom has an event, in which the predicted survival time is larger for the person who lived longer. A c index of 0.7 or larger indicates good discrimination.
To quantify the model calibration, i.e. the agreement between the observed and predicted risks, we used the modified Hosmer-Lemeshow (mH-L) χ2 statistic proposed by D’Agostino and Nam for survival models [
]. The participants were divided into deciles based on their predicted CVD risks, and the Kaplan-Meier estimate for failure within each decile was compared with the mean predicted risk in the same decile. A mH-L χ2 statistic larger than 20 indicates a poor calibration [
To internally validate the models, we calculated the apparent c and mH-L χ2 statistics and corrected these for the optimism that might arise from overfitting using the algorithm developed by Harrell [
] (see supplementary document - section 6 for detailed methods). The corrected c and mH-L χ2 statistics were shown to represent nearly unbiased assessment of the model performance [
We classified 1,583 individuals as either a high (>15%), moderate (10–15%) or low 5-year CVD risk (<10%) using the algorithm provided by the Australian guidelines for CVD prevention [
]. We used the Kaplan-Meier estimator to compute 5-year CVD risks of people within the same category in different imputed data sets and combined them to estimate the observed risks. The Australian guidelines automatically classify people with one or more of the following conditions as high-risk people: diabetes and age >60 years, diabetes with microalbuminuria, moderate or severe chronic kidney disease, a history of familial hypercholesterolaemia, a SBP ≥ 180 mmHg, serum total cholesterol >7.5 mmol/L, or age >74. For people not automatically classified as high risk, the 1991 Framingham equation [
] is recommended by the Australian guidelines for calculating the absolute risks, which are used to classify these people into high (>15%), moderate (10–15%) and low (<10%) risks. Therefore, people classified as high risk by the guidelines may have or not have any of the above-mentioned conditions. We used the same cut-off points for risk categories as those in the guidelines to classify people using our new equations. Because the Australian guidelines do not recommend any equation for the calculation of the actual risk of those automatically classified as high risk, we used the 1991 Framingham equation [
] to examine their absolute risks in reference to the observed risks and the risks predicted by our new models and the recalibrated 2008 Framingham model.
To examine the potential misclassification of 5-year CVD risk using the Australian guidelines, we calculated the proportions of people classified as high risk and medium-low risk by the Australian guidelines, and within each of these groups determined the proportions of people with a contrasting risk category based on our new models.
Results
Of 1,583 people, 142 developed CVD within 5 years after the first screening. We found that sex, age, SBP, waist circumference, diabetes status, triglycerides and ACR ratio were associated with 5-year CVD risk. Table 1 provides summary statistics of these risk factors and the 5-year survival rate of an average person. Current smoking and cholesterol-related variables did not significantly contribute to the prediction (p-values >0.5; see supplementary document - section 7 for coefficients of all variables included in the full model). Table 2 shows the regression coefficients and hazard ratios (HRs) for CVD events associated with the risk factors in the primary and reduced information models. The proportional hazard assumption was satisfied in fitting the models to any of the imputed data sets.
Table 1Baseline risk factors and 5-year survival rate of the study cohort (N = 1,583).
Estimated using the fitted Cox proportional hazard models for a person with mean risk factors.
Primary equation
0.9298
Reduced information equation
0.9344
Abbreviations: ACR, Albumin creatinine ratio; BMI, body mass index; BP, blood pressure; HDL, high-density lipoprotein; LDL, low-density lipoprotein; SE, standard deviation of the mean.
]. The bias-corrected c-statistics of the primary and reduced information models were 0.722 and 0.719, respectively, indicating good discrimination. The bias-corrected mH-L χ2 statistic (p-values) of the primary and reduced information models were 12.01 (0.212) and 12.49 (0.188), respectively, indicating good calibration. Compared to the recalibrated 2008 Framingham model [
], which had a bias-corrected c-statistic of 0.727 and a bias-corrected mH-L χ2 statistic (p-value) of 29.47 (0.0005), our new models perform similarly in terms of discrimination, but far better in terms of calibration. The observed and predicted risks in deciles of the primary and reduced information models were similar (Figure 1).
Table 3Performance of the new models and the recalibrated 2008 Framingham model for 5-year CVD risk prediction.
These values are slightly different from those in the study by Hua et al. [11] because in the present study we used multiple imputation to replace missing values with the estimates and evaluated performance of the recalibrated 2008 Framingham model on the 20 imputed data sets, while in the previous study [11] the recalibrated 2008 Framingham model was evaluated on the cohort with complete cases.
These values are slightly different from those in the study by Hua et al. [11] because in the present study we used multiple imputation to replace missing values with the estimates and evaluated performance of the recalibrated 2008 Framingham model on the 20 imputed data sets, while in the previous study [11] the recalibrated 2008 Framingham model was evaluated on the cohort with complete cases.
Optimism
6.31
5.03
10.30
Biased-corrected mH-L χ2 statistic (p-value)
12.01 (0.212)
12.49 (0.188)
29.47 (<0.001)
* These values are slightly different from those in the study by Hua et al.
because in the present study we used multiple imputation to replace missing values with the estimates and evaluated performance of the recalibrated 2008 Framingham model on the 20 imputed data sets, while in the previous study
Figure 1Mean observed and predicted 5-year cardiovascular risks in deciles of the predicted risks using (A) the primary model and (B) the reduced information model. The error bars represent 95% confidence intervals.
Table 4 shows mean observed and predicted absolute risks for the whole cohort and different groups stratified by sex, age and predicted risk levels. The predicted mean 5-year risks for the whole cohort, women and men were 9.4%, 8.8% and 10.1%, respectively, using the primary model, and were 9.4%, 8.6% and 10.2%, respectively, using the reduced information model. These predicted values were almost identical to the observed 5-years risks (9.4%, 8.7% and 10.1% for the whole cohort, women and men, respectively).
Estimated for all people including persons automatically classified as those with a high cardiovascular risk by the guidelines. See criteria in the previous note.
Estimated for all people including persons automatically classified as those with a high cardiovascular risk by the guidelines. See criteria in the previous note.
Estimated for all people including persons automatically classified as those with a high cardiovascular risk by the guidelines. See criteria in the previous note.
Estimated for all people including persons automatically classified as those with a high cardiovascular risk by the guidelines. See criteria in the previous note.
Estimated for all people including persons automatically classified as those with a high cardiovascular risk by the guidelines. See criteria in the previous note.
Estimated for all people including persons automatically classified as those with a high cardiovascular risk by the guidelines. See criteria in the previous note.
Estimated for all people including persons automatically classified as those with a high cardiovascular risk by the guidelines. See criteria in the previous note.
Estimated for persons not automatically classified as those with high CVD risk by the guidelines [4], i.e. persons do not have any of the following conditions: (1) Diabetes and age>60 years, (2) Diabetes with albumin creatinine ratio >2.5 mg/mmol for males or >3.5 mg/mmol for females, (3) moderate or severe chronic kidney disease, (4) a previous diagnosis of familial hypercholesterolaemia, (5) Systolic blood pressure (BP)≥180 mmHg or diastolic BP > 110 mmHg, and (6) total cholesterol >7.5 mmol/L.
Estimated for all people including persons automatically classified as those with a high cardiovascular risk by the guidelines. See criteria in the previous note.
Estimated for persons not automatically classified as those with high CVD risk by the guidelines [4], i.e. persons do not have any of the following conditions: (1) Diabetes and age>60 years, (2) Diabetes with albumin creatinine ratio >2.5 mg/mmol for males or >3.5 mg/mmol for females, (3) moderate or severe chronic kidney disease, (4) a previous diagnosis of familial hypercholesterolaemia, (5) Systolic blood pressure (BP)≥180 mmHg or diastolic BP > 110 mmHg, and (6) total cholesterol >7.5 mmol/L.
Estimated for all people including persons automatically classified as those with a high cardiovascular risk by the guidelines. See criteria in the previous note.
Estimated for persons not automatically classified as those with high CVD risk by the guidelines [4], i.e. persons do not have any of the following conditions: (1) Diabetes and age>60 years, (2) Diabetes with albumin creatinine ratio >2.5 mg/mmol for males or >3.5 mg/mmol for females, (3) moderate or severe chronic kidney disease, (4) a previous diagnosis of familial hypercholesterolaemia, (5) Systolic blood pressure (BP)≥180 mmHg or diastolic BP > 110 mmHg, and (6) total cholesterol >7.5 mmol/L.
Estimated for all people including persons automatically classified as those with a high cardiovascular risk by the guidelines. See criteria in the previous note.
Estimated for persons not automatically classified as those with high CVD risk by the guidelines [4], i.e. persons do not have any of the following conditions: (1) Diabetes and age>60 years, (2) Diabetes with albumin creatinine ratio >2.5 mg/mmol for males or >3.5 mg/mmol for females, (3) moderate or severe chronic kidney disease, (4) a previous diagnosis of familial hypercholesterolaemia, (5) Systolic blood pressure (BP)≥180 mmHg or diastolic BP > 110 mmHg, and (6) total cholesterol >7.5 mmol/L.
Estimated for all people including persons automatically classified as those with a high cardiovascular risk by the guidelines. See criteria in the previous note.
, i.e. persons do not have any of the following conditions: (1) Diabetes and age>60 years, (2) Diabetes with albumin creatinine ratio >2.5 mg/mmol for males or >3.5 mg/mmol for females, (3) moderate or severe chronic kidney disease, (4) a previous diagnosis of familial hypercholesterolaemia, (5) Systolic blood pressure (BP)≥180 mmHg or diastolic BP > 110 mmHg, and (6) total cholesterol >7.5 mmol/L.
¶ Estimated for all people including persons automatically classified as those with a high cardiovascular risk by the guidelines. See criteria in the previous note.
] slightly overestimated the mean risk of the moderate-risk group (12.4% against 11.7%) and underestimated the mean risk of the low-risk group (3.1% against 5.7%). In the high-risk group classified by the guidelines’ algorithm, the 1991 Framingham equation [
] underestimated the mean risk of the whole group (13.8% against 17.1%), and overestimated the mean risk of persons who did not have any of the conditions pre-defined in the guidelines (20.9% against 17.1%). The recalibrated 2008 Framingham model [
] overestimated the mean risk in the high-risk group (26.4% against 19.5%), but slightly underestimated the mean risk in the moderate- (12.3% against 13.2%) and low-risk (4.5% against 5.1%) groups. In contrast, mean risks predicted by the primary model were relatively close to the observed risks in the high- (25.0% vs 25.4%), moderate- (12.3% vs 12.6%) and low-risk (4.9% vs 4.6%) groups. Similar mean predicted and observed risks in three risk categories were observed using the reduced information model.
When risk was calculated separately for women and men (see Tables S5 and S6 in the supplementary document), we found that, in women, the CVD risk predicted by the 1991 Framingham model [
] was only one third of the observed CVD risk in the youngest group (30–34 years old), while this ratio was much larger in other age groups of women (0.53–0.86) and in any age groups of men (0.6–1.1). When the cohort was classified into major CVD risk categories, the 1991 Framingham equation [
] underestimated CVD risk in all categories in women, while this was not the case in men with moderate risk. In both women and men, the agreement between the observed and predicted CVD risk using the recalibrated 2008 Framingham model [
] and the guidelines for risk classification, the proportions of cohort (i) with high risk were 17.1%, 18.3%, 22.0% and 28.4%, respectively, (ii) with moderate risk were 13.8%, 11.6%, 13.4% and 6.8%, respectively, and (iii) with low risk were 69.1%, 70.2%, 64.7% and 64.8%, respectively (Figure 2). Given the good calibration and the predicted mean risks almost identical to the observed mean risks in any risk categories of the new models, these figures suggest that the guidelines overestimate the proportion of people with high risk, underestimate the proportion of people with moderate and low risk, and potentially misclassify a small proportion of people with actual high, moderate or low risk. The agreement between the guidelines and the new models in classification of proportions with different risk categories was higher in the low-risk category than in the moderate- and high-risk categories.
Figure 2Percentage of cohort with high (>15%), moderate (10–15%) and low 5-year cardiovascular risk (<10%) classified by the new models, the recalibrated 2008 Framingham model and the guidelines.
Using our new models as a standard tool for identification of a person with a high 5-year CVD risk, we found potential misclassification of the high risk and moderate-low risk people by the Australian guidelines (Figure 3). Sixteen per cent (16%) of the population were false positives, i.e. the individuals classified as high risk by the guidelines but as moderate-low risk by either the primary or reduced information model. Four per cent (4%) or 6% of the population were false negatives on the basis of the primary or reduced information model, respectively, i.e. the individuals classified as moderate-low risk by the guidelines but as high risk by the primary or reduced information model. The estimated sensitivity and specificity of guidelines were 74% and 81%, respectively, based on the primary model, and were 67% and 80%, respectively, based on the reduced information model.
Figure 3Potential misclassification of the high risk and moderate-low risk by the Australian guidelines for prevention of CVD based on the (A) primary model and (B) reduced information model for 5-year CVD risk prediction. The bars represent proportions of people classified as high risk (>15%) and moderate-low risk classified by the guidelines, within each of which the upper part represents the proportion of people that are misclassified on the basis of the classification by the new models.
In this study, we developed the first 5-year CVD risk prediction models based on Indigenous Australian data. Both the primary model and the reduced information model performed well, and had a far better calibration (mH-L χ2 statistic) than the recalibrated 2008 Framingham model [
]. This was expected as in the recalibrated model only the baseline survival rate and means of risk factors were derived from the Indigenous data, while the predictors and their coefficients were obtained from the Framingham equation [
]. As the reduced information model performed similarly to the primary model, it provides a convenient tool for a quick assessment of 5-year CVD risk where no laboratory variables are needed. Using multiple imputation combined with bootstrapping, we confirmed the findings in the study by Hua et al. [
] underestimated the CVD risk more severely in younger women (aged 30–34 years) compared to older women and to men. Despite this, the observed CVD risk in this age group is low (mean, 3.2; 95% CI, 1.2–5.2) and, therefore, this underestimation would not have impacted the cardiovascular health of younger women more strongly compared to older women and men.
In terms of discrimination, the primary model and the reduced information model performed similarly to the recalibrated 2008 Framingham model [
Development and validation of a prognostic model for survival time data: application to prognosis of HIV positive patients treated with antiretroviral therapy.
]) were not collected and therefore could not be controlled for in our analysis. Our data set also did not contain information on socioeconomic deprivation, a risk factor for CVD mortality in an Australian cohort [
]. Therefore, socioeconomic deprivation might be a potential risk factor for CVD including both fatal and non-fatal cardiovascular events as defined in our study. It is important to note that the distributions of risk factors in our study sample are different from those in the Framingham cohort [
]. For example, participants in our study sample have a lower mean total cholesterol (198.9 vs 213.9 mg/dL), but higher mean SBP (134.5 vs 127.6 mmHg) and substantially higher mean rates of smoking (53.4% vs 34.7%) and diabetes (23.2% vs 5.0%) compared to those in the Framingham study [
]. Two studies of Indigenous Australians showed that microalbuminuria (ACR between 3 and 30 mg/mmol) and macroalbuminuria (ACR ≥ 30 mg/mmol) increased the risk of coronary heart disease by two to three times compared to normal albuminuria, after adjusting for other CVD risk factors [
]. We did not categorise ACR because this reduced the model calibration.
The outcomes from our model fitting showed that, in the presence of waist circumference, BMI was not significantly associated with CVD risk. This is in line with the results from the study by Wang and Hoy [
], which found that waist circumference was a better predictor for CVD risk than BMI in Indigenous Australians. This may be explained by the fact that Indigenous Australians have a greater amount of abdominal fat compared to the European Australians and the use of BMI for weight status classification has been found to be inappropriate in Indigenous populations [
Surprisingly, smoking was not a significant predictor of 5-year CVD risk in our study. An explanation for this might be that smoking was masked by SBP and triglycerides, as smoking was strongly correlated with the latter. In the study by Luke et al. [
], smoking was also found to be significantly associated with dyslipidaemia and SBP. Although smoking did not significantly contribute to the prediction of 5-year CVD risk, we found it significant when fitting a model without the constraint on the 5-year follow-up time. Hence, the fact the smoking is not included in our models for 5-year CVD risk prediction does not mean that reducing smoking is not an important component of the CVD prevention strategy.
The agreement between the Australian guidelines’ algorithm [
] and our new models in risk classification were only moderate. The false positives represented the largest disagreement, which can lead to unnecessary treatment and increased health care costs. The consequences of false negatives are potentially severe with individuals who would otherwise be medicated not receiving treatment to reduce CVD risk. Although the proportion of people with false negatives was relatively small (about 4–6% of the population), this is roughly equivalent to a large number of 31,000–47,000 Indigenous people with high CVD risk but could potentially be untreated, assuming that the distributions of the risk factors in the WPHC was similar to those in the Indigenous Australian population [
], they should be considered as a contemporarily evidence-based tool for CVD risk assessment in remote Indigenous Australians. Since CVD is the leading cause of death in Indigenous Australians [
], translation of our findings to clinical practice is an important step towards addressing the target set by the Commonwealth of Australia, that is to close the gap in life expectancy between Indigenous and non-Indigenous Australians within a generation [
]. Detailing strategies for this translation is beyond the scope of the present study, but it is worth noting that developing a tool for risk assessment that is readily available and acceptable to health professionals and Indigenous communities is crucial. Communication of this tool and our findings with physicians can be done in collaboration with the Heart Foundation and other relevant organisations via the internet, scientific meetings and media.
Strengths and Limitations
In the present study, we used a relatively large community-based cohort and high-quality data for model development, multiple imputation to deal with missing data, and rigorous methods to select the predictors and assess the model performance. However, our study has some limitations. The cohort used for model development might not represent the Indigenous population in North Queensland because the participants in the screening program were volunteers. Because our models were developed for remote Indigenous people and have not been tested in other populations, their generalisability is currently inconclusive. However, they provide an additional evidence-based tool for risk assessment that can be used alongside existing risk scores such as the recalibrated Framingham model [
]. As data on treatment for hypertension were not ascertained in our study, our models did not include treatment as a covariate, which is aligned with the 1991 Framingham equation [
] accounted for the effect of anti-hypertensive treatment and therefore it is important to explore this treatment effect in future studies for risk stratification in Indigenous populations. Other potential risk factors such as family history of CVD [
] were not collected, and thus additional relevant predictors might be missed to be included in the models. Our equations were designed as a risk prediction tool and thus they may not represent all key risk factors associated with the aetiology of CVD. An example of this is the inclusion of triglycerides as the sole lipid fraction in our primary model while there is a well-established strong correlation between CVD and other lipid fractions such as total and HDL cholesterol [
]. In this respect the role of triglycerides in our equation represents both the direct impact as well as the indirect impact on CVD via its correlation with other lipid fractions in Indigenous Australians [
]. Although we rigorously assess the model performance internally, external validation of the models using other data sources is needed and this warrants further study in the future.
Conclusions
The algorithm for classification of people as high, moderate and low 5-year CVD risk recommended by the Australian guidelines for prevention of CVD [
] is subject to misclassification of Indigenous people in these major risk categories, and is not accurate in estimating mean absolute risk in each of these categories. The prediction models we developed improved these. The reduced information model, with good discrimination and calibration, provides a convenient and reasonable tool for a quick assessment of 5-year CVD risk where no laboratory variables are required.
Funding Source
This study is supported by the Australia National Health and Medical Research Council (project number NHMRC/1107140). The sponsor has no role in study design; or collection, analysis and interpretation of data; or writing of the manuscript; or the decision to submit the article for publication.
Conflicts of Interest
The authors declared that there is no potential conflict of interest.
Appendix A. Supplementary data
The following is Supplementary data to this article:
ACC/AHA guideline on the assessment of cardiovascular risk: a report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines.
Cardiovascular risk assessment in low-resource settings: a consensus document of the European Society of Hypertension Working Group on Hypertension and Cardiovascular Risk in Low Resource Settings.
Development and validation of a prognostic model for survival time data: application to prognosis of HIV positive patients treated with antiretroviral therapy.