- Research article
- Open Access
Lifestyle variables and the risk of myocardial infarction in the General Practice Research Database
BMC Cardiovascular Disorders volume 7, Article number: 38 (2007)
The primary objective of this study is to estimate the association between body mass index (BMI) and the risk of first acute myocardial infarction (AMI). As a secondary objective, we considered the association between other lifestyle variables, smoking and heavy alcohol use, and AMI risk.
This study was conducted in the general practice research database (GPRD) which is a database based on general practitioner records and is a representative sample of the United Kingdom population. We matched cases of first AMI as identified by diagnostic codes with up to 10 controls between January 1st, 2001 and December 31st, 2005 using incidence density sampling. We used multiple imputation to account for missing data.
We identified 19,353 cases of first AMI which were matched on index date, GPRD practice and age to 192,821 controls. There was a modest amount of missing data in the database, and the patients with missing data had different risks than those with recorded values. We adjusted our analysis for each lifestyle variable jointly and also for age, sex, and number of hospitalizations in the past year. Although a record of underweight (BMI <18.0 kg/m2) did not alter the risk for AMI (adjusted odds ratio (OR): 1.00; 95% confidence interval (CI): 0.87–1.11) when compared with normal BMI (18.0–24.9 kg/m2), obesity (BMI ≥30 kg/m2) predicted an increased risk (adjusted OR: 1.41; 95% CI: 1.35–1.47). A history of smoking also predicted an increased risk of AMI (adjusted OR: 1.81; 95% CI: 1.75–1.87) as did heavy alcohol use (adjusted OR: 1.15; 95% CI: 1.06–1.26).
This study illustrates that obesity, smoking and heavy alcohol use, as recorded during routine care by a general practitioner, are important predictors of an increased risk of a first AMI. In contrast, low BMI does not increase the risk of a first AMI.
Obesity is a growing public health problem that is associated with an increased rate of cardiovascular events. About one in three patients admitted to hospital with acute coronary syndrome in Europe were obese with additionally half of the patient population being overweight .
Clinical databases based on general practice records are a potentially useful source of information (when it is available) for studying the magnitude of risk factors such as obesity, smoking and heavy alcohol use at the population level in a real-world setting. However, these databases often have missing data on some patients which needs to be properly accounted for in any analysis. Several methods exist [2, 3], but multiple imputation has been systematically shown to be superior to case deletion and indicator variable methods in reducing bias [4–6].
As obesity is a growing public health concern, it is important to identify the impact of body mass index (BMI) in the occurrence of the first acute myocardial infarction (AMI). The primary objective of this study is to estimate the association between BMI and the risk of the first AMI. As a secondary objective, we considered the association between other lifestyle variables, smoking and heavy alcohol use, and AMI risk. Finally, we sought to determine if the choice of how to deal with missing information was important.
This study is based on the United Kingdom's General Practice Research Database (GPRD) . This is a large clinical database based on the medical charts of general practitioners. It records information such as prescriptions issued and medical diagnoses made using the United Kingdom specific READ and OXMIS medical codes. The recorded information on drug exposure and diagnoses has been validated and proven to be of high quality [7, 8]. The GPRD also records information on factors such as BMI, blood pressure, smoking and alcohol consumption . However, these variables are reported by validation studies to have non-trivial amounts of missing data [7, 8] and this can lead to biased estimates of effect .
We identified all first-ever AMIs recorded in the GPRD between January 1st, 2001 and December 31st, 2005 using the medical codes recorded in the database as our cases. These medical codes are described in Additional File 1. To be eligible to be selected as a case, a patient needed to be at least 18 years of age and have no previous record of an AMI before the index event. The date recorded in the database for the first AMI was taken as the index date for the case. We matched each case with up to 10 controls based on age (± 2 years), GPRD practice and index date. On the index date, the control must not have had a previous AMI, must still be registered in the GPRD and be alive to be eligible as a control.
Both cases and controls were required to have at least 3 years of follow-up in the GPRD before the index date to allow adequate time to assess comorbid conditions.
BMI was defined as the most recently available pre-AMI body weight (in kilograms) divided by the square of the height (in meters) (kg/m2) and was used to categorize patients according to the World Health Organization's definition : underweight (BMI: <18.0 kg/m2), normal weight (BMI: 18.0–24.9 kg/m2), overweight (BMI: 25–29.9 kg/m2) and obese (BMI: ≥30 kg/m2).
For smoking we grouped subjects into the categories of never smokers and ever smokers. For heavy alcohol use we used at least one clinical diagnosis recorded in the database. For BMI and smoking status, we used the closer to the index date recorded value in the database. However, for most patients BMI and smoking status are recorded only once in the GPRD .
Ethical review for this study was done by the Independent Scientific Advisory Committee for MHRA database research
Conditional logistic regression was used to estimate the odds ratios (ORs) for the different BMI categories. We handled missing data using three different typical approaches (case deletion, indicator variable and multiple imputation). It was important to include a broad spectrum of covariates as predictors in our multiple imputation model [11, 12]. We considered a crude model for BMI, smoking and heavy alcohol use, separately. Because of the cross-sectional nature of our data, we could not assess whether comorbidities preceded obesity, and so we did not adjust for these variables in our statistical models (although they were used in the multiple imputation to infer BMI). Instead, we limited our statistical adjustment to each lifestyle variable jointly, as well as age, sex and number of hospitalizations in the past year (as a proxy for overall health status).
More details of the imputation and analysis are discussed in Additional File 2.
We identified 19,353 cases of AMI which were matched to 192,821 controls. Selected characteristics of the cases and the controls are described in Table 1. The pattern of missing data in this study is also shown in Table 1 as are the post-imputation results of some variables. The cases have higher rates and levels of known cardiovascular risk factors including diabetes and angina as well as elevated blood pressure and serum cholesterol levels.
Table 2 describes the distribution of BMI and smoking among subjects with imputed values for BMI as opposed to subjects with measured BMI. Of note is that the size of the underweight category is much greater among those subjects with imputed BMIs; it is 1.8% versus 3.8% in the cases and 1.8% versus 4.7% in the controls. In general, patients with imputed values have systematically lower rates of smoking and lower BMI values than subjects with recorded information.
Table 3 describes the relationship between BMI and the rate of AMI. A pronounced increased risk in the obese patients was found regardless of how we account for missing data. Using the adjusted estimates from the multiple imputation analysis, there is an increase in risk in the obese (adjusted OR: 1.41; 95% confidence interval (CI): 1.35–1.47). The change in adjusted OR for the underweight, as based on different methods of handling missing data, was the most important with a 15.3% change in the estimate between case deletion (adjusted OR: 1.15; 95% CI: 0.96–1.37) and multiple imputation (adjusted OR: 1.00; 95% CI: 0.87–1.11).
Table 4 describes the results for ever smoker versus never smoker using the three different approaches for missing values. In this population sample we confirm the well-known finding that ever smoking is a strong risk factor for having an AMI (adjusted OR: 1.81; 95% CI: 1.75–1.87). This effect was consistently shown with all three different methods used to account for missing values.
Furthermore, subjects with a clinical diagnosis of heavy alcohol use appeared to have a small increased risk of a first AMI (adjusted OR: 1.15; 95% CI: 1.06–1.26).
This is the first study evaluating the association between BMI and the first AMI, using a clinical database based on general practitioner records (GPRD). It is a case-control study that includes a large sample of consecutive, unselected cases with AMI and matched controls. Therefore, it reflects real life data including a large proportion of female and elderly patients. In this study we also assessed the impact of smoking and heavy alcohol use on the occurrence of the first AMI. We used three different methods to account for missing data, namely case deletion, indicator variable and the more sophisticated multiple imputation method.
BMI as a risk factor
In our study, we observe that low and normal BMI values are not associated with an increased risk of a first AMI but that high BMI values are. This shape could be described as a J-shaped association between BMI categories in which we have no effect on one direction from normal and an increased risk in the other. To our knowledge, this is the first study to describe this effect for first AMI in a United Kingdom population sample.
Despite previous research, controversy remains regarding the relationship between BMI and AMI [13–16]. Some studies have shown that BMI has a U-shaped effect (bimodal occurrence) of adverse events and adverse outcomes with an increased risk in underweight and morbidly obese people, but with a lower risk for overweight and obese when compared to normal-weight patients. However, these studies have often not comprehensively accounted for potential sources of confounding, with underestimates of the effect of overweight and obesity on longevity and overestimates of the risks of leanness. Major potential sources of bias particular to studies of BMI and mortality include (1) failure to adequately account for missing values, (2) failure to adequately account for potential sources of confounding (e.g. pre-existing disease or concomitant illnesses such as cancer, leukemia and lymphoma), (3) unmeasured factors that affected outcomes, and (4) inappropriate adjustment for the biological effects of obesity (i.e. for conditions that included in the causal pathway between obesity and AMI), including hypertension and diabetes. Also some prior studies are not very informative as they are hospital-based and they focus on the outcome after AMI. Furthermore, a study of AMI patients followed for 8–10 years showed that although overall obesity (as assessed by BMI) is inversely related to mortality, abdominal obesity appears to be an independent predictor of all-cause mortality in men and perhaps also in women .
Other studies have found similar results with ours but mostly for mortality. In the Multifactor Primary Prevention Study, when the BMI category 20.0–22.5 kg/m2 was used as the reference group, the underweight group did not carry a higher risk for an AMI (adjusted Hazard Ratio (HR): 1.08; 95% CI: 0.76–1.52) or for coronary artery bypass graft without prior AMI (adjusted HR: 0.86; 95% CI: 0.25–2.90). However, overweight and obese patients were carrying a higher risk for AMI when compared with the normal BMI category .
In a prospective study of more than 1,000,000 adults in the United States the curve for the risk of death from cardiovascular disease among subjects who never smoked and had no history of disease was J-shaped; this indicated that a high BMI was most predictive of death from cardiovascular disease than a low BMI. However, the curve for the risk of death from all other causes was U-shaped .
A recent meta-analysis including 302,296 participants worldwide and 18,000 coronary heart disease events during follow-up showed that there was an increased risk for coronary events associated with overweight and obesity; the adjusted relative risk (and 95% CI) was 1.32 (1.24–1.40) for BMI of 25.0 to 29.9 kg/m2 and 1.81 (1.56–2.10) for BMI ≥30 kg/m2, when compared with normal BMI .
Smoking and heavy alcohol use as risk factors
An increased risk of a first AMI was associated with ever smoking. This finding was consistently found regardless of the method used to deal with missing values. This strong association between smoking and AMI has been shown before. For example, the INTERHEART study found that tobacco use is one of the most important causes of AMI globally, especially in men. The risk for AMI was increased regardless of the form of tobacco use, including different types of smoking and chewing tobacco and inhalation of second hand tobacco smoke .
Another study also found that the type or yield of cigarettes did not result in significantly different findings, with similar risk for smoker of low versus high tar cigarettes .
Heavy alcohol use was also consistently associated with a higher risk of a first AMI in our study. Heavy alcohol use is a known risk factor for cardiovascular risk. The INTERHEART study, among others, also found this association .
We used three common methods to deal with missing data. In cases where there is a difference between the results of the case deletion, indicator variable and multiple imputation, simulation studies have demonstrated the superiority of multiple imputation method when missing data exceed 10% of the total . In our study, only smoking met that criterion <10% missing among cases and only slightly more among controls. In all methods, smoking was a strong risk factor for AMI, with little to no change in estimate as we accounted for missing data with different methods.
The pattern of obesity by measured versus unmeasured BMI (as shown in Table 2) demonstrates the circumstances under which multiple imputation will make a difference in the results of a study. The only category of weight shows important differences in the estimates of the effect of BMI on AMI between those with a measure of BMI and those without one is the underweight. In the underweight we found a 15.3% difference in the estimates of the risk of AMI between using case deletion versus multiple imputations to handle missing data. While we are fortunate in this case not to have this bias shift the inference (as neither is statistically significant), this is not guaranteed in future studies. In such cases, the estimate from multiple imputation should be preferred [4–6, 12].
Strengths and limitations
This is a broad and unselected population sample of the United Kingdom population that allows us to infer the current levels of risk. Due to the comprehensive nature of the covariates in the database, we were able to use an extremely rich wealth of information in imputing missing data. This allows us to describe the empirical risk associated with different levels of BMI as seen by general practitioners. Recently, the INTERHEART study also reported that, among others, smoking, obesity, bad dietary habits and alcohol intake, as well as lack of regular physical activity account for most of the risk of AMI worldwide in both sexes and at all ages in all regions .
However, this study also has several limitations. We defined the "first AMI" as the first event occurred after at least 3 consecutive years of being followed in the GPRD and being free of an AMI. This might also include some patients who had their AMI after long intervals. However, as there is very good validation for hospital referrals (and communication with specialists) , it is very uncommon if a patient with a previous MI was not followed by a Cardiologist/Specialist and/or did not have any follow-up tests for at least 3 years. Also in general, in database studies collection of data is often less standardized or less accurate; however, GPRD is a popular database and many validation studies have proven satisfactory accuracy and completeness of the data . Despite adjustments using multivariate analyses, unmeasured factors that affected outcomes were likely present. The BMI was used as a marker for total body fat, while the distribution of body fat is unknown. However, there is evidence supporting that there is a good correlation between BMI and central obesity, a known risk factor for cardiovascular events . We treated smoking as a binary variable. This approach has been known to be subject to misclassification  in the GPRD. However, we avoided classifying the patients as never, current and ex smokers as the GPRD does not systematically track quitting and starting patterns among smokers. Also there is no information on the duration, intensity or type of smoking available in the GPRD. The same limitation applies for the clinical diagnosis of heavy alcohol use. We do not have information on the severity of AMI; it was shown that different levels of healthy lifestyle are associated with the severity of cardiac events and outcomes after the event .
Furthermore, we are not able to verify the assumption that the missing data were ignorable (an assumption of multiple imputation in that the missing data can be completely predicted from the observed data) [4–6, 12]. It is possible that more information would be required to generate an unbiased prediction of the data than is present in this database and this cannot be tested without this data. However, it is quite plausible that the nature of data collection in the GPRD will be such that the data is not missing at random and so the estimates of missing BMI values should be interpreted with caution.
The estimates of effect found in this paper are not protected against misclassification of the exposure. Also, the temporal sequence of variables that are measured cross-sectionally (like BMI) in the GPRD cannot be captured. As can be seen in Figure 1, the analysis of these variables requires assumptions about whether the covariate is a common cause of the exposure and the outcome (and thus a confounder) [27, 28] or if it lies in the causal pathway between the exposure and the outcome (and should not be adjusted for). Our study makes the assumption, as has been seen in other contexts , that the estimate adjusted only for age and sex is the correct model given our understanding of the relationships between the candidate confounders and the exposure. Future researchers, however, can and should test these conceptual models with longitudinal data.
Future work in this area in the GPRD should account for the properties of the missing data in this database. However, once the missing data are properly accounted for, the GPRD appears to be a rich source of data on lifestyle risk factors at the population level. The interesting finding of a J-shaped relationship between BMI and risk of first AMI, while seen for mortality in previous work, is novel for first AMI and should be explored further.
This work on obesity can be extended to other areas where the relationship between obesity and the disease is less well-known . Meanwhile, physicians should continue to advise patients to try and modify lifestyle factors, where possible, to reduce AMI risk.
De Bacquer D, De Backer G, Cokkinos D, Keil U, Montaye M, Ostor E, Pyorala K, Sans S: Overweight and obesity in patients with established coronary heart disease: are we meeting the challenge?. Eur Heart J. 2004, 25: 121-8. 10.1016/j.ehj.2003.10.024.
Mulnier HE, Seaman HE, Raleigh VS, Soedamah-Muthu SS, Colhoun HM, Lawrenson RA: Mortality in people with type 2 diabetes in the UK. Diabet Med. 2006, 23: 516-21. 10.1111/j.1464-5491.2006.01838.x.
Andersohn F, Suissa S, Garbe E: Use of first- and second-generation cyclooxygenase-2-selective nonsteroidal antiinflammatory drugs and risk of acute myocardial infarction. Circulation. 2006, 113: 1950-7. 10.1161/CIRCULATIONAHA.105.602425.
van der Heijden GJ, Donders AR, Stijnen T, Moons KG: Imputation of missing values is superior to complete case analysis and the missing-indicator method in multivariable diagnostic research: a clinical example. J Clin Epidemiol. 2006, 59: 1102-9. 10.1016/j.jclinepi.2006.01.015.
Barzi F, Woodward M: Imputations of missing values in practice: results from imputations of serum cholesterol in 28 cohort studies. Am J Epidemiol. 2004, 160: 34-45. 10.1093/aje/kwh175.
Greenland S, Finkle WD: A critical look at methods for handling missing covariates in epidemiologic regression analyses. Am J Epidemiol. 1995, 142: 1255-64.
Lawrenson R, Williams T, Farmer R: Clinical information for research; the use of general practice databases. J Public Health Med. 1999, 21: 299-304. 10.1093/pubmed/21.3.299.
Jick SS, Kaye JA, Vasilakis-Scaramozza C, Garcia Rodriguez LA, Ruigomez A, Meier CR, Schlienger RG, Black C, Jick H: Validity of the general practice research database. Pharmacotherapy. 2003, 23: 686-9. 10.1592/phco.23.5.686.32205.
Gorelick MH: Bias arising from missing data in predictive models. J Clin Epidemiol. 2006, 59: 1115-23. 10.1016/j.jclinepi.2004.11.029.
Physical Status: The Use and Interpretation of Anthopometry. Report of WHO Expert Committee. WHO Technical Report Series 854. 1995, Geneva: World Health Organization
Moons KG, Donders RA, Stijnen T, Harrell FE: Using the outcome for imputation of missing predictor values was preferred. J Clin Epidemiol. 2006, 59: 1092-101. 10.1016/j.jclinepi.2006.01.009.
Schafer JL: Multiple imputation: a primer. Stat Methods Med Res. 1999, 8: 3-15. 10.1191/096228099671525676.
Minutello RM, Chou ET, Hong MK, Bergman G, Parikh M, Iacovone F, Wong SC: Impact of body mass index on in-hospital outcomes following percutaneous coronary intervention (report from the New York State Angioplasty Registry). Am J Cardiol. 2004, 93: 1229-32. 10.1016/j.amjcard.2004.01.065.
Powell BD, Lennon RJ, Lerman A, Bell MR, Berger PB, Higano ST, Holmes DR, Rihal CS: Association of body mass index with outcome after percutaneous coronary intervention. Am J Cardiol. 2003, 91: 472-6. 10.1016/S0002-9149(02)03252-6.
Reeves BC, Ascione R, Chamberlain MH, Angelini GD: Effect of body mass index on early outcomes in patients undergoing coronary artery bypass surgery. J Am Coll Cardiol. 2003, 42: 668-76. 10.1016/S0735-1097(03)00777-0.
Diercks DB, Roe MT, Mulgund J, Pollack CV, Kirk JD, Gibler WB, Ohman EM, Smith SC, Boden WE, Peterson ED: The obesity paradox in non-ST-segment elevation acute coronary syndromes: results from the Can Rapid risk stratification of Unstable angina patients Suppress ADverse outcomes with Early implementation of the American College of Cardiology/American Heart Association Guidelines Quality Improvement Initiative. Am Heart J. 2006, 152: 140-8. 10.1016/j.ahj.2005.09.024.
Kragelund C, Hassager C, Hildebrandt P, Torp-Pedersen C, Kober L, TRACE study group: Impact of obesity on long-term prognosis following acute myocardial infarction. Int J Cardiol. 2005, 98: 123-31. 10.1016/j.ijcard.2004.03.042.
Dudas KA, Wilhelmsen L, Rosengren A: Predictors of coronary bypass grafting in a population of middle-aged men. Eur J Cardiovasc Prev Rehabil. 2007, 14: 122-7. 10.1097/01.hjr.0000209814.82701.3f.
Calle EE, Thun MJ, Petrelli JM, Rodriguez C, Heath CW: Body-mass index and mortality in a prospective cohort of U.S. adults. N Engl J Med. 1999, 341: 1097-105. 10.1056/NEJM199910073411501.
Bogers RP, Bemelmans WJ, Hoogenveen RT, Boshuizen HC, Woodward M, Knekt P, van Dam RM, Hu FB, Visscher TL, Menotti A, Thorpe RJ, Jamrozik K, Calling S, Strand BH, Shipley MJ, for the BMI-CHD Collaboration Investigators: Association of overweight with increased risk of coronary heart disease partly independent of blood pressure and cholesterol levels: a meta-analysis of 21 cohort studies including more than 300 000 persons. Arch Intern Med. 2007, 167: 1720-8. 10.1001/archinte.167.16.1720.
Teo KK, Ounpuu S, Hawken S, Pandey MR, Valentin V, Hunt D, Diaz R, Rashed W, Freeman R, Jiang L, Zhang X, Yusuf S, INTERHEART Study Investigators: Tobacco use and risk of myocardial infarction in 52 countries in the INTERHEART study: a case-control study. Lancet. 2006, 368: 647-58. 10.1016/S0140-6736(06)69249-0.
Gallus S, Randi G, Negri E, Tavani A, La Vecchia C: Tar yield and risk of acute myocardial infarction: pooled analysis from three case-control studies. Eur J Cardiovasc Prev Rehabil. 2007, 14: 299-303. 10.1097/01.hjr.0000244574.17853.ed.
Yusuf S, Hawken S, Ounpuu S, Dans T, Avezum A, Lanas F, McQueen M, Budaj A, Pais P, Varigos J, Lisheng L, INTERHEART Study Investigators: Effect of potentially modifiable risk factors associated with myocardial infarction in 52 countries (the INTERHEART study): case-control study. Lancet. 2004, 364: 937-52. 10.1016/S0140-6736(04)17018-9.
Farin HM, Abbasi F, Reaven GM: Comparison of body mass index versus waist circumference with the metabolic changes that increase the risk of cardiovascular disease in insulin-resistant individuals. Am J Cardiol. 2006, 98: 1053-6. 10.1016/j.amjcard.2006.05.025.
Lewis JD, Brensinger C: Agreement between GPRD smoking data: a survey of general practitioners and a population-based survey. Pharmacoepidemiol Drug Saf. 2004, 13: 437-41. 10.1002/pds.902.
Panagiotakos DB, Pitsavos C, Stefanadis C, GREECS Study Investigators: Short-term prognosis of patients with acute coronary syndromes through the evaluation of physical activity status, the adoption of Mediterranean diet and smoking habits: the Greek Acute Coronary Syndromes (GREECS) study. Eur J Cardiovasc Prev Rehabil. 2006, 13: 901-8. 10.1097/01.hjr.0000221863.42286.1e.
Hernán MA, Hernández-Díaz S, Werler MM, Mitchell AA: Causal knowledge as a prerequisite for confounding evaluation: an application to birth defects epidemiology. Am J Epidemiol. 2002, 155: 176-84. 10.1093/aje/155.2.176.
Hernán MA, Hernández-Díaz S, Robins JM: A structural approach to selection bias. Epidemiology. 2004, 15: 615-25. 10.1097/01.ede.0000135174.63482.43.
Schneider-Lindner V, Delaney JA, Dial S, Dascal A, Suissa S: Antimicrobial Drugs and Community-Acquired Methicillin-Resistant Staphylococcus aureus, UK. Emerg Infect Dis. 2007, 13 (7): 994-1000.
The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2261/7/38/prepub
This study was funded by the Canadian Institutes of Health Research (CIHR) and the Canadian Foundation for Innovation. The funding sources listed had no role in the study design, writing, analysis or the decision to submit this manuscript.
The author(s) declare that they have no competing interests.
All authors contributed to the conception and design of the study. JD and RS conducted the statistical analysis of the paper. All authors contributed to the interpretation of the data and developed the statistical analysis. JD and SD wrote the paper with critical contributions from all authors. The final manuscript was approved by all authors.
Electronic supplementary material
Additional file 1: List of medical codes used to identify the first acute myocardial infarction. This file documents the READ and OXMIS medical codes that were used to identify the event of myocardial infarction in this study. (DOC 33 KB)
Additional file 2: Imputation variables and method. This file contains more detailed information about the statistical methods used to implement multiple imputation to handle missing data in the paper. (DOC 22 KB)
Authors’ original submitted files for images
Below are the links to the authors’ original submitted files for images.
About this article
Cite this article
Delaney, J.A., Daskalopoulou, S.S., Brophy, J.M. et al. Lifestyle variables and the risk of myocardial infarction in the General Practice Research Database. BMC Cardiovasc Disord 7, 38 (2007). https://doi.org/10.1186/1471-2261-7-38
- Body Mass Index
- Acute Myocardial Infarction
- Multiple Imputation
- Normal Body Mass Index
- Body Mass Index Category