Lifestyle variables and the risk of myocardial infarction in the General Practice Research Database
© Delaney et al; licensee BioMed Central Ltd. 2007
Received: 16 May 2007
Accepted: 18 December 2007
Published: 18 December 2007
The primary objective of this study is to estimate the association between body mass index (BMI) and the risk of first acute myocardial infarction (AMI). As a secondary objective, we considered the association between other lifestyle variables, smoking and heavy alcohol use, and AMI risk.
This study was conducted in the general practice research database (GPRD) which is a database based on general practitioner records and is a representative sample of the United Kingdom population. We matched cases of first AMI as identified by diagnostic codes with up to 10 controls between January 1st, 2001 and December 31st, 2005 using incidence density sampling. We used multiple imputation to account for missing data.
We identified 19,353 cases of first AMI which were matched on index date, GPRD practice and age to 192,821 controls. There was a modest amount of missing data in the database, and the patients with missing data had different risks than those with recorded values. We adjusted our analysis for each lifestyle variable jointly and also for age, sex, and number of hospitalizations in the past year. Although a record of underweight (BMI <18.0 kg/m2) did not alter the risk for AMI (adjusted odds ratio (OR): 1.00; 95% confidence interval (CI): 0.87–1.11) when compared with normal BMI (18.0–24.9 kg/m2), obesity (BMI ≥30 kg/m2) predicted an increased risk (adjusted OR: 1.41; 95% CI: 1.35–1.47). A history of smoking also predicted an increased risk of AMI (adjusted OR: 1.81; 95% CI: 1.75–1.87) as did heavy alcohol use (adjusted OR: 1.15; 95% CI: 1.06–1.26).
This study illustrates that obesity, smoking and heavy alcohol use, as recorded during routine care by a general practitioner, are important predictors of an increased risk of a first AMI. In contrast, low BMI does not increase the risk of a first AMI.
Obesity is a growing public health problem that is associated with an increased rate of cardiovascular events. About one in three patients admitted to hospital with acute coronary syndrome in Europe were obese with additionally half of the patient population being overweight .
Clinical databases based on general practice records are a potentially useful source of information (when it is available) for studying the magnitude of risk factors such as obesity, smoking and heavy alcohol use at the population level in a real-world setting. However, these databases often have missing data on some patients which needs to be properly accounted for in any analysis. Several methods exist [2, 3], but multiple imputation has been systematically shown to be superior to case deletion and indicator variable methods in reducing bias [4–6].
As obesity is a growing public health concern, it is important to identify the impact of body mass index (BMI) in the occurrence of the first acute myocardial infarction (AMI). The primary objective of this study is to estimate the association between BMI and the risk of the first AMI. As a secondary objective, we considered the association between other lifestyle variables, smoking and heavy alcohol use, and AMI risk. Finally, we sought to determine if the choice of how to deal with missing information was important.
This study is based on the United Kingdom's General Practice Research Database (GPRD) . This is a large clinical database based on the medical charts of general practitioners. It records information such as prescriptions issued and medical diagnoses made using the United Kingdom specific READ and OXMIS medical codes. The recorded information on drug exposure and diagnoses has been validated and proven to be of high quality [7, 8]. The GPRD also records information on factors such as BMI, blood pressure, smoking and alcohol consumption . However, these variables are reported by validation studies to have non-trivial amounts of missing data [7, 8] and this can lead to biased estimates of effect .
We identified all first-ever AMIs recorded in the GPRD between January 1st, 2001 and December 31st, 2005 using the medical codes recorded in the database as our cases. These medical codes are described in Additional File 1. To be eligible to be selected as a case, a patient needed to be at least 18 years of age and have no previous record of an AMI before the index event. The date recorded in the database for the first AMI was taken as the index date for the case. We matched each case with up to 10 controls based on age (± 2 years), GPRD practice and index date. On the index date, the control must not have had a previous AMI, must still be registered in the GPRD and be alive to be eligible as a control.
Both cases and controls were required to have at least 3 years of follow-up in the GPRD before the index date to allow adequate time to assess comorbid conditions.
BMI was defined as the most recently available pre-AMI body weight (in kilograms) divided by the square of the height (in meters) (kg/m2) and was used to categorize patients according to the World Health Organization's definition : underweight (BMI: <18.0 kg/m2), normal weight (BMI: 18.0–24.9 kg/m2), overweight (BMI: 25–29.9 kg/m2) and obese (BMI: ≥30 kg/m2).
For smoking we grouped subjects into the categories of never smokers and ever smokers. For heavy alcohol use we used at least one clinical diagnosis recorded in the database. For BMI and smoking status, we used the closer to the index date recorded value in the database. However, for most patients BMI and smoking status are recorded only once in the GPRD .
Ethical review for this study was done by the Independent Scientific Advisory Committee for MHRA database research
Conditional logistic regression was used to estimate the odds ratios (ORs) for the different BMI categories. We handled missing data using three different typical approaches (case deletion, indicator variable and multiple imputation). It was important to include a broad spectrum of covariates as predictors in our multiple imputation model [11, 12]. We considered a crude model for BMI, smoking and heavy alcohol use, separately. Because of the cross-sectional nature of our data, we could not assess whether comorbidities preceded obesity, and so we did not adjust for these variables in our statistical models (although they were used in the multiple imputation to infer BMI). Instead, we limited our statistical adjustment to each lifestyle variable jointly, as well as age, sex and number of hospitalizations in the past year (as a proxy for overall health status).
More details of the imputation and analysis are discussed in Additional File 2.
Lifestyle information and percentage of missing data in subjects comparing patients acute myocardial infarction (cases) to the general population from which cases arose (controls).
Basic Descriptive Statistics
Cases (n = 19,353)
Controls (n = 192,821)
Mean age (SD)
% heavy alcohol use
# hospitalizations/past year (SD)
Rates of missing values (%)
Body Mass Index
Estimated Systolic Blood Pressure (SD)
Estimated Diastolic Blood Pressure (SD)
Estimated Serum Cholesterol (SD)
Chronic Obstructive Pulmonary Disease
Comparison of distributions of body mass index and smoking among subjects with measured body mass index values and those with imputed body mass index values.
Measured (N = 15,423)
Imputed (N = 3,930)
Measured (N = 146,725)
Imputed (N = 46,096)
Body Mass Index
Relationship between body mass index and acute myocardial infarction using three different methods to account for missing values (odds ratio, 95% confidence interval). The normal BMI category (18.0–24.9 kg/m2) was used as the reference group.
Body mass index (kg/m2)
Multiple Imputation (10 copies)
Crude Estimates of Effect
Adjusted* Estimates of Effect
Relationship between smoking status and acute myocardial infarction as shown using three different methods to account for missing values and analyzed using conditional logistic regression (odds ratio, 95% confidence interval). The never smoking group was used as the reference.
Multiple Imputation (10 copies)
Crude Estimates of Effect
Adjusted* Estimates of Effect
Furthermore, subjects with a clinical diagnosis of heavy alcohol use appeared to have a small increased risk of a first AMI (adjusted OR: 1.15; 95% CI: 1.06–1.26).
This is the first study evaluating the association between BMI and the first AMI, using a clinical database based on general practitioner records (GPRD). It is a case-control study that includes a large sample of consecutive, unselected cases with AMI and matched controls. Therefore, it reflects real life data including a large proportion of female and elderly patients. In this study we also assessed the impact of smoking and heavy alcohol use on the occurrence of the first AMI. We used three different methods to account for missing data, namely case deletion, indicator variable and the more sophisticated multiple imputation method.
BMI as a risk factor
In our study, we observe that low and normal BMI values are not associated with an increased risk of a first AMI but that high BMI values are. This shape could be described as a J-shaped association between BMI categories in which we have no effect on one direction from normal and an increased risk in the other. To our knowledge, this is the first study to describe this effect for first AMI in a United Kingdom population sample.
Despite previous research, controversy remains regarding the relationship between BMI and AMI [13–16]. Some studies have shown that BMI has a U-shaped effect (bimodal occurrence) of adverse events and adverse outcomes with an increased risk in underweight and morbidly obese people, but with a lower risk for overweight and obese when compared to normal-weight patients. However, these studies have often not comprehensively accounted for potential sources of confounding, with underestimates of the effect of overweight and obesity on longevity and overestimates of the risks of leanness. Major potential sources of bias particular to studies of BMI and mortality include (1) failure to adequately account for missing values, (2) failure to adequately account for potential sources of confounding (e.g. pre-existing disease or concomitant illnesses such as cancer, leukemia and lymphoma), (3) unmeasured factors that affected outcomes, and (4) inappropriate adjustment for the biological effects of obesity (i.e. for conditions that included in the causal pathway between obesity and AMI), including hypertension and diabetes. Also some prior studies are not very informative as they are hospital-based and they focus on the outcome after AMI. Furthermore, a study of AMI patients followed for 8–10 years showed that although overall obesity (as assessed by BMI) is inversely related to mortality, abdominal obesity appears to be an independent predictor of all-cause mortality in men and perhaps also in women .
Other studies have found similar results with ours but mostly for mortality. In the Multifactor Primary Prevention Study, when the BMI category 20.0–22.5 kg/m2 was used as the reference group, the underweight group did not carry a higher risk for an AMI (adjusted Hazard Ratio (HR): 1.08; 95% CI: 0.76–1.52) or for coronary artery bypass graft without prior AMI (adjusted HR: 0.86; 95% CI: 0.25–2.90). However, overweight and obese patients were carrying a higher risk for AMI when compared with the normal BMI category .
In a prospective study of more than 1,000,000 adults in the United States the curve for the risk of death from cardiovascular disease among subjects who never smoked and had no history of disease was J-shaped; this indicated that a high BMI was most predictive of death from cardiovascular disease than a low BMI. However, the curve for the risk of death from all other causes was U-shaped .
A recent meta-analysis including 302,296 participants worldwide and 18,000 coronary heart disease events during follow-up showed that there was an increased risk for coronary events associated with overweight and obesity; the adjusted relative risk (and 95% CI) was 1.32 (1.24–1.40) for BMI of 25.0 to 29.9 kg/m2 and 1.81 (1.56–2.10) for BMI ≥30 kg/m2, when compared with normal BMI .
Smoking and heavy alcohol use as risk factors
An increased risk of a first AMI was associated with ever smoking. This finding was consistently found regardless of the method used to deal with missing values. This strong association between smoking and AMI has been shown before. For example, the INTERHEART study found that tobacco use is one of the most important causes of AMI globally, especially in men. The risk for AMI was increased regardless of the form of tobacco use, including different types of smoking and chewing tobacco and inhalation of second hand tobacco smoke .
Another study also found that the type or yield of cigarettes did not result in significantly different findings, with similar risk for smoker of low versus high tar cigarettes .
Heavy alcohol use was also consistently associated with a higher risk of a first AMI in our study. Heavy alcohol use is a known risk factor for cardiovascular risk. The INTERHEART study, among others, also found this association .
We used three common methods to deal with missing data. In cases where there is a difference between the results of the case deletion, indicator variable and multiple imputation, simulation studies have demonstrated the superiority of multiple imputation method when missing data exceed 10% of the total . In our study, only smoking met that criterion <10% missing among cases and only slightly more among controls. In all methods, smoking was a strong risk factor for AMI, with little to no change in estimate as we accounted for missing data with different methods.
The pattern of obesity by measured versus unmeasured BMI (as shown in Table 2) demonstrates the circumstances under which multiple imputation will make a difference in the results of a study. The only category of weight shows important differences in the estimates of the effect of BMI on AMI between those with a measure of BMI and those without one is the underweight. In the underweight we found a 15.3% difference in the estimates of the risk of AMI between using case deletion versus multiple imputations to handle missing data. While we are fortunate in this case not to have this bias shift the inference (as neither is statistically significant), this is not guaranteed in future studies. In such cases, the estimate from multiple imputation should be preferred [4–6, 12].
Strengths and limitations
This is a broad and unselected population sample of the United Kingdom population that allows us to infer the current levels of risk. Due to the comprehensive nature of the covariates in the database, we were able to use an extremely rich wealth of information in imputing missing data. This allows us to describe the empirical risk associated with different levels of BMI as seen by general practitioners. Recently, the INTERHEART study also reported that, among others, smoking, obesity, bad dietary habits and alcohol intake, as well as lack of regular physical activity account for most of the risk of AMI worldwide in both sexes and at all ages in all regions .
However, this study also has several limitations. We defined the "first AMI" as the first event occurred after at least 3 consecutive years of being followed in the GPRD and being free of an AMI. This might also include some patients who had their AMI after long intervals. However, as there is very good validation for hospital referrals (and communication with specialists) , it is very uncommon if a patient with a previous MI was not followed by a Cardiologist/Specialist and/or did not have any follow-up tests for at least 3 years. Also in general, in database studies collection of data is often less standardized or less accurate; however, GPRD is a popular database and many validation studies have proven satisfactory accuracy and completeness of the data . Despite adjustments using multivariate analyses, unmeasured factors that affected outcomes were likely present. The BMI was used as a marker for total body fat, while the distribution of body fat is unknown. However, there is evidence supporting that there is a good correlation between BMI and central obesity, a known risk factor for cardiovascular events . We treated smoking as a binary variable. This approach has been known to be subject to misclassification  in the GPRD. However, we avoided classifying the patients as never, current and ex smokers as the GPRD does not systematically track quitting and starting patterns among smokers. Also there is no information on the duration, intensity or type of smoking available in the GPRD. The same limitation applies for the clinical diagnosis of heavy alcohol use. We do not have information on the severity of AMI; it was shown that different levels of healthy lifestyle are associated with the severity of cardiac events and outcomes after the event .
Furthermore, we are not able to verify the assumption that the missing data were ignorable (an assumption of multiple imputation in that the missing data can be completely predicted from the observed data) [4–6, 12]. It is possible that more information would be required to generate an unbiased prediction of the data than is present in this database and this cannot be tested without this data. However, it is quite plausible that the nature of data collection in the GPRD will be such that the data is not missing at random and so the estimates of missing BMI values should be interpreted with caution.
Future work in this area in the GPRD should account for the properties of the missing data in this database. However, once the missing data are properly accounted for, the GPRD appears to be a rich source of data on lifestyle risk factors at the population level. The interesting finding of a J-shaped relationship between BMI and risk of first AMI, while seen for mortality in previous work, is novel for first AMI and should be explored further.
This work on obesity can be extended to other areas where the relationship between obesity and the disease is less well-known . Meanwhile, physicians should continue to advise patients to try and modify lifestyle factors, where possible, to reduce AMI risk.
This study was funded by the Canadian Institutes of Health Research (CIHR) and the Canadian Foundation for Innovation. The funding sources listed had no role in the study design, writing, analysis or the decision to submit this manuscript.
- De Bacquer D, De Backer G, Cokkinos D, Keil U, Montaye M, Ostor E, Pyorala K, Sans S: Overweight and obesity in patients with established coronary heart disease: are we meeting the challenge?. Eur Heart J. 2004, 25: 121-8. 10.1016/j.ehj.2003.10.024.View ArticlePubMedGoogle Scholar
- Mulnier HE, Seaman HE, Raleigh VS, Soedamah-Muthu SS, Colhoun HM, Lawrenson RA: Mortality in people with type 2 diabetes in the UK. Diabet Med. 2006, 23: 516-21. 10.1111/j.1464-5491.2006.01838.x.View ArticlePubMedGoogle Scholar
- Andersohn F, Suissa S, Garbe E: Use of first- and second-generation cyclooxygenase-2-selective nonsteroidal antiinflammatory drugs and risk of acute myocardial infarction. Circulation. 2006, 113: 1950-7. 10.1161/CIRCULATIONAHA.105.602425.View ArticlePubMedGoogle Scholar
- van der Heijden GJ, Donders AR, Stijnen T, Moons KG: Imputation of missing values is superior to complete case analysis and the missing-indicator method in multivariable diagnostic research: a clinical example. J Clin Epidemiol. 2006, 59: 1102-9. 10.1016/j.jclinepi.2006.01.015.View ArticlePubMedGoogle Scholar
- Barzi F, Woodward M: Imputations of missing values in practice: results from imputations of serum cholesterol in 28 cohort studies. Am J Epidemiol. 2004, 160: 34-45. 10.1093/aje/kwh175.View ArticlePubMedGoogle Scholar
- Greenland S, Finkle WD: A critical look at methods for handling missing covariates in epidemiologic regression analyses. Am J Epidemiol. 1995, 142: 1255-64.PubMedGoogle Scholar
- Lawrenson R, Williams T, Farmer R: Clinical information for research; the use of general practice databases. J Public Health Med. 1999, 21: 299-304. 10.1093/pubmed/21.3.299.View ArticlePubMedGoogle Scholar
- Jick SS, Kaye JA, Vasilakis-Scaramozza C, Garcia Rodriguez LA, Ruigomez A, Meier CR, Schlienger RG, Black C, Jick H: Validity of the general practice research database. Pharmacotherapy. 2003, 23: 686-9. 10.1592/phco.23.5.686.32205.View ArticlePubMedGoogle Scholar
- Gorelick MH: Bias arising from missing data in predictive models. J Clin Epidemiol. 2006, 59: 1115-23. 10.1016/j.jclinepi.2004.11.029.View ArticlePubMedGoogle Scholar
- Physical Status: The Use and Interpretation of Anthopometry. Report of WHO Expert Committee. WHO Technical Report Series 854. 1995, Geneva: World Health OrganizationGoogle Scholar
- Moons KG, Donders RA, Stijnen T, Harrell FE: Using the outcome for imputation of missing predictor values was preferred. J Clin Epidemiol. 2006, 59: 1092-101. 10.1016/j.jclinepi.2006.01.009.View ArticlePubMedGoogle Scholar
- Schafer JL: Multiple imputation: a primer. Stat Methods Med Res. 1999, 8: 3-15. 10.1191/096228099671525676.View ArticlePubMedGoogle Scholar
- Minutello RM, Chou ET, Hong MK, Bergman G, Parikh M, Iacovone F, Wong SC: Impact of body mass index on in-hospital outcomes following percutaneous coronary intervention (report from the New York State Angioplasty Registry). Am J Cardiol. 2004, 93: 1229-32. 10.1016/j.amjcard.2004.01.065.View ArticlePubMedGoogle Scholar
- Powell BD, Lennon RJ, Lerman A, Bell MR, Berger PB, Higano ST, Holmes DR, Rihal CS: Association of body mass index with outcome after percutaneous coronary intervention. Am J Cardiol. 2003, 91: 472-6. 10.1016/S0002-9149(02)03252-6.View ArticlePubMedGoogle Scholar
- Reeves BC, Ascione R, Chamberlain MH, Angelini GD: Effect of body mass index on early outcomes in patients undergoing coronary artery bypass surgery. J Am Coll Cardiol. 2003, 42: 668-76. 10.1016/S0735-1097(03)00777-0.View ArticlePubMedGoogle Scholar
- Diercks DB, Roe MT, Mulgund J, Pollack CV, Kirk JD, Gibler WB, Ohman EM, Smith SC, Boden WE, Peterson ED: The obesity paradox in non-ST-segment elevation acute coronary syndromes: results from the Can Rapid risk stratification of Unstable angina patients Suppress ADverse outcomes with Early implementation of the American College of Cardiology/American Heart Association Guidelines Quality Improvement Initiative. Am Heart J. 2006, 152: 140-8. 10.1016/j.ahj.2005.09.024.View ArticlePubMedGoogle Scholar
- Kragelund C, Hassager C, Hildebrandt P, Torp-Pedersen C, Kober L, TRACE study group: Impact of obesity on long-term prognosis following acute myocardial infarction. Int J Cardiol. 2005, 98: 123-31. 10.1016/j.ijcard.2004.03.042.View ArticlePubMedGoogle Scholar
- Dudas KA, Wilhelmsen L, Rosengren A: Predictors of coronary bypass grafting in a population of middle-aged men. Eur J Cardiovasc Prev Rehabil. 2007, 14: 122-7. 10.1097/01.hjr.0000209814.82701.3f.View ArticlePubMedGoogle Scholar
- Calle EE, Thun MJ, Petrelli JM, Rodriguez C, Heath CW: Body-mass index and mortality in a prospective cohort of U.S. adults. N Engl J Med. 1999, 341: 1097-105. 10.1056/NEJM199910073411501.View ArticlePubMedGoogle Scholar
- Bogers RP, Bemelmans WJ, Hoogenveen RT, Boshuizen HC, Woodward M, Knekt P, van Dam RM, Hu FB, Visscher TL, Menotti A, Thorpe RJ, Jamrozik K, Calling S, Strand BH, Shipley MJ, for the BMI-CHD Collaboration Investigators: Association of overweight with increased risk of coronary heart disease partly independent of blood pressure and cholesterol levels: a meta-analysis of 21 cohort studies including more than 300 000 persons. Arch Intern Med. 2007, 167: 1720-8. 10.1001/archinte.167.16.1720.View ArticlePubMedGoogle Scholar
- Teo KK, Ounpuu S, Hawken S, Pandey MR, Valentin V, Hunt D, Diaz R, Rashed W, Freeman R, Jiang L, Zhang X, Yusuf S, INTERHEART Study Investigators: Tobacco use and risk of myocardial infarction in 52 countries in the INTERHEART study: a case-control study. Lancet. 2006, 368: 647-58. 10.1016/S0140-6736(06)69249-0.View ArticlePubMedGoogle Scholar
- Gallus S, Randi G, Negri E, Tavani A, La Vecchia C: Tar yield and risk of acute myocardial infarction: pooled analysis from three case-control studies. Eur J Cardiovasc Prev Rehabil. 2007, 14: 299-303. 10.1097/01.hjr.0000244574.17853.ed.View ArticlePubMedGoogle Scholar
- Yusuf S, Hawken S, Ounpuu S, Dans T, Avezum A, Lanas F, McQueen M, Budaj A, Pais P, Varigos J, Lisheng L, INTERHEART Study Investigators: Effect of potentially modifiable risk factors associated with myocardial infarction in 52 countries (the INTERHEART study): case-control study. Lancet. 2004, 364: 937-52. 10.1016/S0140-6736(04)17018-9.View ArticlePubMedGoogle Scholar
- Farin HM, Abbasi F, Reaven GM: Comparison of body mass index versus waist circumference with the metabolic changes that increase the risk of cardiovascular disease in insulin-resistant individuals. Am J Cardiol. 2006, 98: 1053-6. 10.1016/j.amjcard.2006.05.025.View ArticlePubMedGoogle Scholar
- Lewis JD, Brensinger C: Agreement between GPRD smoking data: a survey of general practitioners and a population-based survey. Pharmacoepidemiol Drug Saf. 2004, 13: 437-41. 10.1002/pds.902.View ArticlePubMedGoogle Scholar
- Panagiotakos DB, Pitsavos C, Stefanadis C, GREECS Study Investigators: Short-term prognosis of patients with acute coronary syndromes through the evaluation of physical activity status, the adoption of Mediterranean diet and smoking habits: the Greek Acute Coronary Syndromes (GREECS) study. Eur J Cardiovasc Prev Rehabil. 2006, 13: 901-8. 10.1097/01.hjr.0000221863.42286.1e.View ArticlePubMedGoogle Scholar
- Hernán MA, Hernández-Díaz S, Werler MM, Mitchell AA: Causal knowledge as a prerequisite for confounding evaluation: an application to birth defects epidemiology. Am J Epidemiol. 2002, 155: 176-84. 10.1093/aje/155.2.176.View ArticlePubMedGoogle Scholar
- Hernán MA, Hernández-Díaz S, Robins JM: A structural approach to selection bias. Epidemiology. 2004, 15: 615-25. 10.1097/01.ede.0000135174.63482.43.View ArticlePubMedGoogle Scholar
- Schneider-Lindner V, Delaney JA, Dial S, Dascal A, Suissa S: Antimicrobial Drugs and Community-Acquired Methicillin-Resistant Staphylococcus aureus, UK. Emerg Infect Dis. 2007, 13 (7): 994-1000.View ArticlePubMedPubMed CentralGoogle Scholar
- The pre-publication history for this paper can be accessed here:http://www.biomedcentral.com/1471-2261/7/38/prepub
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.