Skip to main content
  • Research article
  • Open access
  • Published:

Variations in cardiovascular disease under-diagnosis in England: national cross-sectional spatial analysis



There is under-diagnosis of cardiovascular disease (CVD) in the English population, despite financial incentives to encourage general practices to register new cases. We compared the modelled (expected) and diagnosed (observed) prevalence of three cardiovascular conditions- coronary heart disease (CHD), hypertension and stroke- at local level, their geographical variation, and population and healthcare predictors which might influence diagnosis.


Cross-sectional observational study in all English local authorities (351) and general practices (8,372) comparing model-based expected prevalence with diagnosed prevalence on practice disease registers. Spatial analyses were used to identify geographic clusters and variation in regression relationships.


A total of 9,682,176 patients were on practice CHD, stroke and transient ischaemic attack, and hypertension registers. There was wide spatial variation in observed: expected prevalence ratios for all three diseases, with less than five per cent of expected cases diagnosed in some areas. London and the surrounding area showed statistically significant discrepancies in observed: expected prevalence ratios, with observed prevalence much lower than the epidemiological models predicted. The addition of general practitioner supply as a variable yielded stronger regression results for all three conditions.


Despite almost universal access to free primary healthcare, there may be significant and highly variable under-diagnosis of CVD across England, which can be partially explained by persistent inequity in GP supply. Disease management studies should consider the possible impact of under-diagnosis on population health outcomes. Compared to classical regression modelling, spatial analytic techniques can provide additional information on risk factors for under-diagnosis, and can suggest where healthcare resources may be most needed.

Peer Review reports


Geographic variation in the incidence and prevalence of CVD is a well-known phenomenon in population surveys. Such variations in both CHD and stroke incidence and prevalence are largely explained by area- and person-specific factors such as population socio-economic composition, demographic structure and ethnic diversity [14], mediated through established CVD risk factors [5]. In the case of England, this is exemplified at a regional level by a North-South gradient in prevalence and outcomes, with higher prevalences in the North [6].

From a health services perspective, it is important to ensure that as high a proportion as possible of actual cases have been diagnosed and well-managed when secondary prevention is known to be effective, as is the case for CVD [7]. Under-diagnosis and/or under-treatment of CVD has been found in some previous United Kingdom (UK) studies [8, 9], but it was not possible to differentiate these causes. The recent availability of registered prevalence, performance and outcome data through the Quality & Outcomes Framework (QOF) pay-for-performance programme, in which almost all UK general practices participate, has provided accurate counts of diagnosed disease prevalence, and assurance that identified CVD is being increasingly well-managed. For example, in 2008-9, achievement by English general practices of all the 89 QOF points available for CHD management was 99.1 per cent [10]. The QOF also provides a financial reward for registering new cases of disease.

However, the extent of under-diagnosis may adversely affect population CVD outcomes. Epidemiologic models can be used to provide estimates of expected prevalence for small populations, using local data for known risk factors. Such models can be used for:

  • targeting case-finding initiatives, by comparing diagnosed/observed and expected prevalence

  • supporting needs assessment

  • strategic planning

  • underpinning commissioning of health services by providing denominators

  • health equity audit

  • supporting resource allocation [11].

A recent study which compared observed (QOF-registered) and expected prevalences of CHD and hypertension suggested that under-diagnosis may vary geographically [12]. However, the CHD model used by the authors was derived from the General Practice Research Database, resulting in a lack of independence between the observed and predicted datasets. Several disease prevalence models are now available on the Association of Public Health Observatories' (APHO) website [13]. These provide independently-derived prevalence estimates for resident populations of English local authority (LA), primary care trust (PCT), and practice populations, which can be compared with QOF disease registers [10]. Using these models, we investigated the observed and expected prevalence of three cardiovascular conditions (CHD, hypertension and stroke) at LA level to identify unmet population health care needs, their geographical variation, and population and healthcare predictors which might influence diagnosis.


Data sources: registered prevalence and primary care supply

Counts of general practice-registered, i.e. observed prevalence of CHD, stroke and hypertension in April 2007 were obtained from English general practice disease registers, produced for the purposes of incentivizing practices for achievement of QOF treatment targets. These patients are clinically confirmed cases of CVD who are receiving regular follow-up for their disease. QOF prevalence rates are based on total populations registered with practices, but to compare observed prevalence with expected prevalence estimated from resident population-based contextual data from the Census and other sources e.g. deprivation and proportion ethnic minority population, we derived residence-based QOF prevalence estimates for LAs using a lookup table - a pooled extract of England practice registers - from the National Strategic Tracing Service (now Personal Demographics Service) which apportioned practice populations to LA areas as at January 2006, by providing the exact number of practice population resident in each LA [14].

We apportioned counts of CVD patients registered by practices to LAs, in accordance with the proportion of each practice population resident in that area, which assumed that CVD prevalence was geographically uniform across a practice population (the average practice population is only about 6,400). We divided the aggregated count of CVD patients in each LA by total mid-2006 LA population estimates to give estimated crude prevalence, as used for QOF prevalence. Where less than 50 patients fell into an LA, the numbers were excluded from the look-up process. Three LAs could not be mapped due to discrepancies between QOF and NSTS datasets. In order to investigate the effect of healthcare supply upon diagnosis levels we included a measure of general practitioner availability in the form of the number of general practitioners (GPs) per thousand LA population, calculated in the same way [15].

Data sources: expected/estimated prevalence

Expected prevalence for each LA was obtained from the APHO epidemiologic models, which are based on the socio-demographic and behavioural characteristics of respondents with the respective conditions in the Health Survey for England (HSfE). To produce the models, HSfE data for the years 2003-4 was pooled (sample size 21,233) to increase the cases of diseases. Surveys over this period included boosts for ethnic minority and older people, and focused on CVD and its risk factors. Of the respondents included, 14,574 (68.6%) were of White ethnicity, 308 were Mixed (1.5%), 1,991 (9.4%) were Black or Black British, and 3,725 (17.5%) Asian or Asian British. The outcome variables were patient-reported doctor-diagnosed CHD and stroke, and for hypertension, normotensive-treated, hypertensive-treated but uncontrolled, and hypertensive-untreated groups; i.e. a combination of patient-reported and objectively-measured variables. Patient reports of doctor-diagnosed CHD and stroke have been extensively validated elsewhere [1620]. For example, in the British Regional Heart Study, 80 per cent of men with a GP record of angina reported their diagnosis, and 70 per cent of men who reported an angina diagnosis had confirmation of this from the record review. The prevalence of diagnosed angina in these older men was 10.1 per cent according to self-reported history and 8.9 per cent according to GP record review [16]. At that time (1999) some cases, such as newly-registered patients, may not have had their diagnosis clearly recorded in GP records.

Ordinary least-squares (OLS) logistic regression models were fitted and explanatory variables for each disease outcome identified by reverse stepwise selection.

The baseline odds of each disease were obtained directly from the HSfE dataset. The strength of association between each explanatory variable and disease caseness was then used to calculate the relative odds, which were applied to the baseline odds to derive the prevalence estimates for each sub-group of risk factors. The variables which can be included in each local model are limited by the availability of local data for them from Census and other national sources. The core model variables are ten-year age band, gender (male and female), ethnicity (Asian/Asian British, Black/Black British, White, Mixed and Other including Chinese) and deprivation (based on Index of Multiple Deprivation 2004 scores) [21]. In the case of CHD and stroke, smoking prevalence is also included, and the stroke model does not include ethnicity. The models use 2006 mid-year quinary age-band population estimates by ethnic group from the UK Office for National Statistics (ONS), which were summed to 10-year age bands to match the model [22]. LAs are stratified into deprivation score bands based on cut-offs of Lower Super Output Area quintiles - the ONS categories used in the HSfE. Internal validation included using the models to predict the response for each subject in the source data, and area under the receiver operating characteristics (AUROC) curve. AUROC curve values were 0.834, 0.844, and 0.807 for the stroke, CHD and hypertension models respectively. External validation showed that prevalence gradients derived from the models - for example with age and smoking status - agree well with published results from population-based studies.

In the case of the CHD and stroke models, smoking status is also included. Local smoking prevalence estimates are not available from the HSfE because of small sample sizes, so the CHD model uses synthetic estimates from the Neighbourhood Statistics website, which are for the period 2003-2005 [23]. Model assumptions include that the proportion of smokers, ex-smokers and never-smokers is uniform across ethnic categories and that the proportion of ex-smokers in each age-sex group is constant across areas. Sensitivity analysis has shown that varying the smoking prevalence has a very small effect on prevalence. Further technical details of the models are available in additional files 1 (CHD), 2 (hypertension) and 3 (stroke) in the web appendix, and also on the APHO website [13].

Spatial analyses

Observed: expected prevalence ratios for LAs were calculated in Excel 2007 and mapped using the geographic information systems package ArcGIS 9. Two exploratory spatial data analysis methods commonly used in geographical studies were used to investigate patterns in O:E relationships, Local Moran's I (LMI) analysis and geographically weighted regression (GWR). The LMI technique is used to identify geographic clusters and outliers in data by testing for randomness in spatial distribution across a dataset, localities with significance scores (Z scores) greater than two standard deviations being considered to be either clusters or outliers [24]. Strongly positive Z scores indicate statistically significant similar values in close geographic proximity hence the presence of a cluster; a strongly negative Z score demonstrates a locality with a significantly dissimilar value in relation to its neighbouring localities thus indicating an outlier.

GWR is a form of spatial statistics which disaggregates geographic data into spatial blocks using a probability distribution kernel, which moves from location to location across the dataset to test for geographic variation in regression relationships. In situations where there is geographic variation in the strength of a regression relationship, a phenomenon referred to as spatial non-stationarity, the use of GWR will improve model goodness-of-fit to data, expressed as the trade-off between statistical predictor bias (linked to R-squared values) and variance (linked to degrees of freedom). In comparison to a classical model GWR will produce higher correlation coefficients, lower residuals and higher degrees of freedom than traditional ordinary least squares (OLS) regression [25, 26].

In this paper, we used GWR to assess whether a linear regression relationship between observed and expected prevalence existed and if so whether it varied in strength over space, the purpose of which should be viewed as distinct from that of mapping observed to expected ratios, as the latter aims to measure equality between two variables rather than to assess predictability of an association. Both OLS and GWR models were run in the software package GWR 3 to test for spatial non-stationarity. The optimal bandwidth for the kernel was estimated using the Akaike Information Criterion [27]. Two rounds of regression were performed, the first a univariate regression involving expected prevalence as the independent variable and observed prevalence as the dependent variable; the second bivariate including whole-time equivalent GP supply as an additional variable.

Our research conformed to the Helsinki Declaration, and to local legislation. It did not require ethical approval or patient consent as it is a secondary analysis of publicly-available data.


Observed/diagnosed prevalence of CVD

Observed (O) and expected (E) prevalence summary statistics are shown in Table 1. Total English population was used as the denominator for consistency with standard reporting of QOF prevalence. The mean prevalence of QOF-diagnosed CHD in LAs was 2.57% (95% CIs 2.45-2.69), and the expected prevalence was 4.52% (95% CIs 4.42-4.61), giving an O:E ratio of 0.57 i.e. about 60 per cent of expected cases are diagnosed. Although the prevalence of stroke is less than half that of CHD, the O:E ratio is very similar. The expected prevalence of hypertension is much higher (about 24% of over 16s), and the O:E ratio is only 0.37 i.e. less than 40% of cases may be diagnosed. There was wide variation in the O:E ratio between LAs, with less than five per cent of expected cases diagnosed in some areas for all three diseases.

Table 1 observed (GP-registered) and expected/modelled prevalence summary statistics

Spatial analysis

Mapping of observed and expected prevalence demonstrated that observed and predicted prevalence for both diseases showed strong north-south and southwest-southeast gradients, with the southeast generally showing lowest disease levels, especially in the case of expected prevalence. However, within these geographical trends in both QOF and modelled prevalence, mapping of O:E ratios demonstrated spatial variation between neighbouring LAs. LMI Z scores indicated the presence of some statistically significant clusters and outliers in O:E ratios (Figure 1, Figure 2 and Figure 3).

Figure 1
figure 1

Clusters and outliers in observed to expected ratios for CHD (red indicates clusters and blue indicates outliers).

Figure 2
figure 2

Clusters and outliers in observed to expected ratios for hypertension (red indicates clusters and blue indicates outliers).

Figure 3
figure 3

Clusters and outliers in observed to expected ratios for stroke (red indicates clusters and blue indicates outliers).

For all three conditions, particularly CHD and stroke, London and much of its hinterland showed statistically significant discrepancies in O:E ratios, with observed prevalence much lower than the epidemiological models predicted. For CHD there were also significant clusters and outliers in parts of northern England, especially in Cumbria and Yorkshire, though unlike for the London area the clusters typically related to O:E ratios tending towards unity.

Further geographic analysis with GWR revealed significant spatial variation in the O:E relationship for CHD, hypertension and stroke, with GWR describing the dataset more accurately than traditional OLS regression. The linearity of the relationship between observed and expected prevalence was weakest for hypertension with the lowest overall regression coefficients and highest residual sum of squares for observed against expected prevalence; stronger associations were observed for CHD and stroke.

Even maximum coefficient values for hypertension, found along the south and east coasts, were less than 0.26; coefficients were lowest in the north and a pronounced north-south gradient was observed. The pattern for stroke was somewhat more complex with coefficients typically less than 0.21 for much of the northern parts of the country and the Midlands, yet with parts of Kent and the Sussex Counties having coefficients up to 0.69. CHD followed a similar geographic structure to stroke, with values of less than 0.08 for much of the Midlands and north, yet rising to 0.66 in eastern and southern coastal areas. There were no statistically significant patterns in Cook's D and standardised residual results for any of the three conditions examined. However, although O:E ratios tended towards 1 in the north and declined further south, the GWR results showed that the predictability of the relationship between observed and expected prevalence decreased with an increasing latitude. The addition of GP supply as a variable yielded stronger regression results for all three conditions (Table 2).

Table 2 comparison of classical regression and GWR results


Main findings

The findings presented here indicate that despite almost universal access to free primary healthcare services, and a significant financial incentive for these services to find and register new cases on their computer systems, there may be significant under-diagnosis of all three CVD conditions examined in this study in many areas in England. The difference between expected and recorded disease is most marked for hypertension, with strikingly wide variation in diagnosis levels between areas. Hypertension and stroke showed similar geographic variation to CHD, but without the clusters/outliers for northern England; additionally the contrast between the ratios for London and LAs immediately to the north was more pronounced for stroke.

An obvious question is: why was there such a discrepancy between expected and practice-registered disease prevalence, when the model-based CHD and stroke prevalence estimates were based on patient reports of doctor-diagnosed disease? Reasons why practice computer systems may under-record cases are likely to include inadequate searching of practice records (which are now in the UK mainly electronic) for previous diagnoses or CVD-related prescriptions [28, 29], lack of linkage of practice and hospital records (practice still have to enter codes for hospital admissions manually), and high population mobility in urban areas: data for new patients is often still entered into practice systems manually and previous diagnoses may not be added for some time [30]. The QOF does not appear to have resulted in substantial improvement in recording. The crude QOF CHD prevalence was 3.44% in 2006-7, the year after QOF was implemented, but was 3.54% in 2009/10 - a minimal increase. In the case of hypertension, for which the modelled estimates used a combination of doctor-diagnosed disease and blood pressure measurements, the gap is greatest in younger and middle age groups, when males in particular are less likely to be seen by their practices [31]. In contrast, diagnosis appears to be more complete in the north of England.

From a spatial analytic perspective, there are significant geographic discrepancies between QOF prevalence and modelled prevalence for all three conditions in much of London and its hinterland. The inclusion of a measure of GP supply in the GWR analyses suggests that, despite needs-based healthcare resource allocation in the UK, persistent differences in availability of primary care services is an important limiting factor in diagnosis. An analysis of Gini coefficients to measure geographical equity in GPs per 100,000 population in England and Scotland showed that equity in England rose between 1974 and 1994, but then decreased, and in 2006 it was below the 1974 level [32].

The results also provide some contrast to a study investigating revascularisation rates in males, which found that even after adjusting for the higher CHD burden in the north of the country related to socio-demographic composition, the likelihood of receiving surgery during the 1990s decreased outside of southern England, suggesting a geographic imbalance in the provision of tertiary cardiology care [33]. Considerable efforts have been made subsequently to ensure that access to tertiary care is more equitable. However our analysis suggests that from a primary care perspective, it is London and the surrounding area which would benefit most from increased resource allocation, although GP supply itself seems to be only one influential factor affecting diagnosis levels. Further work will need to investigate the effectiveness and yield of strategies for CVD case-finding by practices, and to validate the predictions of the prevalence models at practice level.

Strengths and limitations of the study

Strengths of the study include the use of new data from practice disease registers and recently-developed geographical analytical techniques and software. Limitations include the fact that modelling was carried out at LA level, which may conceal much wider variation at lower levels. The spatial scale of the analyses includes many LAs which are highly heterogeneous in socio-demographic composition, so the study may have missed significant small area variation; this is especially likely in northern areas where regression associations were weakest. There is imprecision in the model prevalence estimates, for example, because of the size of the Health Survey sample we used, especially in subgroups with small numbers of cases - the precision of the estimates is also affected by the prevalence of the disease outcomes. Model restrictions include that the proportion of smokers, ex-smokers and never-smokers is uniform across ethnic categories, when, for example, smoking prevalence varied from 16.4% in the Black/Black British sample population to 24.1% in the White sample population. Smoking prevalence is itself a modelled estimate, and ex-smoking prevalence is assumed to be the same nationwide. In addition the number of variables in the local prevalence models is constrained by the availability of local data on risk factors, so that it is not possible to include other well-established CVD risk factors as variables. Some of the local risk factor data relies on estimates based on 2001 Census data, although this will be improved by, for example, more accurate and timely data on smoking prevalence from the new ONS Integrated Household Survey. However we were reassured by the results of model validation, e.g. ROC curves. In addition, comparing model predictions to QOF registrations is also a form of external validation. For example, QOF registered counts of CHD were greater than expected counts in only three of 352 LAs, and in these only by small amounts. If the models were very imprecise, much more under-estimation would be expected. Further model validation is needed, and is occurring as the models are being used to guide local case-finding initiatives.


Despite the absence of barriers to primary healthcare, there is likely to be significant and highly variable under-diagnosis of CVD across England, which can be partially explained by persistent inequity in GP supply. However, the distinctive composition and dynamics of London's population probably adds complexities to the identification of CVD in that region. Studies of disease management should consider the impact of this "iceberg" of undiagnosed disease on hospital utilisation and population health outcomes. Spatial analytic techniques can provide additional information about geographical variation compared to classical regression modelling, and can suggest where more healthcare resources may be most needed.

Data sharing

Datasets and statistical code for GWR are available from the corresponding author at


  1. Cox AM, McKevitt C, Rudd AG, Wolfe CD: Socioeconomic status and stroke. Lancet Neurol. 2006, 5 (2): 181-188. 10.1016/S1474-4422(06)70351-9.

    Article  PubMed  Google Scholar 

  2. Hajat C, Tilling K, Stewart JA, Lemic-Stojcevic N, Wolfe CDA: Ethnic Differences in Risk Factors for Ischemic Stroke: A European Case-Control Study. Stroke. 2004, 35 (7): 1562-1567. 10.1161/01.STR.0000131903.04708.b8.

    Article  PubMed  Google Scholar 

  3. Hemingway H, Langenberg C, Damant J, Frost C, Pyorala K, Barrett-Connor E: Prevalence of Angina in Women Versus Men: A Systematic Review and Meta-Analysis of International Variations Across 31 Countries. Circulation. 2008, 117 (12): 1526-1536. 10.1161/CIRCULATIONAHA.107.720953.

    Article  PubMed  Google Scholar 

  4. McFadden E, Luben R, Wareham N, Bingham S, Khaw KT: Social Class, Risk Factors, and Stroke Incidence in Men and Women. A Prospective Study in the European Prospective Investigation Into Cancer in Norfolk Cohort. Stroke. 2009, 40 (4): 1070-1077. 10.1161/STROKEAHA.108.533414.

    Article  PubMed  Google Scholar 

  5. Morris RW, Whincup PH, Lampe FC, Walker M, Wannamethee SG, Shaper AG: Geographic variation in incidence of coronary heart disease in Britain: the contribution of established risk factors. Heart. 2001, 86 (3): 277-283. 10.1136/heart.86.3.277.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  6. Doran T, Drever F, Whitehead M: Health underachievement and overachievement in English local authorities. J Epidemiol Comm Health. 2006, 60 (8): 686-693. 10.1136/jech.2005.041889.

    Article  Google Scholar 

  7. Clark AM, Hartling L, Vandermeer B, McAlister FA: Meta-Analysis: Secondary Prevention Programs for Patients with Coronary Artery Disease. Ann Intern Med. 2005, 143 (9): 659-672.

    Article  PubMed  Google Scholar 

  8. Ashworth M, Lloyd D, Smith RS, Wagner A, Rowlands G: Social deprivation and statin prescribing: a cross-sectional analysis using data from the new UK general practitioner Quality and Outcomes Framework. J Publ Health. 2007, 29 (1): 40-47. 10.1093/pubmed/fdl068.

    Article  CAS  Google Scholar 

  9. Ward P, Noyce P, Leger A: Are GP practice prescribing rates for coronary heart disease drugs equitable? A cross sectional analysis in four primary care trusts in England. J Epidemiol Comm Health. 2004, 58: 89-96. 10.1136/jech.58.2.89.

    Article  CAS  Google Scholar 

  10. The Information Centre: Quality and Outcomes Framework (QOF) for April 2006 to March 2007, England: Numbers of patients on QOF disease registers, and unadjusted prevalence rates. The Information Centre. 2008, [cited 2008 Apr. 25];, []

    Google Scholar 

  11. Soljak M, Flowers J: Closing the Gap: Using Prevalence Models for Long-term Conditions in the United Kingdom. J Ambul Care Manage. 2008, 31 (3): 211-215.

    Article  PubMed  Google Scholar 

  12. Campbell SM, Reeves D, Kontopantelis E, Sibbald B, Roland M: Effects of Pay for Performance on the Quality of Primary Care in England. N Engl J Med. 2009, 361 (4): 368-378. 10.1056/NEJMsa0807651.

    Article  CAS  PubMed  Google Scholar 

  13. Association of Public Health Observatories Browsing Disease prevalence models. 2009, APHO, cited 10/3/2010., []

  14. Connecting for Health: The Personal Demographics Service. 2010, Connecting for Health, cited 30/9/2010., []

    Google Scholar 

  15. Information Centre for Health & Social Care: NHS Staff 1996 - 2006 (General Practice). 2010, Information Centre for health & social care, cited 30/9/2010., []

    Google Scholar 

  16. Lampe FC, Walker M, Lennon LT, Whincup PH, Ebrahim S: Validity of a Self-reported History of Doctor-diagnosed Angina. J Clin Epidemiol. 1999, 52 (1): 73-81. 10.1016/S0895-4356(98)00146-2.

    Article  CAS  PubMed  Google Scholar 

  17. Glymour MM, Avendano M: Can Self-Reported Strokes Be Used to Study Stroke Incidence and Risk Factors? Evidence From the Health and Retirement Study. Stroke. 2009, 40 (3): 873-879. 10.1161/STROKEAHA.108.529479.

    Article  PubMed  Google Scholar 

  18. Kazumasa Y, Ai I, Hiroyasu I, Manami I, Shoichiro T: Self-reported stroke and myocardial infarction had adequate sensitivity in a population-based prospective study. Journal of clinical epidemiology. 2009, 62 (6): 667-673. 10.1016/j.jclinepi.2008.07.016.

    Article  Google Scholar 

  19. O'Mahony PG, Dobson R, Rodgers H, James OFW, Thomson RG: Validation of a Population Screening Questionnaire to Assess Prevalence of Stroke. Stroke. 1995, 26 (8): 1334-1337.

    Article  PubMed  Google Scholar 

  20. Walker MK, Whincup PH, Shaper AG, Lennon LT, Thomson AG: Validation of Patient Recall of Doctor-diagnosed Heart Attack and Stroke: A Postal Questionnaire and Record Review Comparison. Am J Epidemio. 1998, 148 (4): 355-361.

    Article  CAS  Google Scholar 

  21. Department of Communities & Local Government: The English Indices of Deprivation 2004 (revised). 2008, DCLG, cited 18/7/2008., []

    Google Scholar 

  22. Office for National Statistics: Population Estimates by Ethnic Group (experimental). National Statistics Online. 2007, cited 18/7/2008., []

    Google Scholar 

  23. Office for National Statistics: Model Based Estimate for Smoking in Healthy Lifestyle Behaviours: Model Based Estimates, 2003-2005. 2007, Neighbourhood Statistics, [$ph=60_61_62&step=4&productId=969&instanceSelection=022951&timeId=242&containerAreaId=276706&startColumn=1&numberOfColumns=8&viewAction=parent]

    Google Scholar 

  24. Anselin L: Local indicators of spatial association: LISA. Geog Anal. 2009, 27: 93-115. 10.1111/j.1538-4632.1995.tb00338.x.

    Article  Google Scholar 

  25. Brunsdon C, Fotheringham S, Charlton M: Geographically weighted regression-modelling spatial non-stationarity. J Royal Stat Soc: Series D (The Statistician). 1998, 47 (3): 431-10.1111/1467-9884.00145.

    Google Scholar 

  26. Fotheringham AS, Brunsdon C, Charlton M: Geographically Weighted Regression: The Analysis of Spatially Varying Relationships. 2002, Chichester: Wiley

    Google Scholar 

  27. Akaike H: A new look at statistical model identification. Automatic Control, IEEE Transact. 1974, 19 (6): 716-723. 10.1109/TAC.1974.1100705.

    Article  Google Scholar 

  28. Gray J, Majeed A, Kerry S, Rowlands G: Identifying patients with ischaemic heart disease in general practice: cross sectional study of paper and computerised medical records. BMJ. 2000, 321 (7260): 548-550. 10.1136/bmj.321.7260.548.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  29. Donnan PT, Dougall HT, Sullivan FM: Optimal strategies for identifying patients with myocardial infarction in general practice. Fam Pract. 2003, 20 (6): 706-710. 10.1093/fampra/cmg614.

    Article  PubMed  Google Scholar 

  30. Millett C, Zelenyanszki C, Binysh K, Lancaster J, Majeed A: Population mobility: characteristics of people registering with general practices. Pub Health. 2005, 119 (7): 632-638. 10.1016/j.puhe.2004.09.004.

    Article  CAS  Google Scholar 

  31. Craig R, Mindell J: Health Survey for England 2006: Volume 1 CVD and risk factors adults, obesity and risk factors children. Information Centre for Health & Social Care. 2008, cited 26/11/2008., []

    Google Scholar 

  32. Goddard M, Gravelle H, Hole A, Marini G: Where did all the GPs go? Increasing supply and geographical equity in England and Scotland. J Health Serv Res Policy. 2010, 15 (1): 28-35. 10.1258/jhsrp.2009.009003.

    Article  PubMed  Google Scholar 

  33. Morris RW, Whincup PH, Papacosta O, Walker M, Thomson A: Inequalities in coronary revascularisation during the 1990s: evidence from the British regional heart study. Heart. 2005, 91 (5): 635-640. 10.1136/hrt.2004.037507.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

Pre-publication history

Download references


This study was funded by the Care Quality Commission and supported by the Department of Public Health & Primary Care at Imperial College London. We thank the Association of Public Health Observatories for supporting and hosting the disease prevalence models, and the NHS Information Centre for supplying some of the data used in this study. The development of the disease prevalence models was funded by the Department of Health through the Informing Healthier Choices Programme. The Department of Primary Care & Public Health at Imperial College is grateful for support from the NIHR Collaboration for Leadership in Applied Health Research & Care (CLAHRC) Scheme, the NIHR Biomedical Research Centre scheme, and the Imperial Centre for Patient Safety and Service Quality. MS was funded by the Economic & Social Research Council. The views expressed in the article are those of the authors.

Author information

Authors and Affiliations


Corresponding author

Correspondence to Michael Soljak.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

MS and HW developed the prevalence models. MS conceived and planned the study, and wrote the first draft of the paper. ES and TI undertook the spatial analysis, and all authors were involved in the interpretation of the data. All authors revised it critically for important intellectual content and approved the final version.

Michael Soljak, Edgar Samarasundera, Tejal Indulkar, Hannah Walford and Azeem Majeed contributed equally to this work.

Electronic supplementary material


Additional file 1: CHD prevalence modelling briefing document v5. This document describes how the CHD prevalence model was developed from Health Survey for England data and how the model was applied to local population data. (PDF 1 MB)


Additional file 2: Hypertension prevalence modelling briefing document v2. This document describes how the hypertension prevalence model was developed from Health Survey for England data and how the model was applied to local population data. (PDF 1 MB)


Additional file 3: Stroke prevalence modelling briefing document v2. This document describes how the hypertension prevalence model was developed from Health Survey for England data and how the model was applied to local population data. (PDF 988 KB)

Authors’ original submitted files for images

Below are the links to the authors’ original submitted files for images.

Authors’ original file for figure 1

Authors’ original file for figure 2

Authors’ original file for figure 3

Rights and permissions

This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Soljak, M., Samarasundera, E., Indulkar, T. et al. Variations in cardiovascular disease under-diagnosis in England: national cross-sectional spatial analysis. BMC Cardiovasc Disord 11, 12 (2011).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: