Harnessing technology and molecular analysis to understand the development of cardiovascular diseases in Asia: a prospective cohort study (SingHEART)
BMC Cardiovascular Disorders volume 19, Article number: 259 (2019)
Cardiovascular disease (CVD) imposes much mortality and morbidity worldwide. The use of “deep learning”, advancements in genomics, metabolomics, proteomics and devices like wearables have the potential to unearth new insights in the field of cardiology. Currently, in Asia, there are no studies that combine the use of conventional clinical information with these advanced technologies. We aim to harness these new technologies to understand the development of cardiovascular disease in Asia.
Singapore is a multi-ethnic country in Asia with well-represented diverse ethnicities including Chinese, Malays and Indians. The SingHEART study is the first technology driven multi-ethnic prospective population-based study of healthy Asians. Healthy male and female subjects aged 21–69 years old without any prior cardiovascular disease or diabetes mellitus will be recruited from the general population. All subjects are consented to undergo a detailed on-line questionnaire, basic blood investigations, resting and continuous electrocardiogram and blood pressure monitoring, activity and sleep tracking, calcium score, cardiac magnetic resonance imaging, whole genome sequencing and lipidomic analysis. Outcomes studied will include mortality and cause of mortality, myocardial infarction, stroke, malignancy, heart failure, and the development of co-morbidities.
An initial target of 2500 patients has been set. From October 2015 to May 2017, an initial 683 subjects have been recruited and have completed the initial work-up the SingHEART project is the first contemporary population-based study in Asia that will include whole genome sequencing and deep phenotyping: including advanced imaging and wearable data, to better understand the development of cardiovascular disease across different ethnic groups in Asia.
Cardiovascular disease (CVD) is a major cause of mortality and morbidity worldwide. Traditional large population-based studies in Western cohorts (e.g. Framingham Heart Study, Cardiovascular Health Study, Atherosclerosis Risk in Communities Study, Jackson Heart cohort) have formed the foundation of our initial knowledge on the risk factors of cardiovascular disease [1,2,3,4]. With advancements in technology, there have been many more exciting developments in our understanding of the development and prevention of CVD. Advances made in genomics, metabolomics, proteomics and data from wearables have the potential to deliver important new insights in the field of cardiology, especially after smaller datasets are combined into multi-domain, high-dimensionality “big data” and analyzed using novel machine learning techniques.
In the last couple of years, the American Heart Association (AHA) and Google Life Sciences (GLS) have forged a joint collaboration to use technology to tackle the scourge of CVD . This collaboration will allow the scientific community to leverage on the technical capabilities and insights offered by Google Life Sciences. With the unique opportunity to access such resources, this project is forecasted to allow researchers to conceptualize and test new approaches . Recent modern prospective population-based studies (Project Baseline, MURDOCK study) have also been developed to answer these questions with the aid of technology [6, 7]. Project Baseline aims to characterize participants across clinical, molecular, imaging, sensor, self-reported, behavioural, psychological, environmental and other health-related measurements from onsite visits, continuous data collection through sensor technology, and regular engagement via an online portal and mobile app . The MURDOCK study aims to reclassify cardiovascular risk using integrated clinical and molecular biosignatures. However, these studies are based primarily in Western populations .
Ethnic differences in CVD exist [8,9,10,11]. The Multi-Ethnic Study of Atherosclerosis (MESA) study found distinct racial differences in the risk of incident heart failure . The mechanisms are multifactorial with complex interactions between unmodifiable genetic factors and acquired risk factors. Often times, it has been difficult to distinguish the contributory role between socioeconomic and genetic factors towards development of disease with residual uncertainty regarding causality and pathogenic mechanisms [12, 13]. Currently, in Asia, there are no population-based studies that combine the use of conventional clinical information with advanced ‘discovery’-type molecular techniques, advanced quantitative cardiac imaging and wearable technologies to characterize the general community. We aim to harness these new technologies to understand the development of cardiovascular disease in Asia.
Study design and population
Singapore is a developed city state of 5.6 million people in Asia with well-represented diverse ethnicities including Chinese (74.3%), Malays (13.4%) and Indians (9.1%). The elderly ≥65 yr account for 12.4% of the population and the average life-expectancy is 82.9 years old .
The SingHEART study is the first multi-ethnic prospective population-based study of healthy Asians harnessing the latest technologies. In summary, healthy male and female subjects aged 21–69 years old without any prior cardiovascular disease (Ischemic heart disease, stroke, peripheral vascular disease) or diabetes mellitus will be recruited from the general population. The complete inclusion and exclusion criteria are found in the supplementary appendix (Additional file 1: Appendix 1). These patients are drawn from a cohort of healthy individuals who have already volunteered to the institutional Biobank program. Subjects fulfilling the inclusion criteria will be recruited from the public via advertisements (e.g. posters, local newspaper).
Written informed consent will be obtained from every subject and includes follow-up of up to 20 years, including agreement to permit tracking of outcomes via national and disease registries. Subjects will be informed of incidental abnormalities picked up during the tests. If genetic abnormalities are detected, there is an option for genetic counselling. All data will be anonymized for analysis.
Ethical approval was obtained from the Singhealth institutional review board.
This study aims to comprehensively characterize a healthy Asian population at baseline and at follow-up, using multiple modalities to aid in the understanding of the traditional and novel factors that contribute to the development of cardiovascular disease. Emphasis will be placed on understanding how these traditional and novel factors interact, as well as on elucidating the ethnic differences in the development of cardiovascular disease. The specific objectives are as follows:
To characterize cardiovascular health in Asia by measuring multiple systems simultaneously and longitudinally.
To assess lifestyle, diet, physical activity and sleep via traditional questionnaire surveys and wearable technologies and their impact on the development of cardiovascular disease.
To characterize baseline genetic, metabolic and advanced cardiac imaging profiles in healthy individuals and the identification of novel markers influencing the development of cardiovascular disease with the potential for therapeutics.
Validate patient-reported sleep/physical activity and that derived by wearables, study the impact of physical activity on cardiac structure, investigate relationship between calcium score and lifestyle factors, etc.
To study the development and progression of cardiovascular disease in these patients to help develop new preventive, diagnostic and predictive tools for the Asian population
To understand the differential effects of ethnicity on the development of cardiovascular disease
To use both traditional statistical analysis and newer methodologies (e.g. machine learning) to process and analyze the data
Each subject will undergo the following investigations at baseline and at specific intervals on follow-up for up to 20 years, as described in Table 1.
The questionnaire will include sections on demographics, socioeconomic status, medical history, lifestyle, diet and exercise, quality of life as assessed by the EQ-5D-3 L, Pittsburgh sleep quality index and International Index of Erectile Function (IIEF)- 5 (only for males). See supplementary appendix for the complete questionnaire (Additional file 2: Appendix 2).
Basic blood investigations
This will include full blood count, renal and liver function, fasting lipids and fasting glucose. As diabetes mellitus is an exclusion criterion, HbA1c will not be measured routinely. Samples are biobanked for future use through the NHCS biobank.
A standard 12-lead resting ECG will be recorded. Variables studied will include rate, rhythm, axis, conduction intervals, morphologies (including QRS, S-T and T wave abnormalities) and arrhythmias.
Ambulatory BP monitor
Ambulatory blood pressure monitoring will be performed for 24 h via a cuff-monitoring (Spacelab Healthcare Model 90,227/90217A). Data collected will include the systolic, diastolic and mean blood pressure during the day and night, as well as data on dipping.
Continuous ECG monitoring
Continuous ECG monitoring will be performed for 24 h–72 h via a wearable, multi-lead ECG monitoring patch which stores data for up to 3 days (ePatch® ECG recorder AMS3000). Data collected will include variations in heart rate and the various types of arrhythmias present.
Activity and sleep tracker
A commercially available wearable fitness device able to track heart rate, physical activity, exercise and sleep will be used. The participants will wear this activity tracker for 5–7 days. Currently, the study uses the Fitbit Charge HR. Data for each subject will be downloaded from the Fitbit Application Programming Interface (https://dev.fitbit.com/reference/web-api/quickstart). Step counts are available at two levels; intraday step counts in 15-min intervals and daily totals. Intraday HR data are available at 5-min intervals, along with confidence levels. Intraday sleep tracking data containing details of each sleep session will also be recorded.
Electron beam CT scanning using contiguous 3-mm slices during a single breath hold will be performed by a 320-slice CT scanner (Toshiba Aquilion ONE) with the following parameters: tube voltage 120kVp, tube current 200-400 mA (dependent on patient size and shape as visually assessed by the radiographers), gantry rotation time 350 ms and 3 mm section thickness. The non-contrast scan will be volume prospective and ECG-gated and synchronized to the RR interval with a scan time of 100 ms/slice. A calcified lesion will be defined as at least two contiguous voxels with an attenuation coefficient > 130 Hounsfield units. Coronary calcium scores will be calculated as previously described  via Vitrea Workstation.
Cardiovascular phenotyping using cardiovascular magnetic resonance (CMR) will be performed in all healthy volunteers (3 T Ingenia, Philips or 1.5 T MAGNETOM Aera, Siemens). Conventional balanced steady-state free precession cine images of the vertical and horizontal long-axis planes and the sagittal LV outflow tract view will be acquired. Short-axis cines will be obtained from the mitral valve annulus to the apex (8 mm slices with 2 mm gap). In addition, a single breath hold 3D LV short axis stack will also be acquired in the same orientation. Aortic flow will be assessed using velocity-coded phase contrast imaging. Cardiac volumes, left ventricular mass, atrial sizes and aortic root will be measured in all patients using the CMR 42 software (Circle Cardiovascular Imaging). Normal CMR reference ranges in the Asian population were recently published using standardized protocols in our Image Analysis Laboratory .
20 μl of lipid internal standard mix and 10 μl of 14:0 phosphatidylcholine will be added to 100 μl of serum in a microcentrifuge tube. After an equilibration period of 30 s, 1.2 ml of HPLC-grade methanol will be added to the mixture, followed by vortexing. The mixture will be incubated at 50 °C for 10 min, followed by centrifugation to pellet the precipitated protein. The supernatant will then be removed and placed in a clean microcentrifuge tube for drying under nitrogen gas. 100 μl of methanol will be used to reconstitute the dried extract. The reconstituted lipid solution will be separated using a LC-MS (liquid chromatography – mass spectrometry) system (Agilent 1260) and a Thermo Scientific Accucore HILIC column (100 × 2.1 mm; particle size 2.6 μm). Mobile phase A consists of acetonitrile/water (95:5) with 10 mM ammonium acetate, pH 8.0 and mobile phase B consists of acetonitrile/water (50:50) with 10 mM ammonium acetate, pH 8.0. For the separation, the column will be equilibrated with 100% mobile phase A, increasing to 20% mobile phase B in 5 min, then held for 5 min. The column will finally be re-equilibrated with 100% mobile phase A for 5 min. Finally, mass spectrometry and data acquisition will be performed using an Agilent 6430 triple-quadrupole mass spectrometer .
Whole genome sequencing will be done in Illumina HiSeq X sequencers at 30X coverage. 1.0 micrograms of DNA per sample will be used for library preparation. Sequencing libraries will be generated using the Truseq Nano DNA HT sample preparation kit (Illumina, SUA) and idenx codes will be added to each sample. Briefly, the genome will be fragmented by sonication to a size of 350 bp, and the DNA fragments will be end-polished, A-tailed and ligated with full-length adapter for Illumina sequencing with further PCR amplification. Lastly, PCR products will be purified (AMPure XP system) and the libraries analysed for size distribution by Agilent 2100 Bioanalyzer and quantified using real-time PCR.
Outcomes will be tracked 6 monthly for 20 years via review of medical records. The outcomes studied will include mortality and cause of death, myocardial infarction, stroke, malignancy, heart failure, and the development of comorbidities (e.g. ischemic heart disease, peripheral vascular disease, diabetes mellitus, renal failure, etc.). Patients provided explicit consent for the matching of outcomes against databases and registries. Information on mortality, myocardial infarction, stroke, chronic kidney disease and malignancy will be obtained from national state-mandated registries (e.g. Singapore Myocardial Infarction Registry, Singapore Stroke Registry, Singapore Renal Registry, Singapore Cancer Registry, Singapore Renal Registry, etc).
The SingHEART Steering Committee is responsible for the overall conduct of the study. The protocol design and execution of the study is entirely under the oversight of the SingHEART Steering Committee, and the funding sources have no access to patient-specific data. Because volunteers are drawn from the Biobank cohort, the SingHEART program adheres to protocols managing patient data anonymization, patient confidentiality/privacy, incidental finding management; and biosample/genomic DNA samples. Data is entered electronically into a pre-designed database by trained study coordinators and the accuracy of the data will be regularly audited. Requests to use the data or biospecimens for research will require approval from the steering committee in accordance with standardized procedures. The study team meets once every month to review study progress and to address any concerns raised by the study coordinators.
The initial target enrolment is 2500 based on feasibility and initial funding availability, with the opportunity to extend recruitment with further funding. Multiple linear regression and logistic regression analyses using generalised linear models will be performed to study the relationships between factors. Odds ratios, 95% confidence intervals and p-values will be reported. Cox-proportional hazards models will be used to analyse longer-term health outcomes and hazards ratios reported. Multivariate adjustment for confounders will be performed. Machine learning methodology appropriate for high-dimensionality datasets (e.g. deep learning systems and neural networks, classification techniques involving decision trees and probabilistic prediction) will also be performed in tandem with conventional statistical analysis.
From October 2015 to May 2017, an initial 400 subjects have been recruited and all have completed the baseline investigations as documented above. Table 2 describes the baseline characteristics of the initial subjects. Further recruitment is currently ongoing and as of February 2018, 683 patients have been recruited.
In recent times, there has been a revolution in healthcare brought about by quantum leaps in technology. This has brought with it the unprecedented opportunity to better characterise and manage diseases and improve healthcare. Different advancements include the use of “artificial intelligence” and “deep learning”, the development of molecular medicine with improvements in genomic, metabolomic and proteomic screening and the use of devices like wearables which can constantly monitor various physiological data.
As mentioned earlier, there have been several collaborations primarily in Western cohorts to tap on this technology boom to better characterise cardiovascular diseases [5,6,7]. The AHA and GLS aim to leverage on the technology and analysis by Google technical capabilities and insights offered by Google Life Sciences to analyze the various aspects of cardiovascular diseases . Project Baseline, a collaboration between Duke, Stanford and Verily, aims to collect broad phenotypic data across multiple domains including continuous sensors from thousands of participants, harnessing the power of technology to do so . The SingHEART study aims to be one of the most comprehensive phenotypic studies of cardiovascular diseases in the world. We will characterize phenotypes on multiple fronts including a) questionnaires on lifestyle, diet, exercise data, etc. b) detailed cardiac imaging and investigations including cardiac MRI, 24 h ECG and blood pressure monitoring, and calcium score c) wearables for constant physiological monitoring and d) genomic and lipidomic screening. One of the unique features of SingHEART is the pre-hoc intention to pool data across multiple domains into larger datasets suitable for analysis by machine learning algorithms. Early work from this project on the relationship between wearable data and cardiovascular risk factors in normal volunteers have recently been published based on this approach  which will provide the foundation for further insights and understanding into the development and prevention of CVD in this cohort.
Beyond the use of technology to characterize cardiovascular diseases, SingHEART affords the opportunity to better elucidate differences across different Asian ethnicities. Ethnic differences have been known to exist across the spectrum of cardiovascular diseases. With regards to heart failure (HF), the Multi-Ethnic Study of Atherosclerosis (MESA) study found the risk of incident HF to be different amongst races . Prior studies in Asian patients with HF showed increased adverse outcomes in Malays and Indians compared to Chinese . For coronary heart disease (CHD), Whites appear to have higher risk of CHD compared to blacks, Latinos and Asians . A local study found Indians to have the greatest incidence of myocardial infarction but Malays to have the highest mortality rate . SingHEART aims to understand the mechanistic causes behind these ethnic differences. Furthermore, the overlapping influence of genetics and environmental or behavioral factors on the development of disease may be difficult to tease out in the traditional setting [12, 13]. With growth of technology and genetic research, this divide has been narrowed . Studies like SingHEART with detailed genotypic, phenotypic and socioeconomic characterization will allow the opportunity to study this differentiation in greater depth.
The SingHEART project is the first contemporary population-based study in Asia that will allow us to harness up-to-date technology to better understand the development of cardiovascular disease across different ethnic groups. This will provide invaluable opportunities to develop pertinent public health prevention strategies, guide allocation of healthcare resources, improve the diagnosis and management of cardiovascular disease, and serve as a foundation for future research.
Availability of data and materials
The datasets generated and/or analyzed during the current study are not publicly available due the ongoing nature of this study, but are available from the corresponding author on reasonable request.
American Heart Association
Coronary heart disease
Cardiovascular magnetic resonance
Google life sciences
Multi-ethnic study of atherosclerosis
Magnetic resonance imaging
National Heart Centre (Singapore)
Institute of Precision Medicine
Mahmood SS, Levy D, Vasan RS, Wang TJ. The Framingham heart study and the epidemiology of cardiovascular disease: a historical perspective. Lancet. 2014;383:999–1008.
Sempos CT, Bild DE, Manolio TA. Overview of the Jackson heart study: a study of cardiovascular diseases in African American men and women. Am J Med Sci. 1999;317:142–6.
Fried LP, Borhani NO, Enright P, Furberg CD, Gardin JM, Kronmal RA, Kuller LH, Manolio TA, Mittelmark MB, Newman A, O'Leary DH. The cardiovascular health study: design and rationale. Ann Epidemiol. 1991;1:263–76.
The ARIC Investigators. The Atherosclerosis Risk in Communities (ARIC) Study: design and objectives. Am J Epidemiol. 1989;129:687–702.
American Heart Association. The American Heart Association and Google Life Sciences Announce Collaboration to Change Trajectory of Heart Disease. 2015. Available from: http://newsroom.heart.org/news/the-american-heart-association-and-google-life-sciences-announce-collaboration-to-change-trajectory-of-heart-disease.
Project Baseline. Verily launches landmark study with Duke and Stanford as First Initiative of Project Baseline. 2017. Available from: https://static.googleusercontent.com/media/www.projectbaseline.com/en//press/assets/BaselinePressRelease.pdf.
Shah SH, Granger CB, Hauser ER, Kraus WE, Sun JL, Pieper K, Nelson CL, Delong ER, Califf RM, Newby LK, MURDOCK Horizon 1 Cardiovascular Disease Investigators. Reclassification of cardiovascular risk using integrated clinical and molecular biosignatures: design of and rationale for the measurement to understand the reclassification of disease of Cabarrus and Kannapolis (MURDOCK) Horizon 1 Cardiovascular Disease Study. Am Heart J. 2010;160:371.e2–9.e2.
Bahrami H, Kronmal R, Bluemke DA, Olson J, Shea S, Liu K, Burke GL, Lima JA. Differences in the incidence of congestive heart failure by ethnicity: the multi-ethnic study of atherosclerosis. Arch Intern Med. 2008;168:2138–45.
Lee R, Chan SP, Chan YH, Wong J, Lau D, Ng K. Impact of race on morbidity and mortality in patients with congestive heart failure: a study of the multiracial population in Singapore. Int J Cardiol. 2009;134:422–5.
Rana JS, Liu JY, Moffet HH, Jaffe MG, Sidney S, Karter AJ. Ethnic differences in risk of coronary heart disease in a large contemporary population. Am J Prev Med. 2016;5:637–41.
Mak KH, Chia KS, Kark JD, Chua T, Tan C, Foong BH, Lim YL, Chew SK. Ethnic differences in acute myocardial infarction in Singapore. Eur Heart J. 2003;24:151–60.
Whitfield KE, McClearn G. Genes, environment, and race: quantitative generic approaches. Am Psychol. 2005;60:104–14.
Frierson GM, Howard EN, DeFina LE, Powell-Wiley TM, Willis BL. Effect of race and socioeconomic status on cardiovascular risk factor burden: the Cooper Center longitudinal study. Ethn Dis. 2013;23:35–42.
Ministry of Health, Singapore. Population and vital statistics. 2016. Available from: https://www.moh.gov.sg/content/moh_web/home/statistics/Health_Facts_Singapore/Population_And_Vital_Statistics.html.
Agatston AS, Janowitz WR, Hildner FJ, Zusmer NR, Viamonte M, Detrano R. Quantification of coronary artery calcium using ultrafast computed tomography. J Am Coll Cardiol. 1990;15:827–32.
Le TT, Tan RS, De Deyn M, Goh EP, Han Y, Leong BR, Cook SA, Chin CW. Cardiovascular magnetic resonance reference ranges for the heart and aorta in Chinese at 3T. J Cardiovasc Magn Reson. 2016;18:21.
Lim WK, Davila S, Teo JX, Yang C, Pua CJ, Blöcker C, Lim JQ, Ching J, Yap JJL, Tan SY, Sahlén A, Chin CW, Teh BT, Rozen SG, Cook SA, Yeo KK, Tan P. Beyond fitness tracking: the use of consumer-grade wearable data from normal volunteers in cardiovascular and lipidomics research. PLoS Biol. 2018;16:e2004285.
We would like to thank the individuals who volunteered for the study.
We would like to thank the Lee foundation for their grant support for the SingHeart study, conducted at National Heart Centre Singapore. The work was also supported by core funding from SingHealth and Duke NUS through their institute of Precision Medicine (PRISM) and a center grant awarded to National Heart Centre Singapore from the National Medical Research Council, Ministry of Health, Republic of Singapore (NMRC/CG/M006/2017_NHCS).
The funding bodies were not involved in design, conduct, data interpretation, or writing of the manuscript.
Ethics approval and consent to participate
The research study was performed in accordance with the Declaration of Helsinki, and ethical approval was obtained from the Singhealth institution review board. Written informed consent was obtained from all study participants.
Consent for publication
C.S.L. is supported by a Clinician Scientist Award from the National Medical Research Council of Singapore; has received research support from Boston Scientific, Bayer, Thermofisher, Medtronic, and Vifor Pharma; and has consulted for Bayer, Novartis, Takeda, Merck, Astra Zeneca, Janssen Research & Development, LLC, Menarini, Boehringer Ingelheim, Abbott Diagnostics, Corvia, Stealth BioTherapeutics, Roche, and Amgen. The rest of the authors have no relevant competing interest to declare.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Yap, J., Lim, W.K., Sahlén, A. et al. Harnessing technology and molecular analysis to understand the development of cardiovascular diseases in Asia: a prospective cohort study (SingHEART). BMC Cardiovasc Disord 19, 259 (2019). https://doi.org/10.1186/s12872-019-1248-3