Machine Learning and Big Data: Opportunities for Improving Risk Assessment and Treatment in Cardiology

30 Aug 2019

Prof. Eric Boersma

Structured risk assessment and -stratification is an essential part of modern cardiology. European Society of Cardiology (ESC) Guidelines for Preventive Cardiology (1) recommend the use of formal stratification tools to classify subjects according to their risk of first or subsequent CVD events, as well as for patients with established cardiovascular disease (CVD), including ST-elevation myocardial infarction and atrial fibrillation (AF) (2,3). It is then recommended to start treatment or modify treatment intensity, according to estimated risk.

Despite these recommendations, it has been observed that risk assessment tools ‘are not adequately implemented in clinical practice’ (1). One might speculate on the reason why, but it seems quite natural, that the performance of recommended instruments is a relevant factor. For example, the concordance- (C-) statistics for the CHA₂DS₂-VASc score for the prediction of ischemic stroke in AF (4), and the SMART score for the prediction of the 10-year risk of vascular complications in patients with CVD (5) did not exceed 0.70, which can be classified as ‘modest’. Other risk scores, such as the GRACE score for death/myocardial infarction after acute coronary syndrome admission, have better discriminatory power (C-statistic 0.73-0.77) (6), but there definitely is room for improvement.

Multiple factors contribute to (variations in) the performance of risk prediction instruments, including, but not limited to, the population and endpoint of interest, the sample size, the number of potential predictive variables, and the analytic complexity of the data. Risk stratification tools in cardiology are usually derived from classical regression analyses on (large) routine clinical practice data sets. The outcome to be predicted (‘dependent’ variable Y) is then modelled as a function of a series of selected, predefined predictor (or ‘independent’) variables (X). This supervised approach is straightforward from a statistical point of view, and results in a transparent model. The relations between the Xs and Y are estimated by regression coefficients (‘betas’) that can easily be understood by the end users of the model.

Goldstein et al. recently argued that the field should move beyond these regression techniques, and apply machine learning (ML) instead, ‘to address analytic challenges’ (7). They argue that the performance of ‘classical’ regression is suboptimal in case of non-linear X-Y relationships, dependency of X-Y relationships on other X’s, and in case many X’s are present. Indeed, ML techniques, such as ridge regression and LASSO regression, or the method of the ‘Nearest Neighbour’, are useful alternatives to overcome these challenges (7). My personal view is that risk assessment in cardiology of ML techniques go hand-in-hand with the exploration of ‘Big Data’.

<b>The Cardiology Information System</b><br/>Reproduced with permission from Simoons et al. Eur Heart J 2002;23:1148-52

The Cardiology Information System
Reproduced with permission from Simoons et al. Eur Heart J 2002;23:1148-52

As a legacy at the end of his two-year term as ESC president in 2002, Prof. Maarten Simoons introduced the notion of an integrated ‘Cardiology Information System’ (figure) (8). In order to foster personalised CV medicine, he envisaged combining data derived from hospital information systems, regional and national registries, and the knowledge base. In his view, the ESC had an important role to play'… to promote development of data standards for information to achieve the required level of integration’ (7). Since 2002, the volume and variety of data has revolutionised: citizens and patients almost continuously contribute to a wealth of data that might be relevant for their (CV) health, including registries of personal preferences (e.g. supermarkets), spatiotemporal data and social media tracks. Furthermore, subjects generate data on personal CV health by using instruments such as smart watches to measure pulse frequency and blood pressure, or smart phones which can easily be upgraded to become a stethoscope, ECG device, or instrument for measuring blood glucose. Clearly this era of smart data not only provides unprecedented opportunities to further optimise personalised CV management, but also requires smart approaches to transform data into useful information. The ESC should, therefore, be complimented on having launched the Digital Health Virtual Journal, a platform on which data scientists and clinical cardiologists can meet.

Link to the original article

References

https://academic.oup.com/eurheartj/article/39/2/119/4095042

Notes to editor

Editorial on Goldstein BA, Navar AM, Carter RE. Moving beyond regression techniques in cardiovascular risk prediction: applying machine learning to address analytic challenges. Eur Heart J 2017;38(23):1805-1814.

Declaration of Interests: The author(s) have declared no conflicts of interest.

The content of this article reflects the personal opinion of the author/s and is not necessarily the official position of the European Society of Cardiology.

Show navigation Hide navigation

Machine Learning and Big Data: Opportunities for Improving Risk Assessment and Treatment in Cardiology

References

Notes to editor