AI in Healthcare and Epigenetics: Enhancing Disease Understanding and Treatment

the health strategist
institute for strategic health transformation 
& digital technology

Joaquim Cardoso MSc.


Chief Research and Strategy Officer (CRSO),
Chief Editor and Senior Advisor

October 17, 2023

What is the message?

The article discusses how artificial intelligence (AI), particularly machine learning (ML) and deep learning (DL), is being increasingly utilized in various aspects of medicine, and highlights the potential of AI to revolutionize healthcare.

It focuses on the intersection of AI and epigenetics, emphasizing that AI-driven approaches hold significant promise in understanding, diagnosing, and managing complex diseases, including Alzheimer’s disease, schizophrenia, and atrial fibrillation.

The article provides insights into how AI is employed to enhance medical tasks, improve disease prediction, accelerate drug discovery, and develop personalized treatment plans.

However, it also acknowledges challenges associated with the use of AI in healthcare and the need for further refinement in AI-driven methods.

In summary, the article serves as a discussion of the growing role of AI in healthcare, particularly its applications in epigenetics, and how it is poised to transform medical research, diagnosis, and treatment across various fields.

DEEP DIVE

The Potential for Artificial Intelligence Applied to Epigenetics [excerpt]

ScienceDirect

Manlio Vinciguerra MSc, PhDS

September 16, 2023

Abbreviations and Acronyms

See original publication. This is an excerpt version.

Artificial intelligence (AI) has been increasingly applied to various fields in medicine, including diagnosis, treatment, drug discovery, and health management. Although AI is a broad concept, machine learning (ML) is the application of AI into a system or machine, which helps it to self-learn and improve continually. Machine learning approaches can be supervised, semi-supervised, or unsupervised. Supervised learning is the type of ML in which computers are trained using well “labeled” training data (tagged with the correct output), and on the basis of that data, they predict the output.

Deep learning (DL) is a subset of the broader family of ML methods. Deep learning uses complex algorithms and deep neural networks to repetitively train a specific model or pattern. The term “deep” in DL reflects the use of multiple layers in the network. Deep learning approaches can employ several types of architectures or networks, such as convolutional neural networks (CNNs) in the medical field. Artificial intelligence has been found to improve the accuracy and efficiency of medical tasks and has the potential to revolutionize health care in all fields of medicine.1 Artificial intelligence has been used to improve the accuracy of medical imaging. For example, DL can analyze computed tomography scans and magnetic resonance imaging scans using CNNs to identify patterns that may indicate a specific disease.

Convolutional neural networks in turn use a mathematical operation called convolution in place of general matrix multiplication, which is specifically designed to process pixel data for image recognition and processing. This can help make faster and more accurate diagnoses. Artificial intelligence has also been used to develop predictive models for diseases such as cancer. By analyzing patient data, AI can identify risk factors, predict the likelihood of developing a disease, and develop personalized treatment plans for patients. By analyzing patient data, such as medical history and genetics, AI can identify the most effective treatment options for each patient. This can improve treatment outcomes and reduce the risk of adverse effects. Artificial intelligence has also been used to develop predictive models for treatment response.

By analyzing patient data, AI can predict how a patient will respond to a specific treatment, allowing doctors to tailor treatment plans accordingly. Artificial intelligence has been used to accelerate the drug discovery process.2 By analyzing large datasets of chemical compounds, AI can identify potential drug candidates that are more likely to be effective. This can reduce the time and cost of developing new drugs. Artificial intelligence has also been used to develop predictive models for drug toxicity.

By analyzing the chemical properties of drugs, AI can predict the likelihood of adverse effects, allowing drug developers to prioritize safer compounds.2 Artificial intelligence has also been used to develop predictive models for disease outbreaks and personalized health monitoring systems. By analyzing patient data, such as activity levels and vital signs, AI can provide personalized recommendations for improving health and preventing disease.3,4 

Overall, AI has the potential to revolutionize health care by improving the accuracy and efficiency of medical tasks, developing personalized treatment plans, accelerating the drug discovery process, and predicting disease outbreaks. However, there are also challenges associated with the use of AI in health care, such as the need for large datasets, the potential for bias, and the ethical implications of using AI to make medical decisions.

Epigenetics

Epigenetics includes 3 main heritable regulatory systems that determine chromatin remodeling and gene transcription regulation: DNA methylation, noncoding RNAs, and chromatin remodeling (histone modifications and histone variants), and the functional crosstalk among these epigenetic processes determines cell phenotype. Among epigenetic mechanisms, DNA methylation is the most commonly studied and involved in various biological processes, such as organism development, cancer, cardiovascular, and neurological disorders.5 

DNA methylation involves the transfer of a methyl group onto the C5 position of the cytosine to form 5-methylcytosine. It regulates gene expression by recruiting proteins involved in gene repression or by inhibiting the binding of transcription factors to DNA. It is controlled by DNA methyltransferases, methyl-CpG binding proteins, and other chromatin-remodeling factors. The field of genomics has been revolutionized by genome-wide association studies (GWASs), which are observational studies of a genome-wide set of genetic variants in different individuals to see if any variant is associated with a trait or a phenotype.

Genome-wide association studies typically focus on associations between single-nucleotide polymorphisms and human diseases but can equally be applied to any other genetic variant and any other organism. Similarly, in the field of epigenetics, epigenome-wide association studies (EWASs) investigate the association between a phenotype and epigenetic variants, most commonly the above mentioned DNA methylation.

The concept of EWASs was first introduced in 2011.6 Since then, the decreasing cost of measuring DNA methylation sites in EWAS and the increasing availability of bioinformatic analytical tools have contributed to the exponential rise in published EWAS.7 Accordingly, EWASs have shed light on the pathogenesis of cardiovascular,8 neurological,9 psychiatric10 and many other disorders.

Imagem de Freepik

Artificial Intelligence Applied to Epigenetics

In the AI field, DL and, in particular, CNN analysis of large-scale genetic data, such as GWASs and/or EWASs, is expected to be faster, more efficient and accurate, both for rare diseases11 and common noncommunicable diseases.

In this commentary, I will bring selected examples of the use of AI, ML, and DL approaches to identify and study the molecular pathogenesis of 3 common noncommunicable diseases with a polygenic component: schizophrenia (SZ), Alzheimer disease (AD), and atrial fibrillation (AF).

Alzheimer disease is an age-associated, neurodegenerative disorder. It is the leading cause of dementia and a major public health problem worldwide. It is a complex illness due to not fully elucidated environmental and genetic factors with a heritability of approximately 75%.12 Multiple AD EWAS have identified differential methylated DNA and new genes associated with AD in different regions of the human brain131415; however with limited depth and technical challenges.

Huang et al16 recently developed EWASplus, a computational method that uses a supervised ML strategy to extend EWAS coverage to the entire genome. Application of EWASplus to 6 AD-related traits predicted hundreds of new significant brain CpGs associated with AD. Genes found near top ranked EWASplus loci are enriched for kinases and for genes previously shown to be physically interacting with known AD genes.16 Similar approaches have been employed to predict AD progression.

Chen et al17 benchmarked on longitudinal DNA methylation data collected from the peripheral blood in the Alzheimer’s Disease Neuroimaging Initiative, and proposed new multi-task deep autoencoders, belonging to multi-task learning (a sub-field of ML) that outperform other ML approaches for predicting AD progression.

Chen et al17 recently showed that it is possible to predict AD on the basis of a deep neural network by integrating gene expression and DNA methylation datasets. The most challenging problem in constructing a model to diagnose AD based on a multiomics dataset (but this is valid for any disease) is how to integrate different omics data and how to deal with high-dimensional and low-sample-size data. In their study, Chen et al17 proposed to reduce the number of features based on a differentially expressed gene and a differentially methylated position in the multiomics dataset.

They started using 2 previously published large-scale gene expression profiles and 1 large DNA methylation profile from the postmortem prefrontal cortex of hundreds of individuals. They thus developed a deep neural network-based prediction model that improves performance compared to that of conventional ML algorithms.17 The model finally included 35 genes selected that had discriminative ability to classify the AD status compared to the health status.17 Integrating gene expression and DNA methylation data could improve prediction accuracy.

Schizophrenia is a mental disorder characterized by continuous or relapsing episodes of psychosis. Schizophrenia affects approximately 24 million people, or 1 in 300 people (0.32%) worldwide.18 Males are more often affected and, on average, have an earlier onset. The causes of SZ include genetic and environmental factors. However, the epigenetic dysregulation contributing to the etiology of SZ is unclear. In particular, cell type-specificity of DNA methylation makes population-based epigenetic studies of SZ challenging.

A few studies analyzed whether a blood marker of epigenetic risk for SZ could be derived that is specific for the disease and predicts disease-associated brain function. In a case-control study conducted from 2008 to 2018 in sites in Germany, the United Kingdom, the United States, and Australia, blood DNA methylation data (quantified as poly-methylation score) from whole-blood samples of 7488 participants, of whom 3158 received a diagnosis of SZ, together with GWAS and neuroimaging, were analyzed using ML approaches.

The poly-methylation score signature was significantly associated with SZ across independent data sets (area under the curve [AUC]=0.69-0.78; P=.049-1.24 × 10−7) but not with other psychiatric ailments such as major depressive disorder (AUC=0.51; P=.16), autism (AUC=0.53; P=.66), or bipolar disorder (AUC=0.58; P=.21).19 In the latter work, an updated biologically informed ML approach for epigenetic fingerprints was used. Biologically informed ML is a 2-stage ML approach that first compresses data from individual DNA methylation sites into a pathway-level feature. Then, a second-stage algorithm integrates these pathway-level features into a system-level classifier.

In another recent report, DNA methylation data on whole blood from 414 SZ cases and 433 nonpsychiatric controls were used as training data for a ML-classification algorithm with built-in feature selection, sparse partial least squares discriminate analysis, to calculate a “risk distance” to identify individuals with the highest probability of SZ. The model was then evaluated on an independent data set of 353 SZ cases and 322 nonpsychiatric controls. This model classified 303 individuals as cases with a positive predictive value of 80%, far surpassing the performance of a model based on polygenic risk score, and it was not associated with medication use.20 These AI/ML-based results indicate that systemic epigenetic variants may classify patients with SZ accurately.

Atrial fibrillation is an abnormal heart rhythm (arrhythmia) characterized by the rapid and irregular beating of the atrial chambers of the heart. It is a type of supraventricular tachycardia. In Europe and North America, as of 2014, it affects approximately 2%-3% of the population. Atrial fibrillation is associated with an increased risk of heart failure, dementia, and stroke. In fact, AF is the cause of 20%-30% of all ischemic strokes, and patients with AF have a 5-fold increased risk of developing an ischemic stroke. Application of a CNN analysis to a multiethnic AF network GWAS for early-onset AF, including 6358 subjects from 4 independent cohorts (Korean, Japanese, European, and multiethnic), led to moderate-to-high predictive power and assigned a high saliency score for PITX2 among the AF associated single-nucleotide polymorphisms.21 

Libiseller-Egger et al23 used a previously published CNN22 to predict the cardiovascular age of 36,349 participants of the UK Biobank from their electrocardiograms, and performed a GWAS on the difference between predicted and chronological age (delta age). The analysis identified 8 loci associated with delta age, including SCN5A and TTN.23 Interestingly, PITX2 (a transcription factor that plays a critical role in early development), SCN5A (an integral membrane protein and a voltage-gated sodium channel subunit), and TTN (a giant protein that functions as a molecular spring that is responsible for the passive elasticity of muscle) have been previously and consistently associated to AF by conventional non AI-based genetic linkage studies.24

In summary, AI/ML/DL-driven approaches described in this commentary are increasingly robust and useful tools for detecting common polygenic diseases that affect our communities. By capturing the cumulative effects and interactions with the epigenome, AI finds patterns in constantly growing genetic and epigenetic data sets that relate to the development of diseases (Figure). In particular, DL methods are accurate but will need to undergo further refinement before clinicians can confidently use such tools.

Figure. Integration of genomic and epigenomics data by multiomic analyses bioinformatics tools and modeling by artificial intelligence (AI)-dependent approaches is a promising avenue to identify new therapeutic targets, to be tested at the preclinical and clinical level. DL, deep learning; ML, machine learning.

References

See original publication. This is an excerpt version.

Author and Affiliation

Manlio Vinciguerra MSc, PhD – Department of Translational Stem Cell Biology, Research Institute of the Medical University, Varna, BulgariaLiverpool Centre for Cardiovascular Science, Liverpool John Moores University, Liverpool, United Kingdom

Originally published at https://www.sciencedirect.com

Total
0
Shares
Deixe um comentário

O seu endereço de e-mail não será publicado. Campos obrigatórios são marcados com *

Related Posts

Subscribe

PortugueseSpanishEnglish
Total
0
Share