Centenarian Controls Increase Variant Effect-sizes by an average two-fold in an Extreme Case-Extreme Control Analysis of Alzheimer’s Disease

Mapping Intimacies ◽

10.1101/298018 ◽

2018 ◽

Author(s):

Niccolò Tesi ◽

Sven J. van der Lee ◽

Marc Hulsman ◽

Iris E. Jansen ◽

Najada Stringa ◽

...

Keyword(s):

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Effect Size ◽

Genetic Variants ◽

Association Studies ◽

Effect Sizes ◽

Small Samples ◽

Control Analysis ◽

Genome Wide Association Studies ◽

Variant Effect

AbstractThe detection of genetic loci associated with Alzheimer’s disease (AD) requires large numbers of cases and controls because variant effect-sizes are mostly small. We hypothesized that variant effect-sizes should increase when individuals who represent the extreme ends of a disease spectrum are considered, as their genomes are assumed to be maximally enriched or depleted with disease-associated genetic variants.We used 1,073 extensively phenotyped AD cases with relatively young age at onset as extreme cases (66.3±7.9 years), 1,664 age-matched controls (66.0±6.5 years) and 255 cognitively healthy centenarians as extreme controls (101.4±1.3 years). We estimated the effect-size of 29 variants that were previously associated with AD in genome-wide association studies.Comparing extreme AD-cases with centenarian-controls increased the variant effect-size relative to published effect-sizes by on average 1.90-fold (SE=0.29,p=9.0×10−4). The effect-size increase was largest for the rare high-impactTREM2 (R74H)variant (6.5-fold), and significant for variants in/nearECHDC3(4.6-fold),SLC24A4-RIN3(4.5-fold),NME8(3.8-fold),PLCG2(3.3-fold),APOE-ε2(2.2-fold) andAPOE-ε4(2.0-fold). Comparing extreme phenotypes enabled us to replicate the AD association for 10 variants (p<0.05) in relatively small samples. The increase in effect-sizes depended mainly on using centenarians as extreme controls: the average variant effect-size was not increased in a comparison of extreme AD cases and age-matched controls (0.94-fold,p=6.8×10−1), suggesting that on average the tested genetic variants did not explain the extremity of the AD-cases. Concluding, using centenarians as extreme controls in AD case-controls studies boosts the variant effect-size by on average two-fold, allowing the replication of disease-association in relatively small samples.

Download Full-text

Genetic variants influencing human aging from late-onset Alzheimer's disease (LOAD) genome-wide association studies (GWAS)

Neurobiology of Aging ◽

10.1016/j.neurobiolaging.2012.02.014 ◽

2012 ◽

Vol 33 (8) ◽

pp. 1849.e5-1849.e18 ◽

Cited By ~ 23

Author(s):

Hui Shi ◽

Olivia Belbin ◽

Christopher Medway ◽

Kristelle Brown ◽

Noor Kalsheker ◽

...

Keyword(s):

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Genetic Variants ◽

Late Onset ◽

Association Studies ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Human Aging ◽

Genome Wide

Download Full-text

Predicting late-onset Alzheimer’s disease from genomic data using deep neural networks

10.1101/629402 ◽

2019 ◽

Author(s):

Javier de Velasco Oriol ◽

Edgar E. Vallejo ◽

Karol Estrada ◽

Keyword(s):

Alzheimer’S Disease ◽

Neural Networks ◽

Alzheimer's Disease ◽

Genetic Variants ◽

Deep Neural Networks ◽

Late Onset ◽

Association Studies ◽

Genome Wide Association Studies ◽

Clinical Markers ◽

Genome Wide

AbstractAlzheimer’s disease (AD) is the leading form of dementia. Over 25 million cases have been estimated worldwide and this number is predicted to increase two-fold every 20 years. Even though there is a variety of clinical markers available for the diagnosis of AD, the accurate and timely diagnosis of this disease remains elusive. Recently, over a dozen of genetic variants predisposing to the disease have been identified by genome-wide association studies. However, these genetic variants only explain a small fraction of the estimated genetic component of the disease. Therefore, useful predictions of AD from genetic data could not rely on these markers exclusively as they are not sufficiently informative predictors. In this study, we propose the use of deep neural networks for the prediction of late-onset Alzheimer’s disease from a large number of genetic variants. Experimental results indicate that the proposed model holds promise to produce useful predictions for clinical diagnosis of AD.

Download Full-text

Lipid associated polygenic enrichment in Alzheimer’s disease

10.1101/383844 ◽

2018 ◽

Author(s):

Iris J. Broce ◽

Chin Hong Tan ◽

Chun Chieh Fan ◽

Aree Witoelar ◽

Natalie Wen ◽

...

Keyword(s):

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Genetic Variants ◽

Plasma Lipids ◽

Association Studies ◽

Density Lipoprotein ◽

Genome Wide Association Studies ◽

Nucleotide Polymorphisms ◽

Genetic Pleiotropy ◽

Common Genetic Variants

ABSTRACTCardiovascular (CV) and lifestyle associated risk factors (RFs) are increasingly recognized as important for Alzheimer’s disease (AD) pathogenesis. Beyond the ∊4 allele of apolipoprotein E (APOE), comparatively little is known about whether CV associated genes also increase risk for AD (genetic pleiotropy). Using large genome-wide association studies (GWASs) (total n > 500,000 cases and controls) and validated tools to quantify genetic pleiotropy, we systematically identified single nucleotide polymorphisms (SNPs) jointly associated with AD and one or more CV RFs, namely body mass index (BMI), type 2 diabetes (T2D), coronary artery disease (CAD), waist hip ratio (WHR), total cholesterol (TC), low-density (LDL) and high-density lipoprotein (HDL). In fold enrichment plots, we observed robust genetic enrichment in AD as a function of plasma lipids (TC, LDL, and HDL); we found minimal AD genetic enrichment conditional on BMI, T2D, CAD, and WHR. Beyond APOE, at conjunction FDR < 0.05 we identified 57 SNPs on 19 different chromosomes that were jointly associated with AD and CV outcomes including APOA4, ABCA1, ABCG5, LIPG, and MTCH2/SPI1. We found that common genetic variants influencing AD are associated with multiple CV RFs, at times with a different directionality of effect. Expression of these AD/CV pleiotropic genes was enriched for lipid metabolism processes, over-represented within astrocytes and vascular structures, highly co-expressed, and differentially altered within AD brains. Beyond APOE, we show that the polygenic component of AD is enriched for lipid associated RFs. Rather than a single causal link between genetic loci, RF and the outcome, we found that common genetic variants influencing AD are associated with multiple CV RFs. Our collective findings suggest that a network of genes involved in lipid biology also influence Alzheimer’s risk.

Download Full-text

Functional Genetic Biomarkers of Alzheimer’s Disease and Gene Expression from Peripheral Blood

10.1101/2021.01.15.426891 ◽

2021 ◽

Author(s):

Andrew Ni ◽

Amish Sethi ◽

Keyword(s):

Gene Expression ◽

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Peripheral Blood ◽

Genetic Variants ◽

Cell Activation ◽

Association Studies ◽

Gene Set Enrichment Analysis ◽

Machine Learning Techniques ◽

Genome Wide Association Studies

AbstractDetecting Alzheimer’s Disease (AD) at the earliest possible stage is key in advancing AD prevention and treatment but is challenged by normal aging processes in addition to other confounding neurodegenerative diseases. Recent genome-wide association studies (GWAS) have identified associated alleles, but it has been difficult to transition from non-coding genetic variants to underlying mechanisms of AD. Here, we sought to reveal functional genetic variants and diagnostic biomarkers underlying AD using machine learning techniques. We first developed a Random Forest (RF) classifier using microarray gene expression data sampled from the peripheral blood of 744 participants in the Alzheimer’s Disease Neuroimaging Initiative (ADNI) cohort. After initial feature selection, 5-fold cross-validation of the 100-gene RF classifier achieved an accuracy of 99.04%. The high accuracy of the RF classifier supports the possibility of a powerful and minimally invasive tool for screening of AD. Next, unsupervised clustering was used to validate and identify relationships among differentially expressed genes (DEGs) the RF selected revealing 3 distinct AD clusters. Results suggest downregulation of global sulfatase and oxidoreductase activities in AD through mutations in SUMF1 and SMOX respectively. Then, we used Greedy Fast Causal Inference (GFCI) to find potential causes of AD within DEGs. In the causal graph, HLA-DPB1 and CYP4A11 emerge as hub genes, furthering the discussion of the immune system’s role in AD. Finally, we used Gene Set Enrichment Analysis (GSEA) to determine the biological pathways and processes underlying the DEGs that were highly correlated with AD. Cell activation in the immune system, glycosaminoglycan (GAG) binding, vascular dysfunction, oxidative stress, and the neuronal apoptotic process were revealed to be significantly enriched in AD. This study further advances the possibility of low-cost and noninvasive genetic screening for AD while also providing potential gene targets for further experimentation.

Download Full-text

P1-262: Genetic Variants Influencing Human Longevity from Late-Onset Alzheimer's Disease (LOAD) Genome-Wide Association Studies (GWAS)

Alzheimer s & Dementia ◽

10.1016/j.jalz.2011.05.542 ◽

2011 ◽

Vol 7 ◽

pp. S195-S195

Author(s):

Hui Shi ◽

Christopher Medway ◽

Kristelle Brown ◽

Noor Kalsheker ◽

Alison Goate ◽

...

Keyword(s):

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Genetic Variants ◽

Late Onset ◽

Association Studies ◽

Genome Wide Association ◽

Human Longevity ◽

Genome Wide Association Studies ◽

Genome Wide

Download Full-text

Centenarian controls increase variant effect sizes by an average twofold in an extreme case–extreme control analysis of Alzheimer’s disease

European Journal of Human Genetics ◽

10.1038/s41431-018-0273-5 ◽

2018 ◽

Vol 27 (2) ◽

pp. 244-253 ◽

Cited By ~ 15

Author(s):

Niccolò Tesi ◽

Sven J. van der Lee ◽

Marc Hulsman ◽

Iris E. Jansen ◽

Najada Stringa ◽

...

Keyword(s):

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Extreme Case ◽

Effect Sizes ◽

Control Analysis ◽

Variant Effect

Download Full-text

Deep learning-based identification of genetic variants: Application to Alzheimer's disease classification

10.1101/2021.07.19.21260789 ◽

2021 ◽

Author(s):

Taeho Jo ◽

Kwangsik Nho ◽

Paula Bice ◽

Andrew J Saykin

Keyword(s):

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Deep Learning ◽

Genetic Variants ◽

Association Studies ◽

Classification Model ◽

High Dimensional ◽

Optimal Size ◽

Genome Wide Association Studies ◽

Genome Wide

Deep learning is a promising tool that uses nonlinear transformations to extract features from high-dimensional data. Although deep learning has been used in several genetic studies, it is challenging in genome-wide association studies (GWAS) with high-dimensional genomic data. Here we propose a novel three-step approach for identification of genetic variants using deep learning to identify phenotype-related single nucleotide polymorphisms (SNPs) and develop accurate classification models. In the first step, we divided the whole genome into non-overlapping fragments of an optimal size and then ran Convolutional Neural Network (CNN) on each fragment to select phenotype-associated fragments. In the second step, using an overlapping window approach, we ran CNN on the selected fragments to calculate phenotype influence scores (PIS) and identify phenotype-associated SNPs based on PIS. In the third step, we ran CNN on all identified SNPs to develop a classification model. We tested our approach using genome-wide genotyping data for Alzheimer's disease (AD) (N=981; cognitively normal older adults (CN) =650 and AD=331). Our approach identified the well-known APOE region as the most significant genetic locus for AD. Our classification model achieved an area under the curve (AUC) of 0.82, which outperformed traditional machine learning approaches, Random Forest and XGBoost. By using a novel deep learning-based GWAS approach, we were able to identify AD-associated SNPs and develop a better classification model for AD.

Download Full-text

Alzheimer's disease variant portal (ADVP): a catalog of genetic findings for Alzheimer's disease

10.1101/2020.09.29.20203950 ◽

2020 ◽

Author(s):

Pavel P Kuksa ◽

Chia-Lun Lui ◽

Wei Fu ◽

Liming Qu ◽

Yi Zhao ◽

...

Keyword(s):

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Functional Genomics ◽

Genetic Variants ◽

Disease Risk ◽

Association Studies ◽

Genome Wide Association Studies ◽

Genetic Associations ◽

Genome Wide ◽

Disease Variant

Background: Alzheimer's disease (AD) genetic findings span progressively larger genome-wide association studies (GWASs) for various outcomes and populations. These genetic findings are obtained from a single GWAS, joint- or meta- analyses of multiple GWAS datasets. However, no single resource provides harmonized and searchable information on all AD genetic associations obtained from these analyses, nor linking the identified genetic variants and reported genes with other supporting functional genomic evidence. Methods: We created the Alzheimer's Disease Variant Portal (ADVP), which provides unified access to a uniquely extensive collection of high-quality GWAS association results for AD. Records in ADVP are curated from the genome-wide significant and suggestive loci reported in AD genetics literature. ADVP contains curated results from all AD GWAS publications by Alzheimer's Disease Genetics Consortium (ADGC) since 2009 and AD GWAS publications identified from other public catalogs (GWAS catalog). Genetic association information was systematically extracted from these publications, harmonized, and organized into three types of tables. These tables included structured publication, variant, and association categories to ensure consistent representation of all AD genetic findings. All extracted AD genetic associations were further annotated and integrated with NIAGADS Genomics DB in order to provide extensive biological and functional genomics annotations. Results: Currently, ADVP contains 6,990 AD-association records curated from >200 AD GWAS publications corresponding to >900 unique genomic loci and >1,800 unique genetic variants. The ADVP collection contains genetic findings from >80 cohorts and across various populations, including Caucasians, Hispanics, African-Americans, and Asians. Of all the association records, 46% are disease-risk, 13% are related to expression quantitative trait analyses, and 27% are related to AD endophenotypes and neuropathology. ADVP web interface allows accessing AD association records by individual variants, genes, publications, genomic regions of interest, and genome-wide interactive variant views. ADVP is integrated with the NIAGADS Alzheimer's Genomics Database. Researchers can explore additional biological annotations at the genetic variant or gene level and view cross-reference functional genomics evidence provided by other public resources. Conclusions: ADVP is the largest, most up-to-date, and comprehensive literature-derived collection of AD genetic associations. All records have been systematically curated, harmonized, and comprehensively annotated. ADVP is freely accessible at https://advp.niagads.org/.

Download Full-text

A Systems Biology Approach for Hypothesizing the Effect of Genetic Variants on Neuroimaging Features in Alzheimer’s Disease

Journal of Alzheimer s Disease ◽

10.3233/jad-201397 ◽

2021 ◽

Vol 80 (2) ◽

pp. 831-840

Author(s):

Sepehr Golriz Khatami ◽

Daniel Domingo-Fernández ◽

Sarah Mubeen ◽

Charles Tapley Hoyt ◽

Christine Robinson ◽

...

Keyword(s):

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Genetic Variants ◽

Large Scale ◽

Multiple Scales ◽

Association Studies ◽

Hippocampal Atrophy ◽

Genome Wide Association Studies ◽

Biological Processes ◽

Functional Interpretation

Background: Neuroimaging markers provide quantitative insight into brain structure and function in neurodegenerative diseases, such as Alzheimer’s disease, where we lack mechanistic insights to explain pathophysiology. These mechanisms are often mediated by genes and genetic variations and are often studied through the lens of genome-wide association studies. Linking these two disparate layers (i.e., imaging and genetic variation) through causal relationships between biological entities involved in the disease’s etiology would pave the way to large-scale mechanistic reasoning and interpretation. Objective: We explore how genetic variants may lead to functional alterations of intermediate molecular traits, which can further impact neuroimaging hallmarks over a series of biological processes across multiple scales. Methods: We present an approach in which knowledge pertaining to single nucleotide polymorphisms and imaging readouts is extracted from the literature, encoded in Biological Expression Language, and used in a novel workflow to assist in the functional interpretation of SNPs in a clinical context. Results: We demonstrate our approach in a case scenario which proposes KANSL1 as a candidate gene that accounts for the clinically reported correlation between the incidence of the genetic variants and hippocampal atrophy. We find that the workflow prioritizes multiple mechanisms reported in the literature through which KANSL1 may have an impact on hippocampal atrophy such as through the dysregulation of cell proliferation, synaptic plasticity, and metabolic processes. Conclusion: We have presented an approach that enables pinpointing relevant genetic variants as well as investigating their functional role in biological processes spanning across several, diverse biological scales.

Download Full-text

Multi-Omic Analyses Characterize the Ceramide/Sphingomyelin Pathway as a Therapeutic Target in Alzheimer's Disease

10.1101/2021.07.16.21260601 ◽

2021 ◽

Author(s):

Priyanka Baloni ◽

Matthias Arnold ◽

Herman Moreno ◽

Kwangsik Nho ◽

Luna Buitrago ◽

...

Keyword(s):

Alzheimer’S Disease ◽

Alzheimer's Disease ◽

Genetic Variants ◽

Metabolic Flux ◽

Association Studies ◽

Flux Analysis ◽

Genome Wide Association Studies ◽

Imaging Features ◽

Genome Wide ◽

Lipid Species

Dysregulation of sphingomyelin (SM) and ceramide metabolism have been implicated in Alzheimer's Disease (AD). Genome-wide and transcriptome wide association studies have identified various genes and genetic variants in lipid metabolism that are associated with AD. However, the molecular mechanisms of sphingomyelin and ceramide disruption remain to be determined. Evaluation of peripheral lipidomic profiles is useful in providing perspective on metabolic dysregulation in preclinical and clinical AD states. In this study, we focused on the sphingolipid pathway and carried out multi-omic analyses to identify central and peripheral metabolic changes in AD patients and correlate them to imaging features and cognitive performance in amyloidogenic mouse models. Our multi-omic approach was based on (a) 2114 human post-mortem brain transcriptomics to identify differentially expressed genes; (b) in silico metabolic flux analysis on 1708 context-specific metabolic networks to identify differential reaction fluxes; (c) multimodal neuroimaging analysis on 1576 participants to associate genetic variants in SM pathway with AD pathogenesis; (d) plasma metabolomic and lipidomic analysis to identify associations of lipid species with dysregulation in AD; (e) metabolite genome-wide association studies (mGWAS) to define receptors within pathway as potential drug target. Our findings from complementary approaches suggested that depletion of S1P compensated for AD cellular pathology, likely by upregulating the SM pathway, suggesting that modulation of S1P signaling may have protective effects in AD. We tested this hypothesis in APP/PS1 mice and showed that prolonged exposure to fingolimod, an S1P signaling modulator approved for treatment of multiple sclerosis, alleviated the cognitive impairment in mice. Our multi-omic approach identified potential targets in the SM pathway and suggested modulators of S1P metabolism as possible candidates for AD treatment.

Download Full-text