scholarly journals Transcript expression-aware annotation improves rare variant discovery and interpretation

2019 ◽  
Author(s):  
Beryl B. Cummings ◽  
Konrad J. Karczewski ◽  
Jack A. Kosmicki ◽  
Eleanor G. Seaby ◽  
Nicholas A. Watts ◽  
...  

AbstractThe acceleration of DNA sequencing in patients and population samples has resulted in unprecedented catalogues of human genetic variation, but the interpretation of rare genetic variants discovered using such technologies remains extremely challenging. A striking example of this challenge is the existence of disruptive variants in dosage-sensitive disease genes, even in apparently healthy individuals. Through manual curation of putative loss of function (pLoF) variants in haploinsufficient disease genes in the Genome Aggregation Database (gnomAD)(1), we show that one explanation for this paradox involves alternative mRNA splicing, which allows exons of a gene to be expressed at varying levels across cell types. Currently, no existing annotation tool systematically incorporates this exon expression information into variant interpretation. Here, we develop a transcript-level annotation metric, the proportion expressed across transcripts (pext), which summarizes isoform quantifications for variants. We calculate this metric using 11,706 tissue samples from the Genotype Tissue Expression project(2) (GTEx) and show that it clearly differentiates between weakly and highly evolutionarily conserved exons, a proxy for functional importance. We demonstrate that expression-based annotation selectively filters 22.8% of falsely annotated pLoF variants found in haploinsufficient disease genes in gnomAD, while removing less than 4% of high-confidence pathogenic variants in the same genes. Finally, we apply our expression filter to the analysis of de novo variants in patients with autism spectrum disorder (ASD) and developmental disorders and intellectual disability (DD/ID) to show that pLoF variants in weakly expressed regions have effect sizes similar to those of synonymous variants, while pLoF variants in highly expressed exons are most strongly enriched among cases versus controls. Our annotation is fast, flexible, and generalizable, making it possible for any variant file to be annotated with any isoform expression dataset, and will be valuable for rare disease diagnosis, rare variant burden analyses in complex disorders, and curation and prioritization of variants in recall-by-genotype studies.

2021 ◽  
Author(s):  
Brennan H Baker ◽  
Shaoyi Zhang ◽  
Jeremy M Simon ◽  
Sarah M McLarnan ◽  
Wendy K Chung ◽  
...  

De novo mutations contribute to a large proportion of sporadic psychiatric and developmental disorders, yet the potential role of environmental carcinogens as drivers of causal de novo mutations in neurodevelopmental disorders is poorly studied. We demonstrate that several mutagens, including polycyclic aromatic hydrocarbons (PAHs), disproportionately mutate genes related to neurodevelopmental disorders including autism spectrum disorders (ASD), schizophrenia, and attention deficit hyperactivity disorder (ADHD). Other disease genes including amyotrophic lateral sclerosis (ALS), Alzheimers disease, congenital heart disease, orofacial clefts, and coronary artery disease were generally not mutated more than expected. Our findings support a new paradigm of neurodevelopmental disease etiology driven by a contribution of environmentally induced rather than random mutations.


2018 ◽  
Author(s):  
Lucilla Pizzo ◽  
Matthew Jensen ◽  
Andrew Polyak ◽  
Jill A. Rosenfeld ◽  
Katrin Mannik ◽  
...  

AbstractPurposeTo assess the contribution of rare variants in the genetic background towards variability of neurodevelopmental phenotypes in individuals with rare copy-number variants (CNVs) and gene-disruptive mutations.MethodsWe analyzed quantitative clinical information, exome-sequencing, and microarray data from 757 probands and 233 parents and siblings who carry disease-associated mutations.ResultsThe number of rare secondary mutations in functionally intolerant genes (second-hits) correlated with the expressivity of neurodevelopmental phenotypes in probands with 16p12.1 deletion (n=23, p=0.004) and in probands with autism carrying gene-disruptive mutations (n=184, p=0.03) compared to their carrier family members. Probands with 16p12.1 deletion and a strong family history presented more severe clinical features (p=0.04) and higher burden of second-hits compared to those with mild/no family history (p=0.001). The number of secondary variants also correlated with the severity of cognitive impairment in probands carrying pathogenic rare CNVs (n=53) or de novo mutations in disease genes (n=290), and negatively correlated with head size among 80 probands with 16p11.2 deletion. These second-hits involved known disease-associated genes such as SETD5, AUTS2, and NRXN1, and were enriched for genes affecting cellular and developmental processes.ConclusionAccurate genetic diagnosis of complex disorders will require complete evaluation of the genetic background even after a candidate gene mutation is identified.


2021 ◽  
Author(s):  
Xueya Zhou ◽  
Pamela Feliciano ◽  
Tianyun Wang ◽  
Irina Astrovskaya ◽  
Chang Shu ◽  
...  

AbstractDespite the known heritable nature of autism spectrum disorder (ASD), studies have primarily identified risk genes with de novo variants (DNVs). To capture the full spectrum of ASD genetic risk, we performed a two-stage analysis of rare de novo and inherited coding variants in 42,607 ASD cases, including 35,130 new cases recruited online by SPARK. In the first stage, we analyzed 19,843 cases with one or both biological parents and found that known ASD or neurodevelopmental disorder (NDD) risk genes explain nearly 70% of the genetic burden conferred by DNVs. In contrast, less than 20% of genetic risk conferred by rare inherited loss-of-function (LoF) variants are explained by known ASD/NDD genes. We selected 404 genes based on the first stage of analysis and performed a meta-analysis with an additional 22,764 cases and 236,000 population controls. We identified 60 genes with exome-wide significance (p < 2.5e-6), including five new risk genes (NAV3, ITSN1, MARK2, SCAF1, and HNRNPUL2). The association of NAV3 with ASD risk is entirely driven by rare inherited LoFs variants, with an average relative risk of 4, consistent with moderate effect. ASD individuals with LoF variants in the four moderate risk genes (NAV3, ITSN1, SCAF1, and HNRNPUL2, n = 95) have less cognitive impairment compared to 129 ASD individuals with LoF variants in well-established, highly penetrant ASD risk genes (CHD8, SCN2A, ADNP, FOXP1, SHANK3) (59% vs. 88%, p= 1.9e-06). These findings will guide future gene discovery efforts and suggest that much larger numbers of ASD cases and controls are needed to identify additional genes that confer moderate risk of ASD through rare, inherited variants.


2016 ◽  
Vol 113 (52) ◽  
pp. 15054-15059 ◽  
Author(s):  
Xiao Ji ◽  
Rachel L. Kember ◽  
Christopher D. Brown ◽  
Maja Bućan

Autism spectrum disorder (ASD) is a heterogeneous, highly heritable neurodevelopmental syndrome characterized by impaired social interaction, communication, and repetitive behavior. It is estimated that hundreds of genes contribute to ASD. We asked if genes with a strong effect on survival and fitness contribute to ASD risk. Human orthologs of genes with an essential role in pre- and postnatal development in the mouse [essential genes (EGs)] are enriched for disease genes and under strong purifying selection relative to human orthologs of mouse genes with a known nonlethal phenotype [nonessential genes (NEGs)]. This intolerance to deleterious mutations, commonly observed haploinsufficiency, and the importance of EGs in development suggest a possible cumulative effect of deleterious variants in EGs on complex neurodevelopmental disorders. With a comprehensive catalog of 3,915 mammalian EGs, we provide compelling evidence for a stronger contribution of EGs to ASD risk compared with NEGs. By examining the exonic de novo and inherited variants from 1,781 ASD quartet families, we show a significantly higher burden of damaging mutations in EGs in ASD probands compared with their non-ASD siblings. The analysis of EGs in the developing brain identified clusters of coexpressed EGs implicated in ASD. Finally, we suggest a high-priority list of 29 EGs with potential ASD risk as targets for future functional and behavioral studies. Overall, we show that large-scale studies of gene function in model organisms provide a powerful approach for prioritization of genes and pathogenic variants identified by sequencing studies of human disease.


2020 ◽  
Vol 11 ◽  
Author(s):  
Xianru Jiao ◽  
Manuela Morleo ◽  
Vincenzo Nigro ◽  
Annalaura Torella ◽  
Stefano D’Arrigo ◽  
...  

Objective: To establish and broaden the phenotypic spectrum of secretory carrier membrane protein (SCAMP5) associated with epilepsy and neurodevelopmental delay.Methods: A Chinese patient was identified at the First Hospital of Peking University, and the three unrelated patients were recruited from two different countries (Italy and United States) through GeneMatcher. SCAMP5 pathogenic variants were identified by whole exome sequencing; clinical data of the patients were retrospectively collected and analyzed.Result: The onset age of seizures was ranged from 6 to 15 months. Patients had different types of seizures, including focal seizures, generalized tonic-clonic seizures and tonic seizure. One patient showed typical autism spectrum disorder (ASD) symptoms. Electroencephalogram (EEG) findings presented as focal or multifocal discharges, sometimes spreading to generalization. Brain magnetic resonance imaging (MRI) abnormalities were present in each patient. Severe intellectual disability and language and motor developmental disorders were found in our patients, with all patients having poor language development and were nonverbal at last follow-up. All but one of the patients could walk independently in childhood, but the ability to walk independently in one patient had deteriorated with age. All patients had abnormal neurological exam findings, mostly signs of extrapyramidal system involvement. Dysmorphic features were found in 2/4 patients, mainly in the face and trunk. All four unrelated patients were found to have the same heterozygous pathogenic SCAMP5 de novo variant (p. Gly180Trp).Conclusion: Epilepsy, severe developmental delay, abnormal neurological exam findings, with or without ASD or variably dysmorphic features and were common in patients with SCAMP5 variant. The onset time and type of seizure varied greatly. The EEG and brain MRI findings were not consistent, but diverse and nonspecific. The motor ability of patients with heterozygous SCAMP5 variant might have a regressive course; language development was more severely affected.


2020 ◽  
Vol 21 (1) ◽  
pp. 289-304 ◽  
Author(s):  
Caroline M. Dias ◽  
Christopher A. Walsh

Recent advances in understanding the genetic architecture of autism spectrum disorder have allowed for unprecedented insight into its biological underpinnings. New studies have elucidated the contributions of a variety of forms of genetic variation to autism susceptibility. While the roles of de novo copy number variants and single-nucleotide variants—causing loss-of-function or missense changes—have been increasingly recognized and refined, mosaic single-nucleotide variants have been implicated more recently in some cases. Moreover, inherited variants (including common variants) and, more recently, rare recessive inherited variants have come into greater focus. Finally, noncoding variants—both inherited and de novo—have been implicated in the last few years. This work has revealed a convergence of diverse genetic drivers on common biological pathways and has highlighted the ongoing importance of increasing sample size and experimental innovation. Continuing to synthesize these genetic findings with functional and phenotypic evidence and translating these discoveries to clinical care remain considerable challenges for the field.


2018 ◽  
Author(s):  
Virginie Courchet ◽  
Amanda J Roberts ◽  
Peggy Del Carmine ◽  
Tommy L. Lewis ◽  
Franck Polleux ◽  
...  

SUMMARYRecently, numerous rare de novo mutations have been identified in children diagnosed with autism spectrum disorders (ASD). However, despite the predicted loss-of-function nature of some of these de novo mutations, the affected individuals are heterozygous carriers, which would suggest that most of these candidate genes are haploinsufficient and/or that these mutations lead to expression of dominant-negative forms of the protein. Here, we tested this hypothesis with the gene Nuak1, recently identified as a candidate ASD gene and that we previously identified for its role in the development of cortical connectivity. We report that Nuak1 is happloinsufficient in mice in regard to its function in cortical axon branching in vitro and in vivo. Nuak1+/− mice show a combination of abnormal behavioral traits ranging from defective memory consolidation in a spatial learning task, defects in social novelty (but not social preference) and abnormal sensorimotor gating and prepulse inhibition of the startle response. Overall, our results demonstrate that Nuak1 haploinsufficiency leads to defects in the development of cortical connectivity and a complex array of behavorial deficits compatible with ASD, intellectual disability and schizophrenia.


2016 ◽  
Author(s):  
Shahar Shohat ◽  
Eyal Ben-David ◽  
Sagiv Shifman

AbstractGenetic susceptibility to Intellectual disability (ID), autism spectrum disorder (ASD) and schizophrenia (SCZ) often arises from mutations in the same genes, suggesting that they share common mechanisms. We studied genes with de novo mutations in the three disorders and genes implicated by SCZ genome-wide association study (GWAS). Using biological annotations and brain gene expression, we show that mutation class explains enrichment patterns more than specific disorder. Genes with loss of function mutations and genes with missense mutations were enriched with different pathways, shared with genes intolerant to mutations. Specific gene expression patterns were found for each disorder. ID genes were preferentially expressed in fetal cortex, ASD genes also in fetal cerebellum and striatum, and genes associated with SCZ were most significantly enriched in adolescent cortex. Our study suggests that convergence across neuropsychiatric disorders stems from vulnerable pathways to genetic variations, but spatiotemporal activity of genes contributes to specific phenotypes.


2019 ◽  
Author(s):  
Pilar Cacheiro ◽  
Violeta Muñoz-Fuentes ◽  
Stephen A. Murray ◽  
Mary E. Dickinson ◽  
Maja Bucan ◽  
...  

ABSTRACTAlthough genomic sequencing has been transformative in the study of rare genetic diseases, identifying causal variants remains a considerable challenge that can be addressed in part by new gene-specific knowledge. Here, we integrate measures of how essential a gene is to supporting life, as inferred from the comprehensive viability and phenotyping screens performed on knockout mice by the International Mouse Phenotyping Consortium and from human cell line essentiality screens. We propose a novel, cross-species gene classification across the Full Spectrum of Intolerance to Loss-of-function (FUSIL) and demonstrate that genes in five mutually exclusive FUSIL categories have differing characteristics in the biological processes they regulate, tissue expression levels and human mutation rates. Most notably, Mendelian disease genes, particularly those associated with developmental disorders, are highly overrepresented in the developmental lethal category, representing genes not essential for cell survival but required for organism development. Exploiting this finding, we have screened developmental disorder cases from three independent disease sequencing consortia and identified potentially pathogenic, de novo variants shared in different patients for several developmental lethal genes that have not previously been associated with rare disease. We therefore propose FUSIL as an efficient resource for disease gene discovery.


2021 ◽  
Vol 11 (1) ◽  
Author(s):  
Gabrielle L. Sell ◽  
Wendy Xin ◽  
Emily K. Cook ◽  
Mark A. Zbinden ◽  
Thomas B. Schaffer ◽  
...  

AbstractIn humans, loss-of-function mutations in the UBE3A gene lead to the neurodevelopmental disorder Angelman syndrome (AS). AS patients have severe impairments in speech, learning and memory, and motor coordination, for which there is currently no treatment. In addition, UBE3A is duplicated in > 1–2% of patients with autism spectrum disorders—a further indication of the significant role it plays in brain development. Altered expression of UBE3A, an E3 ubiquitin ligase, is hypothesized to lead to impaired levels of its target proteins, but identifying the contribution of individual UBE3A targets to UBE3A-dependent deficits remains of critical importance. Ephexin5 is a putative UBE3A substrate that has restricted expression early in development, regulates synapse formation during hippocampal development, and is abnormally elevated in AS mice, modeled by maternally-derived Ube3a gene deletion. Here, we report that Ephexin5 can be directly ubiquitylated by UBE3A. Furthermore, removing Ephexin5 from AS mice specifically rescued hippocampus-dependent behaviors, CA1 physiology, and deficits in dendritic spine number. Our findings identify Ephexin5 as a key driver of hippocampal dysfunction and related behavioral deficits in AS mouse models. These results demonstrate the exciting potential of targeting Ephexin5, and possibly other UBE3A substrates, to improve symptoms of AS and other UBE3A-related developmental disorders.


Sign in / Sign up

Export Citation Format

Share Document