scholarly journals Genetic regulatory variation in populations informs transcriptome analysis in rare disease

Science ◽  
2019 ◽  
Vol 366 (6463) ◽  
pp. 351-356 ◽  
Author(s):  
Pejman Mohammadi ◽  
Stephane E. Castel ◽  
Beryl B. Cummings ◽  
Jonah Einson ◽  
Christina Sousa ◽  
...  

Transcriptome data can facilitate the interpretation of the effects of rare genetic variants. Here, we introduce ANEVA (analysis of expression variation) to quantify genetic variation in gene dosage from allelic expression (AE) data in a population. Application of ANEVA to the Genotype-Tissues Expression (GTEx) data showed that this variance estimate is robust and correlated with selective constraint in a gene. Using these variance estimates in a dosage outlier test (ANEVA-DOT) applied to AE data from 70 Mendelian muscular disease patients showed accuracy in detecting genes with pathogenic variants in previously resolved cases and led to one confirmed and several potential new diagnoses. Using our reference estimates from GTEx data, ANEVA-DOT can be incorporated in rare disease diagnostic pipelines to use RNA-sequencing data more effectively.

2019 ◽  
Author(s):  
Pejman Mohammadi ◽  
Stephane E. Castel ◽  
Beryl B. Cummings ◽  
Jonah Einson ◽  
Christina Sousa ◽  
...  

AbstractTranscriptome data holds substantial promise for better interpretation of rare genetic variants in basic research and clinical settings. Here, we introduce ANalysis of Expression VAriation (ANEVA) to quantify genetic variation in gene dosage from allelic expression (AE) data in a population. Application to GTEx data showed that this variance estimate is robust across datasets and is correlated with selective constraint in a gene. We next used ANEVA variance estimates in a Dosage Outlier Test (ANEVA-DOT) to identify genes in an individual that are affected by a rare regulatory variant with an unusually strong effect. Applying ANEVA-DOT to AE data form 70 Mendelian muscular disease patients showed high accuracy in detecting genes with pathogenic variants in previously resolved cases, and lead to one confirmed and several potential new diagnoses in cases previously unresolved. Using our reference estimates from GTEx data, ANEVA-DOT can be readily incorporated in rare disease diagnostic pipelines to better utilize RNA-seq data.One Sentence SummaryNew statistical framework for modelling allelic expression characterizes genetic regulatory variation in populations and informs diagnosis in rare disease patients


2018 ◽  
Author(s):  
Jenny Lord ◽  
Giuseppe Gallone ◽  
Patrick J. Short ◽  
Jeremy F. McRae ◽  
Holly Ironfield ◽  
...  

AbstractMutations which perturb normal pre-mRNA splicing are significant contributors to human disease. We used exome sequencing data from 7,833 probands with developmental disorders (DD) and their unaffected parents, as well as >60,000 aggregated exomes from the Exome Aggregation Consortium, to investigate selection around the splice site, and quantify the contribution of splicing mutations to DDs. Patterns of purifying selection, a deficit of variants in highly constrained genes in healthy subjects and excess de novo mutations in patients highlighted particular positions within and around the consensus splice site of greater functional relevance. Using mutational burden analyses in this large cohort of proband-parent trios, we could estimate in an unbiased manner the relative contributions of mutations at canonical dinucleotides (73%) and flanking non-canonical positions (27%), and calculated the positive predictive value of pathogenicity for different classes of mutations. We identified 18 patients with likely diagnostic de novo mutations in dominant DD-associated genes at non-canonical positions in splice sites. We estimate 35-40% of pathogenic variants in non-canonical splice site positions are missing from public databases.


BMJ ◽  
2021 ◽  
pp. n214
Author(s):  
Weedon MN ◽  
Jackson L ◽  
Harrison JW ◽  
Ruth KS ◽  
Tyrrell J ◽  
...  

Abstract Objective To determine whether the sensitivity and specificity of SNP chips are adequate for detecting rare pathogenic variants in a clinically unselected population. Design Retrospective, population based diagnostic evaluation. Participants 49 908 people recruited to the UK Biobank with SNP chip and next generation sequencing data, and an additional 21 people who purchased consumer genetic tests and shared their data online via the Personal Genome Project. Main outcome measures Genotyping (that is, identification of the correct DNA base at a specific genomic location) using SNP chips versus sequencing, with results split by frequency of that genotype in the population. Rare pathogenic variants in the BRCA1 and BRCA2 genes were selected as an exemplar for detailed analysis of clinically actionable variants in the UK Biobank, and BRCA related cancers (breast, ovarian, prostate, and pancreatic) were assessed in participants through use of cancer registry data. Results Overall, genotyping using SNP chips performed well compared with sequencing; sensitivity, specificity, positive predictive value, and negative predictive value were all above 99% for 108 574 common variants directly genotyped on the SNP chips and sequenced in the UK Biobank. However, the likelihood of a true positive result decreased dramatically with decreasing variant frequency; for variants that are very rare in the population, with a frequency below 0.001% in UK Biobank, the positive predictive value was very low and only 16% of 4757 heterozygous genotypes from the SNP chips were confirmed with sequencing data. Results were similar for SNP chip data from the Personal Genome Project, and 20/21 individuals analysed had at least one false positive rare pathogenic variant that had been incorrectly genotyped. For pathogenic variants in the BRCA1 and BRCA2 genes, which are individually very rare, the overall performance metrics for the SNP chips versus sequencing in the UK Biobank were: sensitivity 34.6%, specificity 98.3%, positive predictive value 4.2%, and negative predictive value 99.9%. Rates of BRCA related cancers in UK Biobank participants with a positive SNP chip result were similar to those for age matched controls (odds ratio 1.31, 95% confidence interval 0.99 to 1.71) because the vast majority of variants were false positives, whereas sequence positive participants had a significantly increased risk (odds ratio 4.05, 2.72 to 6.03). Conclusions SNP chips are extremely unreliable for genotyping very rare pathogenic variants and should not be used to guide health decisions without validation.


Cancers ◽  
2021 ◽  
Vol 13 (7) ◽  
pp. 1495
Author(s):  
Tú Nguyen-Dumont ◽  
James G. Dowty ◽  
Robert J. MacInnis ◽  
Jason A. Steen ◽  
Moeen Riaz ◽  
...  

While gene panel sequencing is becoming widely used for cancer risk prediction, its clinical utility with respect to predicting aggressive prostate cancer (PrCa) is limited by our current understanding of the genetic risk factors associated with predisposition to this potentially lethal disease phenotype. This study included 837 men diagnosed with aggressive PrCa and 7261 controls (unaffected men and men who did not meet criteria for aggressive PrCa). Rare germline pathogenic variants (including likely pathogenic variants) were identified by targeted sequencing of 26 known or putative cancer predisposition genes. We found that 85 (10%) men with aggressive PrCa and 265 (4%) controls carried a pathogenic variant (p < 0.0001). Aggressive PrCa odds ratios (ORs) were estimated using unconditional logistic regression. Increased risk of aggressive PrCa (OR (95% confidence interval)) was identified for pathogenic variants in BRCA2 (5.8 (2.7–12.4)), BRCA1 (5.5 (1.8–16.6)), and ATM (3.8 (1.6–9.1)). Our study provides further evidence that rare germline pathogenic variants in these genes are associated with increased risk of this aggressive, clinically relevant subset of PrCa. These rare genetic variants could be incorporated into risk prediction models to improve their precision to identify men at highest risk of aggressive prostate cancer and be used to identify men with newly diagnosed prostate cancer who require urgent treatment.


2020 ◽  
Vol 28 (12) ◽  
pp. 1763-1768
Author(s):  
Thomas Bourinaris ◽  
◽  
Damian Smedley ◽  
Valentina Cipriani ◽  
Isabella Sheikh ◽  
...  

AbstractHereditary spastic paraplegia (HSP) is a group of heterogeneous inherited degenerative disorders characterized by lower limb spasticity. Fifty percent of HSP patients remain yet genetically undiagnosed. The 100,000 Genomes Project (100KGP) is a large UK-wide initiative to provide genetic diagnosis to previously undiagnosed patients and families with rare conditions. Over 400 HSP families were recruited to the 100KGP. In order to obtain genetic diagnoses, gene-based burden testing was carried out for rare, predicted pathogenic variants using candidate variants from the Exomiser analysis of the genome sequencing data. A significant gene-disease association was identified for UBAP1 and HSP. Three protein truncating variants were identified in 13 patients from 7 families. All patients presented with juvenile form of pure HSP, with median age at onset 10 years, showing autosomal dominant inheritance or de novo occurrence. Additional clinical features included parkinsonism and learning difficulties, but their association with UBAP1 needs to be established.


2021 ◽  
Author(s):  
Jet van der Spek ◽  
Joery den Hoed ◽  
Lot Snijders Blok ◽  
Alexander J. M. Dingemans ◽  
Dick Schijven ◽  
...  

Interpretation of next-generation sequencing data of individuals with an apparent sporadic neurodevelopmental disorder (NDD) often focusses on pathogenic variants in genes associated with NDD, assuming full clinical penetrance with limited variable expressivity. Consequently, inherited variants in genes associated with dominant disorders may be overlooked when the transmitting parent is clinically unaffected. While de novo variants explain a substantial proportion of cases with NDDs, a significant number remains undiagnosed possibly explained by coding variants associated with reduced penetrance and variable expressivity. We characterized twenty families with inherited heterozygous missense or protein-truncating variants (PTVs) in CHD3, a gene in which de novo variants cause Snijders Blok-Campeau syndrome, characterized by intellectual disability, speech delay and recognizable facial features (SNIBCPS). Notably, the majority of the inherited CHD3 variants were maternally transmitted. Computational facial and human phenotype ontology-based comparisons demonstrated that the phenotypic features of probands with inherited CHD3 variants overlap with the phenotype previously associated with de novo variants in the gene, while carrier parents are mildly or not affected, suggesting variable expressivity. Additionally, similarly reduced expression levels of CHD3 protein in cells of an affected proband and of related healthy carriers with a CHD3 PTV, suggested that compensation of expression from the wildtype allele is unlikely to be an underlying mechanism. Our results point to a significant role of inherited variation in SNIBCPS, a finding that is critical for correct variant interpretation and genetic counseling and warrants further investigation towards understanding the broader contributions of such variation to the landscape of human disease.


2021 ◽  
Author(s):  
Amein Kadhem AlAli ◽  
Abdulrahman Al-Enazi ◽  
Ahmed Ammar ◽  
Mahmoud Hajj ◽  
Cyril Cyrus ◽  
...  

Abstract Background Epilepsy, a serious chronic neurological condition effecting up to 100 million people globally, has clear genetic underpinnings including common and rare variants. In Saudi Arabia the prevalence of epilepsy is high and caused mainly by perinatal and genetic factors. No whole-exome sequencing (WES) studies have been performed to date in Saudi Arabian Epilepsy cohorts. This offers a unique opportunity for the discovery of rare genetic variants impacting this disease as there is a high rate of consanguinity amongst large tribal pedigrees. Results We performed WES on 144 individuals diagnosed with epilepsy, to interrogate known Epilepsy related genes for known and functional novel variants. We also used an American College of Medical Genetics (ACMG) guideline based variant prioritization approach in an attempt to discover putative causative variants. We identified a 32 potentially causative pathogenic variants across 30 different genes in 44/144 (30%) of these Saudi Epilepsy individuals. We also identified 232 variants of unknown significance (VUS) across 101 different genes in 133/144 (92%) subjects. Strong enrichment of variants of likely pathogenicity were observed in previously described epilepsy-associated loci, and a number of putative pathogenic variants in novel loci are also observed. Conclusion Several putative pathogenic variants known to be epilepsy-related loci were identified for the first time in our population, in addition to several potential new loci have been identified which may be prioritized for further investigation.


2019 ◽  
Vol 11 (1) ◽  
Author(s):  
Elias L. Salfati ◽  
Emily G. Spencer ◽  
Sarah E. Topol ◽  
Evan D. Muse ◽  
Manuel Rueda ◽  
...  

Abstract Background Whole-exome sequencing (WES) has become an efficient diagnostic test for patients with likely monogenic conditions such as rare idiopathic diseases or sudden unexplained death. Yet, many cases remain undiagnosed. Here, we report the added diagnostic yield achieved for 101 WES cases re-analyzed 1 to 7 years after initial analysis. Methods Of the 101 WES cases, 51 were rare idiopathic disease cases and 50 were postmortem “molecular autopsy” cases of early sudden unexplained death. Variants considered for reporting were prioritized and classified into three groups: (1) diagnostic variants, pathogenic and likely pathogenic variants in genes known to cause the phenotype of interest; (2) possibly diagnostic variants, possibly pathogenic variants in genes known to cause the phenotype of interest or pathogenic variants in genes possibly causing the phenotype of interest; and (3) variants of uncertain diagnostic significance, potentially deleterious variants in genes possibly causing the phenotype of interest. Results Initial analysis revealed diagnostic variants in 13 rare disease cases (25.4%) and 5 sudden death cases (10%). Re-analysis resulted in the identification of additional diagnostic variants in 3 rare disease cases (5.9%) and 1 sudden unexplained death case (2%), which increased our molecular diagnostic yield to 31.4% and 12%, respectively. Conclusions The basis of new findings ranged from improvement in variant classification tools, updated genetic databases, and updated clinical phenotypes. Our findings highlight the potential for re-analysis to reveal diagnostic variants in cases that remain undiagnosed after initial WES.


2020 ◽  
Vol 21 (18) ◽  
pp. 6950
Author(s):  
Anastasiya V. Snezhkina ◽  
Dmitry V. Kalinin ◽  
Vladislav S. Pavlov ◽  
Elena N. Lukyanova ◽  
Alexander L. Golovyuk ◽  
...  

Carotid paragangliomas (CPGLs) are rare neuroendocrine tumors often associated with mutations in SDHx genes. The immunohistochemistry of succinate dehydrogenase (SDH) subunits has been considered a useful instrument for the prediction of SDHx mutations in paragangliomas/pheochromocytomas. We compared the mutation status of SDHx genes with the immunohistochemical (IHC) staining of SDH subunits in CPGLs. To identify pathogenic/likely pathogenic variants in SDHx genes, exome sequencing data analysis among 42 CPGL patients was performed. IHC staining of SDH subunits was carried out for all CPGLs studied. We encountered SDHx variants in 38% (16/42) of the cases in SDHx genes. IHC showed negative (5/15) or weak diffuse (10/15) SDHB staining in most tumors with variants in any of SDHx (94%, 15/16). In SDHA-mutated CPGL, SDHA expression was completely absent and weak diffuse SDHB staining was detected. Positive immunoreactivity for all SDH subunits was found in one case with a variant in SDHD. Notably, CPGL samples without variants in SDHx also demonstrated negative (2/11) or weak diffuse (9/11) SDHB staining (42%, 11/26). Obtained results indicate that SDH immunohistochemistry does not fully reflect the presence of mutations in the genes; diagnostic effectiveness of this method was 71%. However, given the high sensitivity of SDHB immunohistochemistry, it could be used for initial identifications of patients potentially carrying SDHx mutations for recommendation of genetic testing.


EP Europace ◽  
2020 ◽  
Vol 22 (Supplement_1) ◽  
Author(s):  
I Rudaka ◽  
D Rots ◽  
O Kalejs ◽  
L Gailite

Abstract Background. Minor part of atrial fibrillation (AF) patients develops the disease without any well-known risk factors, which is a particular form of the disease, known as a lone AF. Rare genetic variants were described as causative for lone AF. The aim of this study was to investigate occurrence of rare genetic variants in lone AF patients. Material and Methods. We performed Mendeliome sequencing for 21 lone AF patients. Lone AF was defined as AF in individuals younger than 65 years in the absence of cardiovascular or structural heart disease, endocrinologic or pulmonary disease, chronic kidney disease, obesity and excessive alcohol consumption. Data analysis was performed by current laboratory pipeline. We analyzed 453 cardiomyopathy, arrhythmias and sudden cardiac death related genes. Results. In eight out of 21 (38%) lone AF patients rare likely pathogenic variants were found (Table 1.). Seven rare truncating TTN variants and one LMNA missense variant were observed. Four unrelated patients were positive for the same TTN variant c.13696 C &gt; T; p.(Gln4566Ter). The same variant was previously found in ARVC patient in our laboratory. Segregation analysis and phenotyping of relatives is ongoing. Conclusions. Rare genetic variants are common causes of the lone atrial fibrillation. TTN gene variant c.13696C &gt; T; p.(Gln4566Ter) is a potential founder variant in the Baltic population. Table 1. Genetic variants in lone AF Gender Age of AF onset Genetic variant Family history Male 53 LMNA: p.(Ser326Thr) AF in mother Male 11 TTN: p.(Trp31854Ter) AF in father Male 30 TTN: p.(GLn4566Ter) AF in uncle Female 45 TTN: p.(GLn4566Ter) Negative Male 37 TTN: p.(GLn4566Ter) AF in father Male 25 TTN: p.(GLn4566Ter) AF in father, maternal and paternal grandmother Female 60 TTN: p.(Arg27414Ter) Sudden cardiac death at the age of 50 in grand father Female 52 TTN: p.(Arg1012Ter) AF in mother


Sign in / Sign up

Export Citation Format

Share Document