scholarly journals First estimation of the scale of canonical 5’ splice site GT>GC mutations generating wild-type transcripts and their medical genetic implications

2018 ◽  
Author(s):  
Jin-Huan Lin ◽  
Xin-Ying Tang ◽  
Arnaud Boulling ◽  
Wen-Bin Zou ◽  
Emmanuelle Masson ◽  
...  

ABSTRACTIt has long been known that canonical 5’ splice site (5’SS) GT>GC mutations may be compatible with normal splicing. However, to date, the true scale of canonical 5’SS GT>GC mutations generating wild-type transcripts, both in the context of the frequency of such mutations and the level of wild-type transcripts generated from the mutation alleles, remain unknown. Herein, combining data derived from a meta-analysis of 45 informative disease-causing 5’SS GT>GC mutations (from 42 genes) and a cell culture-based full-length gene splicing assay of 103 5’SS GT>GC mutations (from 30 genes), we estimate that ∼15-18% of the canonical GT 5’SSs are capable of generating between 1 and 84% normal transcripts as a consequence of the substitution of GT by GC. We further demonstrate that the canonical 5’SSs whose substitutions of GT by GC generated normal transcripts show stronger complementarity to the 5’ end of U1 snRNA than those sites whose substitutions of GT by GC did not lead to the generation of normal transcripts. We also observed a correlation between the generation of wild-type transcripts and a milder than expected clinical phenotype but found that none of the available splicing prediction tools were able to accurately predict the functional impact of 5’SS GT>GC mutations. Our findings imply that 5’SS GT>GC mutations may not invariably cause human disease but should also help to improve our understanding of the evolutionary processes that accompanied GT>GC subtype switching of U2-type introns in mammals.

2021 ◽  
Vol 12 ◽  
Author(s):  
Jin-Huan Lin ◽  
Hao Wu ◽  
Wen-Bin Zou ◽  
Emmanuelle Masson ◽  
Yann Fichou ◽  
...  

Combining data derived from a meta-analysis of human disease-associated 5′ splice site GT>GC (i.e., +2T>C) variants and a cell culture-based full-length gene splicing assay (FLGSA) of forward engineered +2T>C substitutions, we recently estimated that ∼15–18% of +2T>C variants can generate up to 84% wild-type transcripts relative to their wild-type counterparts. Herein, we analyzed the splicing outcomes of 20 +2T>C variants that generate some wild-type transcripts in two minigene assays. We found a high discordance rate in terms of the generation of wild-type transcripts, not only between FLGSA and the minigene assays but also between the different minigene assays. In the pET01 context, all 20 wild-type minigene constructs generated the expected wild-type transcripts; of the 20 corresponding variant minigene constructs, 14 (70%) generated wild-type transcripts. In the pSPL3 context, only 18 of the 20 wild-type minigene constructs generated the expected wild-type transcripts whereas 8 of the 18 (44%) corresponding variant minigene constructs generated wild-type transcripts. Thus, in the context of a particular type of variant, we raise awareness of the limitations of minigene splicing assays and emphasize the importance of sequence context in regulating splicing. Whether or not our findings apply to other types of splice-altering variant remains to be investigated.


2019 ◽  
Author(s):  
Jin-Huan Lin ◽  
Emmanuelle Masson ◽  
Arnaud Boulling ◽  
Matthew Hayden ◽  
David N. Cooper ◽  
...  

ABSTRACTIn the human genome, most 5’ splice sites (~99%) employ the canonical GT dinucleotide whereas a small minority (~1%) use the non-canonical GC dinucleotide. The functionality and pathogenicity of 5’ splice site GT>GC (i.e., +2T>C) variants have been extensively studied but we still know very little about 5’ splice site GC>GT (+2C>T) variants. Herein, we sought to address this deficiency by performing a meta-analysis of identified +2C>T pathogenic variants together with a functional analysis of +2C>T substitutions using a cell culture-based full-length gene splicing assay. Our results establish a proof of concept that +2C>T variants are qualitatively different from +2T>C variants in terms of their functionality and pathogenicity and suggest that, in sharp contrast with +2T>C variants, most if not all +2C>T variants have no pathological relevance. Our findings have important implications for interpreting the clinical relevance of +2C>T variants but might also improve our understanding of the evolutionary basis of switching between GT and GC 5’ splice sites in mammalian genomes.


2019 ◽  
Author(s):  
Jian-Min Chen ◽  
Jin-Huan Lin ◽  
Emmanuelle Masson ◽  
Zhuan Liao ◽  
Claude Férec ◽  
...  

ABSTRACTGT>GC 5’ splice site (or +2T>C) variants have been frequently reported to cause human genetic disease. However, although we have demonstrated that GT>GC variants in human disease genes may not invariably be pathogenic, none of the currently available splicing prediction tools appear to be capable of reliably distinguishing those GT>GC variants that generate wild-type transcripts from those that do not. Recently, SpliceAI, a novel deep residual neural network tool, has been developed for splicing prediction. Methodologically distinct from previous approaches that either rely on human-engineered features and/or which focus on short nucleotide windows adjoining exon-intron boundaries, SpliceAI assesses splicing determinants by evaluating 10,000 nucleotides of flanking contextual sequence to predict the functional role in splicing of each position in the pre-mRNA transcript. Herein, we evaluated the performance of SpliceAI in the context of three datasets of GT>GC variants, all of which had been characterized functionally in terms of their impact on mRNA splicing. The first two datasets refer to our recently described “in vivo” dataset of 45 disease-causing GT>GC variants and the “in vitro” dataset of 103 GT>GC substitutions. The third dataset comprised 12 BRCA1 GT>GC variants that were recently analyzed by saturation genome editing. We processed all GT>GC variants using the default settings of SpliceAI. Comparison of the SpliceAI-predicted and experimentally obtained functional impact assessments of the analyzed GT>GC variants revealed that although SpliceAI performed rather better than other prediction tools, it was still far from perfect. A key issue is that the impact of GT>GC (as well as GT>GA or +2T>A) variants that generated wild-type transcripts represents a quantitative change that can vary from barely detectable to almost full expression of wild-type transcripts, with wild-type transcripts often co-existing with aberrantly spliced transcripts. Our findings highlight the challenges that we still face in attempting to accurately identify splice-altering variants.


2020 ◽  
Vol 21 (1) ◽  
pp. 56-66 ◽  
Author(s):  
Jian-Min Chen ◽  
Jin-Huan Lin ◽  
Emmanuelle Masson ◽  
Zhuan Liao ◽  
Claude Férec ◽  
...  

Introduction: 5' splice site GT>GC or +2T>C variants have been frequently reported to cause human genetic disease and are routinely scored as pathogenic splicing mutations. However, we have recently demonstrated that such variants in human disease genes may not invariably be pathogenic. Moreover, we found that no splicing prediction tools appear to be capable of reliably distinguishing those +2T>C variants that generate wild-type transcripts from those that do not. Methodology: Herein, we evaluated the performance of a novel deep learning-based tool, SpliceAI, in the context of three datasets of +2T>C variants, all of which had been characterized functionally in terms of their impact on pre-mRNA splicing. The first two datasets refer to our recently described “in vivo” dataset of 45 known disease-causing +2T>C variants and the “in vitro” dataset of 103 +2T>C substitutions subjected to full-length gene splicing assay. The third dataset comprised 12 BRCA1 +2T>C variants that were recently analyzed by saturation genome editing. Results: Comparison of the SpliceAI-predicted and experimentally obtained functional impact assessments of these variants (and smaller datasets of +2T>A and +2T>G variants) revealed that although SpliceAI performed rather better than other prediction tools, it was still far from perfect. A key issue was that the impact of those +2T>C (and +2T>A) variants that generated wild-type transcripts represents a quantitative change that can vary from barely detectable to an almost full expression of wild-type transcripts, with wild-type transcripts often co-existing with aberrantly spliced transcripts. Conclusion: Our findings highlight the challenges that we still face in attempting to accurately identify splice-altering variants.


Genetics ◽  
2003 ◽  
Vol 164 (4) ◽  
pp. 1345-1353
Author(s):  
Amber K Bowers ◽  
Jennifer A Keller ◽  
Susan K Dutcher

Abstract To take advantage of available expressed sequence tags and genomic sequence, we have developed 64 PCR-based molecular markers in Chlamydomonas reinhardtii that map to the 17 linkage groups. These markers will allow the rapid association of a candidate gene sequence with previously identified mutations. As proof of principle, we have identified the genes encoded by the ERY1 and ERY2 loci. Mendelian mutations that confer resistance to erythromycin define three unlinked nuclear loci in C. reinhardtii. Candidate genes ribosomal protein L4 (RPL4) and L22 (RPL22) are tightly linked to the ERY1 locus and ERY2 locus, respectively. Genomic DNA for RPL4 from wild type and five mutant ery1 alleles was amplified and sequenced and three different point mutations were found. Two different glycine residues (G102 and G112) are replaced by aspartic acid and both are in the unstructured region of RPL4 that lines the peptide exit tunnel of the chloroplast ribosome. The other two alleles change a splice site acceptor site. Genomic DNA for RPL22 from wild type and three mutant ery2 alleles was amplified and sequenced and revealed three different point mutations. Two alleles have premature stop codons and one allele changes a splice site acceptor site.


2004 ◽  
Vol 72 (11) ◽  
pp. 6589-6596 ◽  
Author(s):  
Ricky L. Ulrich ◽  
David DeShazer ◽  
Harry B. Hines ◽  
Jeffrey A. Jeddeloh

ABSTRACT Numerous gram-negative bacterial pathogens regulate virulence factor expression by using a cell density mechanism termed quorum sensing (QS). An in silico analysis of the Burkholderia mallei ATCC 23344 genome revealed that it encodes at least two luxI and four luxR homologues. Using mass spectrometry, we showed that wild-type B. mallei produces the signaling molecules N-octanoyl-homoserine lactone and N-decanoyl-homoserine lactone. To determine if QS is involved in the virulence of B. mallei, we generated mutations in each putative luxIR homologue and tested the pathogenicities of the derivative strains in aerosol BALB/c mouse and intraperitoneal hamster models. Disruption of the B. mallei QS alleles, especially in RJ16 (bmaII) and RJ17 (bmaI3), which are luxI mutants, significantly reduced virulence, as indicated by the survival of mice who were aerosolized with 104 CFU (10 50% lethal doses [LD50s]). For the B. mallei transcriptional regulator mutants (luxR homologues), mutation of the bmaR5 allele resulted in the most pronounced decrease in virulence, with 100% of the challenged animals surviving a dose of 10 LD50s. Using a Syrian hamster intraperitoneal model of infection, we determined the LD50s for wild-type B. mallei and each QS mutant. An increase in the relative LD50 was found for RJ16 (bmaI1) (>967 CFU), RJ17 (bmaI3) (115 CFU), and RJ20 (bmaR5) (151 CFU) compared to wild-type B. mallei (<13 CFU). These findings demonstrate that B. mallei carries multiple luxIR homologues that either directly or indirectly regulate the biosynthesis of an essential virulence factor(s) that contributes to the pathogenicity of B. mallei in vivo.


2004 ◽  
Vol 287 (4) ◽  
pp. H1730-H1739 ◽  
Author(s):  
Ron Zohar ◽  
Baoqian Zhu ◽  
Peter Liu ◽  
Jaro Sodek ◽  
C. A. McCulloch

Reperfusion-induced oxidative injury to the myocardium promotes activation and proliferation of cardiac fibroblasts and repair by scar formation. Osteopontin (OPN) is a proinflammatory cytokine that is upregulated after reperfusion. To determine whether OPN enhances fibroblast survival after exposure to oxidants, cardiac fibroblasts from wild-type (WT) or OPN-null (OPN−/−) mice were treated in vitro with H2O2to model reperfusion injury. Within 1 h, membrane permeability to propidium iodide (PI) was increased from 5 to 60% in OPN−/−cells but was increased to only 20% in WT cells. In contrast, after 1–8 h of treatment with H2O2, the percent of terminal deoxynucleotidyl transferase-mediated dUTP nick-end labeling (TUNEL)-stained cells was more than twofold higher in WT than OPN−/−cells. Electron microscopy of WT cells treated with H2O2showed chromatin condensation, nuclear fragmentation, and cytoplasmic and nuclear shrinkage, which are consistent with apoptosis. In contrast, H2O2-treated OPN−/−cardiac fibroblasts exhibited cell and nuclear swelling and membrane disruption that are indicative of cell necrosis. Treatment of OPN−/−and WT cells with a cell-permeable caspase-3 inhibitor reduced the percentage of TUNEL staining by more than fourfold in WT cells but decreased staining in OPN−/−cells by ∼30%. Although the percentage of PI-permeable WT cells was reduced threefold, the percent of PI-permeable OPN−/−cells was not altered. Restoration of OPN expression in OPN−/−fibroblasts reduced the percentage of PI-permeable cells but not TUNEL staining after H2O2treatment. Thus H2O2-induced cell death in OPN-deficient cardiac fibroblasts is mediated by a caspase-3-independent, necrotic pathway. We suggest that the increased expression of OPN in the myocardium after reperfusion may promote fibrosis by protecting cardiac fibroblasts from cell death.


2008 ◽  
Vol 295 (1) ◽  
pp. G45-G53 ◽  
Author(s):  
Bin Hu ◽  
Lisa M. Colletti

Stem cell factor (SCF) and its receptor c-kit are important in hematopoiesis and cellular proliferation. c-kit has also been identified as a cell surface marker for progenitor cells. We have previously shown that there is a large reservoir of hepatic SCF, and this molecule plays a significant role in liver regeneration after 70% hepatectomy. In the current study, we further examined the expression of SCF and c-kit in acetaminophen (APAP)-induced liver injury in C57BL/6J mice or SCF-deficient sl-sld mice and their appropriate wild-type controls. Following APAP-induced liver injury, c-kit mRNA expression increased, with peak levels detected 48 h postinjury. Hepatic SCF mRNA levels after APAP injury were also increased, with peak levels seen 16 h post-APAP. The mortality rate in SCF-deficient mice treated with APAP was significantly higher than that of wild-type mice; furthermore, administration of exogenous SCF significantly reduced the mortality of APAP-treated wild-type mice. Bromodeoxyuridine incorporation experiments showed that SCF significantly increased hepatocyte proliferation at 48 and 72 h in APAP-treated mice. SCF inhibited APAP-induced hepatocyte apoptosis and increased Bcl-2 and Bcl-xL expression, suggesting that this decrease in hepatocyte apoptosis is mediated through Bcl-2 and Bcl-xL. In summary, SCF and c-kit expression was increased after APAP-induced liver injury. Administration of exogenous SCF reduces mortality in APAP-treated mice, increases hepatocyte proliferation, and prevents hepatocyte apoptosis induced by APAP, suggesting that these molecules are important in the liver's recovery from these injuries.


2000 ◽  
Vol 191 (8) ◽  
pp. 1281-1292 ◽  
Author(s):  
Raelene J. Grumont ◽  
Steve Gerondakis

In lymphocytes, the Rel transcription factor is essential in establishing a pattern of gene expression that promotes cell proliferation, survival, and differentiation. Here we show that mitogen-induced expression of interferon (IFN) regulatory factor 4 (IRF-4), a lymphoid-specific member of the IFN family of transcription factors, is Rel dependent. Consistent with IRF-4 functioning as a repressor of IFN-induced gene expression, the absence of IRF-4 expression in c-rel−/− B cells coincided with a greater sensitivity of these cells to the antiproliferative activity of IFNs. In turn, enforced expression of an IRF-4 transgene restored IFN modulated c-rel−/− B cell proliferation to that of wild-type cells. This cross-regulation between two different signaling pathways represents a novel mechanism that Rel/nuclear factor κB can repress the transcription of IFN-regulated genes in a cell type–specific manner.


PeerJ ◽  
2018 ◽  
Vol 6 ◽  
pp. e5382 ◽  
Author(s):  
Fernanda Cornejo-Granados ◽  
Luigui Gallardo-Becerra ◽  
Miriam Leonardo-Reza ◽  
Juan Pablo Ochoa-Romo ◽  
Adrian Ochoa-Leyva

The shrimp or prawn is the most valuable traded marine product in the world market today and its microbiota plays an essential role in its development, physiology, and health. The technological advances and dropping costs of high-throughput sequencing have increased the number of studies characterizing the shrimp microbiota. However, the application of different experimental and bioinformatics protocols makes it difficult to compare different studies to reach general conclusions about shrimp microbiota. To meet this necessity, we report the first meta-analysis of the microbiota from freshwater and marine shrimps using all publically available sequences of the 16S ribosomal gene (16S rRNA gene). We obtained data for 199 samples, in which 63.3% were from marine (Alvinocaris longirostris, Litopenaeus vannamei and Penaeus monodon), and 36.7% were from freshwater (Macrobrachium asperulum, Macrobrachium nipponense, Macrobranchium rosenbergii, Neocaridina denticulata) shrimps. Technical variations among studies, such as selected primers, hypervariable region, and sequencing platform showed a significant impact on the microbiota structure. Additionally, the ANOSIM and PERMANOVA analyses revealed that the most important biological factor in structuring the shrimp microbiota was the marine and freshwater environment (ANOSIM R = 0.54, P = 0.001; PERMANOVA pseudo-F = 21.8, P = 0.001), where freshwater showed higher bacterial diversity than marine shrimps. Then, for marine shrimps, the most relevant biological factors impacting the microbiota composition were lifestyle (ANOSIM R = 0.341, P = 0.001; PERMANOVA pseudo-F = 8.50, P = 0.0001), organ (ANOSIM R = 0.279, P = 0.001; PERMANOVA pseudo-F = 6.68, P = 0.001) and developmental stage (ANOSIM R = 0.240, P = 0.001; PERMANOVA pseudo-F = 5.05, P = 0.001). According to the lifestyle, organ, developmental stage, diet, and health status, the highest diversity were for wild-type, intestine, adult, wild-type diet, and healthy samples, respectively. Additionally, we used PICRUSt to predict the potential functions of the microbiota, and we found that the organ had more differentially enriched functions (93), followed by developmental stage (12) and lifestyle (9). Our analysis demonstrated that despite the impact of technical and bioinformatics factors, the biological factors were also statistically significant in shaping the microbiota. These results show that cross-study comparisons are a valuable resource for the improvement of the shrimp microbiota and microbiome fields. Thus, it is important that future studies make public their sequencing data, allowing other researchers to reach more powerful conclusions about the microbiota in this non-model organism. To our knowledge, this is the first meta-analysis that aims to define the shrimp microbiota.


Sign in / Sign up

Export Citation Format

Share Document