scholarly journals A CLEAR pipeline for direct comparison of circular and linear RNA expression

2019 ◽  
Author(s):  
Xu-Kai Ma ◽  
Meng-Ran Wang ◽  
Chu-Xiao Liu ◽  
Rui Dong ◽  
Gordon G. Carmichael ◽  
...  

ABSTRACTSequences of circular RNAs (circRNAs) produced from back-splicing of exon(s) completely overlap with sequences from cognate linear RNAs transcribed from the same gene loci with the exception of their back-splicing junction (BSJ) sites. Examination of global circRNA expression from RNA-seq datasets generally relies on the detection of RNA-seq fragments spanning BSJ sites, but a direct comparison of circular and linear RNA expression from the same gene loci in a genome-wide manner has remained challenging. This is because quantification of BSJ fragments differs from that of linear RNA expression that uses normalized RNA-seq fragments mapped to the whole gene bodies. Here, we have developed a computational pipeline for circular and linear RNA expression analysis from ribosomal-RNA depleted RNA-seq (CLEAR, https://github.com/YangLab/CLEAR). A new quantitation parameter, FPB (fragments per billion mapped bases), is applied to evaluate circular and linear RNA expression individually by fragments mapped to circRNA-specific BSJ sites or to linear RNA-specific splicing junction (SJ) sites. Then, circular and linear RNA expression are directly compared by dividing FPBcirc by FPBlinear to generate a CIRCscore, which indicates the relative circRNA expression using linear RNA expression as the background. Highly-expressed circRNAs with low cognate linear RNA expression background can be identified for further investigation.

2019 ◽  
Vol 9 (1) ◽  
Author(s):  
Junjie Shao ◽  
Liqiang Wang ◽  
Xinyue Liu ◽  
Meng Yang ◽  
Haimei Chen ◽  
...  

Abstract Circular RNAs (circRNAs) play important roles in animals, plants, and fungi. However, no circRNAs have been reported in Ganoderma lucidum. Here, we carried out a genome-wide identification of the circRNAs in G.lucidum using RNA-Seq data, and analyzed their features. In total, 250 and 2193 circRNAs were identified from strand-specific RNA-seq data generated from the polyA(−) and polyA(−)/RNase R-treated libraries, respectively. Six of 131 (4.58%) predicted circRNAs were experimentally confirmed. Across three developmental stages, 731 exonic circRNAs (back spliced read counts ≥ 5) and their parent genes were further analyzed. CircRNAs were preferred originating from exons with flanking introns, and the lengths of the flanking intron were longer than those of the control introns. A total of 200 circRNAs were differentially expressed across the three developmental stages of G. lucidum. The expression profiles of 119 (16.3%) exonic circRNAs and their parent genes showed significant positive correlations (r ≥ 0.9, q < 0.01), whereas 226 (30.9%) exonic circRNAs and their parent genes exhibited significant negative correlations (r ≤ −0.9, q < 0.01), in which 53 parent genes are potentially involved in the transcriptional regulation, polysaccharide biosynthesis etc. Our results indicated that circRNAs are present in G. lucidum, with potentially important regulatory roles.


2021 ◽  
Author(s):  
Taeyoung Hwang ◽  
Sojin Kim ◽  
Tamrin Chowdhury ◽  
Hyeon Jong Yu ◽  
Kyung-Min Kim ◽  
...  

Abstract BackgroundAlu is a primate-specific repeat element in the human genome and has been increasingly appreciated as a regulatory element in many biological processes. But the role of Alu has not been studied comprehensively in brain tumor because an evolutionary perspective has been the subject of little research in brain tumor. We aim to investigate the relevance of Alu to the gliomagenesis.MethodsUsing a total of 41 pairs of neurotypicial brain tissue samples and samples of diverse gliomas, we performed strand-specific RNA-seq and analyzed two Alu-associated post-transcriptional regulations, A-to-I editing and circular RNAs, and Alu expression in a genome-wide way. ResultsWe found that while both A-to-I editing and circular RNA are decreased overall in gliomas, grade 2 oligodendrogliomas do not show this same pattern of global changes. Instead, in comparison with other gliomas, oligodendrogliomas showed a higher proportion of perturbed Alu RNA. Adenosine deaminase acting on RNA 2 (ADAR2) was down-regulated in gliomas other than grade 2 oligodendrogliomas, contributing to the observed Alu-associated perturbation. ConclusionsOur results demonstrate that Alu is associated with glioma development and grade 2 oligodendroglioma exhibits a unique pattern of Alu-associated post-transcriptional regulations, which provides an insight to gliomagenesis from the perspective of an evolutionary genetic element.


2021 ◽  
Author(s):  
Tanzeem Fatima ◽  
Rangachari Krishnan ◽  
Ashutosh Srivastava ◽  
Vageeshbabu S. Hanur ◽  
M. Srinivasa Rao

East Indian Sandalwood (Santalum album L.) is highly valued for its heartwood and its oil. There have been no efforts to comparative study of high and low oil yielding genetically identical sandalwood trees grown in similar climatic condition. Thus we intend to study a genome wide transcriptome analysis to identify the corresponding genes involved in high oil biosynthesis in S. album. In this study, 15 years old S. album (SaSHc and SaSLc) genotypes were targeted for analysis to understand the contribution of genetic background on high oil biosynthesis in S. album. A total of 28,959187 and 25,598869 raw PE reads were generated by the Illumina sequencing. 2.12 million and 1.811 million coding sequences were obtained in respective accessions. Based on the GO terms, functional classification of the CDS 21262, & 18113 were assigned into 26 functional groups of three GO categories; (4,168; 3,641) for biological process (5,758;4,971) cellular component and (5,108;4,441) for molecular functions. Total 41,900 and 36,571 genes were functionally annotated and KEGG pathways of the DEGs resulted 213 metabolic pathways. In this, 14 pathways were involved in secondary metabolites biosynthesis pathway in S. album. Among 237 cytochrome families, nine groups of cytochromes were participated in high oil biosynthesis. 16,665 differentially expressed genes were commonly detected in both the accessions (SaHc and SaSLc). The results showed that 784 genes were upregulated and 339 genes were downregulated in SaHc whilst 635 upregulated 299 downregulated in SaSLc S. album. RNA-Seq results were further validated by quantitative RT-PCR. Maximum Blast hits were found to be against Vitis vinifera. From this study we have identified additional number of cytochrome family in SaHc. The accessibility of a RNA-Seq for high oil yielding sandalwood accessions will have broader associations for the conservation and selection of superior elite samples/populations for further genetic improvement program.


2020 ◽  
Vol 10 (6) ◽  
pp. 2057-2068 ◽  
Author(s):  
Jessica R. Eisenstatt ◽  
Lars Boeckmann ◽  
Wei-Chun Au ◽  
Valerie Garcia ◽  
Levi Bursch ◽  
...  

The evolutionarily conserved centromeric histone H3 variant (Cse4 in budding yeast, CENP-A in humans) is essential for faithful chromosome segregation. Mislocalization of CENP-A to non-centromeric chromatin contributes to chromosomal instability (CIN) in yeast, fly, and human cells and CENP-A is highly expressed and mislocalized in cancers. Defining mechanisms that prevent mislocalization of CENP-A is an area of active investigation. Ubiquitin-mediated proteolysis of overexpressed Cse4 (GALCSE4) by E3 ubiquitin ligases such as Psh1 prevents mislocalization of Cse4, and psh1Δ strains display synthetic dosage lethality (SDL) with GALCSE4. We previously performed a genome-wide screen and identified five alleles of CDC7 and DBF4 that encode the Dbf4-dependent kinase (DDK) complex, which regulates DNA replication initiation, among the top twelve hits that displayed SDL with GALCSE4. We determined that cdc7-7 strains exhibit defects in ubiquitin-mediated proteolysis of Cse4 and show mislocalization of Cse4. Mutation of MCM5 (mcm5-bob1) bypasses the requirement of Cdc7 for replication initiation and rescues replication defects in a cdc7-7 strain. We determined that mcm5-bob1 does not rescue the SDL and defects in proteolysis of GALCSE4 in a cdc7-7 strain, suggesting a DNA replication-independent role for Cdc7 in Cse4 proteolysis. The SDL phenotype, defects in ubiquitin-mediated proteolysis, and the mislocalization pattern of Cse4 in a cdc7-7 psh1Δ strain were similar to that of cdc7-7 and psh1Δ strains, suggesting that Cdc7 regulates Cse4 in a pathway that overlaps with Psh1. Our results define a DNA replication initiation-independent role of DDK as a regulator of Psh1-mediated proteolysis of Cse4 to prevent mislocalization of Cse4.


2020 ◽  
Vol 21 (15) ◽  
pp. 5492 ◽  
Author(s):  
Yu Jin Jung ◽  
Jong Hee Kim ◽  
Hyo Ju Lee ◽  
Dong Hyun Kim ◽  
Jihyeon Yu ◽  
...  

The rice SLR1 gene encodes the DELLA protein (protein with DELLA amino acid motif), and a loss-of-function mutation is dwarfed by inhibiting plant growth. We generate slr1-d mutants with a semi-dominant dwarf phenotype to target mutations of the DELLA/TVHYNP domain using CRISPR/Cas9 genome editing in rice. Sixteen genetic edited lines out of 31 transgenic plants were generated. Deep sequencing results showed that the mutants had six different mutation types at the target site of the TVHYNP domain of the SLR1 gene. The homo-edited plants selected individuals without DNA (T-DNA) transcribed by segregation in the T1 generation. The slr1-d7 and slr1-d8 plants caused a gibberellin (GA)-insensitive dwarf phenotype with shrunken leaves and shortened internodes. A genome-wide gene expression analysis by RNA-seq indicated that the expression levels of two GA-related genes, GA20OX2 (Gibberellin oxidase) and GA3OX2, were increased in the edited mutant plants, suggesting that GA20OX2 acts as a convert of GA12 signaling. These mutant plants are required by altering GA responses, at least partially by a defect in the phytohormone signaling system process and prevented cell elongation. The new mutants, namely, the slr1-d7 and slr1-d8 lines, are valuable semi-dominant dwarf alleles with potential application value for molecule breeding using the CRISPR/Cas9 system in rice.


2020 ◽  
Author(s):  
PENG MA ◽  
Xiao Zhang ◽  
Bowen Luo ◽  
Zhen Chen ◽  
Xuan He ◽  
...  

Abstract Background: Long noncoding RNAs (lncRNAs) play important roles in essential biological processes. However, our understanding of lncRNAs as competing endogenous RNAs (ceRNAs) and their responses to nitrogen stress is still limited.Results: Here, we surveyed the lncRNAs and miRNAs in maize inbred line P178 leaves and roots at the seedling stage under high-nitrogen and low-nitrogen conditions using lncRNA-Seq and small RNA-Seq. A total of 894 differentially expressed lncRNAs and 38 different miRNAs were identified. Co-expression analysis found two lncRNAs and four lncRNA-targets could competitively combine with ZmmiR159 and ZmmiR164, respectively. To dissect the genetic regulatory by which lncRNAs might enable adaptation to limited nitrogen availability. An association mapping panel containing a high-density single–nucleotide polymorphism (SNP) array (56,110 SNPs) combined with variable LN resistance-related phenotypes obtained from hydroponics was used for a genome-wide association study (GWAS). By combining GWAS and RNA-Seq, 170 differently expressed lncRNAs within the range of significant markers were screened. Moreover, 40 consistently LN-responsive genes including those involved in glutamine biosynthesis and nitrogen acquisition in root were identified. Transient expression assays in Nicotiana benthamiana demonstrated LNC_002923 could inhabit ZmmiR159-guided cleavage of Zm00001d015521. Conclusions: These lncRNAs containing trait-associated significant SNPs could consider to be related to root development and nutrient utilization. Taken together, the results of our study can provide new insights into the potential regulatory roles of lncRNAs in response to LN stress, and give valuable information for further screening of candidates as well as the improvement of maize regarding LN-responsive resistance.


2021 ◽  
Author(s):  
Aaron Chuah ◽  
Sean Li ◽  
Andrea Do ◽  
Matt A Field ◽  
T. Daniel Andrews

AbstractSummaryMissense mutations that change protein stability are strongly associated with human inherited genetic disease. With the recent availability of predicted structures for all human proteins generated using the AlphaFold2 prediction model, genome-wide assessment of the stability effects of genetic variation can, for the first time, be easily performed. This facilitates the interrogation of personal genetic variation for potentially pathogenic effects through the application of stability metrics. Here, we present a novel algorithm to prioritise variants predicted to strongly destabilise essential proteins, available as both a standalone software package and a web-based tool. We demonstrate the utility of this tool by showing that at values of the Stability Sort Z-score above 1.6, pathogenic, protein-destabilising variants from ClinVar are detected at a 58% enrichment, over and above the destabilising (but presumably non-pathogenic) variation already present in the HapMap NA12878 genome.Availability and ImplementationStabilitySort is available as both a web service (http://130.56.244.113/StabilitySort/) and can be deployed as a standalone system (https://gitlab.com/baaron/StabilitySort)[email protected]


2021 ◽  
Author(s):  
Nicolas Eugenie ◽  
Yvan Zivanovic ◽  
Gaelle Lelandais ◽  
Genevieve Coste ◽  
Claire Bouthier de la Tour ◽  
...  

Numerous genes are overexpressed in the radioresistant bacterium Deinococcus radiodurans after exposure to radiation or prolonged desiccation. The DdrO and IrrE proteins play a major role in regulating the expression of approximately predicted twenty of these genes. The transcriptional repressor DdrO blocks the expression of these genes under normal growth conditions. After exposure to genotoxic agents, the IrrE metalloprotease cleaves DdrO and relieves gene repression. Bioinformatic analyzes showed that this mechanism seems to be conserved in several species of Deinococcus, but many questions remain as such the number of genes regulated by DdrO. Here, by RNA-seq and CHiP-seq assays performed at a genome-wide scale coupled with bioinformatic analyses, we show that, the DdrO regulon in D. radiodurans includes many other genes than those previously described. These results thus pave the way to better understand the radioresistance mechanisms encoded by this bacterium.


2017 ◽  
Vol 1 (Special Issue) ◽  
pp. 14-14
Author(s):  
Disha Sharma ◽  
Paras Sehgal ◽  
Angom Ramcharan Singh ◽  
Shamsudheen Karuthedath Vellarikkal ◽  
Samatha Mathew ◽  
...  

Cancers ◽  
2019 ◽  
Vol 11 (10) ◽  
pp. 1473 ◽  
Author(s):  
Franz ◽  
Ralla ◽  
Weickmann ◽  
Jung ◽  
Rochow ◽  
...  

Circular RNAs (circRNAs) may act as novel cancer biomarkers. However, a genome-wide evaluation of circRNAs in clear cell renal cell carcinoma (ccRCC) has yet to be conducted. Therefore, the objective of this study was to identify and validate circRNAs in ccRCC tissue with a focus to evaluate their potential as prognostic biomarkers. A genome-wide identification of circRNAs in total RNA extracted from ccRCC tissue samples was performed using microarray analysis. Three relevant differentially expressed circRNAs were selected (circEGLN3, circNOX4, and circRHOBTB3), their circular nature was experimentally confirmed, and their expression—along with that of their linear counterparts—was measured in 99 malignant and 85 adjacent normal tissue samples using specifically established RT-qPCR assays. The capacity of circRNAs to discriminate between malignant and adjacent normal tissue samples and their prognostic potential (with the endpoints cancer-specific, recurrence-free, and overall survival) after surgery were estimated by C-statistics, Kaplan-Meier method, univariate and multivariate Cox regression analysis, decision curve analysis, and Akaike and Bayesian information criteria. CircEGLN3 discriminated malignant from normal tissue with 97% accuracy. We generated a prognostic for the three endpoints by multivariate Cox regression analysis that included circEGLN3, circRHOBT3 and linRHOBTB3. The predictive outcome accuracy of the clinical models based on clinicopathological factors was improved in combination with this circRNA-based signature. Bootstrapping as well as Akaike and Bayesian information criteria confirmed the statistical significance and robustness of the combined models. Limitations of this study include its retrospective nature and the lack of external validation. The study demonstrated the promising potential of circRNAs as diagnostic and particularly prognostic biomarkers in ccRCC patients.


Sign in / Sign up

Export Citation Format

Share Document