scholarly journals iRAP - an integrated RNA-seq Analysis Pipeline

2014 ◽  
Author(s):  
Nuno A. Fonseca ◽  
Robert Petryszak ◽  
John Marioni ◽  
Alvis Brazma

RNA-sequencing (RNA-Seq) has become the technology of choice for whole-transcriptome profiling. However, processing the millions of sequence reads generated requires considerable bioinformatics skills and computational resources. At each step of the processing pipeline many tools are available, each with specific advantages and disadvantages. While using a specific combination of tools might be desirable, integrating the different tools can be time consuming, often due to specificities in the formats of input/output files required by the different programs. Here we present iRAP, an integrated RNA-seq analysis pipeline that allows the user to select and apply their preferred combination of existing tools for mapping reads, quantifying expression, testing for differential expression. iRAP also includes multiple tools for gene set enrichment analysis and generates web browsable reports of the results obtained in the different stages of the pipeline. Depending upon the application, iRAP can be used to quantify expression at the gene, exon or transcript level. iRAP is aimed at a broad group of users with basic bioinformatics training and requires little experience with the command line. Despite this, it also provides more advanced users with the ability to customise the options used by their chosen tools.

Cancers ◽  
2021 ◽  
Vol 13 (15) ◽  
pp. 3683
Author(s):  
Kazuko Sakai ◽  
Marco A. De Velasco ◽  
Yurie Kura ◽  
Kazuto Nishio

Colitis is a risk factor for colorectal cancer (CRC) and can change the dynamics of gut microbiota, leading to dysbiosis and contributing to carcinogenesis. The functional interactions between colitis-associated CRC and microbiota remain unknown. In this study, colitis and CRC were induced in BALB/c mice by the administration of dextran sodium sulfate (DSS) and/or azoxymethane (AOM). Whole transcriptome profiling of normal colon was then performed, and gene set enrichment analysis (GSEA) revealed enriched fatty acid metabolism, oxidative phosphorylation, and PI3K-Akt-mTOR signaling in the tissues from DSS/AOM mice. Additionally, immunohistochemical staining showed increased expression levels of phosphorylated S6 ribosomal protein, a downstream target of the PI3K-Akt-mTOR pathway in the inflamed mucosa of DSS/AOM mice. Fecal microbes were characterized using 16S rDNA gene sequencing. Redundancy analysis demonstrated a significant dissimilarity between the DSS/AOM group and the others. Functional analysis inferred from microbial composition showed enrichments of the sphingolipid signal and lipoarabinomannan biosynthetic pathways. This study provides additional insights into alterations associated with DSS/AOM-induced colitis and associates PI3K-Akt-mTOR, sphingolipid-signaling and lipoarabinomannan biosynthetic pathways in mouse DSS/AOM-induced colitis.


Genes ◽  
2020 ◽  
Vol 11 (4) ◽  
pp. 412
Author(s):  
Zhe Zhang ◽  
Zizengchen Wang ◽  
Yanna Dang ◽  
Jinyang Wang ◽  
Sakthidasan Jayaprakash ◽  
...  

The newly identified liver-enriched gene 1 (LEG1) encodes a protein with a characteristic domain of unknown function 781 (DUF781/LEG1), constituting a protein family with only one member in mammals. A functional study in zebrafish suggested that LEG1 genes are involved in liver development, while the platypus LEG1 homolog, Monotreme Lactation Protein (MLP), which is enriched in the mammary gland and milk, acts as an antibacterial substance. However, no functional studies on eutherian LEG1s have been published to date. Thus, we here report the first functional prediction study at the cellular level. As previously reported, eutherian LEG1s can be classified into three paralogous groups. Pigs have all three LEG1 genes (pLEG1s), while humans and mice have retained only LEG1a. Hence, pLEG1s might represent an ideal model for studying LEG1 gene functions. RNA-seq was performed by the overexpression of pLEG1s and platypus MLP in HepG2 cells. Enrichment analysis showed that pLEG1a and pLEG1b might exhibit little function in liver cells; however, pLEG1c is probably involved in the endoplasmic reticulum (ER) stress response and protein folding. Additionally, gene set enrichment analysis revealed that platypus MLP shows antibacterial activity, confirming the functional study in platypus. Therefore, our study showed from the transcriptomic perspective that mammalian LEG1s have different functions in liver cells due to the subfunctionalization of paralogous genes.


2019 ◽  
Vol 37 (7_suppl) ◽  
pp. 146-146
Author(s):  
William S. Chen ◽  
Rahul Raj Aggarwal ◽  
Li Zhang ◽  
Shuang Zhao ◽  
Tomasz M. Beer ◽  
...  

146 Background: Metastatic castration-resistant prostate cancer (mCRPC) is the lethal form of the disease. Several recent efforts have identified genomic alterations in mCRPC, but the clinical implications of these alterations have not been fully elucidated. We conducted a prospective cohort study (n = 101) using whole genome sequencing (WGS) to analyze the association between key driver gene alterations and overall survival. We also performed whole-transcriptome RNA sequencing (RNA-seq) analyses to identify potential mechanisms of enzalutamide resistance in mCRPC. Methods: Metastasis biopsies were obtained in 101 mCRPC patients as part of the multi-institutional West Coast Prostate Cancer Dream Team project. Samples underwent WGS and RNA-seq. The resulting mutation, copy number, and structural variant calls were integrated to determine functional copy number status of candidate genes for downstream clinical analyses. We performed univariate and multivariable analyses to assess the prognostic significance of candidate genomic events with respect to overall survival. To nominate and investigate genomic pathways associated with enzalutamide resistance, we performed expression-based gene set enrichment analysis followed by cross-sectional enrichment and survival analyses related to the top nominated pathway. Results: RB1 loss was associated with poor overall survival (median 14.1 vs. 42.0 months, p < 0.001). When we compared enzalutamide resistant versus naïve samples using gene set enrichment analysis, we identified the Wnt/beta-catenin pathway as the top differentially expressed pathway in enzalutamide-resistant patients. Furthermore, CTNNB1 (beta-catenin) activating mutations were exclusive to enzalutamide-resistant patients (p = 0.013) and predictive of poor overall survival (median 13.6 vs. 41.7 months, p < 0.001). Conclusions: Impaired survival in mCRPC patients is associated with RB1 loss, identified by integrated genomic analysis of CRPC metastasis biopsies. Among men with mCRPC that was enzalutamide-resistant, the Wnt/beta-catenin pathway is nominated as an important predictive (and potentially therapeutic) pathway.


Author(s):  
Hajime Takase ◽  
Gen Hamanaka ◽  
Ryo Ohtomo ◽  
Hidehiro Ishikawa ◽  
Kelly K. Chung ◽  
...  

White matter damage caused by cerebral hypoperfusion is a major hallmark of subcortical ischemic vascular dementia (SIVD), which is the most common subtype of vascular cognitive impairment and dementia (VCID) syndrome. In an aging society, the number of SIVD patients is expected to increase; however, effective therapies have yet to be developed. To understand the pathological mechanisms, we analyzed the profiles of the cells of the corpus callosum after cerebral hypoperfusion in a preclinical SIVD model. We prepared cerebral hypoperfused mice by subjecting 2-month old male C57BL/6J mice to bilateral carotid artery stenosis (BCAS) operation. BCAS-hypoperfusion mice exhibited cognitive deficits at 4 weeks after cerebral hypoperfusion, assessed by novel object recognition test. RNA samples from the corpus callosum region of sham- or BCAS-operated mice were then processed using RNA sequencing. A gene set enrichment analysis using differentially expressed genes between sham and BCAS-operated mice showed activation of oligodendrogenesis pathways along with angiogenic responses. This database of transcriptomic profiles of BCAS-hypoperfusion mice will be useful for future studies to find a therapeutic target for SIVD.


2020 ◽  
Author(s):  
Xiao-Han Cui ◽  
Qiu-Ju Peng ◽  
Peng Gao ◽  
Xu-Dong Zhang ◽  
Ren-Zhi Li ◽  
...  

Abstract Background: Cancer is one of the most common causes of death, and the morbidity and mortality are gradually increasing in the world. KIF20A plays an important role in tumors, but its immune relevance in pan-cancer needs to be further studied.Methods: KIF20A-related information was download from The Cancer Genome Atlas (TCGA). Collecting RNA-seq data is fragments per kilobase million (FPKM) style data. The ESTIMATE algorithm was used for estimating the stromal and immune scores for 33 tumors. Then, we analyzed the correlation between KIF20A in pan-cancer and immune checkpoints and performed gene set enrichment analysis (GSEA) analysis on the co-expressed genes of KIF20A in pan-cancer.Results: We have confirmed that the expression of KIF20A has a intensive correlation with prognosis in 33 kinds of tumors. Its expression of KIF20A was related to a variety of immune cells and immune checkpoints. Based on the results of GSEA for further analysis, in multiple tumors, KIF20A is related to immune-related pathways.Conclusion: We have demonstrated that KIF20A played an important role in pan-cancer and could affect the occurrence or development of a variety of tumors. Moreover, KIF20A was related to immunity, and KIF20A- related immune research in pan-cancer also needs to be further demonstrate.


2019 ◽  
Author(s):  
Ludwig Geistlinger ◽  
Gergely Csaba ◽  
Mara Santarelli ◽  
Marcel Ramos ◽  
Lucas Schiffer ◽  
...  

AbstractBackgroundAlthough gene set enrichment analysis has become an integral part of high-throughput gene expression data analysis, the assessment of enrichment methods remains rudimentary and ad hoc. In the absence of suitable gold standards, evaluations are commonly restricted to selected data sets and biological reasoning on the relevance of resulting enriched gene sets. However, this is typically incomplete and biased towards the goals of individual investigations.ResultsWe present a general framework for standardized and structured benchmarking of enrichment methods based on defined criteria for applicability, gene set prioritization, and detection of relevant processes. This framework incorporates a curated compendium of 75 expression data sets investigating 42 different human diseases. The compendium features microarray and RNA-seq measurements, and each dataset is associated with a precompiled GO/KEGG relevance ranking for the corresponding disease under investigation. We perform a comprehensive assessment of 10 major enrichment methods on the benchmark compendium, identifying significant differences in (i) runtime and applicability to RNA-seq data, (ii) fraction of enriched gene sets depending on the type of null hypothesis tested, and (iii) recovery of the a priori defined relevance rankings. Based on these findings, we make practical recommendations on (i) how methods originally developed for microarray data can efficiently be applied to RNA-seq data, (ii) how to interpret results depending on the type of gene set test conducted, and (iii) which methods are best suited to effectively prioritize gene sets with high relevance for the phenotype investigated.ConclusionWe carried out a systematic assessment of existing enrichment methods, and identified best performing methods, but also general shortcomings in how gene set analysis is currently conducted. We provide a directly executable benchmark system for straightforward assessment of additional enrichment methods.Availabilityhttp://bioconductor.org/packages/GSEABenchmarkeR


2020 ◽  
Vol 64 (4-5-6) ◽  
pp. 331-341
Author(s):  
Shainy Ojha ◽  
Madhu G. Tapadia

Drosophila metamorphosis is associated with substantial metabolic activity involving cell death and cell proliferation leading to differentiation of adult tissues and structures. Unlike other larval tissues, Malpighian tubules (MTs) exhibit apoptotic immunity and do not undergo cell death but are carried over to the adult with some cell reorganisation. They persist despite the fact that they express apoptotic proteins and caspases. In the present study, we analysed the global transcription changes in MTs and compared with salivary glands, to decipher the biology of MTs. Gene set enrichment analysis indicated reduced expression of many ecdysone induced genes, including the critical regulator of cell death, E93 in MTs. We hypothesize that reduction of E93 could be because of over expression of ecdysone oxidase, which is high in MTs and is responsible for regulation of hormone titer by degradation of ecdysone. Ectopic expression of E93 in MTs results in cell death through autophagy. Fork head, which is crucial for survival, is enriched in the MT transcriptome, and its down regulation in MTs could be consequent to over expression of E93. Together our data suggests that the cascade of events initiated by ecdysone mediates survival of MTs through concerted action of multiple factors.


PLoS ONE ◽  
2016 ◽  
Vol 11 (11) ◽  
pp. e0165919 ◽  
Author(s):  
Sora Yoon ◽  
Seon-Young Kim ◽  
Dougu Nam

Author(s):  
Ludwig Geistlinger ◽  
Gergely Csaba ◽  
Mara Santarelli ◽  
Marcel Ramos ◽  
Lucas Schiffer ◽  
...  

Abstract Motivation Although gene set enrichment analysis has become an integral part of high-throughput gene expression data analysis, the assessment of enrichment methods remains rudimentary and ad hoc. In the absence of suitable gold standards, evaluations are commonly restricted to selected datasets and biological reasoning on the relevance of resulting enriched gene sets. Results We develop an extensible framework for reproducible benchmarking of enrichment methods based on defined criteria for applicability, gene set prioritization and detection of relevant processes. This framework incorporates a curated compendium of 75 expression datasets investigating 42 human diseases. The compendium features microarray and RNA-seq measurements, and each dataset is associated with a precompiled GO/KEGG relevance ranking for the corresponding disease under investigation. We perform a comprehensive assessment of 10 major enrichment methods, identifying significant differences in runtime and applicability to RNA-seq data, fraction of enriched gene sets depending on the null hypothesis tested and recovery of the predefined relevance rankings. We make practical recommendations on how methods originally developed for microarray data can efficiently be applied to RNA-seq data, how to interpret results depending on the type of gene set test conducted and which methods are best suited to effectively prioritize gene sets with high phenotype relevance. Availability http://bioconductor.org/packages/GSEABenchmarkeR Contact [email protected]


Sign in / Sign up

Export Citation Format

Share Document