scholarly journals Transcriptome analysis provides a blueprint of coral egg and sperm functions

PeerJ ◽  
2020 ◽  
Vol 8 ◽  
pp. e9739
Author(s):  
Julia Van Etten ◽  
Alexander Shumaker ◽  
Tali Mass ◽  
Hollie M. Putnam ◽  
Debashish Bhattacharya

Background Reproductive biology and the evolutionary constraints acting on dispersal stages are poorly understood in many stony coral species. A key piece of missing information is egg and sperm gene expression. This is critical for broadcast spawning corals, such as our model, the Hawaiian species Montipora capitata, because eggs and sperm are exposed to environmental stressors during dispersal. Furthermore, parental effects such as transcriptome investment may provide a means for cross- or trans-generational plasticity and be apparent in egg and sperm transcriptome data. Methods Here, we analyzed M. capitata egg and sperm transcriptomic data to address three questions: (1) Which pathways and functions are actively transcribed in these gametes? (2) How does sperm and egg gene expression differ from adult tissues? (3) Does gene expression differ between these gametes? Results We show that egg and sperm display surprisingly similar levels of gene expression and overlapping functional enrichment patterns. These results may reflect similar environmental constraints faced by these motile gametes. We find significant differences in differential expression of egg vs. adult and sperm vs. adult RNA-seq data, in contrast to very few examples of differential expression when comparing egg vs. sperm transcriptomes. Lastly, using gene ontology and KEGG orthology data we show that both egg and sperm have markedly repressed transcription and translation machinery compared to the adult, suggesting a dependence on parental transcripts. We speculate that cell motility and calcium ion binding genes may be involved in gamete to gamete recognition in the water column and thus, fertilization.

2016 ◽  
Author(s):  
Huijuan Feng ◽  
Tingting Li ◽  
Xuegong Zhang

AbstractBackgroundAlternative splicing is a ubiquitous post-transcriptional process in most eukaryotic genes. Aberrant splicing isoforms and abnormal isoform ratios can contribute to cancer development. Kinase genes are key regulators of various cellular processes. Many kinases are found to be oncogenic and have been intensively investigated in the study of cancer and drugs. RNA-Seq provides a powerful technology for genome-wide study of alternative splicing in cancer besides the conventional gene expression profiling. But this potential has not been fully demonstrated yet.MethodsHere we characterized the transcriptome profile of prostate cancer using RNA-Seq data from viewpoints of both differential expression and differential splicing, with an emphasis on kinase genes and their splicing variations. We built up a pipeline to conduct differential expression and differential splicing analysis. Further functional enrichment analysis was performed to explore functional interpretation of the genes. With focus on kinase genes, we performed kinase domain analysis to identify the functionally important candidate kinase gene in prostate cancer. We further calculated the expression level of isoforms to explore the function of isoform switching of kinase genes in prostate cancer.ResultsWe identified distinct gene groups from differential expression and splicing analysis, which suggested that alternative splicing adds another level to gene expression regulation. Enriched GO terms of differentially expressed and spliced kinase genes were found to play different roles in regulation of cellular metabolism. Function analysis on differentially spliced kinase genes showed that differentially spliced exons of these genes are significantly enriched in protein kinase domains. Among them, we found that gene CDK5 has isoform switching between prostate cancer and benign tissues, which may affect cancer development by changing androgen receptor (AR) phosphorylation. The observation was validated in another RNA-Seq dataset of prostate cancer cell lines.ConclusionsOur work characterized the expression and splicing profile of kinase genes in prostate cancer and proposed a hypothetical model on isoform switching of CDK5 and AR phosphorylation in prostate cancer. These findings bring new understanding to the role of alternatively spliced kinases in prostate cancer and demonstrate the use of RNA-Seq data in studying alternative splicing in cancer.


2021 ◽  
Vol 15 (1) ◽  
Author(s):  
Weitong Cui ◽  
Huaru Xue ◽  
Lei Wei ◽  
Jinghua Jin ◽  
Xuewen Tian ◽  
...  

Abstract Background RNA sequencing (RNA-Seq) has been widely applied in oncology for monitoring transcriptome changes. However, the emerging problem that high variation of gene expression levels caused by tumor heterogeneity may affect the reproducibility of differential expression (DE) results has rarely been studied. Here, we investigated the reproducibility of DE results for any given number of biological replicates between 3 and 24 and explored why a great many differentially expressed genes (DEGs) were not reproducible. Results Our findings demonstrate that poor reproducibility of DE results exists not only for small sample sizes, but also for relatively large sample sizes. Quite a few of the DEGs detected are specific to the samples in use, rather than genuinely differentially expressed under different conditions. Poor reproducibility of DE results is mainly caused by high variation of gene expression levels for the same gene in different samples. Even though biological variation may account for much of the high variation of gene expression levels, the effect of outlier count data also needs to be treated seriously, as outlier data severely interfere with DE analysis. Conclusions High heterogeneity exists not only in tumor tissue samples of each cancer type studied, but also in normal samples. High heterogeneity leads to poor reproducibility of DEGs, undermining generalization of differential expression results. Therefore, it is necessary to use large sample sizes (at least 10 if possible) in RNA-Seq experimental designs to reduce the impact of biological variability and DE results should be interpreted cautiously unless soundly validated.


2020 ◽  
Vol 12 (8) ◽  
pp. 1277-1301
Author(s):  
Mark J Nolte ◽  
Peicheng Jing ◽  
Colin N Dewey ◽  
Bret A Payseur

Abstract Island populations repeatedly evolve extreme body sizes, but the genomic basis of this pattern remains largely unknown. To understand how organisms on islands evolve gigantism, we compared genome-wide patterns of gene expression in Gough Island mice, the largest wild house mice in the world, and mainland mice from the WSB/EiJ wild-derived inbred strain. We used RNA-seq to quantify differential gene expression in three key metabolic organs: gonadal adipose depot, hypothalamus, and liver. Between 4,000 and 8,800 genes were significantly differentially expressed across the evaluated organs, representing between 20% and 50% of detected transcripts, with 20% or more of differentially expressed transcripts in each organ exhibiting expression fold changes of at least 2×. A minimum of 73 candidate genes for extreme size evolution, including Irs1 and Lrp1, were identified by considering differential expression jointly with other data sets: 1) genomic positions of published quantitative trait loci for body weight and growth rate, 2) whole-genome sequencing of 16 wild-caught Gough Island mice that revealed fixed single-nucleotide differences between the strains, and 3) publicly available tissue-specific regulatory elements. Additionally, patterns of differential expression across three time points in the liver revealed that Arid5b potentially regulates hundreds of genes. Functional enrichment analyses pointed to cell cycling, mitochondrial function, signaling pathways, inflammatory response, and nutrient metabolism as potential causes of weight accumulation in Gough Island mice. Collectively, our results indicate that extensive gene regulatory evolution in metabolic organs accompanied the rapid evolution of gigantism during the short time house mice have inhabited Gough Island.


2017 ◽  
Author(s):  
Jóhannes Guðbrandsson ◽  
Sigríður Rut Franzdóttir ◽  
Bjarni Kristófer Kristjánsson ◽  
Ehsan Pashay Ahi ◽  
Valerie Helene Maier ◽  
...  

Phenotypic differences between closely related taxa or populations can arise through genetic variation or be environmentally induced, in both cases leading to altered transcription of genes during the structural and functional development of the body. Comparative developmental studies of closely related species or variable populations of the same species can help to elucidate the molecular mechanisms related to population divergence and speciation. Studies of Arctic charr (Salvelinus alpinus) and related salmonids have revealed considerable phenotypic variation among populations and in Arctic charr many cases of extensive variation within lakes (resource polymorphism) have been recorded. One example is the four Arctic charr morphs in the ~10.000 year old Lake Thingvallavatn, which differ in numerous morphological and life history traits. We set out to investigate the molecular and developmental roots of this polymorphism by studying gene expression in embryos of three of the morphs reared in a common garden set-up. We performed RNA-sequencing, de-novo transcriptome assembly and compared gene expression among morphs during a timeframe in early development. Expectedly, developmental time was the predominant explanatory variable. As the data were affected by RNA-degradation, an estimate of 3’-bias was the second most common explanatory variable. Morph, both as a independent variable and as interaction with developmental time, affected the expression of numerous transcripts. The majority of transcripts with significant morph effects separated the limnetic and the benthic morphs. However, gene ontology analyses did not reveal clear functional enrichment of transcripts between groups. Verification via qPCR confirmed differential expression of several genes between the morphs, including regulatory genes such as Arid4a and Tsn. The data are consistent with a scenario where genetic divergence has contributed to differential expression of multiple genes and systems during early development of these sympatric Arctic charr morphs.


Blood ◽  
2014 ◽  
Vol 124 (21) ◽  
pp. 1894-1894
Author(s):  
Hogune Im ◽  
Varsha Rao ◽  
Kunju Joshi Sridhar ◽  
Rui Chen ◽  
George Mias ◽  
...  

Abstract Background: Prior studies using microarray platforms have shown alterations of gene expression profiles (GEPs) in MDS CD34+ marrow cells related to clinical outcomes (Sridhar et al, Blood 2009, Pellagatti et al, JCO 2013). Given the increased sensitivity and accuracy of high-throughput RNA sequencing (RNA-Seq) (Mortazavi et al, Nat Meth 2008, Soon et al, Mol Syst Bio 2012) for detecting and quantifying mRNA transcripts, we applied this methodology for evaluating differential gene expression between MDS and normal CD34+ marrow cells. Methods:RNA was isolated from magnetic bead affinity-enriched CD34+ (>90%) marrow aspirate cells (Miltenyi Biotec, Auburn, CA) and amplified using the Smarter Kit (Clontech, Mt View, CA). The amplified product (ds DNA) was fragmented to a size distribution of ~200-300bp using the E220 Focused Ultrasonicator (Covaris Inc, Woburn, MA). End repair, adapter ligation and PCR amplification were performed using the NEBNext Ultra RNA library prep kit for Illumina (New England Biolabs, Ipswich, MA). The indexed cDNA libraries were sequenced (paired end, 100bp) on an Illumina HiSeq2000 platform with median read counts of 69 million. The sequences were aligned to Human Reference sequence hg19 using DNAnexus mapper with gene detection focused on known annotated genes. The differential expression was analyzed using edgeR. DAVID and Ingenuity IPA programs were used for pathway analyses. Gene Set Enrichment Analysis (GSEA) was used to identify biologic processes in our dataset present across phenotypes. Results: Correlations of RNA-Seq data from unamplified to amplified transcripts demonstrated high fidelity of transcripts obtained (Pearson and Spearman R2 = 0.80). After filtering samples for adequate read counts, 12,323 genes were evaluated. Differential expression analysis yielded 719 differentially expressed genes (DEGs) in MDS (n=30) vs normal (n=21) with FDR <.05. Among the DEGs, 548 and 171 were over- and under-expressed ≥2 fold in MDS vs Normal, respectively: 20% of the overexpressed genes were present in >50% of the patients. Hierarchical cluster analysis using these DEGs confirmed clear separation of MDS patients from normals, with 2 differential expression clusters—one region overexpressed and one underexpressed. A distinctive trend toward clustering of the patients was seen which related to their IPSS categories and marrow blast %. In functional pathway analysis of the 2 distinctive gene clusters which distinguished MDS from normal, the underexpressed MDS DEGs demonstrated enrichment of inflammatory cytokines, oxidative stress and interleukin signaling pathways, plus mitochondrial calcium transport; whereas the MDS overexpressed DEG cluster showed enrichment of adherens junction/cytokeletal remodeling, cell cycle control of chromosome replication and DNA damage response pathways. Using GSEA analysis, significantly increased numbers of genes in MDS vs normal, common to those in gene sets present within curated public databases, were involved with TP53 targets and mTOR signaling pathways. Conclusions: Our study demonstrated that RNA-Seq methodology, a high-throughput and more comprehensive technique than most gene expression microarrays, was capable of showing significant and distinctive differences in gene expression between MDS and normal marrow CD34+ cells. Specific clustering of the DEGs was demonstrated to distinguish patient subsets associated with their major clinical features. Further, the stringently identified DEGs shown to be engaged in functional pathways and biologic processes highly relevant for MDS were extant within the patients’ CD34+ cells. These transcriptomic data provide information complementary to exomic mutational findings contributing to improved understanding of biologic mechanisms underlying MDS. Disclosures No relevant conflicts of interest to declare.


2017 ◽  
Author(s):  
Jóhannes Guðbrandsson ◽  
Sigríður Rut Franzdóttir ◽  
Bjarni Kristófer Kristjánsson ◽  
Ehsan Pashay Ahi ◽  
Valerie Helene Maier ◽  
...  

Phenotypic differences between closely related taxa or populations can arise through genetic variation or be environmentally induced, in both cases leading to altered transcription of genes during the structural and functional development of the body. Comparative developmental studies of closely related species or variable populations of the same species can help to elucidate the molecular mechanisms related to population divergence and speciation. Studies of Arctic charr (Salvelinus alpinus) and related salmonids have revealed considerable phenotypic variation among populations and in Arctic charr many cases of extensive variation within lakes (resource polymorphism) have been recorded. One example is the four Arctic charr morphs in the ~10.000 year old Lake Thingvallavatn, which differ in numerous morphological and life history traits. We set out to investigate the molecular and developmental roots of this polymorphism by studying gene expression in embryos of three of the morphs reared in a common garden set-up. We performed RNA-sequencing, de-novo transcriptome assembly and compared gene expression among morphs during a timeframe in early development. Expectedly, developmental time was the predominant explanatory variable. As the data were affected by RNA-degradation, an estimate of 3’-bias was the second most common explanatory variable. Morph, both as a independent variable and as interaction with developmental time, affected the expression of numerous transcripts. The majority of transcripts with significant morph effects separated the limnetic and the benthic morphs. However, gene ontology analyses did not reveal clear functional enrichment of transcripts between groups. Verification via qPCR confirmed differential expression of several genes between the morphs, including regulatory genes such as Arid4a and Tsn. The data are consistent with a scenario where genetic divergence has contributed to differential expression of multiple genes and systems during early development of these sympatric Arctic charr morphs.


2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Federico Marini ◽  
Annekathrin Ludt ◽  
Jan Linke ◽  
Konstantin Strauch

Abstract Background The interpretation of results from transcriptome profiling experiments via RNA sequencing (RNA-seq) can be a complex task, where the essential information is distributed among different tabular and list formats—normalized expression values, results from differential expression analysis, and results from functional enrichment analyses. A number of tools and databases are widely used for the purpose of identification of relevant functional patterns, yet often their contextualization within the data and results at hand is not straightforward, especially if these analytic components are not combined together efficiently. Results We developed the software package, which serves as a comprehensive toolkit for streamlining the interpretation of functional enrichment analyses, by fully leveraging the information of expression values in a differential expression context. is implemented in R and Shiny, leveraging packages that enable HTML-based interactive visualizations for executing drilldown tasks seamlessly, viewing the data at a level of increased detail. is integrated with the core classes of existing Bioconductor workflows, and can accept the output of many widely used tools for pathway analysis, making this approach applicable to a wide range of use cases. Users can effectively navigate interlinked components (otherwise available as flat text or spreadsheet tables), bookmark features of interest during the exploration sessions, and obtain at the end a tailored HTML report, thus combining the benefits of both interactivity and reproducibility. Conclusion is distributed as an R package in the Bioconductor project (https://bioconductor.org/packages/GeneTonic/) under the MIT license. Offering both bird’s-eye views of the components of transcriptome data analysis and the detailed inspection of single genes, individual signatures, and their relationships, aims at simplifying the process of interpretation of complex and compelling RNA-seq datasets for many researchers with different expertise profiles.


Blood ◽  
2019 ◽  
Vol 134 (Supplement_1) ◽  
pp. 1237-1237 ◽  
Author(s):  
Shalini Sankar ◽  
Miriam Guillen Navarro ◽  
Frida Ponthan ◽  
Simon Bomken ◽  
Sirintra Nakjang ◽  
...  

To identify potential regulators of propagation and self-renewal of Acute Lymphoblastic Leukaemia (ALL), we performed an explorative genome-wide RNAi screen followed by CRISPR ex vivo and in vivo validation screens in the t(4;11)-positive ALL cell line SEM. These screens identified the splicing factor PHF5A as a crucial component of the leukemic program. PHF5A is a subunit of the SF3b protein complex, which directs alternative splicing by binding to the branchpoint of pre-mRNA. Mutations in members of this complex including SF3B1 have been implicated in several haematological malignancies. Functional perturbation experiments demonstrated that PHF5A depletion impairs proliferation, viability and clonogenicity in a range of ALL and AML cell lines strongly suggesting that PHF5A is required for leukemic propagation and self-renewal. To identify genetic programs affected by PHF5A inhibition, we performed RNA-seq followed by analysis of differential gene expression and splicing events. We identified 473 genes with differential expression upon PHF5A knockdown. In addition, we performed in-depth analysis of splicing patterns by examining both differential exon/intron usage and exon junction formation. These analyses demonstrated that loss of PHF5A affects splicing of more than 2500 genes with exon skipping and intron retention being the most frequent splicing events. In order to identify processes and pathways affected by PHF5A, we performed gene set enrichment analysis (GSEA) on both differential expression and splicing. While gene sets associated with RNA processing including splicing, turnover and translation were enriched in both data sets, the differential gene expression signature was also linked to DNA repair processes including base excision, mismatch and homologous recombination repair. In line with these findings, knockdown of either PHF5A or its partner protein SF3B1 induced DNA strand breaks as indicated by comet assay and increased y-H2AX levels. Furthermore, both PHF5A and SF3B1 depletion sensitized ALL cells towards the DNA crosslinking agent mitomycin C. Closer inspection of RNA-seq datasets revealed reduced FANCD2 expression and skipping of exon 22 associated with impaired mono-ubiquitination of the FANCD2 protein as a consequence of PHF5A and SF3B1 knockdown. Furthermore, expression of RAD51, a key component of double strand break repair, also decreased upon PHF5A and SF3B1 knockdown. Notably, in vitro pharmacological inhibition of SF3b complex activity using H3B-8800 (or Pladienolide B) showed a very similar effect on FANCD2 expression, and ubiquitination as well as decrease of RAD51 and an increase in y-H2AX levels on a dose and time-dependent manner. This strongly suggests a mechanistic link between impaired RNA splicing and the repair of DNA double-strand breaks. These combined data show that leukemic cells are highly dependent on a functional SF3b splicing complex. Interference with its function results in DNA damage and also sensitizes towards DNA damaging agents pointing towards a possible benefit of the combined application of inhibitors targeting the SF3b complex with more conventional chemotherapy. Disclosures Ponthan: Epistem Ltd: Employment. Zwaan:Sanofi: Consultancy; Incyte: Consultancy; BMS: Research Funding; Roche: Consultancy; Janssen: Consultancy; Daiichi Sankyo: Consultancy; Servier: Consultancy; Jazz Pharmaceuticals: Other: Travel support; Pfizer: Research Funding; Celgene: Consultancy, Research Funding. Vormoor:Abbvie (uncompensated): Consultancy, Honoraria; Novartis: Consultancy, Honoraria; Amgen: Consultancy, Honoraria; Roche/Genentech: Consultancy, Honoraria, Research Funding; AstraZeneca: Research Funding.


2020 ◽  
Author(s):  
Thomas J. Hall ◽  
Michael P. Mullen ◽  
Gillian P. McHugo ◽  
Kate E. Killick ◽  
Siobhán C. Ring ◽  
...  

Abstract BackgroundBovine TB (BTB), caused by infection with Mycobacterium bovis, is a major endemic disease affecting global cattle production, particularly in many developing countries. The key innate immune that first encounters the pathogen is the alveolar macrophage, previously shown to be substantially reprogrammed during intracellular infection by the pathogen. Here we use differential expression, and correlation- and interaction-based network approaches to analyse the host response to infection with M. bovis at the transcriptome level to identify core infection response pathways and gene modules. These outputs were then integrated with genome-wide association study (GWAS) data sets to enhance detection of genomic variants for susceptibility/resistance to M. bovis infection.ResultsThe host gene expression data consisted of bovine RNA-seq data from alveolar macrophages infected with M. bovis at 24 and 48 hours post-infection. These RNA-seq data were analysed using three distinct analysis pipelines and novel response pathways and modules were further refined using cross-comparison and integration of the results. First, a differential expression analysis was carried out to determine the most significantly differentially expressed (DE) genes between conditions at each time point. Second, two networks were constructed at each time point using gene correlation patterns to determine changes in expression across conditions. Functional sub-modules within each correlation network were selected by statistical criteria for modularity. Third, a base gene interaction network of the mammalian host response to mycobacterial infection was generated using the GeneCards database and InnateDB. Differential gene expression data were superimposed on this base network to extract functional modules of interconnected DE genes.ConclusionsBovine GWAS data was obtained from a published BTB susceptibility/resistance study. The results from the three parallel analyses were integrated with this data to determine which of the three approaches identified genes significantly enriched for SNPs associated with susceptibility/resistance to M. bovis infection. Results indicate distinct and significant overlap in SNP discovery, demonstrating that network-based integration of biologically relevant transcriptomics data can leverage substantial additional information from GWAS data sets.


Author(s):  
Dionysios Fanidis ◽  
Panagiotis Moulos

Abstract The study of differential gene expression patterns through RNA-Seq comprises a routine task in the daily lives of molecular bioscientists, who produce vast amounts of data requiring proper management and analysis. Despite widespread use, there are still no widely accepted golden standards for the normalization and statistical analysis of RNA-Seq data, and critical biases, such as gene lengths and problems in the detection of certain types of molecules, remain largely unaddressed. Stimulated by these unmet needs and the lack of in-depth research into the potential of combinatorial methods to enhance the analysis of differential gene expression, we had previously introduced the PANDORA P-value combination algorithm while presenting evidence for PANDORA’s superior performance in optimizing the tradeoff between precision and sensitivity. In this article, we present the next generation of the algorithm along with a more in-depth investigation of its capabilities to effectively analyze RNA-Seq data. In particular, we show that PANDORA-reported lists of differentially expressed genes are unaffected by biases introduced by different normalization methods, while, at the same time, they comprise a reliable input option for downstream pathway analysis. Additionally, PANDORA outperforms other methods in detecting differential expression patterns in certain transcript types, including long non-coding RNAs.


Sign in / Sign up

Export Citation Format

Share Document