scholarly journals Bayesian Framework for Detecting Gene Expression Outliers in Individual Samples

2020 ◽  
pp. 160-170
Author(s):  
John Vivian ◽  
Jordan M. Eizenga ◽  
Holly C. Beale ◽  
Olena M. Vaske ◽  
Benedict Paten

PURPOSE Many antineoplastics are designed to target upregulated genes, but quantifying upregulation in a single patient sample requires an appropriate set of samples for comparison. In cancer, the most natural comparison set is unaffected samples from the matching tissue, but there are often too few available unaffected samples to overcome high intersample variance. Moreover, some cancer samples have misidentified tissues of origin or even composite-tissue phenotypes. Even if an appropriate comparison set can be identified, most differential expression tools are not designed to accommodate comparisons to a single patient sample. METHODS We propose a Bayesian statistical framework for gene expression outlier detection in single samples. Our method uses all available data to produce a consensus background distribution for each gene of interest without requiring the researcher to manually select a comparison set. The consensus distribution can then be used to quantify over- and underexpression. RESULTS We demonstrate this method on both simulated and real gene expression data. We show that it can robustly quantify overexpression, even when the set of comparison samples lacks ideally matched tissue samples. Furthermore, our results show that the method can identify appropriate comparison sets from samples of mixed lineage and rediscover numerous known gene-cancer expression patterns. CONCLUSION This exploratory method is suitable for identifying expression outliers from comparative RNA sequencing (RNA-seq) analysis for individual samples, and Treehouse, a pediatric precision medicine group that leverages RNA-seq to identify potential therapeutic leads for patients, plans to explore this method for processing its pediatric cohort.

2019 ◽  
Author(s):  
John Vivian ◽  
Jordan Eizenga ◽  
Holly C. Beale ◽  
Olena Morozova-Vaske ◽  
Benedict Paten

ABSTRACTObjectiveMany antineoplastics are designed to target upregulated genes, but quantifying upregulation in a single patient sample requires an appropriate set of samples for comparison. In cancer, the most natural comparison set is unaffected samples from the matching tissue, but there are often too few available unaffected samples to overcome high inter-sample variance. Moreover, some cancer samples have misidentified tissues or origin, or even composite-tissue phenotypes. Even if an appropriate comparison set can be identified, most differential expression tools are not designed to accommodate comparing to a single patient sample.Materials and MethodsWe propose a Bayesian statistical framework for gene expression outlier detection in single samples. Our method uses all available data to produce a consensus background distribution for each gene of interest without requiring the researcher to manually select a comparison set. The consensus distribution can then be used to quantify over- and under-expression.ResultsWe demonstrate this method on both simulated and real gene expression data. We show that it can robustly quantify overexpression, even when the set of comparison samples lacks ideally matched tissues samples. Further, our results show that the method can identify appropriate comparison sets from samples of mixed lineage and rediscover numerous known gene-cancer expression patterns.ConclusionsThis exploratory method is suitable for identifying expression outliers from comparative RNA-seq analysis for individual samples and Treehouse, a pediatric precision medicine group that leverages RNA-seq to identify potential therapeutic leads for patients, plans to explore this method for processing their pediatric cohort.


Foods ◽  
2021 ◽  
Vol 10 (2) ◽  
pp. 360
Author(s):  
Guodong Rao ◽  
Jianguo Zhang ◽  
Xiaoxia Liu ◽  
Xue Li ◽  
Chenhe Wang

Olive oil has been favored as high-quality edible oil because it contains balanced fatty acids (FAs) and high levels of minor components. The contents of FAs and minor components are variable in olive fruits of different color at harvest time, which render it difficult to determine the optimal harvest strategy for olive oil producing. Here, we combined metabolome, Pacbio Iso-seq, and Illumina RNA-seq transcriptome to investigate the association between metabolites and gene expression of olive fruits at harvest time. A total of 34 FAs, 12 minor components, and 181 other metabolites (including organic acids, polyols, amino acids, and sugars) were identified in this study. Moreover, we proposed optimal olive harvesting strategy models based on different production purposes. In addition, we used the combined Pacbio Iso-seq and Illumina RNA-seq gene expression data to identify genes related to the biosynthetic pathways of hydroxytyrosol and oleuropein. These data lay the foundation for future investigations of olive fruit metabolism and gene expression patterns, and provide a method to obtain olive harvesting strategies for different production purposes.


Author(s):  
VG LeBlanc ◽  
D Trinh ◽  
M Hughes ◽  
I Luthra ◽  
D Livingstone ◽  
...  

Glioblastomas (GBMs) account for nearly half of all primary malignant brain tumours, and current therapies are often only marginally effective. Our understanding of the underlying biology of these tumours and the development of new therapies have been complicated in part by widespread inter- and intratumoural heterogeneity. To characterize this heterogeneity, we performed regional subsampling of primary glioblastomas and derived organoids from these tissue samples. We then performed single-cell RNA-sequencing (scRNA-seq) on these primary regional subsamples and 1-3 matched organoids per sample. We have profiled samples from six tumour sets to date and have obtained sequencing data for 21,234 primary tissue cells and 14,742 organoid cells. While the most apparent differences in gene expression appear to be between individual tumours, we were also able to identify similar cellular subpopulations across tissue samples and across organoids. Importantly, organoids derived from the same tissue sample appeared to be composed of similar cellular subpopulations and were highly comparable to each other, indicating that replicate organoids faithfully represent the original tumour tissue. Overall, our scRNA-seq approach will help evaluate the utility of tumour-derived organoids as model systems for GBM and will aid in identifying cellular subpopulations defined by gene expression patterns, both in primary GBM regional subsamples and their associated organoids. These analyses will allow for the characterization of clonal or subclonal populations that are likely to respond to different therapeutic approaches and may also uncover novel therapeutic targets previously unrevealed through bulk analyses.


2019 ◽  
Vol 37 (15_suppl) ◽  
pp. e13533-e13533
Author(s):  
Ella L Kim ◽  
Anton Buzdin ◽  
Maxim Sorokin ◽  
Elena Poddubskaya ◽  
Artem Poddubskiy ◽  
...  

e13533 Background: This study developed molecular guided tools for individualized selection of chemotherapeutics for recurrent glioblastoma (rGB). A consortium involving clinical neurooncologists, molecular biologists and bioinformaticians identified gene expression patterns in rGB and quantitatively analyzed pathways involved in response to FDA approved oncodrugs. Methods: From2016 to 2018 biopsies from GB were collected using a multisampling approach. Biopsy material was used to isolate glioma stem-like cells and examined by RNA-sequencing. RNA-seq results were subjected to differential expression (DE) analysis and Oncobox analysis – a bioinformatic tool for quantitative pathway activation analysis. Results for newly diagnosed (nGB) and rGB (tissue samples and cell cultures) were compared. Oncobox analysis was further used to examine differential activation of pathways involved in response to existing chemotherapeutics. Results: 128 tissue samples and 28 cell cultures from a total of 44 GBs including 23 nGB, 19 rGB and 2 second-recurrent GBs were analyzed. 14 patient-matched pairs of nGB and rGB were obtained. DE analysis of nGB and rGB, showed a distinct “signature” associated with rGB. Oncobox analysis found down regulation of pathways related to cell cycle and DNA repair and upregulation of immune response pathways in rGB vs corresponding nGB. Specifically, pathways targeted by temozolomide, which is the first line chemotherapy for GB, were found down regulated in rGB. Among the top pathways upregulated in rGB were the pathways targeted by durvalumab and pomalidomide currently under investigation in phase II or III trials for GB. Conclusions: Specific pathway analysis revealed regional and clinical stage-associated differences in the transcriptional landscapes of nGB and rGB. Our results support a concept of treatment-induced resistance to cytotoxic therapeutics and indicate that temozolomide and radiation treatment have important impacts on gene expression changes associated with GB recurrence. Systematic molecular profiling of rGB is a promising avenue towards predicting sensitivity to targeted therapeutics in rGBs on an individual basis.


2019 ◽  
Vol 37 (15_suppl) ◽  
pp. e13032-e13032 ◽  
Author(s):  
Anton Buzdin ◽  
Andrew Garazha ◽  
Maxim Sorokin ◽  
Alex Glusker ◽  
Alexey Aleshin ◽  
...  

e13032 Background: Intracellular molecular pathways (IMPs) control all major events in the living cell. They are considered hotspots in contemporary oncology because knowledge of IMPs activation is essential for understanding mechanisms of molecular pathogenesis in oncology. Profiling IMPs requires RNA-seq data for tumors and for a collection of reference normal tissues. However, there is a shortage now in such profiles for normal tissues from healthy human donors, uniformly profiled in a single series of experiments. Access to the largest dataset of normal profiles GTEx is only partly available through the dbGaP. In TCGA database, norms are adjacent to surgically removed tumors and may be affected by tumor-linked growth factors, inflammation and altered vascularization. ENCODE datasets were for the autopsies of normal tissues, but they can’t form statistically significant reference groups. Methods: Tissue samples representing 20 organs were taken from post-mortal human healthy donors killed in road accidents no later than 36 hours after death, blood samples were taken from healthy volunteers. Gene expression was profiled in RNA-seq experiments using the same reagents, equipment and protocols. Bioinformatic algorithms for IMP analysis were developed and validated using experimental and public gene expression datasets. Results: From original sequencing data we constructed the biggest fully open reference expression database of normal human tissues including 465 profiles termed Oncobox Atlas of Normal Tissue Expression (ANTE, original data: GSE120795). We next developed a method termed Oncobox for interrogating activation of IMPs in human cancers. It includes modules of expression data harmonization and comparison and an algorithm for automatic annotation of molecular pathways. The Oncobox system enables accurate scoring of thousands molecular pathways using RNA-seq data. Oncobox pathway analysis is also applicable for quantitative proteomics and microRNA data in oncology. Conclusions: The Oncobox system can be used for a plethora of applications in cancer research including finding differentially regulated genes and IMPs, and for discovery of new pathway-related diagnostic and prognostic biomarkers.


2017 ◽  
Vol 2017 ◽  
pp. 1-10 ◽  
Author(s):  
Chuanhui Sun ◽  
Changsong Han ◽  
Peng Wang ◽  
Yinji Jin ◽  
Yanan Sun ◽  
...  

The purpose of this study was to investigate the HOX gene expression profile in laryngeal squamous cell carcinoma (LSCC) and assess whether some genes are associated with the clinicopathological features and prognosis in LSCC patients. The HOX gene levels were tested by microarray and validated by qRT-PCR in paired cancerous and adjacent noncancerous LSCC tissue samples. The microarray testing data of 39 HOX genes revealed 15 HOX genes that were at least 2-fold upregulated and 2 that were downregulated. After qRT-PCR evaluation, the three most upregulated genes (HOXB9, HOXB13, and HOXD13) were selected for tissue microarray (TMA) analysis. The correlations between the HOXB9, HOXB13, and HOXD13 expression levels and both clinicopathological features and prognosis were analyzed. Three HOX gene expression levels were markedly increased in LSCC tissues compared with adjacent noncancerous tissues (P<0.001). HOXB9 was found to correlate with histological grade (P<0.01) and prognosis (P<0.01) in LSCC. In conclusion, this study revealed that HOXB9, HOXB13, and HOXD13 were upregulated and may play important roles in LSCC. Moreover, HOXB9 may serve as a novel marker of poor prognosis and a potential therapeutic target in LSCC patients.


2015 ◽  
Author(s):  
Carl J Schmdt ◽  
Elizabeth M Pritchett ◽  
Liang Sun ◽  
Richard V.N. Davis ◽  
Allen Hubbard ◽  
...  

Transcriptome analysis by RNA-seq has emerged as a high-throughput, cost-effective means to evaluate the expression pattern of genes in organisms. Unlike other methods, such as microarrays or quantitative PCR, RNA-seq is a target free method that permits analysis of essentially any RNA that can be amplified from a cell or tissue. At its most basic, RNA-seq can determine individual gene expression levels by counting the number of times a particular transcript was found in the sequence data. Transcript levels can be compared across multiple samples to identify differentially expressed genes and infer differences in biological states between the samples. We have used this approach to examine gene expression patterns in chicken and human cells, with particular interest in determining response to heat stress.


2020 ◽  
Author(s):  
Hiroto Yamamoto ◽  
Yutaro Uchida ◽  
Tomoki Chiba ◽  
Ryota Kurimoto ◽  
Takahide Matsushima ◽  
...  

AbstractBackgroundsSevoflurane is a most frequently used volatile anaesthetics, but its molecular mechanisms of action remain unclear. We hypothesized that specific genes play regulatory roles in whole brain exposed to sevoflurane. Thus, we aimed to evaluate the effects of sevoflurane inhalation and identify potential regulatory genes by RNA-seq analysis.MethodsEight-week old mice were exposed to sevoflurane. RNA from four medial prefrontal cortex, striatum, hypothalamus, and hippocampus were analysed using RNA-seq. Differently expressed genes were extracted. Their gene ontology terms and the transcriptome array data of the cerebral cortex of sleeping mice were analysed using Metascape, and the gene expression patterns were compared. Finally, the activities of transcription factors were evaluated using a weighted parametric gene set analysis (wPGSA). JASPAR was used to confirm the existence of binding motifs in the upstream sequences of the differently expressed genes.ResultsThe gene ontology term enrichment analysis result suggests that sevoflurane inhalation upregulated angiogenesis and downregulated neural differentiation in the whole brain. The comparison with the brains of sleeping mice showed that the gene expression changes were specific to anaesthetized mice. Sevoflurane induced Klf4 upregulation in the whole brain. The transcriptional analysis result suggests that KLF4 is a potential transcriptional regulator of angiogenesis and neural development.ConclusionsKlf4 was upregulated by sevoflurane inhalation in whole brain. KLF4 might promote angiogenesis and cause the appearance of undifferentiated neural cells by transcriptional regulation. The roles of KLF4 might be key to elucidating the mechanisms of sevoflurane induced functional modification in the brain.


2021 ◽  
Author(s):  
Jakub Jankowski ◽  
Hye Kyung Lee ◽  
Julia Wilflingseder ◽  
Lothar Hennighausen

SummaryRecently, a short, interferon-inducible isoform of Angiotensin-Converting Enzyme 2 (ACE2), dACE2 was identified. ACE2 is a SARS-Cov-2 receptor and changes in its renal expression have been linked to several human nephropathies. These changes were never analyzed in context of dACE2, as its expression was not investigated in the kidney. We used Human Primary Proximal Tubule (HPPT) cells to show genome-wide gene expression patterns after cytokine stimulation, with emphasis on the ACE2/dACE2 locus. Putative regulatory elements controlling dACE2 expression were identified using ChIP-seq and RNA-seq. qRT-PCR differentiating between ACE2 and dACE2 revealed 300- and 600-fold upregulation of dACE2 by IFNα and IFNβ, respectively, while full length ACE2 expression was almost unchanged. JAK inhibitor ruxolitinib ablated STAT1 and dACE2 expression after interferon treatment. Finally, with RNA-seq, we identified a set of genes, largely immune-related, induced by cytokine treatment. These gene expression profiles provide new insights into cytokine response of proximal tubule cells.


Sign in / Sign up

Export Citation Format

Share Document