scholarly journals Analysis of Genome-Wide RNA-Sequencing Data Suggests Age of the CEPH/Utah (CEU) Lymphoblastoid Cell Lines Systematically Biases Gene Expression Profiles

2015 ◽  
Vol 5 (1) ◽  
Author(s):  
Yuan Yuan ◽  
Lei Tian ◽  
Dongsheng Lu ◽  
Shuhua Xu
2020 ◽  
Author(s):  
Weimiao Wu ◽  
Qile Dai ◽  
Yunqing Liu ◽  
Xiting Yan ◽  
Zuoheng Wang

AbstractSingle-cell RNA sequencing provides an opportunity to study gene expression at single-cell resolution. However, prevalent dropout events result in high data sparsity and noise that may obscure downstream analyses. We propose a novel method, G2S3, that imputes dropouts by borrowing information from adjacent genes in a sparse gene graph learned from gene expression profiles across cells. We applied G2S3 and other existing methods to seven single-cell datasets to compare their performance. Our results demonstrated that G2S3 is superior in recovering true expression levels, identifying cell subtypes, improving differential expression analyses, and recovering gene regulatory relationships, especially for mildly expressed genes.


BMC Genomics ◽  
2010 ◽  
Vol 11 (1) ◽  
pp. 96 ◽  
Author(s):  
Josine L Min ◽  
Amy Barrett ◽  
Tim Watts ◽  
Fredrik H Pettersson ◽  
Helen E Lockstone ◽  
...  

2021 ◽  
Vol 17 (5) ◽  
pp. e1009029
Author(s):  
Weimiao Wu ◽  
Yunqing Liu ◽  
Qile Dai ◽  
Xiting Yan ◽  
Zuoheng Wang

Single-cell RNA sequencing technology provides an opportunity to study gene expression at single-cell resolution. However, prevalent dropout events result in high data sparsity and noise that may obscure downstream analyses in single-cell transcriptomic studies. We propose a new method, G2S3, that imputes dropouts by borrowing information from adjacent genes in a sparse gene graph learned from gene expression profiles across cells. We applied G2S3 and ten existing imputation methods to eight single-cell transcriptomic datasets and compared their performance. Our results demonstrated that G2S3 has superior overall performance in recovering gene expression, identifying cell subtypes, reconstructing cell trajectories, identifying differentially expressed genes, and recovering gene regulatory and correlation relationships. Moreover, G2S3 is computationally efficient for imputation in large-scale single-cell transcriptomic datasets.


2021 ◽  
Vol 13 (1) ◽  
Author(s):  
Trine B. Mattesen ◽  
Claus L. Andersen ◽  
Jesper B. Bramsen

Abstract Background Transcriptional analysis is widely used to study the molecular biology of cancer and hold great biomarker potential for clinical patient stratification. Yet, accurate transcriptional profiling requires RNA of a high quality, which often cannot be retrieved from formalin-fixed, paraffin-embedded (FFPE) tumor tissue that is routinely collected and archived in clinical departments. To overcome this roadblock to clinical testing, we previously developed MethCORR, a method that infers gene expression from DNA methylation data, which is robustly retrieved from FFPE tissue. MethCORR was originally developed for colorectal cancer and with this study, we aim to: (1) extend the MethCORR method to 10 additional cancer types and (2) to illustrate that the inferred gene expression is accurate and clinically informative. Results Regression models to infer gene expression information from DNA methylation were developed for ten common cancer types using matched RNA sequencing and DNA methylation profiles (HumanMethylation450 BeadChip) from The Cancer Genome Atlas Project. Robust and accurate gene expression profiles were inferred for all cancer types: on average, the expression of 11,000 genes was modeled with good accuracy and an intra-sample correlation of R2 = 0.90 between inferred and measured gene expression was observed. Molecular pathway analysis and transcriptional subtyping were performed for breast, prostate, and lung cancer samples to illustrate the general usability of the inferred gene expression profiles: overall, a high correlation of r = 0.96 (Pearson) in pathway enrichment scores and a 76% correspondence in molecular subtype calls were observed when using measured and inferred gene expression as input. Finally, inferred expression from FFPE tissue correlated better with RNA sequencing data from matched fresh-frozen tissue than did RNA sequencing data from FFPE tissue (P < 0.0001; Wilcoxon rank-sum test). Conclusions In all cancers investigated, MethCORR enabled DNA methylation-based transcriptional analysis, thus enabling future analysis of cancer in situations where high-quality DNA, but not RNA, is available. Here, we provide the framework and resources for MethCORR modeling of ten common cancer types, thereby widely expanding the possibilities for transcriptional studies of archival FFPE material.


2021 ◽  
Vol 22 (12) ◽  
pp. 6556
Author(s):  
Junjun Huang ◽  
Xiaoyu Li ◽  
Xin Chen ◽  
Yaru Guo ◽  
Weihong Liang ◽  
...  

ATP-binding cassette (ABC) transporter proteins are a gene super-family in plants and play vital roles in growth, development, and response to abiotic and biotic stresses. The ABC transporters have been identified in crop plants such as rice and buckwheat, but little is known about them in soybean. Soybean is an important oil crop and is one of the five major crops in the world. In this study, 255 ABC genes that putatively encode ABC transporters were identified from soybean through bioinformatics and then categorized into eight subfamilies, including 7 ABCAs, 52 ABCBs, 48 ABCCs, 5 ABCDs, 1 ABCEs, 10 ABCFs, 111 ABCGs, and 21 ABCIs. Their phylogenetic relationships, gene structure, and gene expression profiles were characterized. Segmental duplication was the main reason for the expansion of the GmABC genes. Ka/Ks analysis suggested that intense purifying selection was accompanied by the evolution of GmABC genes. The genome-wide collinearity of soybean with other species showed that GmABCs were relatively conserved and that collinear ABCs between species may have originated from the same ancestor. Gene expression analysis of GmABCs revealed the distinct expression pattern in different tissues and diverse developmental stages. The candidate genes GmABCB23, GmABCB25, GmABCB48, GmABCB52, GmABCI1, GmABCI5, and GmABCI13 were responsive to Al toxicity. This work on the GmABC gene family provides useful information for future studies on ABC transporters in soybean and potential targets for the cultivation of new germplasm resources of aluminum-tolerant soybean.


2021 ◽  
Vol 22 (11) ◽  
pp. 5798
Author(s):  
Shoko Tokumoto ◽  
Yugo Miyata ◽  
Ruslan Deviatiiarov ◽  
Takahiro G. Yamada ◽  
Yusuke Hiki ◽  
...  

The Pv11, an insect cell line established from the midge Polypedilum vanderplanki, is capable of extreme hypometabolic desiccation tolerance, so-called anhydrobiosis. We previously discovered that heat shock factor 1 (HSF1) contributes to the acquisition of desiccation tolerance by Pv11 cells, but the mechanistic details have yet to be elucidated. Here, by analyzing the gene expression profiles of newly established HSF1-knockout and -rescue cell lines, we show that HSF1 has a genome-wide effect on gene regulation in Pv11. The HSF1-knockout cells exhibit a reduced desiccation survival rate, but this is completely restored in HSF1-rescue cells. By comparing mRNA profiles of the two cell lines, we reveal that HSF1 induces anhydrobiosis-related genes, especially genes encoding late embryogenesis abundant proteins and thioredoxins, but represses a group of genes involved in basal cellular processes, thus promoting an extreme hypometabolism state in the cell. In addition, HSF1 binding motifs are enriched in the promoters of anhydrobiosis-related genes and we demonstrate binding of HSF1 to these promoters by ChIP-qPCR. Thus, HSF1 directly regulates the transcription of anhydrobiosis-related genes and consequently plays a pivotal role in the induction of anhydrobiotic ability in Pv11 cells.


BMC Genomics ◽  
2021 ◽  
Vol 22 (1) ◽  
Author(s):  
Yanan Ren ◽  
Ting-You Wang ◽  
Leah C. Anderton ◽  
Qi Cao ◽  
Rendong Yang

Abstract Background Long non-coding RNAs (lncRNAs) are a growing focus in cancer research. Deciphering pathways influenced by lncRNAs is important to understand their role in cancer. Although knock-down or overexpression of lncRNAs followed by gene expression profiling in cancer cell lines are established approaches to address this problem, these experimental data are not available for a majority of the annotated lncRNAs. Results As a surrogate, we present lncGSEA, a convenient tool to predict the lncRNA associated pathways through Gene Set Enrichment Analysis of gene expression profiles from large-scale cancer patient samples. We demonstrate that lncGSEA is able to recapitulate lncRNA associated pathways supported by literature and experimental validations in multiple cancer types. Conclusions LncGSEA allows researchers to infer lncRNA regulatory pathways directly from clinical samples in oncology. LncGSEA is written in R, and is freely accessible at https://github.com/ylab-hi/lncGSEA.


Oncogene ◽  
2002 ◽  
Vol 21 (42) ◽  
pp. 6549-6556 ◽  
Author(s):  
Jiafu Ji ◽  
Xin Chen ◽  
Suet Yi Leung ◽  
Jen-Tsan A Chi ◽  
Kent Man Chu ◽  
...  

Sign in / Sign up

Export Citation Format

Share Document