How well do RNA-Seq differential gene expression tools perform in a eukaryote with a complex transcriptome?

Mapping Intimacies ◽

10.1101/090753 ◽

2016 ◽

Cited By ~ 4

Author(s):

Kimon Froussios ◽

Nick J. Schurch ◽

Katarzyna Mackinnon ◽

Marek Gierliński ◽

Céline Duc ◽

...

Keyword(s):

Gene Expression ◽

Differential Gene Expression ◽

Negative Binomial Distribution ◽

Binomial Distribution ◽

Negative Binomial ◽

False Positive Rate ◽

Rna Seq ◽

Underlying Distribution ◽

Differential Gene ◽

Log Normal

AbstractRNA-seq experiments are usually carried out in three or fewer replicates. In order to work well with so few samples, Differential Gene Expression (DGE) tools typically assume the form of the underlying distribution of gene expression. A recent highly replicated study revealed that RNA-seq gene expression measurements in yeast are best represented as being drawn from an underlying negative binomial distribution. In this paper, the statistical properties of gene expression in the higher eukaryote Arabidopsis thaliana are shown to be essentially identical to those from yeast despite the large increase in the size and complexity of the transcriptome: Gene expression measurements from this model plant species are consistent with being drawn from an underlying negative binomial or log-normal distribution and the false positive rate performance of nine widely used DGE tools is not strongly affected by the additional size and complexity of the A. thaliana transcriptome. For RNA-seq data, we therefore recommend the use of DGE tools that are based on the negative binomial distribution.

Download Full-text

How well do RNA-Seq differential gene expression tools perform in a complex eukaryote? A case study in Arabidopsis thaliana

Bioinformatics ◽

10.1093/bioinformatics/btz089 ◽

2019 ◽

Vol 35 (18) ◽

pp. 3372-3377 ◽

Cited By ~ 2

Author(s):

Kimon Froussios ◽

Nick J Schurch ◽

Katarzyna Mackinnon ◽

Marek Gierliński ◽

Céline Duc ◽

...

Keyword(s):

Gene Expression ◽

Arabidopsis Thaliana ◽

Normal Distribution ◽

Differential Gene Expression ◽

Negative Binomial Distribution ◽

Binomial Distribution ◽

Negative Binomial ◽

Supplementary Information ◽

Rna Seq ◽

Differential Gene

Abstract Motivation RNA-seq experiments are usually carried out in three or fewer replicates. In order to work well with so few samples, differential gene expression (DGE) tools typically assume the form of the underlying gene expression distribution. In this paper, the statistical properties of gene expression from RNA-seq are investigated in the complex eukaryote, Arabidopsis thaliana, extending and generalizing the results of previous work in the simple eukaryote Saccharomyces cerevisiae. Results We show that, consistent with the results in S.cerevisiae, more gene expression measurements in A.thaliana are consistent with being drawn from an underlying negative binomial distribution than either a log-normal distribution or a normal distribution, and that the size and complexity of the A.thaliana transcriptome does not influence the false positive rate performance of nine widely used DGE tools tested here. We therefore recommend the use of DGE tools that are based on the negative binomial distribution. Availability and implementation The raw data for the 17 WT Arabidopsis thaliana datasets is available from the European Nucleotide Archive (E-MTAB-5446). The processed and aligned data can be visualized in context using IGB (Freese et al., 2016), or downloaded directly, using our publicly available IGB quickload server at https://compbio.lifesci.dundee.ac.uk/arabidopsisQuickload/public_quickload/ under ‘RNAseq>Froussios2019’. All scripts and commands are available from github at https://github.com/bartongroup/KF_arabidopsis-GRNA. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Inference and Sample Size Calculations Based on Statistical Tests in a Negative Binomial Distribution for Differential Gene Expression in RNAseq Data

Journal of Biometrics & Biostatistics ◽

10.4172/2155-6180.1000332 ◽

2017 ◽

Vol 08 (01) ◽

Author(s):

Xiaohong Li ◽

Nigel GF Cooper ◽

Yu Shyr ◽

Dongfeng Wu ◽

Eric C Rouchka ◽

...

Keyword(s):

Gene Expression ◽

Sample Size ◽

Differential Gene Expression ◽

Negative Binomial Distribution ◽

Binomial Distribution ◽

Negative Binomial ◽

Statistical Tests ◽

Rnaseq Data ◽

Sample Size Calculations ◽

Differential Gene

Download Full-text

The NBP Negative Binomial Model for Assessing Differential Gene Expression from RNA-Seq

Statistical Applications in Genetics and Molecular Biology ◽

10.2202/1544-6115.1637 ◽

2011 ◽

Vol 10 (1) ◽

Cited By ~ 59

Author(s):

Yanming Di ◽

Daniel W Schafer ◽

Jason S Cumbie ◽

Jeff H Chang

Keyword(s):

Gene Expression ◽

Differential Gene Expression ◽

Negative Binomial ◽

Small Sample Size ◽

Small Sample ◽

Dispersion Parameter ◽

Additional Parameter ◽

Rna Seq ◽

Data Set ◽

Differential Gene

We propose a new statistical test for assessing differential gene expression using RNA sequencing (RNA-Seq) data. Commonly used probability distributions, such as binomial or Poisson, cannot appropriately model the count variability in RNA-Seq data due to overdispersion. The small sample size that is typical in this type of data also prevents the uncritical use of tools derived from large-sample asymptotic theory. The test we propose is based on the NBP parameterization of the negative binomial distribution. It extends an exact test proposed by Robinson and Smyth (2007, 2008). In one version of Robinson and Smyth’s test, a constant dispersion parameter is used to model the count variability between biological replicates. We introduce an additional parameter to allow the dispersion parameter to depend on the mean. Our parametric method complements nonparametric regression approaches for modeling the dispersion parameter. We apply the test we propose to an Arabidopsis data set and a range of simulated data sets. The results show that the test is simple, powerful and reasonably robust against departures from model assumptions.

Download Full-text

A Novel Bayesian Outlier Score Based on the Negative Binomial Distribution for Detecting Aberrantly Expressed Genes in RNA-Seq Gene Expression Count Data

IEEE Access ◽

10.1109/access.2021.3082311 ◽

2021 ◽

pp. 1-1

Author(s):

Edin Salkovic ◽

Halima Bensmail

Keyword(s):

Gene Expression ◽

Negative Binomial Distribution ◽

Count Data ◽

Binomial Distribution ◽

Negative Binomial ◽

Rna Seq

Download Full-text

Faculty Opinions recommendation of Scotty: a web tool for designing RNA-Seq experiments to measure differential gene expression.

Faculty Opinions – Post-Publication Peer Review of the Biomedical Literature ◽

10.3410/f.717971189.793469500 ◽

2013 ◽

Author(s):

Stephen Turner

Keyword(s):

Gene Expression ◽

Differential Gene Expression ◽

Rna Seq ◽

Web Tool ◽

Differential Gene

Download Full-text

Differential Gene Expression by RNA-Seq Analysis of the Primo Vessel in the Rabbit Lymph

Journal of Acupuncture and Meridian Studies ◽

10.1016/j.jams.2018.10.008 ◽

2019 ◽

Vol 12 (1) ◽

pp. 11-19 ◽

Cited By ~ 1

Author(s):

Jun-Young Shin ◽

Sang-Heon Choi ◽

Da-Woon Choi ◽

Ye-Jin An ◽

Jae-Hyuk Seo ◽

...

Keyword(s):

Gene Expression ◽

Differential Gene Expression ◽

Rna Seq ◽

Differential Gene

Download Full-text

A Unified Model for Joint Normalization and Differential Gene Expression Detection in RNA-Seq Data

IEEE/ACM Transactions on Computational Biology and Bioinformatics ◽

10.1109/tcbb.2018.2790918 ◽

2019 ◽

Vol 16 (2) ◽

pp. 442-454 ◽

Cited By ~ 5

Author(s):

Kefei Liu ◽

Jieping Ye ◽

Yang Yang ◽

Li Shen ◽

Hui Jiang

Keyword(s):

Gene Expression ◽

Differential Gene Expression ◽

Unified Model ◽

Rna Seq ◽

Differential Gene

Download Full-text

Comprehensive evaluation of differential gene expression analysis methods for RNA-seq data

Genome Biology ◽

10.1186/gb-2013-14-9-r95 ◽

2013 ◽

Vol 14 (9) ◽

pp. R95 ◽

Cited By ~ 408

Author(s):

Franck Rapaport ◽

Raya Khanin ◽

Yupu Liang ◽

Mono Pirun ◽

Azra Krek ◽

...

Keyword(s):

Gene Expression ◽

Differential Gene Expression ◽

Expression Analysis ◽

Gene Expression Analysis ◽

Comprehensive Evaluation ◽

Rna Seq ◽

Differential Gene Expression Analysis ◽

Analysis Methods ◽

Differential Gene

Download Full-text

Identification and Comparative Analysis of Differential Gene Expression in Soybean Leaf Tissue under Drought and Flooding Stress Revealed by RNA-Seq

Frontiers in Plant Science ◽

10.3389/fpls.2016.01044 ◽

2016 ◽

Vol 7 ◽

Cited By ~ 44

Author(s):

Wei Chen ◽

Qiuming Yao ◽

Gunvant B. Patil ◽

Gaurav Agarwal ◽

Rupesh K. Deshmukh ◽

...

Keyword(s):

Gene Expression ◽

Comparative Analysis ◽

Differential Gene Expression ◽

Leaf Tissue ◽

Rna Seq ◽

Flooding Stress ◽

Soybean Leaf ◽

Differential Gene

Download Full-text

Challenges and strategies in transcriptome assembly and differential gene expression quantification. A comprehensivein silicoassessment of RNA-seq experiments

Molecular Ecology ◽

10.1111/mec.12014 ◽

2012 ◽

Vol 22 (3) ◽

pp. 620-634 ◽

Cited By ~ 167

Author(s):

Nagarjun Vijay ◽

Jelmer W. Poelstra ◽

Axel Künstner ◽

Jochen B. W. Wolf

Keyword(s):

Gene Expression ◽

Differential Gene Expression ◽

Transcriptome Assembly ◽

Rna Seq ◽

Gene Expression Quantification ◽

Differential Gene ◽

Expression Quantification ◽

Challenges And Strategies

Download Full-text