A Comparison of Two Classes of Methods for Estimating False Discovery Rates in Microarray Studies

Scientifica ◽

10.6064/2012/519394 ◽

2012 ◽

Vol 2012 ◽

pp. 1-9 ◽

Cited By ~ 2

Author(s):

Emily Hansen ◽

Kathleen F. Kerr

Keyword(s):

Differentially Expressed Genes ◽

Null Distribution ◽

Differentially Expressed ◽

Test Statistics ◽

False Discovery Rates ◽

Model Method ◽

False Discovery ◽

Microarray Studies ◽

Discovery Rates

The goal of many microarray studies is to identify genes that are differentially expressed between two classes or populations. Many data analysts choose to estimate the false discovery rate (FDR) associated with the list of genes declared differentially expressed. Estimating an FDR largely reduces to estimatingπ1, the proportion of differentially expressed genes among all analyzed genes. Estimatingπ1is usually done throughP-values, but computingP-values can be viewed as a nuisance and potentially problematic step. We evaluated methods for estimatingπ1directly from test statistics, circumventing the need to computeP-values. We adapted existing methodology for estimatingπ1fromt- andz-statistics so thatπ1could be estimated from other statistics. We compared the quality of these estimates to estimates generated by two established methods for estimatingπ1fromP-values. Overall, methods varied widely in bias and variability. The least biased and least variable estimates ofπ1, the proportion of differentially expressed genes, were produced by applying the “convest” mixture model method toP-values computed from a pooled permutation null distribution. Estimates computed directly from test statistics rather thanP-values did not reliably perform well.

Download Full-text

A general method for accurate estimation of false discovery rates in identification of differentially expressed genes

Bioinformatics ◽

10.1093/bioinformatics/btu124 ◽

2014 ◽

Vol 30 (14) ◽

pp. 2018-2025 ◽

Cited By ~ 13

Author(s):

Yuan-De Tan ◽

Hongyan Xu

Keyword(s):

Differentially Expressed Genes ◽

Differentially Expressed ◽

Accurate Estimation ◽

False Discovery Rates ◽

False Discovery ◽

Discovery Rates ◽

General Method

Download Full-text

Random rotation for identifying differentially expressed genes with linear models following batch effect correction

Bioinformatics ◽

10.1093/bioinformatics/btab063 ◽

2021 ◽

Author(s):

Peter Hettegger ◽

Klemens Vierlinger ◽

Andreas Weinhaeusel

Keyword(s):

Data Analysis ◽

Supplementary Information ◽

Dependence Structure ◽

Test Statistics ◽

False Discovery Rates ◽

P Values ◽

Null Distributions ◽

False Discovery ◽

Random Rotation ◽

Discovery Rates

Abstract Motivation Data generated from high-throughput technologies such as sequencing, microarray and bead-chip technologies are unavoidably affected by batch effects (BEs). Large effort has been put into developing methods for correcting these effects. Often, BE correction and hypothesis testing cannot be done with one single model, but are done successively with separate models in data analysis pipelines. This potentially leads to biased P-values or false discovery rates due to the influence of BE correction on the data. Results We present a novel approach for estimating null distributions of test statistics in data analysis pipelines where BE correction is followed by linear model analysis. The approach is based on generating simulated datasets by random rotation and thereby retains the dependence structure of genes adequately. This allows estimating null distributions of dependent test statistics, and thus the calculation of resampling-based P-values and false-discovery rates following BE correction while maintaining the alpha level. Availability The described methods are implemented as randRotation package on Bioconductor: https://bioconductor.org/packages/randRotation/ Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

Identification of differentially expressed genes and false discovery rate in microarray studies

Current Opinion in Lipidology ◽

10.1097/mol.0b013e3280895d6f ◽

2007 ◽

Vol 18 (2) ◽

pp. 187-193 ◽

Cited By ~ 27

Author(s):

Arief Gusnanto ◽

Stefano Calza ◽

Yudi Pawitan

Keyword(s):

False Discovery Rate ◽

Differentially Expressed Genes ◽

Differentially Expressed ◽

False Discovery ◽

Microarray Studies

Download Full-text

Simple estimators of false discovery rates given as few as one or two p-values without strong parametric assumptions

Statistical Applications in Genetics and Molecular Biology ◽

10.1515/sagmb-2013-0003 ◽

2013 ◽

Vol 12 (4) ◽

Cited By ~ 7

Author(s):

David R. Bickel

Keyword(s):

False Discovery Rates ◽

P Values ◽

False Discovery ◽

Discovery Rates

Download Full-text

False discovery rates and copy number variation

Biometrika ◽

10.1093/biomet/asr018 ◽

2011 ◽

Vol 98 (2) ◽

pp. 251-271 ◽

Cited By ~ 11

Author(s):

Bradley Efron ◽

Nancy R. Zhang

Keyword(s):

Copy Number Variation ◽

Copy Number ◽

False Discovery Rates ◽

False Discovery ◽

Number Variation ◽

Discovery Rates

Download Full-text

Signal identification for rare and weak features: higher criticism or false discovery rates?

Biostatistics ◽

10.1093/biostatistics/kxs030 ◽

2012 ◽

Vol 14 (1) ◽

pp. 129-143 ◽

Cited By ~ 14

Author(s):

Bernd Klaus ◽

Korbinian Strimmer

Keyword(s):

Signal Identification ◽

False Discovery Rates ◽

Higher Criticism ◽

False Discovery ◽

Discovery Rates

Download Full-text

False discovery rates: a new deal

Biostatistics ◽

10.1093/biostatistics/kxw041 ◽

2016 ◽

pp. kxw041 ◽

Cited By ~ 66

Author(s):

Matthew Stephens

Keyword(s):

New Deal ◽

False Discovery Rates ◽

False Discovery ◽

Discovery Rates

Download Full-text

Large and ancient linguistic areas

Language Dispersal, Diversification, and Contact ◽

10.1093/oso/9780198723813.003.0005 ◽

2020 ◽

pp. 78-100

Author(s):

Balthasar Bickel

Keyword(s):

Regression Models ◽

Large Scale ◽

Population History ◽

False Discovery Rates ◽

Ancient Population ◽

False Discovery ◽

Language Universals ◽

Discovery Rates ◽

Pacific Area

Large-scale areal patterns point to ancient population history and form a well-known confound for language universals. Despite their importance, demonstrating such patterns remains a challenge. This chapter argues that large-scale area hypotheses are better tested by modeling diachronic family biases than by controlling for genealogical relations in regression models. A case study of the Trans-Pacific area reveals that diachronic bias estimates do not depend much on the amount of phylogenetic information that is used when inferring them. After controlling for false discovery rates, about 39 variables in WALS and AUTOTYP show diachronic biases that differ significantly inside vs. outside the Trans-Pacific area. Nearly three times as many biases hold outside than inside the Trans-Pacific area, indicating that the Trans-Pacific area is not so much characterized by the spread of biases but rather by the retention of earlier diversity, in line with earlier suggestions in the literature.

Download Full-text

Improving sensitivity in proteome studies by analysis of false discovery rates for multiple search engines

PROTEOMICS ◽

10.1002/pmic.200800473 ◽

2009 ◽

Vol 9 (5) ◽

pp. 1220-1229 ◽

Cited By ~ 65

Author(s):

Andrew R. Jones ◽

Jennifer A. Siepen ◽

Simon J. Hubbard ◽

Norman W. Paton

Keyword(s):

Search Engines ◽

False Discovery Rates ◽

False Discovery ◽

Discovery Rates

Download Full-text

False Discovery Rates in Identifying Functional DNA Motifs

2007 IEEE 7th International Symposium on BioInformatics and BioEngineering ◽

10.1109/bibe.2007.4375592 ◽

2007 ◽

Author(s):

Osman Abul ◽

Geir Kjetil Sandve ◽

Finn Drablos

Keyword(s):

False Discovery Rates ◽

Dna Motifs ◽

False Discovery ◽

Functional Dna ◽

Discovery Rates

Download Full-text