optimal discovery procedure Latest Research Papers

Abstract Motivation Analysis of biological data often involves the simultaneous testing of thousands of genes. This requires two key steps: the ranking of genes and the selection of important genes based on a significance threshold. One such testing procedure, called the optimal discovery procedure (ODP), leverages information across different tests to provide an optimal ranking of genes. This approach can lead to substantial improvements in statistical power compared to other methods. However, current applications of the ODP have only been established for simple study designs using microarray technology. Here, we extend this work to the analysis of complex study designs and RNA-sequencing studies. Results We apply our extended framework to a static RNA-sequencing study, a longitudinal study, an independent sampling time-series study,and an independent sampling dose–response study. Our method shows improved performance compared to other testing procedures, finding more differentially expressed genes and increasing power for enrichment analysis. Thus, the extended ODP enables a favorable significance analysis of genome-wide gene expression studies. Availability and implementation The algorithm is implemented in our freely available R package called edge and can be downloaded at https://www.bioconductor.org/packages/release/bioc/html/edge.html. Supplementary information Supplementary data are available at Bioinformatics online.

Download Full-text

The optimal discovery procedure for significance analysis of general gene expression studies

10.1101/571992 ◽

2019 ◽

Cited By ~ 1

Author(s):

Andrew J. Bass ◽

John D. Storey

Keyword(s):

Rna Sequencing ◽

Statistical Power ◽

Enrichment Analysis ◽

R Package ◽

Testing Procedure ◽

Biological Data ◽

Testing Procedures ◽

Significance Analysis ◽

Study Designs ◽

Optimal Discovery Procedure

Analysis of biological data often involves the simultaneous testing of thousands of genes. This requires two key steps: the ranking of genes and the selection of important genes based on a significance threshold. One such testing procedure, called the "optimal discovery procedure" (ODP), leverages information across different tests to provide an optimal ranking of genes. This approach can lead to substantial improvements in statistical power compared to other methods. However, current applications of the ODP have only been established for simple study designs using microarray technology. Here we extend this work to the analysis of complex study designs and RNA sequencing studies. We then apply our extended framework to a static RNA sequencing study, a longitudinal and an independent sampling time-series study, and an independent sampling dose-response study. We find that our method shows improved performance compared to other testing procedures, finding more differentially expressed genes and increasing power for enrichment analysis. Thus the extended ODP enables a superior significance analysis of genomic studies. The algorithm is implemented in our freely available R package called edge.

Download Full-text

Multi-subgroup gene screening using semi-parametric hierarchical mixture models and the optimal discovery procedure: Application to a randomized clinical trial in multiple myeloma

Biometrics ◽

10.1111/biom.12716 ◽

2017 ◽

Vol 74 (1) ◽

pp. 313-320 ◽

Cited By ~ 4

Author(s):

Shigeyuki Matsui ◽

Hisashi Noma ◽

Pingping Qu ◽

Yoshio Sakai ◽

Kota Matsui ◽

...

Keyword(s):

Clinical Trial ◽

Multiple Myeloma ◽

Randomized Clinical Trial ◽

Mixture Models ◽

Gene Screening ◽

Hierarchical Mixture Models ◽

Optimal Discovery Procedure ◽

Subgroup Gene

Download Full-text

Evaluations of the Optimal Discovery Procedure for Multiple Testing

The International Journal of Biostatistics ◽

10.1515/ijb-2015-0027 ◽

2016 ◽

Vol 12 (1) ◽

pp. 21-29 ◽

Cited By ~ 2

Author(s):

Daniel B. Rubin

Keyword(s):

Error Rate ◽

Optimality Theory ◽

Multiple Testing ◽

Type I Error ◽

Type I ◽

Test Statistics ◽

Testing Procedures ◽

Type I Error Rate ◽

Rate Measure ◽

Optimal Discovery Procedure

Abstract The Optimal Discovery Procedure (ODP) is a method for simultaneous hypothesis testing that attempts to gain power relative to more standard techniques by exploiting multivariate structure [1]. Specializing to the example of testing whether components of a Gaussian mean vector are zero, we compare the power of the ODP to a Bonferroni-style method and to the Benjamini-Hochberg method when the testing procedures aim to respectively control certain Type I error rate measures, such as the expected number of false positives or the false discovery rate. We show through theoretical results, numerical comparisons, and two microarray examples that when the rejection regions for the ODP test statistics are chosen such that the procedure is guaranteed to uniformly control a Type I error rate measure, the technique is generally less powerful than competing methods. We contrast and explain these results in light of previously proven optimality theory for the ODP. We also compare the ordering given by the ODP test statistics to the standard rankings based on sorting univariate p-values from smallest to largest. In the cases we considered the standard ordering was superior, and ODP rankings were adversely impacted by correlation.

Download Full-text

Consistent variable selection via the optimal discovery procedure in multiple testing

Communication in Statistics- Theory and Methods ◽

10.1080/03610926.2015.1069351 ◽

2016 ◽

Vol 46 (13) ◽

pp. 6303-6322

Author(s):

Li Wang ◽

Xingzhong Xu

Keyword(s):

Variable Selection ◽

Multiple Testing ◽

Optimal Discovery Procedure

Download Full-text

An Empirical Bayes Optimal Discovery Procedure Based on Semiparametric Hierarchical Mixture Models

Computational and Mathematical Methods in Medicine ◽

10.1155/2013/568480 ◽

2013 ◽

Vol 2013 ◽

pp. 1-9

Author(s):

Hisashi Noma ◽

Shigeyuki Matsui

Keyword(s):

Mixture Model ◽

Multiple Testing ◽

Empirical Bayes ◽

Gene Selection ◽

Fixed Number ◽

Test Statistic ◽

Microarray Experiments ◽

False Discovery ◽

Genome Wide ◽

Optimal Discovery Procedure

Multiple testing has been widely adopted for genome-wide studies such as microarray experiments. For effective gene selection in these genome-wide studies, the optimal discovery procedure (ODP), which maximizes the number of expected true positives for each fixed number of expected false positives, was developed as a multiple testing extension of the most powerful test for a single hypothesis by Storey (Journal of the Royal Statistical Society, Series B,vol. 69, no. 3, pp. 347–368, 2007). In this paper, we develop an empirical Bayes method for implementing the ODP based on a semiparametric hierarchical mixture model using the “smoothing-by-roughening" approach. Under the semiparametric hierarchical mixture model, (i) the prior distribution can be modeled flexibly, (ii) the ODP test statistic and the posterior distribution are analytically tractable, and (iii) computations are easy to implement. In addition, we provide a significance rule based on the false discovery rate (FDR) in the empirical Bayes framework. Applications to two clinical studies are presented.

Download Full-text

The optimal discovery procedure in multiple significance testing: an empirical Bayes approach

Statistics in Medicine ◽

10.1002/sim.4375 ◽

2011 ◽

Vol 31 (2) ◽

pp. 165-176 ◽

Cited By ~ 11

Author(s):

Hisashi Noma ◽

Shigeyuki Matsui

Keyword(s):

Empirical Bayes ◽

Significance Testing ◽

Empirical Bayes Approach ◽

Optimal Discovery Procedure ◽

Bayes Approach

Download Full-text

Application of the Optimal Discovery Procedure to Genetic Case-Control Studies: Comparison with p Values and Asymptotic Bayes Factors

Human Heredity ◽

10.1159/000323518 ◽

2011 ◽

Vol 71 (1) ◽

pp. 37-49

Author(s):

Ioanna Tachmazidou ◽

Maria De Iorio ◽

Frank Dudbridge

Keyword(s):

Case Control ◽

Bayes Factors ◽

Case Control Studies ◽

P Values ◽

Optimal Discovery Procedure

Download Full-text

A computationally efficient modular optimal discovery procedure

Bioinformatics ◽

10.1093/bioinformatics/btq701 ◽

2010 ◽

Vol 27 (4) ◽

pp. 509-515 ◽

Cited By ~ 12

Author(s):

Sangsoon Woo ◽

Jeffrey T. Leek ◽

John D. Storey

Keyword(s):

Computationally Efficient ◽

Optimal Discovery Procedure

Download Full-text

Bayesian optimal discovery procedure for simultaneous significance testing

BMC Bioinformatics ◽

10.1186/1471-2105-10-5 ◽

2009 ◽

Vol 10 (1) ◽

Cited By ~ 15

Author(s):

Jing Cao ◽

Xian-Jin Xie ◽

Song Zhang ◽

Angelique Whitehurst ◽

Michael A White

Keyword(s):

Significance Testing ◽

Optimal Discovery Procedure

Download Full-text

optimal discovery procedure
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

The optimal discovery procedure for significance analysis of general gene expression studies

The optimal discovery procedure for significance analysis of general gene expression studies

Multi-subgroup gene screening using semi-parametric hierarchical mixture models and the optimal discovery procedure: Application to a randomized clinical trial in multiple myeloma

Evaluations of the Optimal Discovery Procedure for Multiple Testing

Consistent variable selection via the optimal discovery procedure in multiple testing

An Empirical Bayes Optimal Discovery Procedure Based on Semiparametric Hierarchical Mixture Models

The optimal discovery procedure in multiple significance testing: an empirical Bayes approach

Application of the Optimal Discovery Procedure to Genetic Case-Control Studies: Comparison with p Values and Asymptotic Bayes Factors

A computationally efficient modular optimal discovery procedure

Bayesian optimal discovery procedure for simultaneous significance testing

Export Citation Format

optimal discovery procedureRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

The optimal discovery procedure for significance analysis of general gene expression studies

The optimal discovery procedure for significance analysis of general gene expression studies

Multi-subgroup gene screening using semi-parametric hierarchical mixture models and the optimal discovery procedure: Application to a randomized clinical trial in multiple myeloma

Evaluations of the Optimal Discovery Procedure for Multiple Testing

Consistent variable selection via the optimal discovery procedure in multiple testing

An Empirical Bayes Optimal Discovery Procedure Based on Semiparametric Hierarchical Mixture Models

The optimal discovery procedure in multiple significance testing: an empirical Bayes approach

Application of the Optimal Discovery Procedure to Genetic Case-Control Studies: Comparison with p Values and Asymptotic Bayes Factors

A computationally efficient modular optimal discovery procedure

Bayesian optimal discovery procedure for simultaneous significance testing

optimal discovery procedure
Recently Published Documents