GENERALIZED p-VALUE APPROACH FOR MULTIPLE HYPOTHESIS TESTING IN MICROARRAY

Bindu Punathumparambath; Kannan Vadakkadath Meethal

doi:10.17654/bs017020443

A combined p-value test for multiple hypothesis testing

Journal of Statistical Planning and Inference ◽

10.1016/j.jspi.2012.10.004 ◽

2013 ◽

Vol 143 (4) ◽

pp. 764-770 ◽

Cited By ~ 12

Author(s):

Shunpu Zhang ◽

Huann-Sheng Chen ◽

Ruth M. Pfeiffer

Keyword(s):

Hypothesis Testing ◽

Multiple Hypothesis Testing ◽

P Value ◽

Multiple Hypothesis

Download Full-text

AdaFDR: a Fast, Powerful and Covariate-Adaptive Approach to Multiple Hypothesis Testing

10.1101/496372 ◽

2018 ◽

Author(s):

Martin J. Zhang ◽

Fei Xia ◽

James Zou

Keyword(s):

Hypothesis Testing ◽

Data Science ◽

Multiple Hypothesis Testing ◽

P Value ◽

Eqtl Analysis ◽

Computationally Efficient ◽

Additional Information ◽

False Discovery ◽

Multiple Hypothesis ◽

False Discoveries

Multiple hypothesis testing is an essential component of modern data science. Its goal is to maximize the number of discoveries while controlling the fraction of false discoveries. In many settings, in addition to the p-value, additional information/covariates for each hypothesis are available. For example, in eQTL studies, each hypothesis tests the correlation between a variant and the expression of a gene. We also have additional covariates such as the location, conservation and chromatin status of the variant, which could inform how likely the association is to be due to noise. However, popular multiple hypothesis testing approaches, such as Benjamini-Hochberg procedure (BH) and independent hypothesis weighting (IHW), either ignore these covariates or assume the covariate to be univariate. We introduce AdaFDR, a fast and flexible method that adaptively learns the optimal p-value threshold from covariates to significantly improve detection power. On eQTL analysis of the GTEx data, AdaFDR discovers 32% and 27% more associations than BH and IHW, respectively, at the same false discovery rate. We prove that AdaFDR controls false discovery proportion, and show that it makes substantially more discoveries while controlling FDR in extensive experiments. AdaFDR is computationally efficient and can process more than 100 million hypotheses within an hour and allows multi-dimensional covariates with both numeric and categorical values. It also provides exploratory plots for the user to interpret how each covariate affects the significance of hypotheses, making it broadly useful across many applications.

Download Full-text

Visualizing the Costs and Benefits of Correcting P-Values for Multiple Hypothesis Testing in Omics Data

10.1101/2021.09.09.459558 ◽

2021 ◽

Author(s):

Steven R. Shuken ◽

Margaret W. McNerney

Keyword(s):

Hypothesis Testing ◽

Web Application ◽

Multiple Testing ◽

Test Group ◽

Multiple Hypothesis Testing ◽

P Value ◽

Critical Element ◽

Costs And Benefits ◽

Multiple Hypothesis ◽

R Shiny

AbstractThe multiple hypothesis testing problem is inherent in high-throughput quantitative genomic, transcriptomic, proteomic, and other “omic” screens. The correction of p-values for multiple testing is a critical element of quantitative omic data analysis, yet many researchers are unfamiliar with the sensitivity costs and false discovery rate (FDR) benefits of p-value correction. We developed models of quantitative omic experiments, modeled the costs and benefits of p-value correction, and visualized the results with color-coded volcano plots. We developed an R Shiny web application for further exploration of these models which we call the Simulator of P-value Multiple Hypothesis Correction (SIMPLYCORRECT). We modeled experiments in which no analytes were truly differential between the control and test group (all null hypotheses true), all analytes were differential, or a mixture of differential and non-differential analytes were present. We corrected p-values using the Benjamini-Hochberg (BH), Bonferroni, and permutation FDR methods and compared the costs and benefits of each. By manipulating variables in the models, we demonstrated that increasing sample size or decreasing variability can reduce or eliminate the sensitivity cost of p-value correction and that permutation FDR correction can yield more hits than BH-adjusted and even unadjusted p-values in strongly differential data. SIMPLYCORRECT can serve as a tool in education and research to show how p-value adjustment and various parameters affect the results of quantitative omics experiments.

Download Full-text

EU ETS Market Interactions: A Multiple Hypothesis Testing Approach

SSRN Electronic Journal ◽

10.2139/ssrn.2161246 ◽

2012 ◽

Author(s):

Mark Cummins

Keyword(s):

Hypothesis Testing ◽

Multiple Hypothesis Testing ◽

Eu Ets ◽

Market Interactions ◽

Multiple Hypothesis ◽

Testing Approach ◽

Hypothesis Testing Approach

Download Full-text

An algorithm for construction of multiple hypothesis testing

Computational Statistics ◽

10.1007/s001800100057 ◽

2001 ◽

Vol 16 (1) ◽

pp. 165-171 ◽

Cited By ~ 7

Author(s):

Koon-Shing Kwong

Keyword(s):

Hypothesis Testing ◽

Multiple Hypothesis Testing ◽

Multiple Hypothesis

Download Full-text

Least Conservative Critical Boundaries of Multiple Hypothesis Testing in a Range of Correlation Values

Statistics in Biopharmaceutical Research ◽

10.1080/19466315.2021.1873842 ◽

2021 ◽

pp. 1-9

Author(s):

Jiangtao Gou

Keyword(s):

Hypothesis Testing ◽

Multiple Hypothesis Testing ◽

Multiple Hypothesis

Download Full-text

MULTIPLE HYPOTHESIS TESTING AND THE DECLINING-POPULATION PARADIGM IN STELLER SEA LIONS

Ecological Applications ◽

10.1890/07-1254.1 ◽

2008 ◽

Vol 18 (8) ◽

pp. 1932-1955 ◽

Cited By ~ 22

Author(s):

Nicholas Wolf ◽

Marc Mangel

Keyword(s):

Hypothesis Testing ◽

Multiple Hypothesis Testing ◽

Steller Sea Lions ◽

Sea Lions ◽

Multiple Hypothesis

Download Full-text

Blind Source Separation in the Time-Frequency Domain Based on Multiple Hypothesis Testing

IEEE Transactions on Signal Processing ◽

10.1109/tsp.2007.914316 ◽

2008 ◽

Vol 56 (6) ◽

pp. 2267-2279 ◽

Cited By ~ 10

Author(s):

L. Cirillo ◽

A. Zoubir ◽

M. Amin

Keyword(s):

Hypothesis Testing ◽

Frequency Domain ◽

Blind Source Separation ◽

Source Separation ◽

Multiple Hypothesis Testing ◽

Time Frequency ◽

Multiple Hypothesis

Download Full-text

System identification by sequential multiple hypothesis testing

10.1109/cdc.1977.271538 ◽

1977 ◽

Author(s):

H. Nebeker ◽

R. Tomanek ◽

D. Wiberg

Keyword(s):

System Identification ◽

Hypothesis Testing ◽

Multiple Hypothesis Testing ◽

Multiple Hypothesis

Download Full-text

Genome-wide association studies of rheumatoid arthritis data via multiple hypothesis testing methods for correlated tests

BMC Proceedings ◽

10.1186/1753-6561-3-s7-s38 ◽

2009 ◽

Vol 3 (Suppl 7) ◽

pp. S38 ◽

Cited By ~ 4

Author(s):

Guolian Kang ◽

Douglas K Childers ◽

Nianjun Liu ◽

Kui Zhang ◽

Guimin Gao

Keyword(s):

Rheumatoid Arthritis ◽

Hypothesis Testing ◽

Association Studies ◽

Multiple Hypothesis Testing ◽

Genome Wide Association ◽

Genome Wide Association Studies ◽

Testing Methods ◽

Multiple Hypothesis ◽

Genome Wide ◽

Rheumatoid Arthritis Data

Download Full-text