JOURNAL ARTICLE

Optimal false discovery rate control for large scale multiple testing with auxiliary information

Hongyuan CaoJun ChenXianyang Zhang

Year: 2022 Journal:   The Annals of Statistics Vol: 50 (2)Pages: 807-857   Publisher: Institute of Mathematical Statistics

Abstract

Large-scale multiple testing is a fundamental problem in high dimensional statistical inference. It is increasingly common that various types of auxiliary information, reflecting the structural relationship among the hypotheses, are available. Exploiting such auxiliary information can boost statistical power. To this end, we propose a framework based on a two-group mixture model with varying probabilities of being null for different hypotheses a priori, where a shape-constrained relationship is imposed between the auxiliary information and the prior probabilities of being null. An optimal rejection rule is designed to maximize the expected number of true positives when average false discovery rate is controlled. Focusing on the ordered structure, we develop a robust EM algorithm to estimate the prior probabilities of being null and the distribution of p-values under the alternative hypothesis simultaneously. We show that the proposed method has better power than state-of-the-art competitors while controlling the false discovery rate, both empirically and theoretically. Extensive simulations demonstrate the advantage of the proposed method. Datasets from genome-wide association studies are used to illustrate the new methodology.

Keywords:
False discovery rate Statistical hypothesis testing Mathematics Null hypothesis Multiple comparisons problem Null distribution False positives and false negatives A priori and a posteriori Inference Null (SQL) Statistical power False positive paradox Statistical inference Type I and type II errors Data mining Algorithm Statistics Test statistic Computer science Artificial intelligence

Metrics

28
Cited By
10.02
FWCI (Field Weighted Citation Impact)
56
Refs
0.97
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Statistical Methods in Clinical Trials
Physical Sciences →  Mathematics →  Statistics and Probability
Gene expression and cancer classification
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
Molecular Biology Techniques and Applications
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology

Related Documents

JOURNAL ARTICLE

Weighted False Discovery Rate Control in Large-Scale Multiple Testing

Pallavi BasuTommaso CaiKiranmoy DasWenguang Sun

Journal:   Journal of the American Statistical Association Year: 2017 Vol: 113 (523)Pages: 1172-1183
JOURNAL ARTICLE

False Discovery Control in Large-Scale Spatial Multiple Testing

Wenguang SunBrian J. ReichTommaso CaiMichele GuindaniArmin Schwartzman

Journal:   Journal of the Royal Statistical Society Series B (Statistical Methodology) Year: 2014 Vol: 77 (1)Pages: 59-83
JOURNAL ARTICLE

Directional False Discovery Rate Control in Large‐Scale Multiple Testing Under Data Dependence

Wendong LiJianqing ShiYi WangDongdong Xiang

Journal:   Applied Stochastic Models in Business and Industry Year: 2025 Vol: 41 (5)
JOURNAL ARTICLE

Directional false discovery rate control in large-scale multiple comparisons

Wenjuan LiangDongdong XiangYajun MeiWendong Li

Journal:   Journal of Applied Statistics Year: 2024 Vol: 51 (15)Pages: 3195-3214
© 2026 ScienceGate Book Chapters — All rights reserved.