JOURNAL ARTICLE

False discovery rate estimation for high-dimensional regression models

Abstract

A genome-wide association study (GWAS) aims to determine genetic variants statistically associated with phenotypes. However, because of linkage disequilibrium (LD), a characteristic of large-scale genomic datasets referring to the strong local dependencies between single-nucleotide polymorphisms (SNPs), it is usually challenging to identify the actual causal variants among their associated proxies. In this work, we propose a Bayesian variable selection method called the sparse mixed Gaussian prior for generalized linear models (SMG-GLM). It is an efficient high-dimensional Bayesian variable selection approach designed for arbitrary relationships between variants and phenotypes. Besides, it calibrates the selection uncertainty, which many popular variable selection methods do not address, by estimating posterior inclusion probabilities. We additionally combine SMG-GLM with knockoffs, named SMG-knockoffs, to account for the collinearity problem caused by LD. The SMG-knockoffs method can make inferences on the variable selection result and control the false discovery rate at an expected level. Its competence in discovering causal variables while controlling a desired false discovery rate has been shown in simulation studies conducted on a GWAS dataset.

Keywords:
False discovery rate Feature selection Collinearity Bayesian probability Posterior probability Selection (genetic algorithm) Prior probability Linkage disequilibrium Model selection

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.44
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Mycorrhizal Fungi and Plant Interactions
Life Sciences →  Agricultural and Biological Sciences →  Plant Science
Genomics and Phylogenetic Studies
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
Plant Pathogens and Fungal Diseases
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Cell Biology

Related Documents

JOURNAL ARTICLE

Hypothesis testing for high-dimensional multivariate regression with false discovery rate control

Yunlong Zhu

Journal:   Communication in Statistics- Theory and Methods Year: 2021 Vol: 51 (21)Pages: 7476-7495
JOURNAL ARTICLE

Ah-knockoff: false discovery rate control in high-dimensional additive hazards models

Yan DongXin Zhou

Journal:   Journal of the Korean Statistical Society Year: 2025 Vol: 54 (3)Pages: 739-759
JOURNAL ARTICLE

False discovery rate procedures for high-dimensional data

KI Kyung In Kim

Journal:   Data Archiving and Networked Services (DANS) Year: 2008
JOURNAL ARTICLE

Improving false discovery rate estimation

Stanley PoundsCheng Cheng

Journal:   Bioinformatics Year: 2004 Vol: 20 (11)Pages: 1737-1745
© 2026 ScienceGate Book Chapters — All rights reserved.