JOURNAL ARTICLE

Controlling the False Discovery Rate for Feature Selection in High‐resolution NMR Spectra

Seoung Bum KimVictoria C. P. ChenYoungja ParkThomas R. ZieglerDean P. Jones

Year: 2008 Journal:   Statistical Analysis and Data Mining The ASA Data Science Journal Vol: 1 (2)Pages: 57-66   Publisher: Wiley

Abstract

Abstract Successful implementation of feature selection in nuclear magnetic resonance (NMR) spectra not only improves classification ability, but also simplifies the entire modeling process and, thus, reduces computational and analytical efforts. Principal component analysis (PCA) and partial least squares (PLS) have been widely used for feature selection in NMR spectra. However, extracting meaningful metabolite features from the reduced dimensions obtained through PCA or PLS is complicated because these reduced dimensions are linear combinations of a large number of the original features. In this paper, we propose a multiple testing procedure controlling false discovery rate (FDR) as an efficient method for feature selection in NMR spectra. The procedure clearly compensates for the limitation of PCA and PLS and identifies individual metabolite features necessary for classification. In addition, we present orthogonal signal correction to improve classification and visualization by removing unnecessary variations in NMR spectra. Our experimental results with real NMR spectra showed that classification models constructed with the features selected by our proposed procedure yielded smaller misclassification rates than those with all features. Copyright © 2008 Wiley Periodicals, Inc., A Wiley Company Statistical Analy Data Mining 1: 000‐000, 2008

Keywords:
False discovery rate Principal component analysis Pattern recognition (psychology) Feature selection Partial least squares regression Artificial intelligence Feature (linguistics) NMR spectra database Computer science Selection (genetic algorithm) Visualization Data mining Spectral line Chemistry Machine learning Physics

Metrics

21
Cited By
0.92
FWCI (Field Weighted Citation Impact)
33
Refs
0.75
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Spectroscopy and Chemometric Analyses
Physical Sciences →  Chemistry →  Analytical Chemistry
Metabolomics and Mass Spectrometry Studies
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
Fault Detection and Control Systems
Physical Sciences →  Engineering →  Control and Systems Engineering

Related Documents

JOURNAL ARTICLE

Federated feature selection with false discovery rate control

Jie HuJiayi TongYang NingCheng Yong TangJason H. MooreRunze LiYong Chen

Journal:   Journal of the Royal Statistical Society Series B (Statistical Methodology) Year: 2025
JOURNAL ARTICLE

Optimal Bayesian Feature Selection with Bounded False Discovery Rate

Ali Foroughi pourLori A. Dalton

Journal:   2018 52nd Asilomar Conference on Signals, Systems, and Computers Year: 2018 Vol: 45 Pages: 1202-1206
JOURNAL ARTICLE

Hierarchical False Discovery Rate–Controlling Methodology

Daniel Yekutieli

Journal:   Journal of the American Statistical Association Year: 2008 Vol: 103 (481)Pages: 309-316
© 2026 ScienceGate Book Chapters — All rights reserved.