JOURNAL ARTICLE

Tight conditions for consistent variable selection in high dimensional nonparametric regression

Laëtitia CommingesArnak S. Dalalyan

Year: 2011 Journal:   arXiv (Cornell University) Pages: 187-206   Publisher: Cornell University

Abstract

We address the issue of variable selection in the regression model with very high ambient dimension, i.e., when the number of covariates is very large. The main focus is on the situation where the number of relevant covariates, called intrinsic dimension, is much smaller than the ambient dimension. Without assuming any parametric form of the underlying regression function, we get tight conditions making it possible to consistently estimate the set of relevant variables. These conditions relate the intrinsic dimension to the ambient dimension and to the sample size. The procedure that is provably consistent under these tight conditions is simple and is based on comparing the empirical Fourier coefficients with an appropriately chosen threshold value.

Keywords:
Covariate Dimension (graph theory) Intrinsic dimension Mathematics Nonparametric statistics Statistics Regression analysis Regression Parametric statistics Nonparametric regression Variable (mathematics) Feature selection Sufficient dimension reduction Sample size determination Computer science Mathematical analysis Artificial intelligence Combinatorics

Metrics

14
Cited By
0.00
FWCI (Field Weighted Citation Impact)
27
Refs
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Statistical Methods and Inference
Physical Sciences →  Mathematics →  Statistics and Probability
Statistical Methods and Bayesian Inference
Physical Sciences →  Mathematics →  Statistics and Probability
Bayesian Methods and Mixture Models
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.