JOURNAL ARTICLE

Sure Independence Screening for Ultrahigh Dimensional Feature Space

Jianqing FanJinchi Lv

Year: 2008 Journal:   Journal of the Royal Statistical Society Series B (Statistical Methodology) Vol: 70 (5)Pages: 849-911   Publisher: Oxford University Press

Abstract

Summary Variable selection plays an important role in high dimensional statistical modelling which nowadays appears in many areas and is key to various scientific discoveries. For problems of large scale or dimensionality p, accuracy of estimation and computational cost are two top concerns. Recently, Candes and Tao have proposed the Dantzig selector using L1-regularization and showed that it achieves the ideal risk up to a logarithmic factor log(p). Their innovative procedure and remarkable result are challenged when the dimensionality is ultrahigh as the factor log(p) can be large and their uniform uncertainty principle can fail. Motivated by these concerns, we introduce the concept of sure screening and propose a sure screening method that is based on correlation learning, called sure independence screening, to reduce dimensionality from high to a moderate scale that is below the sample size. In a fairly general asymptotic framework, correlation learning is shown to have the sure screening property for even exponentially growing dimensionality. As a methodological extension, iterative sure independence screening is also proposed to enhance its finite sample performance. With dimension reduced accurately from high to below sample size, variable selection can be improved on both speed and accuracy, and can then be accomplished by a well-developed method such as smoothly clipped absolute deviation, the Dantzig selector, lasso or adaptive lasso. The connections between these penalized least squares methods are also elucidated.

Keywords:
Curse of dimensionality Independence (probability theory) Sample size determination Lasso (programming language) Dimension (graph theory) Logarithm Dimensionality reduction Regularization (linguistics) Mathematics Feature selection Distance correlation Scale (ratio) Mathematical optimization Computer science Algorithm Artificial intelligence Statistics Random variable

Metrics

2709
Cited By
56.52
FWCI (Field Weighted Citation Impact)
170
Refs
1.00
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Statistical Methods and Inference
Physical Sciences →  Mathematics →  Statistics and Probability
Control Systems and Identification
Physical Sciences →  Engineering →  Control and Systems Engineering
Advanced Statistical Methods and Models
Physical Sciences →  Mathematics →  Statistics and Probability

Related Documents

JOURNAL ARTICLE

Sure independence screening in ultrahigh dimensional generalized additive models

Guangren YangWeixin YaoSijia Xiang

Journal:   Journal of Statistical Planning and Inference Year: 2018 Vol: 199 Pages: 126-135
JOURNAL ARTICLE

Robust sure independence screening for ultrahigh dimensional non-normal data

Wei Zhong

Journal:   Acta Mathematica Sinica English Series Year: 2014 Vol: 30 (11)Pages: 1885-1896
JOURNAL ARTICLE

ExSIS: Extended sure independence screening for ultrahigh-dimensional linear models

Talal AhmedWaheed U. Bajwa

Journal:   Signal Processing Year: 2019 Vol: 159 Pages: 33-48
JOURNAL ARTICLE

Ultrahigh-Dimensional Multiclass Linear Discriminant Analysis by Pairwise Sure Independence Screening

Rui PanHansheng WangRunze Li

Journal:   Journal of the American Statistical Association Year: 2015 Vol: 111 (513)Pages: 169-179
© 2026 ScienceGate Book Chapters — All rights reserved.