JOURNAL ARTICLE

Sparse Projected Averaged Regression for High-Dimensional Data

Abstract

We examine the linear regression problem in a challenging high-dimensional setting with correlated predictors to explain and predict relevant quantities, with explicitly allowing the regression coefficient to vary from sparse to dense. Most classical high-dimensional regression estimators require some degree of sparsity. We discuss the more recent concepts of variable screening and random projection as computationally fast dimension reduction tools, and propose a new random projection matrix tailored to the linear regression problem with a theoretical bound on the gain in expected prediction error over conventional random projections. Around this new random projection, we built the Sparse Projected Averaged Regression (SPAR) method combining probabilistic variable screening steps with the random projection steps to obtain an ensemble of small linear models. In difference to existing methods, we introduce a thresholding parameter to obtain some degree of sparsity. In extensive simulations and two real data applications we guide through the elements of this method and compare prediction and variable selection performance to various competitors. For prediction, our method performs at least as good as the best competitors in most settings with a high number of truly active variables, while variable selection remains a hard task for all methods in high dimensions.

Keywords:
Feature selection Linear regression Estimator Design matrix Linear predictor function Projection (relational algebra) Random projection Regression Random variable Thresholding

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.43
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Statistical Methods and Inference
Physical Sciences →  Mathematics →  Statistics and Probability
Stochastic Gradient Optimization Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Gaussian Processes and Bayesian Inference
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Sparse sliced inverse regression for high dimensional data analysis

Haileab HilafuSandra E. Safo

Journal:   BMC Bioinformatics Year: 2022 Vol: 23 (1)Pages: 168-168
JOURNAL ARTICLE

Sparse data-driven random projection in regression for high-dimensional data

Roman ParzerPeter FilzmoserLaura Vana-Gür

Journal:   Journal of Data Science Statistics and Visualisation Year: 2025 Vol: 5 (5)
JOURNAL ARTICLE

Online sparse sliced inverse regression for high-dimensional streaming data

Jianjun XuWenquan CuiHaoyang Cheng

Journal:   International Journal of Wavelets Multiresolution and Information Processing Year: 2022 Vol: 21 (02)
© 2026 ScienceGate Book Chapters — All rights reserved.