JOURNAL ARTICLE

Sparse data-driven random projection in regression for high-dimensional data

Roman ParzerPeter FilzmoserLaura Vana-Gür

Year: 2025 Journal:   Journal of Data Science Statistics and Visualisation Vol: 5 (5)

Abstract

We examine the linear regression problem in a challenging high-dimensional setting with correlated predictors where the degree of sparsity of the coefficients is unknown and can vary from sparse to dense. In this setting, we propose a combination of probabilistic variable screening with random projection tools as a computationally efficient approach. In particular, we introduce a new data-driven random projection for dimension reduction in linear regression,which is motivated by a theoretical bound on the gain in expected prediction error over conventional random projections when using information about the true coefficient. The variables to be included in the projection are screened by considering the correlation of the predictors. To reduce the dependence on fine-tuning choices, we aggregate over an ensemble of linear models. A threshold parameter is introduced to obtain a higher degree of sparsity, which can be chosen together with the number of models in the ensemble by cross-validation.In extensive simulations, we compare the proposed method with other random projection tools and with well-known methods, and show that it is competitive in terms of prediction in a variety of scenarios with different sparsity and predictor covariance settings, while most competitors are targeted at either sparse or dense settings.Finally, we illustrate the method on two data applications.

Keywords:
Random projection Regression Projection (relational algebra) Computer science Statistics Data mining Mathematics Artificial intelligence Pattern recognition (psychology) Algorithm

Metrics

1
Cited By
4.82
FWCI (Field Weighted Citation Impact)
32
Refs
0.93
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Gaussian Processes and Bayesian Inference
Physical Sciences →  Computer Science →  Artificial Intelligence
Face and Expression Recognition
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
© 2026 ScienceGate Book Chapters — All rights reserved.