JOURNAL ARTICLE

Selection of optimal regression models via cross‐validation

David W. Osten

Year: 1988 Journal:   Journal of Chemometrics Vol: 2 (1)Pages: 39-48   Publisher: Wiley

Abstract

Abstract A general problem arising in the development of regression models is the selection of the optimal model. Whenever a feature selection procedure, such as step forward, backward elimination, best subset or all possible combinations, or when a data compression approach, such as principal components or partial least‐squares regression, is used, the question of how many regression terms to include in the final model must be addressed. This work describes the evaluation of four different criteria for selection of the optimal predictive regression model using cross‐validation. The results obtained in this work illustrate the problems which can arise in the analysis of small or inadequately sampled data sets. The common approach, selecting the model which yields the absolute minimum in the predictive residual error sum of squares (PRESS), was found to have particularly poor statistical properties. A very simple change to a criterion based on the first local minimum in PRESS will provide a significant improvement in the cross‐validation result. A criterion based on testing the significance of incremental changes in PRESS with an F ‐test may provide more robust performance than the local minimum in PRESS method.

Keywords:
Cross-validation Residual Selection (genetic algorithm) Model selection Regression Feature selection Regression analysis Partial least squares regression Computer science Mathematics Statistics Linear regression Principal component regression Residual sum of squares Total least squares Algorithm Artificial intelligence

Metrics

271
Cited By
8.34
FWCI (Field Weighted Citation Impact)
8
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Statistical Methods and Models
Physical Sciences →  Mathematics →  Statistics and Probability
Spectroscopy and Chemometric Analyses
Physical Sciences →  Chemistry →  Analytical Chemistry
Statistical and numerical algorithms
Physical Sciences →  Mathematics →  Applied Mathematics

Related Documents

JOURNAL ARTICLE

Cross-Validation of Regression Models

Richard R. PicardR. Dennis Cook

Journal:   Journal of the American Statistical Association Year: 1984 Vol: 79 (387)Pages: 575-575
JOURNAL ARTICLE

Cross-Validation of Regression Models

Richard R. PicardR. Dennis Cook

Journal:   Journal of the American Statistical Association Year: 1984 Vol: 79 (387)Pages: 575-583
JOURNAL ARTICLE

Procrustes cross-validation of multivariate regression models

Sergey KucheryavskiyOxana Ye. RodionovaAlexey L. Pomerantsev

Journal:   Analytica Chimica Acta Year: 2023 Vol: 1255 Pages: 341096-341096
JOURNAL ARTICLE

Parameterized cross-validation for nonlinear regression models

Imhoi KooNamgil LeeRhee Man Kil

Journal:   Neurocomputing Year: 2008 Vol: 71 (16-18)Pages: 3089-3095
© 2026 ScienceGate Book Chapters — All rights reserved.