JOURNAL ARTICLE

Incorporating prior knowledge into regularized regression

Chubing ZengDuncan C. ThomasJuan Pablo Lewinger

Year: 2020 Journal:   Bioinformatics Vol: 37 (4)Pages: 514-521   Publisher: Oxford University Press

Abstract

Abstract Motivation Associated with genomic features like gene expression, methylation and genotypes, used in statistical modeling of health outcomes, there is a rich set of meta-features like functional annotations, pathway information and knowledge from previous studies, that can be used post hoc to facilitate the interpretation of a model. However, using this meta-feature information a priori rather than post hoc can yield improved prediction performance as well as enhanced model interpretation. Results We propose a new penalized regression approach that allows a priori integration of external meta-features. The method extends LASSO regression by incorporating individualized penalty parameters for each regression coefficient. The penalty parameters are, in turn, modeled as a log-linear function of the meta-features and are estimated from the data using an approximate empirical Bayes approach. Optimization of the marginal likelihood on which the empirical Bayes estimation is performed using a fast and stable majorization–minimization procedure. Through simulations, we show that the proposed regression with individualized penalties can outperform the standard LASSO in terms of both parameters estimation and prediction performance when the external data is informative. We further demonstrate our approach with applications to gene expression studies of bone density and breast cancer. Availability and implementation The methods have been implemented in the R package xtune freely available for download from https://cran.r-project.org/web/packages/xtune/index.html.

Keywords:
Regression Computer science Regression analysis Artificial intelligence Machine learning Statistics Mathematics

Metrics

23
Cited By
3.93
FWCI (Field Weighted Citation Impact)
48
Refs
0.94
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Statistical Methods and Inference
Physical Sciences →  Mathematics →  Statistics and Probability
Control Systems and Identification
Physical Sciences →  Engineering →  Control and Systems Engineering
Gaussian Processes and Bayesian Inference
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

DISSERTATION

Incorporating prior knowledge into regularized regression

Zeng, Chubing (author)

University:   University of Southern California Digital Library Year: 2020
JOURNAL ARTICLE

Incorporating Prior Knowledge into Kernel Based Regression

Zhe SunZengke ZhangHuangang Wang

Journal:   ACTA AUTOMATICA SINICA Year: 2009 Vol: 34 (12)Pages: 1515-1521
JOURNAL ARTICLE

Incorporating Prior Knowledge into Kernel Based Regression

Zhe SunZengke ZhangHuangang Wang

Journal:   Acta Automatica Sinica Year: 2008 Vol: 34 (12)Pages: 1515-1521
JOURNAL ARTICLE

Incorporating prior knowledge in support vector regression

Fabien LauerGérard Bloch

Journal:   Machine Learning Year: 2007 Vol: 70 (1)Pages: 89-118
© 2026 ScienceGate Book Chapters — All rights reserved.