JOURNAL ARTICLE

Learning Sparse Gaussian Bayesian Network Structure by Variable Grouping

Abstract

Bayesian networks (BNs) are popular for modeling conditional distributions of variables and causal relationships, especially in biological settings such as protein interactions, gene regulatory networks and microbial interactions. Previous BN structure learning algorithms treat variables with similar tendency separately. In this paper, we propose a grouped sparse Gaussian BN (GSGBN) structure learning algorithm which creates BN based on three assumptions: (i) variables follow a multivariate Gaussian distribution, (ii) the network only contains a few edges (sparse), (iii) similar variables have less-divergent sets of parents, while not-so-similar ones should have divergent sets of parents (variable grouping). We use L 1 regularization to make the learned network sparse, and another term to incorporate shared information among variables. For similar variables, GSGBN tends to penalize the differences of similar variables' parent sets more, compared to those not-so-similar variables' parent sets. The similarity of variables is learned from the data by alternating optimization, without prior domain knowledge. Based on this new definition of the optimal BN, a coordinate descent algorithm and a projected gradient descent algorithm are developed to obtain edges of the network and also similarity of variables. Experimental results on both simulated and real datasets show that GSGBN has substantially superior prediction performance for structure learning when compared to several existing algorithms.

Keywords:
Bayesian network Artificial intelligence Gaussian Coordinate descent Similarity (geometry) Latent variable Computer science Multivariate normal distribution Gradient descent Variable (mathematics) Bayesian probability Regularization (linguistics) Machine learning Prior probability Conditional independence Mathematics Algorithm Multivariate statistics Artificial neural network Image (mathematics)

Metrics

1
Cited By
0.00
FWCI (Field Weighted Citation Impact)
39
Refs
0.06
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Bayesian Modeling and Causal Inference
Physical Sciences →  Computer Science →  Artificial Intelligence
Gaussian Processes and Bayesian Inference
Physical Sciences →  Computer Science →  Artificial Intelligence
Machine Learning and Data Classification
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.