JOURNAL ARTICLE

Renewable Estimation and Incremental Inference in Generalized Linear Models with Streaming Data Sets

Lan LuoPeter X.‐K. Song

Year: 2019 Journal:   Journal of the Royal Statistical Society Series B (Statistical Methodology) Vol: 82 (1)Pages: 69-97   Publisher: Oxford University Press

Abstract

Summary The paper presents an incremental updating algorithm to analyse streaming data sets using generalized linear models. The method proposed is formulated within a new framework of renewable estimation and incremental inference, in which the maximum likelihood estimator is renewed with current data and summary statistics of historical data. Our framework can be implemented within a popular distributed computing environment, known as Apache Spark, to scale up computation. Consisting of two data-processing layers, the rho architecture enables us to accommodate inference-related statistics and to facilitate sequential updating of the statistics used in both estimation and inference. We establish estimation consistency and asymptotic normality of the proposed renewable estimator, in which the Wald test is utilized for an incremental inference. Our methods are examined and illustrated by various numerical examples from both simulation experiments and a real world data analysis.

Keywords:
Inference Estimator Computer science Consistency (knowledge bases) Statistical inference Wald test Data mining Algorithm Statistical hypothesis testing Mathematics Statistics Artificial intelligence

Metrics

94
Cited By
3.99
FWCI (Field Weighted Citation Impact)
49
Refs
0.95
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Gaussian Processes and Bayesian Inference
Physical Sciences →  Computer Science →  Artificial Intelligence
Data Stream Mining Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Statistical Methods and Inference
Physical Sciences →  Mathematics →  Statistics and Probability

Related Documents

JOURNAL ARTICLE

Online inference in high-dimensional generalized linear models with streaming data

Lan LuoRuijian HanYuanyuan LinJian Huang

Journal:   Electronic Journal of Statistics Year: 2023 Vol: 17 (2)Pages: 3443-3471
JOURNAL ARTICLE

Renewable estimation in expectile regression model with streaming data sets

Yingli PanJun LiuZhan Liu

Journal:   Journal of Statistical Computation and Simulation Year: 2024 Vol: 94 (17)Pages: 3767-3787
JOURNAL ARTICLE

Constrained inference for generalized linear models with incomplete covariate data

Karelyn DavisSanjoy K. SinhaChul Gyu Park

Journal:   Journal of Statistical Computation and Simulation Year: 2013 Vol: 85 (4)Pages: 693-710
© 2026 ScienceGate Book Chapters — All rights reserved.