JOURNAL ARTICLE

Cafe: Improved Federated Data Imputation by Leveraging Missing Data Heterogeneity

Sitao MinHafiz AsifXinyue WangJaideep Vaidya

Year: 2025 Journal:   IEEE Transactions on Knowledge and Data Engineering Vol: 37 (5)Pages: 2266-2281   Publisher: IEEE Computer Society

Abstract

Federated learning (FL), a decentralized machine learning approach, offers great performance while alleviating autonomy and confidentiality concerns. Despite FL's popularity, how to deal with missing values in a federated manner is not well understood. In this work, we initiate a study of federated imputation of missing values, particularly in complex scenarios, where missing data heterogeneity exists and the state-of-the-art (SOTA) approaches for federated imputation suffer from significant loss in imputation quality. We propose Cafe, a personalized FL approach for missing data imputation. Cafe is inspired from the observation that heterogeneity can induce differences in observable and missing data distribution across clients, and that these differences can be leveraged to improve the imputation quality. Cafe computes personalized weights that are automatically calibrated for the level of heterogeneity, which can remain unknown, to develop personalized imputation models for each client. An extensive empirical evaluation over a variety of settings demonstrates that Cafe matches the performance of SOTA baselines in homogeneous settings while significantly outperforming the baselines in heterogeneous settings.

Keywords:
Computer science Imputation (statistics) Missing data Data mining Data modeling Information retrieval Database Machine learning

Metrics

6
Cited By
28.92
FWCI (Field Weighted Citation Impact)
83
Refs
0.99
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Privacy-Preserving Technologies in Data
Physical Sciences →  Computer Science →  Artificial Intelligence
Data Quality and Management
Social Sciences →  Decision Sciences →  Management Science and Operations Research
Cryptography and Data Security
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

SICE: an improved missing data imputation technique

Shahidul Islam KhanAhasanul Hoque

Journal:   Greater South Information System Year: 2020
JOURNAL ARTICLE

SICE: an improved missing data imputation technique

Shahidul Islam KhanAhasanul Hoque

Journal:   Greater South Information System Year: 2020
JOURNAL ARTICLE

SICE: an improved missing data imputation technique

Shahidul Islam KhanAbu Sayed Md. Latiful Hoque

Journal:   Journal Of Big Data Year: 2020 Vol: 7 (1)Pages: 37-37
BOOK-CHAPTER

Missing Data Imputation

Chapman & Hall/CRC biostatistics series Year: 2011 Pages: 275-290
© 2026 ScienceGate Book Chapters — All rights reserved.