JOURNAL ARTICLE

Multiple imputation for high-dimensional mixed incomplete continuous and binary data

Ren HeThomas R. Belin

Year: 2014 Journal:   Statistics in Medicine Vol: 33 (13)Pages: 2251-2262   Publisher: Wiley

Abstract

It is common in applied research to have large numbers of variables measured on a modest number of cases. Even with low rates of missingness of individual variables, such data sets can have a large number of incomplete cases with a mix of data types. Here, we propose a new joint modeling approach to address the high-dimensional incomplete data with a mix of continuous and binary data. Specifically, we propose a multivariate normal model encompassing both continuous variables and latent variables corresponding to binary variables. We apply a parameter-extended Metropolis–Hastings algorithm to generate the covariance matrix of a mixture of continuous and binary variables. We also introduce prior distribution families for unstructured covariance matrices to reduce the dimension of the parameter space. In several simulation settings, the method is compared with available-case analysis, a rounding method, and a sequential regression method.

Keywords:
Binary data Missing data Rounding Imputation (statistics) Covariance Latent variable Binary number Computer science Multivariate statistics Statistics Multivariate normal distribution Covariance matrix Mathematics

Metrics

8
Cited By
1.29
FWCI (Field Weighted Citation Impact)
0
Refs
0.82
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Statistical Methods and Bayesian Inference
Physical Sciences →  Mathematics →  Statistics and Probability
Bayesian Modeling and Causal Inference
Physical Sciences →  Computer Science →  Artificial Intelligence
Bayesian Methods and Mixture Models
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.