JOURNAL ARTICLE

Reinforced Risk Prediction With Budget Constraint Using Irregularly Measured Data From Electronic Health Records

Yinghao PanEric B. LaberMaureen A. SmithYing‐Qi Zhao

Year: 2021 Journal:   Journal of the American Statistical Association Vol: 118 (542)Pages: 1090-1101

Abstract

Uncontrolled glycated hemoglobin (HbA1c) levels are associated with adverse events among complex diabetic patients. These adverse events present serious health risks to affected patients and are associated with significant financial costs. Thus, a high-quality predictive model that could identify high-risk patients so as to inform preventative treatment has the potential to improve patient outcomes while reducing healthcare costs. Because the biomarker information needed to predict risk is costly and burdensome, it is desirable that such a model collect only as much information as is needed on each patient so as to render an accurate prediction. We propose a sequential predictive model that uses accumulating patient longitudinal data to classify patients as: high-risk, low-risk, or uncertain. Patients classified as high-risk are then recommended to receive preventative treatment and those classified as low-risk are recommended to standard care. Patients classified as uncertain are monitored until a high-risk or low-risk determination is made. We construct the model using claims and enrollment files from Medicare, linked with patient Electronic Health Records (EHR) data. The proposed model uses functional principal components to accommodate noisy longitudinal data and weighting to deal with missingness and sampling bias. The proposed method demonstrates higher predictive accuracy and lower cost than competing methods in a series of simulation experiments and application to data on complex patients with diabetes.

Keywords:
Weighting Missing data Medicine Computer science Data mining Health care Emergency medicine Machine learning

Metrics

2
Cited By
0.28
FWCI (Field Weighted Citation Impact)
35
Refs
0.64
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Machine Learning in Healthcare
Physical Sciences →  Computer Science →  Artificial Intelligence
Diabetes, Cardiovascular Risks, and Lipoproteins
Health Sciences →  Medicine →  Endocrinology, Diabetes and Metabolism
Diabetes Management and Research
Health Sciences →  Medicine →  Endocrinology, Diabetes and Metabolism
© 2026 ScienceGate Book Chapters — All rights reserved.