JOURNAL ARTICLE

Tuning Random Forests for Causal Inference under Cluster-Level Unmeasured Confounding

Abstract

Recently, there has been growing interest in using machine learning methods for causal inference due to their automatic and flexible ability to model the propensity score and the outcome model. However, almost all the machine learning methods for causal inference have been studied under the assumption of no unmeasured confounding and there is little work on handling omitted/unmeasured variable bias. This paper focuses on a machine learning method based on random forests known as Causal Forests and presents five simple modifications for tuning Causal Forests so that they are robust to cluster-level unmeasured confounding. Our simulation study finds that adjusting the default tuning procedure with the propensity score from fixed effects logistic regression or using variables that are centered to their cluster means produces estimates that are more robust to cluster-level unmeasured confounding. Also, when these parametric propensity score models are mis-specified, our modified machine learning methods remain robust to bias from cluster-level unmeasured confounders compared to existing parametric approaches based on propensity score weighting. We conclude by demonstrating our proposals in a real data study concerning the effect of taking an eighth-grade algebra course on math achievement scores from the Early Childhood Longitudinal Study.

Keywords:
Causal inference Propensity score matching Random forest Instrumental variable Confounding Inference Outcome (game theory) Logistic regression Causal model

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.22
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Advanced Causal Inference Techniques
Physical Sciences →  Mathematics →  Statistics and Probability
Bayesian Modeling and Causal Inference
Physical Sciences →  Computer Science →  Artificial Intelligence
Psychometric Methodologies and Testing
Social Sciences →  Decision Sciences →  Management Science and Operations Research

Related Documents

JOURNAL ARTICLE

Tuning Random Forests for Causal Inference under Cluster-Level Unmeasured Confounding

Youmi SukHyunseung Kang

Journal:   Multivariate Behavioral Research Year: 2022 Vol: 58 (2)Pages: 408-440
JOURNAL ARTICLE

Causal Inference with Unmeasured Confounding: A Minimax Perspective

SÉRGIO DE ANDRADE, PAULO

Journal:   Zenodo (CERN European Organization for Nuclear Research) Year: 2025
© 2026 ScienceGate Book Chapters — All rights reserved.