Causal Discovery in High-Dimensional Time Series with Latent Confounders via Score-Based Diffusion Models

Jeffrey A. Torres

doi:10.71465/csb167

ScienceGate Book Chapters

JOURNAL ARTICLE

Causal Discovery in High-Dimensional Time Series with Latent Confounders via Score-Based Diffusion Models

Jeffrey A. Torres

Year: 2025 Journal: Computer Science Bulletin Vol: 8 (01)Pages: 375-384

DOI: 10.71465/csb167

Get Full-Text PDF Get Analytical Report

Abstract

The identification of causal relationships from observational time series data constitutes a fundamental challenge across scientific disciplines, ranging from climate science to econometrics and systems biology. While classical constraint-based and score-based methods have achieved success in low-dimensional settings, they frequently falter when applied to high-dimensional data, particularly in the presence of latent confounders—unobserved variables that influence two or more observed variables, leading to spurious correlations. This paper introduces a novel framework, Causal-Diff, which leverages the generative power of score-based diffusion models to address these limitations. By modeling the time-dependent evolution of the data distribution via stochastic differential equations, we approximate the score function (the gradient of the log-density) to disentangle observed temporal dependencies from hidden confounding factors. Unlike traditional structural equation models that rely on rigid parametric assumptions, our approach utilizes the flexibility of deep neural networks to learn complex, non-linear causal mechanisms. We theoretically demonstrate that the score matching objective, when augmented with appropriate sparsity constraints and temporal masking, allows for the identifiability of the causal graph even under partial observability. Extensive experiments on both synthetic datasets and real-world functional magnetic resonance imaging (fMRI) data reveal that Causal-Diff significantly outperforms state-of-the-art baselines in terms of structural Hamming distance and orientation accuracy.

Keywords:

Spurious relationship Identifiability Latent variable Series (stratigraphy) Time series Synthetic data Matching (statistics) Generative model Parametric statistics Parametric model

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.78

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Functional Brain Connectivity Studies

Life Sciences → Neuroscience → Cognitive Neuroscience

Bayesian Modeling and Causal Inference

Physical Sciences → Computer Science → Artificial Intelligence

Machine Learning in Healthcare

Physical Sciences → Computer Science → Artificial Intelligence

Causal Discovery in High-Dimensional Time Series with Latent Confounders via Score-Based Diffusion Models

Abstract

Metrics

Topics

Related Documents

LPCMCI: Causal Discovery in Time Series with Latent Confounders

Characterization of causal ancestral graphs for time series with latent confounders

Causal Discovery from Markov Properties Under Latent Confounders

CAUSAL DISCOVERY FROM MARKOV PROPERTIES UNDER LATENT CONFOUNDERS

CUTS+: High-Dimensional Causal Discovery from Irregular Time-Series