Robust Inverse Reinforcement Learning Under State Adversarial Perturbations

Mine Melodi Caliskan; Saeed Ghoorchian; Setareh Maghsudi

doi:10.3233/faia251151

ScienceGate Book Chapters

BOOK-CHAPTER

Robust Inverse Reinforcement Learning Under State Adversarial Perturbations

Mine Melodi Caliskan Saeed Ghoorchian Setareh Maghsudi

Year: 2025 Frontiers in artificial intelligence and applications

DOI: 10.3233/faia251151

Get Full-Text PDF Get Analytical Report

Abstract

State-adversarial perturbations—arising from sensor spoofing, environmental interference, or targeted attacks—corrupt observations and invalidate state-wise optimality assumptions commonly made in IRL. We study inverse reinforcement learning (IRL) in state-adversarial MDPs (SA-MDPs) where only perturbed states are observable and propose SAMM-IRL, a max-margin IRL framework that operates purely in the belief (perturbed) space without access to clean states. In contrast to point-wise, state-wise optimality, we adopt a robust optimality notion based on the expected return over the initial-state distribution, which is well-posed under adversarial observation mappings. We prove (i) the existence of robust optimal policies in SA-MDPs, (ii) the contraction properties of intermediate RL operators under fixed and adaptive adversaries, and the iteration bounds for SAMM-IRL max-margin updates in belief space. Empirically, in discrete GridWorld and continuous control, SAMM-IRL achieves stronger reward recovery and imitation performance under adversarial observations than baselines, while maintaining stable policy updates. We further report perturbation parameters and ablation results in the main text to support reproducibility and practical deployment.

Keywords:

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.58

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Adversarial Robustness in Machine Learning

Physical Sciences → Computer Science → Artificial Intelligence

Robust Inverse Reinforcement Learning Under State Adversarial Perturbations

Abstract

Metrics

Topics

Related Documents

Active Robust Adversarial Reinforcement Learning Under Temporally Coupled Perturbations

Learning Robust Adaptive Bitrate Algorithms with Adversarial Inverse Reinforcement Learning

Robust Safe Reinforcement Learning under Adversarial Disturbances

Robust Adversarial Reinforcement Learning

Robust Proximal Adversarial Reinforcement Learning Under Model Mismatch