JOURNAL ARTICLE

Online model learning in adversarial Markov decision processes

Doran ChakrabortyPeter Stone

Year: 2010 Journal:   Adaptive Agents and Multi-Agents Systems Pages: 1583-1584

Abstract

Consider, for example, the well-known game of Roshambo (Figure 1), or rock-paper-scissors, in which two players select one of three actions simultaneously. One may know that the adversary will base its next action on some bounded sequence of the past joint actions, but may be unaware of its exact strategy. For example, one may notice that every time it selects P, the adversary selects S in the next step; or perhaps whenever it selects R in three of the last four steps, the adversary selects P 90% of the time in the next step. The challenge is that to begin with, neither the adversary function that maps action histories to future actions (may be stochastic), nor even how far back it looks back in the action history (other than an upper bound) may be known. At a high level, this paper is concerned with automatically building such predictive models of an adversary's future actions as a function of past interactions.

Keywords:
Adversary Adversarial system Computer science Notice Action (physics) Markov decision process Sequence (biology) Function (biology) Adversary model Artificial intelligence Markov process Theoretical computer science Mathematical economics Computer security Mathematics Law

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
6
Refs
0.09
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Reinforcement Learning in Robotics
Physical Sciences →  Computer Science →  Artificial Intelligence
Markov Chains and Monte Carlo Methods
Physical Sciences →  Mathematics →  Statistics and Probability
Adversarial Robustness in Machine Learning
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Online Convex Optimization in Adversarial Markov Decision Processes

Aviv RosenbergYishay Mansour

Journal:   arXiv (Cornell University) Year: 2019 Pages: 5478-5486
JOURNAL ARTICLE

Learning Adversarial Markov Decision Processes with Delayed Feedback

Tal LancewickiAviv RosenbergYishay Mansour

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2022 Vol: 36 (7)Pages: 7281-7289
JOURNAL ARTICLE

Online Markov Decision Processes

Eyal Even-DarSham M. KakadeYishay Mansour

Journal:   Mathematics of Operations Research Year: 2009 Vol: 34 (3)Pages: 726-736
JOURNAL ARTICLE

Online Learning in Weakly Coupled Markov Decision Processes

Xiaohan WeiHao YuMichael J. Neely

Journal:   ACM SIGMETRICS Performance Evaluation Review Year: 2018 Vol: 46 (1)Pages: 56-58
© 2026 ScienceGate Book Chapters — All rights reserved.