Model-Based Reinforcement Learning in Factored-State MDPs

Alexander L. Strehl

doi:10.1109/adprl.2007.368176

ScienceGate Book Chapters

JOURNAL ARTICLE

Model-Based Reinforcement Learning in Factored-State MDPs

Alexander L. Strehl

Year: 2007 Pages: 103-110

DOI: 10.1109/adprl.2007.368176

Get Full-Text PDF Get Analytical Report

Abstract

We consider the problem of learning in a factored-state Markov decision process that is structured to allow a compact representation. We show that the well-known algorithm, factored Rmax, performs near-optimally on all but a number of timesteps that is polynomial in the size of the compact representation, which is often exponentially smaller than the number of states. This is equivalent to the result obtained by Kearns and Roller for their DBN-E ³ algorithm, except that we've conducted the analysis in a more general setting. We also extend the results to a new algorithm, factored IE, that uses the interval estimation approach to exploration and can be expected to outperform factored Rmax on most domains

Keywords:

Reinforcement learning Markov decision process Representation (politics) State (computer science) Computer science Markov process Artificial intelligence Polynomial Interval (graph theory) Markov chain Algorithm Mathematics Mathematical optimization Combinatorics Machine learning Statistics

Metrics

Cited By

3.88

FWCI (Field Weighted Citation Impact)

Refs

0.94

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Reinforcement Learning in Robotics

Physical Sciences → Computer Science → Artificial Intelligence

Machine Learning and Algorithms

Physical Sciences → Computer Science → Artificial Intelligence

Formal Methods in Verification

Physical Sciences → Computer Science → Computational Theory and Mathematics

Model-Based Reinforcement Learning in Factored-State MDPs

Abstract

Metrics

Citation History

Topics

Related Documents

Automatic Feature Selection for Model-Based Reinforcement Learning in Factored MDPs

Algorithm-Directed Exploration for Model-Based Reinforcement Learning in Factored MDPs

TeXDYNA: Hierarchical Reinforcement Learning in Factored MDPs

Near-optimal Reinforcement Learning in Factored MDPs

Exploiting Additive Structure in Factored MDPs for Reinforcement Learning