JOURNAL ARTICLE

Automatic Feature Selection for Model-Based Reinforcement Learning in Factored MDPs

Abstract

Feature selection is an important challenge in machine learning. Unfortunately, most methods for automating feature selection are designed for supervised learning tasks and are thus either inapplicable or impractical for reinforcement learning. This paper presents a new approach to feature selection specifically designed for the challenges of reinforcement learning. In our method, the agent learns a model, represented as a dynamic Bayesian network, of a factored Markov decision process, deduces a minimal feature set from this network, and efficiently computes a policy on this feature set using dynamic programming methods. Experiments in a stock-trading benchmark task demonstrate that this approach can reliably deduce minimal feature sets and that doing so can substantially improve performance and reduce the computational expense of planning.

Keywords:
Reinforcement learning Computer science Artificial intelligence Markov decision process Machine learning Feature selection Feature (linguistics) Benchmark (surveying) Markov blanket Learning classifier system Set (abstract data type) Markov process Markov model Markov chain Variable-order Markov model

Metrics

41
Cited By
2.29
FWCI (Field Weighted Citation Impact)
48
Refs
0.92
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Reinforcement Learning in Robotics
Physical Sciences →  Computer Science →  Artificial Intelligence
Evolutionary Algorithms and Applications
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Bandit Algorithms Research
Social Sciences →  Decision Sciences →  Management Science and Operations Research

Related Documents

JOURNAL ARTICLE

Algorithm-Directed Exploration for Model-Based Reinforcement Learning in Factored MDPs

Carlos GuestrinRelu PatrascuDale Schuurmans

Journal:   International Conference on Machine Learning Year: 2002 Vol: 29 (3)Pages: 235-242
BOOK-CHAPTER

TeXDYNA: Hierarchical Reinforcement Learning in Factored MDPs

Olga KozlovaOlivier SigaudChristophe Meyer

Lecture notes in computer science Year: 2010 Pages: 489-500
JOURNAL ARTICLE

Near-optimal Reinforcement Learning in Factored MDPs

Ian OsbandBenjamin Van Roy

Journal:   arXiv (Cornell University) Year: 2014 Vol: 27 Pages: 604-612
© 2026 ScienceGate Book Chapters — All rights reserved.