DISSERTATION

A sufficient condition for sample-efficient reinforcement learning with general function approximation

Abstract

In this paper, we study reinforcement learning (RL) with general function approximation, where either the value function or the model dynamics is approximated by a given abstract hypothesis space. We propose the generalized eluder coefficient (GEC), which measures the hardness of generalization from the historical in-sample error to the prediction error, and further serves to measure the hardness of learning an RL problem. In terms of the algorithmic design, we propose an optimization-based framework for RL with general function approximation, following the general principle of “Optimism in the Face of Uncertainty” (OFU). Compared to existing algorithms, the proposed framework does not explicitly maintain the confidence set, and neatly handles both model-free and model-based problems wi...[ Read more ]

Keywords:
Reinforcement learning Generalization Bellman equation Function approximation Computer science Function (biology) Mathematical optimization Sample (material) Set (abstract data type) Space (punctuation) Applied mathematics Measure (data warehouse) Face (sociological concept) Mathematics Artificial intelligence Artificial neural network Mathematical analysis

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Reinforcement Learning in Robotics
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.