A sufficient condition for sample-efficient reinforcement learning with general function approximation

Wei Xiong

doi:10.14711/thesis-991013222953603412

ScienceGate Book Chapters

DISSERTATION

A sufficient condition for sample-efficient reinforcement learning with general function approximation

Wei Xiong

Year: 2023

DOI: 10.14711/thesis-991013222953603412

Get Full-Text PDF Get Analytical Report

Abstract

In this paper, we study reinforcement learning (RL) with general function approximation, where either the value function or the model dynamics is approximated by a given abstract hypothesis space. We propose the generalized eluder coefficient (GEC), which measures the hardness of generalization from the historical in-sample error to the prediction error, and further serves to measure the hardness of learning an RL problem. In terms of the algorithmic design, we propose an optimization-based framework for RL with general function approximation, following the general principle of “Optimism in the Face of Uncertainty” (OFU). Compared to existing algorithms, the proposed framework does not explicitly maintain the confidence set, and neatly handles both model-free and model-based problems wi...[ Read more ]

Keywords:

Reinforcement learning Generalization Bellman equation Function approximation Computer science Function (biology) Mathematical optimization Sample (material) Set (abstract data type) Space (punctuation) Applied mathematics Measure (data warehouse) Face (sociological concept) Mathematics Artificial intelligence Artificial neural network Mathematical analysis

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Reinforcement Learning in Robotics

Physical Sciences → Computer Science → Artificial Intelligence

A sufficient condition for sample-efficient reinforcement learning with general function approximation

Abstract

Metrics

Topics

Related Documents

Towards Sample Efficient Reinforcement Learning with Function Approximation

Sample-Efficient Constrained Reinforcement Learning with General Parameterization

Provably Efficient Reinforcement Learning with Linear Function Approximation

Provably Efficient Reinforcement Learning with Multinomial Logit Function Approximation

Randomized Exploration for Reinforcement Learning with General Value Function Approximation