Reinforcement Learning through Asynchronous Advantage Actor-Critic on a GPU

Mohammad Babaeizadeh; Iuri Frosio; Stephen Tyree; Jason Clemons; Jan Kautz

doi:10.48550/arxiv.1611.06256

ScienceGate Book Chapters

JOURNAL ARTICLE

Reinforcement Learning through Asynchronous Advantage Actor-Critic on a GPU

Mohammad Babaeizadeh Iuri Frosio Stephen Tyree Jason Clemons Jan Kautz

Year: 2016 Journal: arXiv (Cornell University) Pages: 1-12 Publisher: Cornell University

DOI: 10.48550/arxiv.1611.06256

Get Full-Text PDF Get Analytical Report

Abstract

We introduce a hybrid CPU/GPU version of the Asynchronous Advantage Actor-Critic (A3C) algorithm, currently the state-of-the-art method in reinforcement learning for various gaming tasks. We analyze its computational traits and concentrate on aspects critical to leveraging the GPU's computational power. We introduce a system of queues and a dynamic scheduling strategy, potentially helpful for other asynchronous algorithms as well. Our hybrid CPU/GPU version of A3C, based on TensorFlow, achieves a significant speed up compared to a CPU implementation; we make it publicly available to other researchers at https://github.com/NVlabs/GA3C .

Keywords:

Computer science Reinforcement learning Asynchronous communication Queue Scheduling (production processes) Parallel computing Central processing unit Artificial intelligence Distributed computing Programming language Operating system Mathematical optimization

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Reinforcement Learning in Robotics

Physical Sciences → Computer Science → Artificial Intelligence

Evolutionary Algorithms and Applications

Physical Sciences → Computer Science → Artificial Intelligence

Parallel Computing and Optimization Techniques

Physical Sciences → Computer Science → Hardware and Architecture

Reinforcement Learning through Asynchronous Advantage Actor-Critic on a GPU

Abstract

Metrics

Citation History

Topics

Related Documents

Quantum Advantage Actor-Critic for Reinforcement Learning

Application of Improved Asynchronous Advantage Actor Critic Reinforcement Learning Model on Anomaly Detection

Locating algorithm of steel stock area with asynchronous advantage actor-critic reinforcement learning

Variational value learning in advantage actor-critic reinforcement learning

An Asynchronous Advantage Actor-Critic Reinforcement Learning Method for Stock Selection and Portfolio Management