Contrastive Learning Methods for Deep Reinforcement Learning

Di Wang; Mengqi Hu

doi:10.1109/access.2023.3312383

ScienceGate Book Chapters

JOURNAL ARTICLE

Contrastive Learning Methods for Deep Reinforcement Learning

Di Wang Mengqi Hu

Year: 2023 Journal: IEEE Access Vol: 11 Pages: 97107-97117 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/access.2023.3312383

Get Full-Text PDF Get Analytical Report

Abstract

Deep reinforcement learning (DRL) has shown promising performance in various application areas (e.g., games and autonomous vehicles). Experience replay buffer strategy and parallel learning strategy are widely used to boost the performances of offline and online deep reinforcement learning algorithms. However, state-action distribution shifts lead to bootstrap errors. Experience replay buffer learns policies with elder experience trajectories, limiting its application to off-policy algorithms. Balancing the new and the old experience is challenging. Parallel learning strategies can train policies with online experiences. However, parallel environmental instances organize the agent pool inefficiently with higher simulation or physical costs. To overcome these shortcomings, we develop four lightweight and effective DRL algorithms, instance-actor, parallel-actor, instance-critic, and parallel-critic methods, to contrast different-age trajectory experiences. We train the contrast DRL according to the received rewards and proposed contrast loss, which is calculated by designed positive/negative keys. Our benchmark experiments using PyBullet robotics environments show that our proposed algorithm matches or is better than the state-of-the-art DRL algorithms.

Keywords:

Reinforcement learning Computer science Benchmark (surveying) Contrast (vision) Artificial intelligence Limiting Trajectory Robotics Deep learning Action (physics) Machine learning Robot

Metrics

Cited By

3.07

FWCI (Field Weighted Citation Impact)

Refs

0.90

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Reinforcement Learning in Robotics

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Bandit Algorithms Research

Social Sciences → Decision Sciences → Management Science and Operations Research

Mobile Crowdsensing and Crowdsourcing

Physical Sciences → Computer Science → Computer Science Applications

Contrastive Learning Methods for Deep Reinforcement Learning

Abstract

Metrics

Citation History

Topics

Related Documents

Contrastive Learning-Based Agent Modeling for Deep Reinforcement Learning

Deep reinforcement learning methods

Deep reinforcement learning methods

CARL: Efficient Deep Reinforcement Learning via Contrastive Replay

Generalized Representation Learning Methods for Deep Reinforcement Learning