JOURNAL ARTICLE

Contrastive Learning Methods for Deep Reinforcement Learning

Di WangMengqi Hu

Year: 2023 Journal:   IEEE Access Vol: 11 Pages: 97107-97117   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Deep reinforcement learning (DRL) has shown promising performance in various application areas (e.g., games and autonomous vehicles). Experience replay buffer strategy and parallel learning strategy are widely used to boost the performances of offline and online deep reinforcement learning algorithms. However, state-action distribution shifts lead to bootstrap errors. Experience replay buffer learns policies with elder experience trajectories, limiting its application to off-policy algorithms. Balancing the new and the old experience is challenging. Parallel learning strategies can train policies with online experiences. However, parallel environmental instances organize the agent pool inefficiently with higher simulation or physical costs. To overcome these shortcomings, we develop four lightweight and effective DRL algorithms, instance-actor, parallel-actor, instance-critic, and parallel-critic methods, to contrast different-age trajectory experiences. We train the contrast DRL according to the received rewards and proposed contrast loss, which is calculated by designed positive/negative keys. Our benchmark experiments using PyBullet robotics environments show that our proposed algorithm matches or is better than the state-of-the-art DRL algorithms.

Keywords:
Reinforcement learning Computer science Benchmark (surveying) Contrast (vision) Artificial intelligence Limiting Trajectory Robotics Deep learning Action (physics) Machine learning Robot

Metrics

12
Cited By
3.07
FWCI (Field Weighted Citation Impact)
76
Refs
0.90
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Reinforcement Learning in Robotics
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Bandit Algorithms Research
Social Sciences →  Decision Sciences →  Management Science and Operations Research
Mobile Crowdsensing and Crowdsourcing
Physical Sciences →  Computer Science →  Computer Science Applications

Related Documents

JOURNAL ARTICLE

Contrastive Learning-Based Agent Modeling for Deep Reinforcement Learning

Wenhao MaYu‐Cheng ChangJie YangYu–Kai WangChin‐Teng Lin

Journal:   IEEE Transactions on Emerging Topics in Computational Intelligence Year: 2025 Vol: 9 (5)Pages: 3719-3726
JOURNAL ARTICLE

Deep reinforcement learning methods

Λαζαρίδης, Αριστοτέλης Λ.

Journal:   Aristotle University of Thessaloniki Year: 2023
BOOK-CHAPTER

CARL: Efficient Deep Reinforcement Learning via Contrastive Replay

Jitender KumarB. GopiK. E. Srinivasa Desikan

Lecture notes on data engineering and communications technologies Year: 2025 Pages: 36-46
© 2026 ScienceGate Book Chapters — All rights reserved.