JOURNAL ARTICLE

Offline Quantum Reinforcement Learning in a Conservative Manner

Zhihao ChengKaining ZhangLi ShenDacheng Tao

Year: 2023 Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Vol: 37 (6)Pages: 7148-7156   Publisher: Association for the Advancement of Artificial Intelligence

Abstract

Recently, to reap the quantum advantage, empowering reinforcement learning (RL) with quantum computing has attracted much attention, which is dubbed as quantum RL (QRL). However, current QRL algorithms employ an online learning scheme, i.e., the policy that is run on a quantum computer needs to interact with the environment to collect experiences, which could be expensive and dangerous for practical applications. In this paper, we aim to solve this problem in an offline learning manner. To be more specific, we develop the first offline quantum RL (offline QRL) algorithm named CQ2L (Conservative Quantum Q-learning), which learns from offline samples and does not require any interaction with the environment. CQ2L utilizes variational quantum circuits (VQCs), which are improved with data re-uploading and scaling parameters, to represent Q-value functions of agents. To suppress the overestimation of Q-values resulting from offline data, we first employ a double Q-learning framework to reduce the overestimation bias; then a penalty term that encourages generating conservative Q-values is designed. We conduct abundant experiments to demonstrate that the proposed method CQ2L can successfully solve offline QRL tasks that the online counterpart could not.

Keywords:
Reinforcement learning Computer science Upload Offline learning Quantum computer Quantum Online and offline Scheme (mathematics) Q-learning Theoretical computer science Artificial intelligence Algorithm Online learning Mathematics Quantum mechanics

Metrics

6
Cited By
0.87
FWCI (Field Weighted Citation Impact)
78
Refs
0.66
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Quantum Computing Algorithms and Architecture
Physical Sciences →  Computer Science →  Artificial Intelligence
Quantum Information and Cryptography
Physical Sciences →  Computer Science →  Artificial Intelligence
Neural Networks and Reservoir Computing
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Conservative network for offline reinforcement learning

Zhiyong PengYadong LiuHaoqiang ChenZongtan Zhou

Journal:   Knowledge-Based Systems Year: 2023 Vol: 282 Pages: 111101-111101
BOOK-CHAPTER

Stable Conservative Q-Learning for Offline Reinforcement Learning

Zhenyuan Ji

Advances in computer science research Year: 2023 Pages: 175-184
© 2026 ScienceGate Book Chapters — All rights reserved.