JOURNAL ARTICLE

Data-based Optimal Control for Discrete-time Systems via Deep Deterministic Policy Gradient Adaptive Dynamic Programming

Abstract

The model-free optimal control problem for discrete-time systems is considered in this paper by using deep deterministic policy gradient adaptive dynamic programming (DDPGADP) algorithm. The system data is obtained by using the off-policy learning and the control law is updated by policy gradient. The convergence of DDPGADP algorithm is verified by showing that the Q-function sequence is monotonically non-increasing and converges to the optimum. In order to implement this method, an actor-critic neural network structure is established by adopting the target network technology from deep Q-learning during the neural network training process. Finally, simulation examples are presented to verify the effectiveness of the proposed method.

Keywords:
Computer science Monotonic function Artificial neural network Dynamic programming Convergence (economics) Reinforcement learning Process (computing) Mathematical optimization Function (biology) Gradient method Optimal control Discrete time and continuous time Sequence (biology) Adaptive control Control theory (sociology) Control (management) Algorithm Artificial intelligence Mathematics

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
24
Refs
0.15
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Adaptive Dynamic Programming Control
Physical Sciences →  Computer Science →  Computational Theory and Mathematics
Reinforcement Learning in Robotics
Physical Sciences →  Computer Science →  Artificial Intelligence
Mechanical Circulatory Support Devices
Physical Sciences →  Engineering →  Biomedical Engineering

Related Documents

JOURNAL ARTICLE

Twin Deterministic Policy Gradient Adaptive Dynamic Programming for Optimal Control of Affine Nonlinear Discrete-time Systems

Jiahui XuJingcheng WangJun RaoYanjiu ZhongShangwei Zhao

Journal:   International Journal of Control Automation and Systems Year: 2022 Vol: 20 (9)Pages: 3098-3109
JOURNAL ARTICLE

Policy Gradient Adaptive Dynamic Programming for Data-Based Optimal Control

Biao LuoDerong LiuHuai‐Ning WuDing WangFrank L. Lewis

Journal:   IEEE Transactions on Cybernetics Year: 2016 Vol: 47 (10)Pages: 3341-3354
JOURNAL ARTICLE

Event-Triggered Control of Discrete-Time Zero-Sum Games via Deterministic Policy Gradient Adaptive Dynamic Programming

Yongwei ZhangBo ZhaoDerong LiuShunchao Zhang

Journal:   IEEE Transactions on Systems Man and Cybernetics Systems Year: 2021 Vol: 52 (8)Pages: 4823-4835
© 2026 ScienceGate Book Chapters — All rights reserved.