JOURNAL ARTICLE

Policy Gradient Adaptive Dynamic Programming for Model-Free Multi-Objective Optimal Control

Hao ZhangYan LiZhuping WangYi DingHuaicheng Yan

Year: 2023 Journal:   IEEE/CAA Journal of Automatica Sinica Vol: 11 (4)Pages: 1060-1062   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Dear Editor, In this letter, the multi-objective optimal control problem of nonlinear discrete-time systems is investigated. A data-driven policy gradient algorithm is proposed in which the action-state value function is used to evaluate the policy. In the policy improvement process, the policy gradient based method is employed, which can improve the performance of the system and finally derive the optimal policy in the Pareto sense. The actor-critic structure is established to implement the algorithm. In order to improve the efficiency of data usage and enhance the learning effect, the experience replay technology is used during the training process, with both offline data and online data. Finally, simulation is given to illustrate the effectiveness of the method.

Keywords:
Computer science Reinforcement learning Bellman equation Dynamic programming Process (computing) Mathematical optimization Optimal control Function (biology) Control (management) Data-driven Control theory (sociology) Artificial intelligence Algorithm Mathematics

Metrics

3
Cited By
0.93
FWCI (Field Weighted Citation Impact)
10
Refs
0.73
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Adaptive Dynamic Programming Control
Physical Sciences →  Computer Science →  Computational Theory and Mathematics
Reinforcement Learning in Robotics
Physical Sciences →  Computer Science →  Artificial Intelligence
Mechanical Circulatory Support Devices
Physical Sciences →  Engineering →  Biomedical Engineering
© 2026 ScienceGate Book Chapters — All rights reserved.