Policy Gradient Adaptive Dynamic Programming for Model-Free Multi-Objective Optimal Control

Hao Zhang; Yan Li; Zhuping Wang; Yi Ding; Huaicheng Yan

doi:10.1109/jas.2023.123381

ScienceGate Book Chapters

JOURNAL ARTICLE

Policy Gradient Adaptive Dynamic Programming for Model-Free Multi-Objective Optimal Control

Hao Zhang Yan Li Zhuping Wang Yi Ding Huaicheng Yan

Year: 2023 Journal: IEEE/CAA Journal of Automatica Sinica Vol: 11 (4)Pages: 1060-1062 Publisher: Institute of Electrical and Electronics Engineers

DOI: 10.1109/jas.2023.123381

Get Full-Text PDF Get Analytical Report

Abstract

Dear Editor, In this letter, the multi-objective optimal control problem of nonlinear discrete-time systems is investigated. A data-driven policy gradient algorithm is proposed in which the action-state value function is used to evaluate the policy. In the policy improvement process, the policy gradient based method is employed, which can improve the performance of the system and finally derive the optimal policy in the Pareto sense. The actor-critic structure is established to implement the algorithm. In order to improve the efficiency of data usage and enhance the learning effect, the experience replay technology is used during the training process, with both offline data and online data. Finally, simulation is given to illustrate the effectiveness of the method.

Keywords:

Computer science Reinforcement learning Bellman equation Dynamic programming Process (computing) Mathematical optimization Optimal control Function (biology) Control (management) Data-driven Control theory (sociology) Artificial intelligence Algorithm Mathematics

Metrics

Cited By

0.93

FWCI (Field Weighted Citation Impact)

Refs

0.73

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Adaptive Dynamic Programming Control

Physical Sciences → Computer Science → Computational Theory and Mathematics

Reinforcement Learning in Robotics

Physical Sciences → Computer Science → Artificial Intelligence

Mechanical Circulatory Support Devices

Physical Sciences → Engineering → Biomedical Engineering

Policy Gradient Adaptive Dynamic Programming for Model-Free Multi-Objective Optimal Control

Abstract

Metrics

Citation History

Topics

Related Documents

Deterministic policy gradient adaptive dynamic programming for model-free optimal control

Policy Gradient Adaptive Dynamic Programming for Data-Based Optimal Control

Optimal consensus control for multi‐agent systems: Multi‐step policy gradient adaptive dynamic programming method

Policy Iteration Adaptive Dynamic Programming for Optimal Control of Multi-Player Stackelberg-Nash Games

Model-Free Off-Policy Iterative Adaptive Dynamic Programming for Nitrate-Nitrogen Concentration Control