JOURNAL ARTICLE

Nondominated Policy-Guided Learning in Multi-Objective Reinforcement Learning

Man-Je KimHyunsoo ParkChang Wook Ahn

Year: 2022 Journal:   Electronics Vol: 11 (7)Pages: 1069-1069   Publisher: Multidisciplinary Digital Publishing Institute

Abstract

Control intelligence is a typical field where there is a trade-off between target objectives, and researchers in this field have longed for artificial intelligence that achieves the target objectives. Multi-objective deep reinforcement learning was sufficient to satisfy this need. In particular, multi-objective deep reinforcement learning methods based on policy optimization are leading the optimization of control intelligence. However, multi-objective reinforcement learning has difficulties when finding various Pareto optimals of multi-objectives due to the greedy nature of reinforcement learning. We propose a method of policy assimilation to solve this problem. This method was applied to MO-V-MPO, one of preference-based multi-objective reinforcement learning, to increase diversity. The performance of this method has been verified through experiments in a continuous control environment.

Keywords:
Reinforcement learning Artificial intelligence Computer science Reinforcement Control (management) Machine learning Learning classifier system Pareto principle Mathematical optimization Engineering Mathematics

Metrics

2
Cited By
0.39
FWCI (Field Weighted Citation Impact)
22
Refs
0.59
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Reinforcement Learning in Robotics
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Multi-Objective Optimization Algorithms
Physical Sciences →  Computer Science →  Computational Theory and Mathematics
Adaptive Dynamic Programming Control
Physical Sciences →  Computer Science →  Computational Theory and Mathematics
© 2026 ScienceGate Book Chapters — All rights reserved.