Nondominated Policy-Guided Learning in Multi-Objective Reinforcement Learning

Man-Je Kim; Hyunsoo Park; Chang Wook Ahn

doi:10.3390/electronics11071069

ScienceGate Book Chapters

JOURNAL ARTICLE

Nondominated Policy-Guided Learning in Multi-Objective Reinforcement Learning

Man-Je Kim Hyunsoo Park Chang Wook Ahn

Year: 2022 Journal: Electronics Vol: 11 (7)Pages: 1069-1069 Publisher: Multidisciplinary Digital Publishing Institute

DOI: 10.3390/electronics11071069

Get Full-Text PDF Get Analytical Report

Abstract

Control intelligence is a typical field where there is a trade-off between target objectives, and researchers in this field have longed for artificial intelligence that achieves the target objectives. Multi-objective deep reinforcement learning was sufficient to satisfy this need. In particular, multi-objective deep reinforcement learning methods based on policy optimization are leading the optimization of control intelligence. However, multi-objective reinforcement learning has difficulties when finding various Pareto optimals of multi-objectives due to the greedy nature of reinforcement learning. We propose a method of policy assimilation to solve this problem. This method was applied to MO-V-MPO, one of preference-based multi-objective reinforcement learning, to increase diversity. The performance of this method has been verified through experiments in a continuous control environment.

Keywords:

Reinforcement learning Artificial intelligence Computer science Reinforcement Control (management) Machine learning Learning classifier system Pareto principle Mathematical optimization Engineering Mathematics

Metrics

Cited By

0.39

FWCI (Field Weighted Citation Impact)

Refs

0.59

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Reinforcement Learning in Robotics

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Multi-Objective Optimization Algorithms

Physical Sciences → Computer Science → Computational Theory and Mathematics

Adaptive Dynamic Programming Control

Physical Sciences → Computer Science → Computational Theory and Mathematics

Nondominated Policy-Guided Learning in Multi-Objective Reinforcement Learning

Abstract

Metrics

Citation History

Topics

Related Documents

Prediction Guided Meta-Learning for Multi-Objective Reinforcement Learning

Reinforcement Learning Guided Multi-Objective Exam Paper Generation

Multi-objective safe reinforcement learning: the relationship between multi-objective reinforcement learning and safe reinforcement learning

Learning-guided nondominated sorting genetic algorithm II for multi-objective satellite range scheduling problem

Multi-objective Reinforcement Learning with Path Integral Policy Improvement