Multi-objective Reinforcement Learning with Path Integral Policy Improvement

Ryo Ariizumi; Hayato Sago; Toru Asai; Shun‐ichi Azuma

doi:10.23919/sice59929.2023.10354223

ScienceGate Book Chapters

JOURNAL ARTICLE

Multi-objective Reinforcement Learning with Path Integral Policy Improvement

Ryo Ariizumi Hayato Sago Toru Asai Shun‐ichi Azuma

Year: 2023 Pages: 1418-1423

DOI: 10.23919/sice59929.2023.10354223

Get Full-Text PDF Get Analytical Report

Abstract

Multi-objective reinforcement learning (MORL) for robot motion learning is a challenging problem not only because of the scarcity of the data but also of the high-dimensional and continuous state and action spaces. Most existing MORL algorithms are inadequate in this regard. However, in single-objective reinforcement learning, policy-based algorithms have solved the problem of high-dimensional and continuous state and action spaces. Among such algorithms is policy improvement with path integral (PI2), which has been successful in robot motion learning. P$\mathrm{I}^{2}$ is similar to evolution strategies (ES), and multi-objective optimization is a hot topic in ES algorithms. This paper proposes a MORL algorithm based on P$\mathrm{I}^{2}$ and multi-objective ES, which can handle the problem related to robot motion learning. The effectiveness is shown via numerical simulations.

Keywords:

Reinforcement learning Computer science Motion (physics) Action (physics) Path (computing) Robot State (computer science) Mathematical optimization Artificial intelligence Q-learning Path integral formulation Machine learning Algorithm Mathematics

Metrics

Cited By

0.31

FWCI (Field Weighted Citation Impact)

Refs

0.59

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Advanced Multi-Objective Optimization Algorithms

Physical Sciences → Computer Science → Computational Theory and Mathematics

Viral Infectious Diseases and Gene Expression in Insects

Life Sciences → Biochemistry, Genetics and Molecular Biology → Molecular Biology

Reinforcement Learning in Robotics

Physical Sciences → Computer Science → Artificial Intelligence

Multi-objective Reinforcement Learning with Path Integral Policy Improvement

Abstract

Metrics

Citation History

Topics

Related Documents

Multi-objective path integral policy improvement for learning robotic motion

Automatic Temperature Parameter Tuning for Reinforcement Learning Using Path Integral Policy Improvement

Multi-objective Path Finding Using Reinforcement Learning

Knowledge transfer in multi-objective multi-agent reinforcement learning via generalized policy improvement

Multi-Objective Dynamic Path Planning with Multi-Agent Deep Reinforcement Learning