JOURNAL ARTICLE

Multi-objective Reinforcement Learning with Path Integral Policy Improvement

Abstract

Multi-objective reinforcement learning (MORL) for robot motion learning is a challenging problem not only because of the scarcity of the data but also of the high-dimensional and continuous state and action spaces. Most existing MORL algorithms are inadequate in this regard. However, in single-objective reinforcement learning, policy-based algorithms have solved the problem of high-dimensional and continuous state and action spaces. Among such algorithms is policy improvement with path integral (PI2), which has been successful in robot motion learning. P$\mathrm{I}^{2}$ is similar to evolution strategies (ES), and multi-objective optimization is a hot topic in ES algorithms. This paper proposes a MORL algorithm based on P$\mathrm{I}^{2}$ and multi-objective ES, which can handle the problem related to robot motion learning. The effectiveness is shown via numerical simulations.

Keywords:
Reinforcement learning Computer science Motion (physics) Action (physics) Path (computing) Robot State (computer science) Mathematical optimization Artificial intelligence Q-learning Path integral formulation Machine learning Algorithm Mathematics

Metrics

1
Cited By
0.31
FWCI (Field Weighted Citation Impact)
16
Refs
0.59
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Advanced Multi-Objective Optimization Algorithms
Physical Sciences →  Computer Science →  Computational Theory and Mathematics
Viral Infectious Diseases and Gene Expression in Insects
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
Reinforcement Learning in Robotics
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Multi-objective path integral policy improvement for learning robotic motion

Hayato SagoRyo AriizumiToru AsaiShun‐ichi Azuma

Journal:   Artificial Life and Robotics Year: 2025 Vol: 30 (3)Pages: 534-545
JOURNAL ARTICLE

Automatic Temperature Parameter Tuning for Reinforcement Learning Using Path Integral Policy Improvement

Hiroyasu NakanoRyo AriizumiToru AsaiShun‐ichi Azuma

Journal:   IEEE Transactions on Neural Networks and Learning Systems Year: 2023 Vol: 35 (12)Pages: 18200-18211
JOURNAL ARTICLE

Knowledge transfer in multi-objective multi-agent reinforcement learning via generalized policy improvement

Virgı́lio AlmeidaLucas N. AlegreAna L. C. Bazzan

Journal:   Computer Science and Information Systems Year: 2023 Vol: 21 (1)Pages: 335-362
JOURNAL ARTICLE

Multi-Objective Dynamic Path Planning with Multi-Agent Deep Reinforcement Learning

Tao MengQiang LiJiabin Yu

Journal:   Journal of Marine Science and Engineering Year: 2024 Vol: 13 (1)Pages: 20-20
© 2026 ScienceGate Book Chapters — All rights reserved.