Multi-objective path integral policy improvement for learning robotic motion

Hayato Sago; Ryo Ariizumi; Toru Asai; Shun‐ichi Azuma

doi:10.1007/s10015-025-01027-z

ScienceGate Book Chapters

JOURNAL ARTICLE

Multi-objective path integral policy improvement for learning robotic motion

Hayato Sago Ryo Ariizumi Toru Asai Shun‐ichi Azuma

Year: 2025 Journal: Artificial Life and Robotics Vol: 30 (3)Pages: 534-545 Publisher: Springer Science+Business Media

DOI: 10.1007/s10015-025-01027-z

Get Full-Text PDF Get Analytical Report

Abstract

Abstract This paper proposes a new multi-objective reinforcement learning (MORL) algorithm for robotics by extending policy improvement with path integral ( $$\text {PI}^2$$ PI 2 ) algorithm. For a robot motion acquisition problem, most existing MORL algorithms are hard to apply, because of the high-dimensional and continuous state and action spaces. However, policy-based algorithms such as $$\text {PI}^2$$ PI 2 can be applied to solve this problem in single-objective cases. Based on the similarity of $$\text {PI}^2$$ PI 2 and evolution strategies (ESs) and the fact that ESs are well-suited for multi-objective optimization, we propose an extension of $$\text {PI}^2$$ PI 2 and some techniques to speed up the learning. The effectiveness is shown via numerical simulations.

Keywords:

Computer science Path (computing) Motion (physics) Artificial intelligence Path integral formulation Computer vision Physics Computer network

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.14

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Robot Manipulation and Learning

Physical Sciences → Engineering → Control and Systems Engineering

Robotic Mechanisms and Dynamics

Physical Sciences → Engineering → Control and Systems Engineering

Robotic Path Planning Algorithms

Physical Sciences → Computer Science → Computer Vision and Pattern Recognition

Multi-objective path integral policy improvement for learning robotic motion

Abstract

Metrics

Topics

Related Documents

Multi-objective Reinforcement Learning with Path Integral Policy Improvement

Multi-Agent Stochastic Control using Path Integral Policy Improvement

Multi-Agent Stochastic Control using Path Integral Policy Improvement

Path Integral Policy Improvement With Population Adaptation

Automatic Temperature Parameter Tuning for Reinforcement Learning Using Path Integral Policy Improvement