Model-based imitation learning by probabilistic trajectory matching

Péter Englert; Alexandros Paraschos; Jan Peters; Marc Peter Deisenroth

doi:10.1109/icra.2013.6630832

ScienceGate Book Chapters

JOURNAL ARTICLE

Model-based imitation learning by probabilistic trajectory matching

Péter Englert Alexandros Paraschos Jan Peters Marc Peter Deisenroth

Year: 2013 Pages: 1922-1927

DOI: 10.1109/icra.2013.6630832

Get Full-Text PDF Get Analytical Report

Abstract

One of the most elegant ways of teaching new skills to robots is to provide demonstrations of a task and let the robot imitate this behavior. Such imitation learning is a non-trivial task: Different anatomies of robot and teacher, and reduced robustness towards changes in the control task are two major difficulties in imitation learning. We present an imitation-learning approach to efficiently learn a task from expert demonstrations. Instead of finding policies indirectly, either via state-action mappings (behavioral cloning), or cost function learning (inverse reinforcement learning), our goal is to find policies directly such that predicted trajectories match observed ones. To achieve this aim, we model the trajectory of the teacher and the predicted robot trajectory by means of probability distributions. We match these distributions by minimizing their Kullback-Leibler divergence. In this paper, we propose to learn probabilistic forward models to compute a probability distribution over trajectories. We compare our approach to model-based reinforcement learning methods with hand-crafted cost functions. Finally, we evaluate our method with experiments on a real compliant robot.

Keywords:

Computer science Robot Probabilistic logic Artificial intelligence Reinforcement learning Trajectory Robustness (evolution) Machine learning Divergence (linguistics)

Metrics

Cited By

6.35

FWCI (Field Weighted Citation Impact)

Refs

0.96

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Robot Manipulation and Learning

Physical Sciences → Engineering → Control and Systems Engineering

Reinforcement Learning in Robotics

Physical Sciences → Computer Science → Artificial Intelligence

Robotic Locomotion and Control

Physical Sciences → Engineering → Biomedical Engineering

Model-based imitation learning by probabilistic trajectory matching

Abstract

Metrics

Citation History

Topics

Related Documents

Probabilistic model-based imitation learning

GeoGail: A Model-Based Imitation Learning Framework for Human Trajectory Synthesizing

Imitation Learning and Model Integrated Excavator Trajectory Planning

Model-Based Imitation Learning

Tidy-Up Tasks Using Trajectory-based Imitation Learning