JOURNAL ARTICLE

Model-based imitation learning by probabilistic trajectory matching

Abstract

One of the most elegant ways of teaching new skills to robots is to provide demonstrations of a task and let the robot imitate this behavior. Such imitation learning is a non-trivial task: Different anatomies of robot and teacher, and reduced robustness towards changes in the control task are two major difficulties in imitation learning. We present an imitation-learning approach to efficiently learn a task from expert demonstrations. Instead of finding policies indirectly, either via state-action mappings (behavioral cloning), or cost function learning (inverse reinforcement learning), our goal is to find policies directly such that predicted trajectories match observed ones. To achieve this aim, we model the trajectory of the teacher and the predicted robot trajectory by means of probability distributions. We match these distributions by minimizing their Kullback-Leibler divergence. In this paper, we propose to learn probabilistic forward models to compute a probability distribution over trajectories. We compare our approach to model-based reinforcement learning methods with hand-crafted cost functions. Finally, we evaluate our method with experiments on a real compliant robot.

Keywords:
Computer science Robot Probabilistic logic Artificial intelligence Reinforcement learning Trajectory Robustness (evolution) Machine learning Divergence (linguistics)

Metrics

46
Cited By
6.35
FWCI (Field Weighted Citation Impact)
52
Refs
0.96
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Robot Manipulation and Learning
Physical Sciences →  Engineering →  Control and Systems Engineering
Reinforcement Learning in Robotics
Physical Sciences →  Computer Science →  Artificial Intelligence
Robotic Locomotion and Control
Physical Sciences →  Engineering →  Biomedical Engineering

Related Documents

JOURNAL ARTICLE

Probabilistic model-based imitation learning

Péter EnglertAlexandros ParaschosMarc Peter DeisenrothJan Peters

Journal:   Adaptive Behavior Year: 2013 Vol: 21 (5)Pages: 388-403
JOURNAL ARTICLE

GeoGail: A Model-Based Imitation Learning Framework for Human Trajectory Synthesizing

Yuchen WuHuandong WangChangzheng GaoDepeng JinYong Li

Journal:   ACM Transactions on Knowledge Discovery from Data Year: 2024 Vol: 19 (1)Pages: 1-23
JOURNAL ARTICLE

Imitation Learning and Model Integrated Excavator Trajectory Planning

Qiangqiang GuoZhixian YeLiyang WangLiangjun Zhang

Journal:   2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) Year: 2022 Pages: 5737-5743
BOOK-CHAPTER

Model-Based Imitation Learning

Robert Babuška

Year: 2012 Pages: 2298-2299
JOURNAL ARTICLE

Tidy-Up Tasks Using Trajectory-based Imitation Learning

Doo-Jun KimHyun-Jun JoJae‐Bok Song

Journal:   2021 21st International Conference on Control, Automation and Systems (ICCAS) Year: 2021 Pages: 496-499
© 2026 ScienceGate Book Chapters — All rights reserved.