JOURNAL ARTICLE

Zero-Shot Policy Transfer with Disentangled Task Representation of Meta-Reinforcement Learning

Abstract

Humans are capable of abstracting various tasks as different combinations of multiple attributes. This perspective of compositionality is vital for human rapid learning and adaption since previous experiences from related tasks can be combined to generalize across novel compositional settings. In this work, we aim to achieve zero-shot policy generalization of Reinforcement Learning (RL) agents by leveraging the task compositionality. Our proposed method is a meta-RL algorithm with disentangled task representation, explicitly encoding different aspects of the tasks. Policy generalization is then performed by inferring unseen compositional task representations via the obtained disentanglement without extra exploration. The evaluation is conducted on three simulated tasks and a challenging real-world robotic insertion task. Experimental results demonstrate that our proposed method achieves policy generalization to unseen compositional tasks in a zero-shot manner.

Keywords:
Principle of compositionality Reinforcement learning Generalization Computer science Task (project management) Representation (politics) Artificial intelligence Zero (linguistics) Perspective (graphical) Task analysis Encoding (memory) Machine learning Mathematics

Metrics

7
Cited By
1.74
FWCI (Field Weighted Citation Impact)
41
Refs
0.81
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Robot Manipulation and Learning
Physical Sciences →  Engineering →  Control and Systems Engineering
Reinforcement Learning in Robotics
Physical Sciences →  Computer Science →  Artificial Intelligence
Prosthetics and Rehabilitation Robotics
Physical Sciences →  Engineering →  Biomedical Engineering

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.