Concurrent Multiagent Reinforcement Learning with Reward Machines

Alessandro Trapasso; Anders Jönsson

doi:10.3233/faia251253

ScienceGate Book Chapters

BOOK-CHAPTER

Concurrent Multiagent Reinforcement Learning with Reward Machines

Alessandro Trapasso Anders Jönsson

Year: 2025 Frontiers in artificial intelligence and applications

DOI: 10.3233/faia251253

Get Full-Text PDF Get Analytical Report

Abstract

Coordinating and synchronizing multiple agents in reinforcement learning (RL) presents significant challenges, particularly when concurrent actions and shared objectives are required. We propose a novel framework that integrates Reward Machines (RMs) with Partial-Order Planning (POP) to enhance coordination in multiagent reinforcement learning (MARL). By transforming high-level POP strategies into individual RMs for each agent, our approach explicitly captures action dependencies and concurrency requirements, enabling agents to learn and execute coordinated plans effectively in complex environments. We validate our approach in a grid-based multiagent domain in which agents have to synchronize actions such as jointly accessing limited pathways or collaboratively manipulating objects. The explicit representation of action dependencies and synchronization points in RMs provides a scalable and flexible mechanism to model concurrent actions, enabling agents to focus on relevant tasks and reducing exploration.

Keywords:

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.58

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Reinforcement Learning in Robotics

Physical Sciences → Computer Science → Artificial Intelligence

Concurrent Multiagent Reinforcement Learning with Reward Machines

Abstract

Metrics

Topics

Related Documents

Automatic Decomposition of Reward Machines for Decentralized Multiagent Reinforcement Learning

Reinforcement Learning with Stochastic Reward Machines

Counterfactually-Guided Causal Reinforcement Learning with Reward Machines

Reinforcement Learning with Reward Machines in Stochastic Games

Pushdown Reward Machines for Reinforcement Learning