JOURNAL ARTICLE

Individual versus Difference Rewards on Reinforcement Learning for Route Choice

Abstract

In transportation systems, drivers usually choose their routes based on their own knowledge about the network. Such a knowledge is obtained from drivers' previous trips. When drivers are faced with jams they may change their routes to take a faster path. But this re-routing may not be a good choice because other drivers can proceed in the same way. Furthermore, such behaviour can create jams in other links. On the other hand, if drivers build their routes aiming at maximizing the overall travel time (system's utility), rather than their individual travel time (agents' utility), the whole system may benefit. This work presents two reinforcement learning algorithms for solving the route choice problem in road networks. The IQ-learning uses an individual reward function, which aims at finding a policy that maximizes the agents' utility. On the other hand, DQ-learning algorithm shapes the agents' reward based on difference rewards function, and aims at finding a route that maximizes the system's utility. Through experiments we show that DQ-learning is able to reduce the overall travel time when compared to other methods.

Keywords:
Reinforcement learning Computer science TRIPS architecture Function (biology) Path (computing) Temporal difference learning Routing (electronic design automation) Work (physics) Artificial intelligence Operations research Engineering

Metrics

20
Cited By
1.85
FWCI (Field Weighted Citation Impact)
15
Refs
0.89
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Transportation Planning and Optimization
Social Sciences →  Social Sciences →  Transportation
Transportation and Mobility Innovations
Physical Sciences →  Engineering →  Automotive Engineering
Traffic control and management
Physical Sciences →  Engineering →  Control and Systems Engineering

Related Documents

JOURNAL ARTICLE

A deep inverse reinforcement learning approach to route choice modeling with context-dependent rewards

Zhan ZhaoYuebing Liang

Journal:   Transportation Research Part C Emerging Technologies Year: 2023 Vol: 149 Pages: 104079-104079
JOURNAL ARTICLE

Learning Individual Potential-Based Rewards in Multiagent Reinforcement Learning

Chen YangPei XuJunge Zhang

Journal:   IEEE Transactions on Games Year: 2024 Vol: 17 (2)Pages: 334-345
JOURNAL ARTICLE

Potential-based difference rewards for multiagent reinforcement learning

Sam DevlinLogan YliniemiDaniel Kudenko⋆Kagan Tumer

Journal:   Adaptive Agents and Multi-Agents Systems Year: 2014 Pages: 165-172
JOURNAL ARTICLE

Approximate Difference Rewards for Scalable Multigent Reinforcement Learning.

Arambam James SinghAkshat KumarHoong Chuin Lau

Journal:   Autonomous Agents and Multi-Agent Systems Year: 2021 Pages: 1655-1657
JOURNAL ARTICLE

Reinforcement learning of route choice considering traveler’s preference

Xueqin LongJianxu MaoZhongbao QiaoPeng LiWei He

Journal:   Transportation Letters Year: 2023 Vol: 16 (7)Pages: 658-671
© 2026 ScienceGate Book Chapters — All rights reserved.