JOURNAL ARTICLE

Research on Multi-Agent Reinforcement Learning Traffic Control

Abstract

Optimizing the traffic control system is of great significance for improving people's livelihood and promoting economic development. To realize an intelligent traffic control system that is different from the fixed timing scheme based on historical traffic flow and the adjustment of green signal ratio, maddpg and qmix reinforcement learning methods are tried to be applied to two different four intersection traffic network files with restricted traffic routes and the whole route. The total reward value of the algorithm is compared; the stability of the two methods applied to the traffic control system is evaluated by the growth speed of the reward items, the reduction speed of the penalty items and the convergence speed of the algorithm. Finally, through two groups of comparative experiments, the convergence speed of maddpg is slightly slower than that of qmix in the case of the whole route and the restricted route, but the total reward is significantly higher than that of qmix algorithm. At the same time, the growth rate of maddpg in reward items and the decline rate of punishment items are faster than qmix. Through the comparison of the two methods in different road environments, it is found that maddpg is more suitable for optimizing traffic control. Optimizing the traffic control system is of great significance for improving people's livelihood and promoting economic development. To realize an intelligent traffic control system that is different from the fixed timing scheme based on historical traffic flow and the adjustment of green signal ratio, maddpg and qmix reinforcement learning methods are tried to be applied to two different four intersection traffic network files with restricted traffic routes and the whole route. The total reward value of the algorithm is compared; the stability of the two methods applied to the traffic control system is evaluated by the growth speed of the reward items, the reduction speed of the penalty items and the convergence speed of the algorithm. Finally, through two groups of comparative experiments, the convergence speed of maddpg is slightly slower than that of qmix in the case of the whole route and the restricted route, but the total reward is significantly higher than that of qmix algorithm; at the same time, the growth rate of maddpg in reward items and the decline rate of punishment items are faster than qmix. Through the comparison of the two methods in different road environments, it is found that maddpg is more suitable for optimizing traffic control.

Keywords:
Reinforcement learning Intersection (aeronautics) Traffic flow (computer networking) Computer science Convergence (economics) Control (management) Scheme (mathematics) Intelligent transportation system Rate of convergence Real-time computing Artificial intelligence Engineering Transport engineering Telecommunications Computer network Channel (broadcasting) Mathematics Economics

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
6
Refs
0.06
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Traffic control and management
Physical Sciences →  Engineering →  Control and Systems Engineering
Traffic Prediction and Management Techniques
Physical Sciences →  Engineering →  Building and Construction
Transportation Planning and Optimization
Social Sciences →  Social Sciences →  Transportation

Related Documents

JOURNAL ARTICLE

Multi-Agent Reinforcement Learning for Traffic Signal Control

Kaïs AmmariGhazi Bel MuftiMohammed Safwen Markou

Journal:   IFAC-PapersOnLine Year: 2025 Vol: 59 (13)Pages: 58-63
JOURNAL ARTICLE

Multi-Agent Reinforcement Learning for Traffic Signal Control

Wu, Chunliang

Journal:   OPAL (Open@LaTrobe) (La Trobe University) Year: 2022
JOURNAL ARTICLE

Multi-Agent Reinforcement Learning for Traffic Signal Control

Wu, Chunliang

Journal:   Monash University Year: 2022
JOURNAL ARTICLE

Fair Multi-Agent Reinforcement Learning for Traffic Control

Umer SiddiquePeilang LiYongcan Cao

Journal:   ACM Journal on Autonomous Transportation Systems Year: 2025 Vol: 3 (2)Pages: 1-24
© 2026 ScienceGate Book Chapters — All rights reserved.