Research on Multi-Agent Reinforcement Learning Traffic Control

Xinpeng Fu; Simin Chen; Qixian Liang; Yueqiao Li

doi:10.1109/iccect57938.2023.10140678

ScienceGate Book Chapters

JOURNAL ARTICLE

Research on Multi-Agent Reinforcement Learning Traffic Control

Xinpeng Fu Simin Chen Qixian Liang Yueqiao Li

Year: 2023 Vol: 22 Pages: 231-239

DOI: 10.1109/iccect57938.2023.10140678

Get Full-Text PDF Get Analytical Report

Abstract

Optimizing the traffic control system is of great significance for improving people's livelihood and promoting economic development. To realize an intelligent traffic control system that is different from the fixed timing scheme based on historical traffic flow and the adjustment of green signal ratio, maddpg and qmix reinforcement learning methods are tried to be applied to two different four intersection traffic network files with restricted traffic routes and the whole route. The total reward value of the algorithm is compared; the stability of the two methods applied to the traffic control system is evaluated by the growth speed of the reward items, the reduction speed of the penalty items and the convergence speed of the algorithm. Finally, through two groups of comparative experiments, the convergence speed of maddpg is slightly slower than that of qmix in the case of the whole route and the restricted route, but the total reward is significantly higher than that of qmix algorithm. At the same time, the growth rate of maddpg in reward items and the decline rate of punishment items are faster than qmix. Through the comparison of the two methods in different road environments, it is found that maddpg is more suitable for optimizing traffic control. Optimizing the traffic control system is of great significance for improving people's livelihood and promoting economic development. To realize an intelligent traffic control system that is different from the fixed timing scheme based on historical traffic flow and the adjustment of green signal ratio, maddpg and qmix reinforcement learning methods are tried to be applied to two different four intersection traffic network files with restricted traffic routes and the whole route. The total reward value of the algorithm is compared; the stability of the two methods applied to the traffic control system is evaluated by the growth speed of the reward items, the reduction speed of the penalty items and the convergence speed of the algorithm. Finally, through two groups of comparative experiments, the convergence speed of maddpg is slightly slower than that of qmix in the case of the whole route and the restricted route, but the total reward is significantly higher than that of qmix algorithm; at the same time, the growth rate of maddpg in reward items and the decline rate of punishment items are faster than qmix. Through the comparison of the two methods in different road environments, it is found that maddpg is more suitable for optimizing traffic control.

Keywords:

Reinforcement learning Intersection (aeronautics) Traffic flow (computer networking) Computer science Convergence (economics) Control (management) Scheme (mathematics) Intelligent transportation system Rate of convergence Real-time computing Artificial intelligence Engineering Transport engineering Telecommunications Computer network Channel (broadcasting) Mathematics Economics

Metrics

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

0.06

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Topics

Traffic control and management

Physical Sciences → Engineering → Control and Systems Engineering

Traffic Prediction and Management Techniques

Physical Sciences → Engineering → Building and Construction

Transportation Planning and Optimization

Social Sciences → Social Sciences → Transportation

Research on Multi-Agent Reinforcement Learning Traffic Control

Abstract

Metrics

Topics

Related Documents

Multi-agent reinforcement learning for traffic signal control

Multi-Agent Reinforcement Learning for Traffic Signal Control

Multi-Agent Reinforcement Learning for Traffic Signal Control

Multi-Agent Reinforcement Learning for Traffic Signal Control

Fair Multi-Agent Reinforcement Learning for Traffic Control