Multi-agents Reinforcement Learning Method With Delayed Reward For Arterial Traffic Signal Control In Connected Environment

Posted on:2021-05-30

Degree:Master

Type:Thesis

Country:China

Candidate:H Wang

Full Text:PDF

GTID:2492306476457414

Subject:Traffic and Transportation Engineering

Abstract/Summary:

PDF Full Text Request

Reinforcement learning(RL)is widely utilized in the study of intersection signal control methods and exhibits excellent performance in improving traffic efficiency and reducing fuel consumption,mainly for individual intersections.However,to achieve the coordination in the multi-intersection arterial traffic signal control(ATSC)system,RL-based control methods must confront the training difficulty of multi-agent systems,like the curse of dimensionality problem,which is aggravated by the delayed-reward property(integral traffic flow state does not change instantly with the signal operation)of the multi-intersection artery.In this paper,the Delayed Reward’s Multi-Agent Arterial Signal Control(DMAS)method has been proposed.Considering the topological characteristic of the communication network of the ATSC system in the CV environment,DMAS adopts the multi-agent actor-critic training regimen referring to MADDPG,wherein single signal controllers play the role of actor agents,and the center controller of the artery operates as an integrated valuator with multiple critic agents.Furthermore,considering the delayed reward property,we ameliorate the MADDPG by embedding the return decomposition module whereby DMAS transfers the delayed reward into immediate reward.We introduce a dynamic delayed reward prediction model to implement the information contribution analysis from RUDDER.The total delayed reward is assigned to each step according to the prediction value difference between adjacent steps.The simulation results show that the average reward of MADDPG increased by 18% compared with DDPG and 3%further increased by the reward decomposition module.Meanwhile,the line chart of train procession indicates the better stability of performance.

Keywords/Search Tags:

Arterial Signal Control, Multi-agent System, Delayed Reward, Deep Reinforcement Learning, Contribution Analysis

PDF Full Text Request

Related items

1	Research On Multi-agent Deep Reinforcement Learning For Large-scale Traffic Signal Control
2	Traffic Signal Control Based On Deep Reinforcement Learnin
3	Research On Multi-agent Deep Reinforcement Learning Algorithms For Traffic Signal Control
4	Traffic Signal Control Method Based On Multi-Agent Reinforcement Learning
5	Research On Traffic Signal Control Method Based On Deep Reinforcement Learning
6	Study On Urban Arterial Traffic Signal Optimization Based On Multi-agent Switched Control
7	Traffic Signal Priority Control Based On Multi-Agent Deep Reinforcement Learning
8	The Research Of Urban Traffic Signal Control Based On Multi-Agent System
9	Optimal Design Of Jamming-Detection Shared Signal Based On Deep Reinforcement Learning
10	Research On Traffic Signal Control Method Based On Deep Reinforcement Learning