The Research On Traffic Signal Timing Based On Risk-sensitive Q-learning

Posted on:2015-02-23

Degree:Master

Type:Thesis

Country:China

Candidate:Y F Mao

Full Text:PDF

GTID:2272330461496821

Subject:Transportation planning and management

Abstract/Summary:

PDF Full Text Request

At present, the urban traffic problem has become the important influence factors restrict-ing the development of urban economy. How to solve the traffic congestion, guarantee traffic system is smooth and orderly operation has become a top priority in the government’s work.But, Limited and confined space, the demand of the economy and environment, traffic infrastructure expansion is impossible. At this time, the development of intelligent transportation to solve the traffic congestion has become the only way.Summary based on the research of intelligent transportation system both at home and abroad, this paper is sensitive to risk theory and the Q learning theory applied in traffic signal control optimization is studied. The main research contents of this dissertation are listed as follows:1. Research on based on risk-avoiding Q-learning online signal timing optimization modelMost of the existing signal timing models apply risk-neutral reinforcement learning model. The disadvantages of these models are instability and low robustness. Also computing time of these models is long. For solving these problems, the paper formulates an on-line risk avoidance reinforcement learning model. The queue length difference is the performance index. Then, through VISSIM-Excel VBA-Matlab simulation platform, we analyze the effect of risk avoidance parameter on signal timing and convergence. Also we compare the proposed model with risk-neural reinforcement learning model. The results show that the proposed model has quick convergence, better stability and almost the same performance. Lastly, we propose incremental risk avoidance reinforcement learning method is suitable to signal timing optimization, that is, risk avoidance parameters should increase in a small step.2. Research on based on risk-seeking Q-learning online signal timing optimization modelConsidering the traffic randomness, uncertainty, impossible to transport planners expect. So sometimes we must fully consider the situation that may arise, even if there may be a higher risk. This paper seeks to further build the based on risk-seeking Q-learning online signal timing optimization model. The queue length difference is the performance index. In order to better contrast with the based on risk-avoiding Q-learning online signal timing optimization model, all sorts of model establishment conditions are consistent. Then, through VISSIM-Excel VBA-Matlab simulation platform, we analyze the effect of risk avoidance parameter on signal timing and convergence. Also we compare the proposed model with risk-neural reinforcement learning model. The results show that the proposed model has quick convergence. Lastly, we compare the proposed model with risk-avoiding reinforcement learning model. The results show that the exioring of model is wider,and the number of behavior training is larger. but the performance of the timing plan is not very stable,when good or bad.

Keywords/Search Tags:

incremental risk sensitive, online Q-learning, Queue length difference, signal timing, simulation

PDF Full Text Request

Related items

1	The Single Intersection Online Q Learning Control Model For The Queue Length Management
2	Traffic Signal Timing Optimization For Congested Hot Spots In Multi-source Data Environment
3	Research On Control Methods Of Signalized Intersections Under Queue Length Restriction
4	Dynamic Queue Length Estimation Using Gps And Lpr Data
5	Optimal Signal Timing For Urban Intersection Based On Vehicle Emission
6	Research On Signal Control Method Of Bottleneck Intersection
7	Optimization And Application Of Intersection Signal Timing Of The Bridge Considering Emission
8	Function Approximation Type Reinforcement Learning Models For Signal Timing Optimization Of A Single Intersection
9	Research On Vehicle Queue Length Detection Method Based On Image Processing
10	Design And Implementation Of Vehicles Queue Length Dynamic Prediction System