Research On IoV Spectrum Efficiency Optimization Based On Deep Reinforcement Learning

Posted on:2021-04-01

Degree:Master

Type:Thesis

Country:China

Candidate:Q F Lin

Full Text:PDF

GTID:2392330614950096

Subject:Information and Communication Engineering

Abstract/Summary:

PDF Full Text Request

The Internet of Vehicles is an important application of the fifth generation mobile communication.In this network,there are two kinds of communication links,one is a vehicle-to-infrastructure(V2I)communication link,and the other is a vehicle-tovehicle(V2V)communication link.In order to improve the efficiency of spectrum utilization,many studies have focused on interference management through resource allocation algorithm.However,the basis for the implementation of these methods is based on the fact that the base station needs to obtain all the channel state information(CSI)of all the vehicles.In reality,it is difficult for the base station to obtain accurate CSI due to the high-speed movement of vehicles.In order to solve this problem,this paper uses the deep reinforcement learning algorithm(DRL).First,this paper studies the resource allocation problem,considering a single V2 V link as an agent.We take the real-time CSI that can be obtained from this V2 V link and the obtained interference from other vehicles as the state,using the channel selection and transmission power as the agent's action,and using the system's spectrum effect as a reward.Then,we construct reinforcement learning problem and use deep Q network(DQN)to solve this problem.Next,in view of the fact that multiple V2 V links in the system are agents,a multi-agent reinforcement learning model is constructed,and the agents continuously update their own strategy in order to maximize the same reward.The simulation proves that the single agent algorithm is better than the random selection algorithm,which improves the spectrum efficiency of the system.And because the multi-agent algorithm is based on a cooperative model,the simulation result is better than the single-agent algorithm,which further improves the spectrum efficiency of the system.Secondly,this paper studies the resource allocation problem,considering both V2 V links and V2 I links as agents.In order to solve the problem of inconsistent action selection between V2 I link and V2 V link,this paper first allocates channels to V2 V link,and then uses multi-agent deep deterministic policy gradient(MADDPG)to solve the power allocation problem of V2 I links and V2 V links.It can be seen from the simulation results that the resource allocation algorithm based on MADDPG can handle the continuous variable well,which improves the spectrum efficiency of the system.Finally,in order to deal with discrete variables and continuous variables simultaneously,this paper studies the resource allocation algorithm at the base station.The overall system optimization problem is decomposed into two sub-problems.For the problem of power allocation,a linear search algorithm is used to solve it.For the channel allocation problem,this paper uses DQN to solve the channel allocation problem.By comparing with the depth-first search algorithm,it is verified that DQN reduces the complexity of the algorithm while ensuring the performance of resource allocation.In order to further solve the improvement of the robust of the algorithm,this paper proposes an intelligent branch and bound algorithm,using DQN to guide the pruning strategy of the branch and bound algorithm,while ensuring the traversal effect,greatly reducing the complexity of the algorithm and the algorithm is robust.

Keywords/Search Tags:

Internet of Vehicles, spectrum efficiency, deep Q network, intelligent branch and bound algorithm

PDF Full Text Request

Related items

1	On Scheduling Of Satellite In Production Based On Branch And Bound Algorithm
2	Research On Joint Resource Management Algorithm Of Communication,Caching And Computing For Internet Of Vehicles Based On Deep Reinforcement Learning
3	Research On Spectrum Sensing Algorithm Of Cognitive Internet Of Vehicles Based On Collaboration
4	A Branch And Bound Algorithm For Flow Shop Scheduling Problem
5	A Study On3PL Transportation Schedule Problem Based On Lagrangian Relaxation And Branch-and-Bound Method
6	Research On Resource Management Strategy Of UAV-assisted Edge Internet Of Vehicles For Intelligent Transportation
7	Research On The Control Algorithm Of Virtual Intelligent Traffic Lights In The Internet Of Vehicles
8	On Dynamic Flow Shop Makespan Problems Based On Branch And Bound Algorithm
9	Research On Intelligent And Adaptive Driving Strategy Oriented To The Internet Of Vehicles
10	Research On Efficiency Dissemination Of Information In Internet Of Vehicles Based On Phantom DAG Blockchain And Its Consensus Algorithm