Application Of Markov Decision Process In Wireless Caching Networks

Posted on:2021-05-08

Degree:Master

Type:Thesis

Country:China

Candidate:B J Lv

Full Text:PDF

GTID:2370330611498035

Subject:Information and Communication Engineering

Abstract/Summary:

PDF Full Text Request

With the development of wireless communication technology,wireless transmission rate is getting faster.People’s demand for wireless data transmission is also increasing.At the same time,content-centric data(video,audio,etc.)has gradually become the mainstream of wireless data transmission.Wireless cache technology is to store these content-centric data in cache nodes at the edge of the network,thereby improving the overall performance of the network.In this paper,the scheduling of downlink file transmission in one cell with the assistance of cache nodes with finite cache space is studied.Specifically,requesting users arrive randomly and the base station(BS)reactively multicasts files to the requesting users and selected cache nodes.The latter can offload the traffic in their coverage areas from the BS.We consider the joint optimization of the abovementioned file placement and delivery within a finite lifetime subject to the cache space constraint.Within the lifetime,the allocation of multicast power and symbol number for each file transmission at the BS is formulated as a dynamic programming problem with a random stage number.Note that there are no existing solutions to this problem.We develop an asymptotically optimal solution framework by transforming the original problem to an equivalent finite-horizon Markov decision process(MDP)with a fixed stage number.A novel approximation approach is then proposed to address the curse of dimensionality,where the analytical expressions of approximate value functions are provided.We also derive analytical bounds on the exact value function and approximation error.Based on the expression of approximate value function,this paper presents a low complexity online resource allocation algorithm.The approximate value functions depend on some system statistics,e.g.,requesting users’ distribution.One reinforcement learning algorithm is proposed for the scenario where these statistics are unknown.Numerical simulations show that the low-complexity algorithm based on the approximation function proposed in this paper can significantly reduce the average transmission cost of the base station compared with some benchmark schemes.

Keywords/Search Tags:

wireless caching networks, markov decision process, reinforcement learning, approximate algorithm

PDF Full Text Request

Related items

1	Modeling And Optimization Of Wireless Communication Networks Based On Mobility-aware Caching
2	Partial Observation Of Memory-based Reinforcement Learning Problems In Markov Decision Process
3	State Estimation And Policy Learning In Partially Observable Markov Decision Processes
4	Research On Multi-level Inverted Pendulum Balance Control Based On Deep Reinforcement Learning
5	Research On Approximate Programming Methods In Partially Observable Markov Decision Problems
6	Research On Intelligent Decision Model Based On Deep Reinforcement Learning
7	Research And Realization Of The Real-time Bidding Model Based On A Constrained Markov Decision Process
8	Research On Autonomous Driving On Highway Roads Based On Reinforcement Learning And Vehicle Dynamics
9	Some New Algorithms Of Reinforcement Learning And Their Theoretical Study
10	Research On Resource Allocation And Reliable Transmission Of Wireless Body Area Networks Based On Reinforcement Learning