Research On Imperfect Information Machine Game Based On Deep Reinforcement Learning

Posted on:2018-04-07

Degree:Master

Type:Thesis

Country:China

Candidate:P C Wang

Full Text:PDF

GTID:2348330533969251

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

Since the concept of artificial intelligence has been proposed,the machine game has been one of its most challenging research directions.Machine game can be divided into perfect information machine game and imperfect information machine game.The characteristic of imperfect information machine game is that agent can’t get all information in the game process.Many real-world decision-making problems can be abstracted as imperfect information game problems,such as airport planning,network security,financial energy and other problems.Therefore,it is great practical significance to study the imperfect information machine game.The traditional method of solving the imperfect game machine game problem is partially observed Markov decision process model and reinforcement learning algorithm.However,the reinforcement learning algorithm can’t guarantee convergence in imperfect state and high latitude state space.Only through limited data and repeated testing can’t traverse all the state.In this paper,deep reinforcement learning algorithm is used to solve the game of imperfect information machine,and the state-action value function in reinforcement learning is replaced by a deep learning network.Aiming at the problem that historical information can ’t be considered in the decision-making process of deep reinforcement learning algorithm,we propose to add the long-short term memory model to the deep reinforcement learning algorithm.In this paper,a reward function based on Monte Carlo tree search is pro posed.By comparing the return of the game and expected reward of the Monte Carlo game tree search,we can judge whether the agent should be rewarded or be punished.Traditional methods need to extract features manually.It is difficult to find the internal relations between features.Besides,training requires a lot of domain knowledge,which makes poorly scalability.This paper proposes a poker modeling method,which is suitable for pattern matching algorithms such as deep reinforcement learning.This coding method can apply the same network structure to different poker games with very little domain knowledge.Finally,this paper applies the improved deep reinforcement learning algorithm to the Texas poker game system.Learning from end-to-end avoids the complex process of extracting features manually.Comparing with the traditional reinforcement learning algorithm,it can achieve a higher level of intelligent.Improved deep reinforcement learning can provide a feasible method for the realization of large-scale machine game system and provide the possibility for extending to real life.

Keywords/Search Tags:

deep reinforcement learning, imperfect information machine game, long-short-term memory model, poker modeling

PDF Full Text Request

Related items

1	Research On Imperfect Information Machine Game Based On Deep Reinforcement Learning In 3D Game
2	Research On Game Algorithm Of Imperfect Information 3D Video Game Based On Deep Reinforcement Learning
3	Research On Texas Poker Game System Based On Risk Model
4	Research On The Fusion Of Different Memory Networks In Deep Reinforcement Learning
5	Lstm Based Short Message Service(SMS) Modeling For Spam Classification
6	Research On Agent Autonomous Navigation Technology Based On Reinforcement Learning
7	Research And Application Of The Short-term Memory Network For Adjusting Gate Length
8	Research On Fall Detection Based On Long Short-term Memory Artificial Neural Network And Wrist Sensor
9	Research On Texas Poker Game Based On Counterfactual Regret Minimization Algorithm
10	Research On Imperfect Information Game Based On Counterfactual Regret Minimization Algorithm