Research On Cooperative Confrontation Of Multiple Agents Based On Deep Reinforcement Learning

Posted on:2024-05-13

Degree:Master

Type:Thesis

Country:China

Candidate:D Teng

Full Text:PDF

GTID:2568307079976299

Subject:Electronic information

Abstract/Summary:

PDF Full Text Request

With the rapid development of deep reinforcement learning,game AI has achieved good results in various game environments.There are many factors affecting the model ability of game AI,such as training core algorithm,feature processing method,neural network structure,reward function design and so on.This thesis studies the optimization of game AI model in multi-agent environment from three aspects: training core algorithm,feature processing method and neural network structure.Firstly,in terms of training core algorithms,the Factored Multi-Agent Centralised Policy Gradients(FACMAC)algorithm based on value decomposition is researched.This algorithm combines the advantages of value function decomposition algorithm and deep strategic gradient algorithm.The algorithm can be applied not only in continuous motion space,but also in discrete motion space,and the performance is better than other algorithms.However,like other depth strategic gradient algorithms,it will produce overestimate error of Q function.In order to solve this problem,In Thesis,a Double Factored Multi-Agent Centralised Policy Gradients(DFACMAC)algorithm based on double factored multi-agent centralised policy gradients is proposed to solve this problem through two Q functions.Thesis proves that this algorithm can solve the problem of Q function overestimation error from both theoretical and experimental points of view,and the actual performance is better than other algorithms.Then,from the perspective of feature processing method and neural network structure,the optimization method of AI model ability is proposed.Firstly,an improved algorithm scheme for feature communication during feature extraction is proposed.The addition of communication can enable agents to obtain information of friends and enemies in time,so as to make appropriate decisions and strengthen the collaborative efficiency between agents;Then the algorithm improvement scheme of customized neural network is proposed.The customized neural network based on specific environment and task can maximize the learning ability of the agent and improve the model confrontation ability.The final experimental results show that both of the two improvement schemes are helpful to improve the capability of the model.In conclusion,combining theoretical basis and practical experience,Thesis conducts in-depth research on theoretical optimization of deep reinforcement learning algorithm and actual performance of game AI model in multi-agent environment,and obtains better experimental results in different types of game environments.

Keywords/Search Tags:

Game AI, Deep Reinforcement Learning Algorithms, Multi-Agent Environments

PDF Full Text Request

Related items

1	Research On Multi-Agent Pursuit-Evasion Based On Deep Reinforcement Learning
2	A Research Of Hierarchical Multi-agents Deep Reinforcement Learning For Action Game
3	AI Research Of MOBA Game Based On Deep Reinforcement Learning
4	Research On Antagonistic Strategies Based On Deep Reinforcement Learning
5	Research On Deep Reinforcement Learning Technology For Multi-agent Collaboration
6	Research On Multi-agent System Decision Algorithm Based On Deep Reinforcement Learning
7	Research On Multi-objective Workflow Scheduling With Deep-Q-network-based Multi-agent Reinforcement Learning
8	Research On The Key Technology Of Multi-agent Collaborative Algorithm Based On Deep Reinforcement Learning
9	A Multi-agent Reinforcement Learning Algorithm Based On Stackelberg Game
10	Research On Multi-Agent Cooperative Algorithm Based On Deep Reinforcement Learning