Multi-Agent Partial Observation Division And Policy Mixing Network With Attention

Posted on:2022-06-03

Degree:Master

Type:Thesis

Country:China

Candidate:N N Zhang

Full Text:PDF

GTID:2558307154475164

Subject:Engineering

Abstract/Summary:

PDF Full Text Request

In multi-agent systems,only part of the information in the agent’s original observation is crucial to the selection of the optimal policies and irrelevant information is often used as noise to interfere with the selection.However,agents cannot learn attention to each part of the information effectively and reduce the negative impact of irrelevant information.In complex settings,the size of observation space increases exponentially with respect to the number of agents and the large-scale observation space aggravates the degree of redundancy in the original observation.Irrelevant and redundant information increases the negative influence on the policies and hinders the RL-Agent from learning stably and efficiently.In this thesis,we propose a novel network architecture,named partial observation division and policy mixing network(ODPM),to address the negative impact of irrelevant information.ODPM utilizes an end-to-end training policy network to divide agent’s original observation.For the representations of group information,a local value estimation module is constructed to get the values corresponding to group information,and then ODPM utilizes the attention mechanism to aggregate the values as a correction to the original policy for the interaction between the agent and the environment.ODPM makes agents give a fine-grained attention to key information and a coarse-grained combination of irrelevant information and reduces the negative influence of irrelevant and redundant information on the current policies and improve training stability and performance.We conduct experiments based on two classic multi-agent settings,Magent Battle and SMAC.Experimental results show that ODPM improves the performance of state-of-the-art DRL approaches compared with several network architectures based on the attention mechanism.

Keywords/Search Tags:

Multi-agent, Deep reinforcement learning, Information division, Attention

PDF Full Text Request

Related items

1	Research On Multi-agent Deep Reinforcement Learning In Non-globally Knowable Environment
2	Research On Multi-Agent Cooperative Algorithm Based On Deep Reinforcement Learning
3	Research On Multi-agent Distributed Cooperation Method Based On Deep Reinforcement Learning
4	Research On Multi-agent Cooperation Method Based On Deep Reinforcement Learning
5	Research On Group Confrontation Strategies Based On Deep Reinforcement Learning
6	Research On Multi-agent Attack And Defense Countermeasures Based On Deep Reinforcement Learning
7	Research On Deep Reinforcement Learning Technology For Multi-agent Collaboration
8	Research On Multi-agent System Decision Algorithm Based On Deep Reinforcement Learning
9	Research On The Key Technology Of Multi-agent Collaborative Algorithm Based On Deep Reinforcement Learning
10	Research On Multi-agent Confrontation Strategy Based On Deep Reinforcement Learning