Research On Robot Manipulation Skills Learning Based On Deep Reinforcement Learning

Posted on:2023-08-27

Degree:Master

Type:Thesis

Country:China

Candidate:Y Zhang

Full Text:PDF

GTID:2568306848462174

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

Robot manipulation skills learning is an important branch in the field of robotics research.With the rapid development of artificial intelligence technology,deep reinforcement learning algorithms have been initially applied to solve this problem.However,due to the sparse reward characteristics of robot manipulation skills learning tasks,existing algorithms have training problems such as low efficiency and non-convergence of training.Aiming at the problem of sparse reward,this paper improves the existing deep reinforcement learning algorithm and designs a more efficient robot manipulation skills learning algorithm to improve the robot’s perception ability and autonomous decisionmaking ability in complex environments.The specific research contents are as follows.Firstly,in view of the low training efficiency of deep reinforcement learning in the learning task of robot manipulation skills in a sparse reward environment,an Adaptive Temperature Hindsight Experience Replay(ATHER)algorithm is proposed.First,an adaptive temperature parameter is introduced,which can dynamically adjust the exploratory nature of the agent according to the environment to adapt to different tasks and different stages of the task,which improves the performance and robustness of the algorithm.Then,a simplified value function calculation is proposed.The method uses the action value function to approximate the state value function,which simplifies the algorithm model and improves the training efficiency of the algorithm.Finally,the target network mechanism and the dual network partial inheritance mechanism improve the stability of the algorithm training and solve the problem of overestimation.the problem of deviation.Secondly,aiming at the low utilization rate of data samples in the application of the post-experience replay method,a Meta-Learning Double Experience Replay Buffer Hindsight Experience Replay(MDHER)algorithm is proposed.First,without changing the overall size of the experience pool,the original experience pool is divided into two parts:the sampling experience pool and the virtual experience pool.Then,batches of data are extracted from the two experience pools in proportion to train the algorithm model,so as to ensure that both kinds of data exist in each training.Finally,based on the meta-learning method,the proportional hyperparameter is learned by using the knowledge learned in the past,and the ratio of the two data is dynamically adjusted according to the value of the two data,so as to achieve the purpose of improving the value of the training samples.Finally,the ATHER algorithm and the MDHER algorithm are implemented in the Fetch manipulator environment and the Hand manipulator environment using the Mu Jo Co simulation environment,and compared with the existing algorithms in 8 multi-goal and sparse reward robot manipulation skills learning tasks respectively,which verifies the effectiveness of the method proposed in this paper.

Keywords/Search Tags:

deep reinforcement learning, robot manipulation skills, hindsight experience replay, information entropy, meta-learning

PDF Full Text Request

Related items

1	Study Of Robot Manipulation Skills Learning Algorithm Based On Demonstration Data
2	Research On Robot Path Planning Based On Deep Reinforcement Learnin
3	Application Research Of Reinforcement Learning In Robot Indoor Navigation
4	Study Of Robot Arm Control Based On Deep Reinforcement Learning
5	Robot Push And Grasp Manipulation Skills Learning Based On Deep Reinforcement Learning
6	Research On Optimization Method Of Deep Reinforcement Learning Experience Replay
7	Research On Experience Replay Method For Deep Reinforcement Learning
8	Research On Robot Operation Skills Learning Based On Improved Deep Reinforcement Learning Algorithm
9	Research On The Learning Method Of Service Robot Arm Manipulation Skills Based On Behavior Tree And Reinforcement Learning
10	Deep Reinforcement Learning With Experience Replay