Research And Application Of Visual Information Reconstruction Method For Manipulator Operation

Posted on:2023-03-14

Degree:Master

Type:Thesis

Country:China

Candidate:Q Y Jia

Full Text:PDF

GTID:2568306836964629

Subject:Engineering

Abstract/Summary:

PDF Full Text Request

The skill teaching of manipulator has always been one of the research hotspots in the field of artificial intelligence.Current skill teaching method mainly constructs a virtual space through three-dimensional reconstruction technology for manipulator to simulate and train.However,the current reconstruction method has complicated calculation process,large reconstruction error,high time cost and poor environmental adaptation.In addition,current reconstruction method also has some problems,such as the different size of the object constructed in the virtual space and the real world,and the different viewing angles between human and manipulator,which makes it difficult for the grasping skills in the virtual space to be transformed into the real operation of manipulator in the real world.To solve the above problems,a visual information reconstruction method for manipulator operation is proposed in this thesis.The main research contents are as follows:(1)Aiming at the problems of cumbersome calculation process and poor environmental adaptation of current three-dimensional real-time reconstruction methods,we propose a visual space and operation space mapping method DL-VOM based on deep learning.Firstly,a single RGB-D camera is used to collect visual information,and then the convolutional neural network is used to extract the features of visual information.Finally,the extracted feature values are used to calculate the three-dimensional position information of the corresponding manipulator operation space by fully connected layers.Experiments show that this method only need a single RGB-D camera and the convolutional neural network to complete the mapping,which is simple and fast,and the error is about 15 mm.(2)Aiming at the problems of large reconstruction error and time cost of current three-dimensional real-time reconstruction methods,we propose a three-dimensional real-time reconstruction method DL-3D based on DL-VOM monocular vision.Firstly,the visual information collected by a single RGB-D camera in real time is extracted through Mask R-CNN network,and then the extracted visual information is mapped by DL-VOM method.Finally,a cluster center distance limited outlier adjustment method BCC-Drop is proposed to reduce the error,and a symmetrical virtual space is built through Open GL.Experiments show that the reconstruction speed of this method can reach 16 fps,the absolute error of reconstruction is about 5.73 mm,and the relative error of reconstruction is about 5.23%.(3)Aiming at the problems that the grasping skills shown in the current virtual space are difficult to be transformed into the real-world operation of the manipulator in the real world,we propose a novel grasping strategy and establish a man-machine cooperative manipulator skill teaching system based on DL-3D.In this system,we record the state changes of virtual objects to guide the motion planning of manipulator in the real world.Experiments show that compared with the traditional system,the placement error of the system is reduced by about 1.7%,the placement angle error is reduced by about 2.1%,and the time consumed is reduced by about 3.6s.

Keywords/Search Tags:

skill teaching, convolutional neural network, real-time three-dimensional reconstruction, Mask R-CNN network, man-machine coordinated

PDF Full Text Request

Related items

1	Research On Three-dimensional Real-time Reconstruction Algorithms Of Binocular Stereovision Based On CNN
2	Design Of Real Time Video Super-resolution Reconstruction System Based On Convolutional Neural Network
3	Research On Building Segmentation Of Aerial Images Based On Convolutional Neural Network
4	Design Of Binary Neural Network System For One-dimensional Time Series Signals
5	Multi-Dimensional Domain Coordinated Intelligent Detection Of Surface Targets
6	Research On Reconstruction Algorithm Of Deep Convolutional Network For Lensless Coded-mask Imaging
7	Research On Real-time Video Action Classification Based On Three-Dimensional Convolutional Neural Network
8	Real-time Target Detection Based On Regional Convolutional Neural Network
9	Research On Real-Time Semantic Segmentation Methods Based On Convolutional Neural Network
10	Motion Foreground Detection Based On Time Series Feature Analysis