Research On 3D Object Detection Algorithm Based On Point Clou

Posted on:2024-05-16

Degree:Master

Type:Thesis

Country:China

Candidate:Y J Hou

Full Text:PDF

GTID:2532307148463114

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

Autonomous driving technology is an important component of future intelligent transportation systems,and three-dimensional object detection is one of the core technologies in autonomous driving.3D target detection aims to extract the position,size,shape and other information of objects from 3D point cloud data of laser radar,so as to help the auto drive system recognize and predict objects in the surrounding environment,which is of great significance to improve the safety and reliability of autopilot.This thesis proposes two 3D object detection algorithms based on the sparsity of Li DAR point clouds using deep learning methods,and conducts theoretical analysis,method validation,and result analysis.The main research content is as follows:(1)In response to the differences in target feature distribution caused by different sparsity of 3D point cloud targets in existing Li DARs,as well as the insufficient flexibility of existing algorithm feature sampling strategies,this thesis proposes a 3D object detection algorithm based on deformable voxel feature pyramid.Firstly,an adaptive voxel feature pyramid module was proposed,which selects the corresponding feature layer for feature aggregation based on the sparsity of non empty voxels of the region of interest(Ro I).Secondly,a deformable voxel Ro I pooling operation is proposed,which extracts non empty voxel dense regions through deformable grid points to enrich semantic features and gather relevant background information outside of Ro I to improve positioning accuracy.The experiment shows that in the widely used KITTI dataset vehicle classification,our method outperforms the 3D object detection method PV-RCNN in terms of average precision(AP),which increases by 0.47%,1.63%,and 1.34% in three levels: simple,medium,and difficult,respectively.(2)To address the issue of insufficient single stream feature information in single point cloud or single voxel 3D object detection algorithms,this thesis proposes a 3D object detection algorithm based on multi stream feature aggregation,which utilizes point cloud representations from different perspectives for feature complementarity to achieve accurate 3D object detection.Firstly,multi-stream features are extracted from the point cloud through three representations: original points,voxels,and perspective images.Then,a multi stream feature fusion module based on channel attention was proposed,which effectively utilizes the correlation between different features and eliminates redundant information.The experiment shows that in the widely used KITTI dataset,the proposed method outperforms its baseline method H23D-RCNN in terms of rider classification,with an AP improvement of 5.56%,4.73%,and 5.16% in simple,medium,and difficult levels,respectively.

Keywords/Search Tags:

3D point cloud, Feature pyramid, Deformable voxel feature, Multi stream feature fusion

PDF Full Text Request

Related items

1	Research On 3-D Object Detection Algorithms Based On Point Cloud Feature Enhancement
2	Research On CNN Fault Diagnosis Method Based On Multi-Scale And Two-Stream Feature Fusion
3	Multi-Channel Fusion Feature Vehicle Detection Based On Static Image
4	3D Object Detection Based On Point Cloud And Image Fusion
5	A Pyramid For Multi-Task Feature Selection And Its Application In Electric Power Equipment Detection
6	The Research On Classification Of Defects In Transmission Hub Based On Feature Point Detection
7	Research On Traffic Sign Detection Method Based On Multi-scale Convolutional Neural Network
8	Algorithm Research On Multi-Imformation Feature Fusion
9	Research And Application Of Point Cloud Recognition Algorithm Based On Feature Matching In Workpiece Picking
10	Research On Three Dimensional Object Detection Based On Attention Fusion And Cross-module