Research On Cityscape Scene Parsing Method For Automatic Driving

Posted on:2024-07-28

Degree:Master

Type:Thesis

Country:China

Candidate:J F Shi

Full Text:PDF

GTID:2542306932980349

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

Since the concept of autonomous driving was born,it has been an important technology that countries are competing to develop.However,current self-driving cars are still unable to accurately and efficiently analyze and recognize the surrounding environment,especially in complex urban streets.Scene parsing requires providing a complete understanding of the scene,which predicts the label and location of each pixel in the entire image,and classifies and localizes the objects it belongs to.There are two key difficulties in the analysis of complex urban landscape scenes,namely,the maintenance of high resolution and the acquisition of high-level semantic information.Based on deep learning technology,this paper proposes an improved high-resolution network combined with hollow space pyramid pooling and a highresolution scene analysis network for two major difficulties in urban landscape scene analysis,aiming to provide efficient and high-precision images for autonomous vehicles.Cityscape Scene Parsing Algorithms.The main research content of this paper is as follows:(1)Two key difficulties in urban landscape scene parsing are deeply studied,namely,the preservation of high resolution and the acquisition of high-level semantic information.In-depth study of the design of convolutional neural network,the feature map near the input end has high resolution and high semantic information,but lacks edge and detail information;the feature map near the output end has a low resolution,loses semantic information,but acquires a lot It is difficult to meet the needs of scene analysis.Therefore,this paper proposes a convolutional neural network that combines hollow convolution and pyramid pooling to solve these two difficulties.(2)For the problem of multi-scale object segmentation,an improved high-resolution network combined with empty space pyramid pooling is proposed.First,half of the residual module of the high-resolution network is improved by depth-separable convolution,and the feature extraction network is based on this;secondly,the hole-space pyramid pooling module with improved hole rate stacking is designed to obtain multiple scales The receptive field of the model enhances the ability of the model to extract multi-scale object features;finally,a dualchannel parallel decoder structure is designed to aggregate all the advantages and obtain highquality segmentation images.Finally,with 34.8M parameters on the City Scapes dataset,82.1%MIo U was obtained,and the forward prediction speed reached 10 FPS.(3)Based on the research on the improved high-resolution network combined with hollow space pyramid pooling,a high-resolution scene parsing network is proposed for the context information aggregation problem.Firstly,the high-resolution network is improved by the hole-separable convolution superimposed with the three-level hole rate,and the feature extraction network is based on this;secondly,a multi-stage fusion upsampling structure is designed to make full use of the improved high-resolution The output feature maps of the four parallel paths of the network;finally,an improved pyramid pooling module that can adapt to input images of different sizes is designed to aggregate global context information and local context information of different sizes,optimize the segmentation results,and obtain highquality Split graph.Finally,with 16.4M parameters on the City Scapes dataset,83.3% MIo U was obtained,and the forward prediction speed reached 14 FPS.(4)Apply the above algorithm to the actual application,design and realize a city landscape scene analysis system based on deep learning for automatic driving,and carry out system design and system implementation completely according to the specifications of software engineering.This paper expounds in detail the whole content of system requirement analysis,system design,system realization and system test.The scene analysis system includes modules such as user management,training model,scene analysis,data set management,and algorithm management.It meets the needs of users for scene analysis of different data sets.

Keywords/Search Tags:

Deep learning, Convolution neural network, Semantic segmentation, Scene parsing, Atrous convolution

PDF Full Text Request

Related items

1	Urban Scene Semantic Segmentation Based On Convolutional Neural Network
2	Research On Road Scene Semantic Segmentation Algorithm Based On Fully Convolutional Neural Network
3	Research On Semantic Segmentation Method Of Traffic Scene Image Based On Deep Learning
4	Research On Video Semantic Segmentation Based On Deep Convolution Neural Networks
5	Research On Semantics Segmentation Algorithm Of Road Scene Based On Convolution Neural Network
6	Research On Target Semantic Segmentation Method Of Multi-Component Parts In Industrial Scene
7	Semantic Segmentation Of Remote Sensing Images Based On Convolution Neural Network
8	Research And Implementation Of Parts Recognition Of Aerial Target Based On Semantic Segmentation
9	Research On Semantic Segmentation Of Road Scene Images Based On Deep Learning
10	Research On Image Semantic Segmentation Algorithm Based On Deep Learning