Real Time Semantic Segmentation Algorithm Via Attention Based Feature Fusion

Posted on:2022-12-06

Degree:Master

Type:Thesis

Country:China

Candidate:B L Chen

Full Text:PDF

GTID:2568306326975779

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

Deep learning based computer vision algorithms are widely developed in recent years due to the availability of a large number of image data sets.Beyond image-level classification and recognition,state-of-the-art methods can also perform pixel-level localization and semantic labeling,facilitating a large variety of applications such as intelligent driving,medical diagnosis,face segmentation and recognition and other fields.Driven from both academic research and industrial potential,deep learning based semantic segmentation has attracted ever-increasing research attention in the past decade.Most existing semantic segmentation algorithms improve the model accuracy from the perspective of expanding receptive field,such as using spatial pyramid module to expand receptive field.However using hole convolution with large void ratio will slow down the calculation speed,while the smaller void ratio will lead to too small receptive field.To solve this problem,we propose a smooth version of spatial pyramid module,which expands the receptive field without increasing the amount of computation,and ensures real-time model performance.At present,the cutting-edge semantic segmentation networks directly performs Concat or Add operations on two feature maps when doing feature fusion.Since the two feature maps with different resolutions are quite different,directly weighting the two feature maps will cause serious accuracy drop.To handle this problem,in this thesis we propose a feature fusion module based on the attention mechanism to fully integrate shallow spatial information and deep semantic information,and improve the accuracy of the semantic segmentation model.A series of experiments were carried out on Cityscapes,VOC2012 and Sun-RGBD data sets.The experimental platform is NVIDIA-1080Ti graphics card,in which 73.75%MIOU is achieved on Cityscapes data sets,and the speed is 89.29 FPS.On the Sun-RGBD data sets,58.81%MIOU was obtained for thirteen classifications,71.42%MIOU was obtained for five classifications,and the speed was 76.92 FPS.The experimental results show that the proposed real-time semantic segmentation algorithm using attention-based mechanism feature fusion achieves better accuracy and real-time inference speed on benchmark data sets.

Keywords/Search Tags:

real time semantic segmentation, lightweight network, spatial pyramid module, attention mechanism, feature fusion

PDF Full Text Request

Related items

1	Research On Real-time Semantic Segmentation Network Technology
2	Research On Real-Time Semantic Segmentation Method Based On Feature Fusion
3	Research On Real-Time Semantic Segmentation Method Based On Adaptive Attention Mechanism And Boundary Assistance
4	Pyramid Pooling And Spatial Attention Optimized Deep Semantic Segmentation
5	Research On Real-time Semantic Segmentation Based On Contextual Feature Aggregation Learning
6	Semantic Segmentation Of Urban Scenes Based On Lightweight Networks
7	Research On The Semantic Segmentation Algorithm Of Low-Visibility Images In Foggy Scenes
8	Research On Real-Time Semantic Segmentation Methods Based On Convolutional Neural Network
9	Image Semantic Segmentation Based On Pyramid Pooling And Attention Mechanism
10	Real-time Semantic Segmentation In Street Scene Based On Double Branch Structure