Research On Semi-supervised Video Object Segmentation

Posted on:2023-10-25

Degree:Master

Type:Thesis

Country:China

Candidate:B C Gao

Full Text:PDF

GTID:2568307070484124

Subject:Engineering

Abstract/Summary:

PDF Full Text Request

Semi-supervised video object segmentation is an extremely challenging task and has broad application prospects in video editing,autonomous driving and other fields.However,complex situations such as deformation,occlusion,and rapid-motion,etc,often exist in videos,which limit the segmentation speed and accuracy of existing methods.To solve this problem,this thesis proposes a semi-supervised video object segmentation model based on multi-level target appearance information.On this basis,a semi-supervised video object segmentation model based on spatio-temporal memory network is designed to solve the problem of insufficient segmentation stability when dealing with local information confusion.The main work and contributions of this thesis are as follows:(1)In order to segment target objects in video sequences with high speed and accuracy,this thesis proposes a semi-supervised video object segmentation based on Multi-level Target Models and Feature Integration(MTMFI).Firstly,a multi-level target appearance model composed of a light-weight convolution structure is used to enrich the target appearance details and ensure the segmentation inference speed.Besides,a feature integration module is designed to capture the dynamic changes of the target object between different video frames and further improve the segmentation accuracy.The model can achieve the trade-off between segmentation speed and segmentation accuracy,and achieve accurate segmentation of target objects at a higher inference speed.(2)In order to solve the problem of the degradation of segmentation performance of most methods when dealing with local information confusion,this thesis proposes a semi-supervised video object segmentation based on Spatial-Temporal Memmory network with Top-K filter and ASPP(TA-STM).Firstly,the Top-K filtering mechanism is added to the spatiotemporal memory network to filter the global noise and capture the local similarity of the target objects.At the same time,an atrous convolutional spatial pooling pyramid module is added to prevent the loss of local information while capturing the appearance information of multilevel target objects.The model can ensure segmentation stability,and its segmentation accuracy is not be significantly affected by complex factors.The methods proposed in this thesis have been experimentally verified on the video object segmentation datasets DAVIS-2016,DAVIS-2017 and You Tube-2018,and sufficient experimental comparisons show that both methods achieve competitive results.

Keywords/Search Tags:

Semi-supervised Video Object Segmentation, Multi-level Target Models, Feature Integration Module, Spatial-Temporal Memory Network, Top-K Filter, Atrous Convolutional Spatial Pooling Pyramid Module

PDF Full Text Request

Related items

1	Research On Image Panoptic Segmentation Based On Spatial Clustering Module And Multi-layer Feature Fusion
2	Research On Semi-supervised Video Object Segmentation Via Pyramid Network Modulation
3	Research On Semi-Supervised Video Object Segmentation Method Based On Deep Learning
4	Research On Semantic Segmentation Algorithm Based On Fully Convolutional Neural Network
5	Pyramid Pooling And Spatial Attention Optimized Deep Semantic Segmentation
6	Research On Weakly Supervised Semantic Segmentation Algorithm Based On Image-level Label
7	Domain Adaptation For Semantic Segmentation
8	Research On Kidney Segmentation Of CT Images Based On Deep Learning
9	Research On Algorithms For Defect Detection Of Printed Circuit Boards Based On Deep Learning
10	Research On Video Object Segmentation Algorithm Based On Learning Attention Modulation Network