Research Based On Few-shot Object Detection Methods

Posted on:2023-06-23

Degree:Master

Type:Thesis

Country:China

Candidate:Y Hou

Full Text:PDF

GTID:2568306848967289

Subject:Engineering

Abstract/Summary:

PDF Full Text Request

Traditional object models usually require large-scale labeled samples for pre-training,which is difficult to solve related detection tasks under the condition of insufficient sample size.Therefore,in order to solve the accuracy loss caused by a small number of samples,this paper proposes a new few-shot object model.Depends on large-scale labeled data.The main work is as follows:First,a new detection model Swin T-FSOD(Swin Transformer based Few-Shot Object Detection)is proposed based on the idea of Swin Transformer.The model designs a feature learner based on the Swin Transformer and feature pyramid to extract multi-scale metafeatures containing global contextual information from the query set to detect new categories of objects.Based on the channel attention mechanism,a weight adjustment module is designed to convert the support set samples into weight coefficients with class attributes,so as to adjust the distribution of meta-features and extract the class-attribute features of the region of interest.The prediction module directly uses the detection head of the YOLOv5 framework to make predictions.Secondly,based on the Swin T-FSOD model,the SA-FSOD(Semantic Alignment based Few-Shot Object Detection)model is proposed to further improve the detection accuracy of small samples.The model proposes a new feature learner and a new weight adjustment module.In order to ensure the accuracy of feature extraction performed by the feature learner as the backbone network,a simple semantic alignment method is used to align highlevel semantic information with low-level semantic information,so that the feature extraction network no longer depends on the semantics of specific categories;use path aggregation The network performs bidirectional multi-scale fusion to solve the problem of lack of shallow location information.In the weight adjustment module,a method of adaptively adjusting the feature weight coefficient is used to enhance the interest category features in each scale meta-feature respectively.And use a task-general meta-learning initialization weight training method to further improve the generalization ability of the model.Finally,based on the above proposed few-shot object detection model,experiments are carried out on multiple data sets,and compared with the traditional few-shot detection model on various indicators.

Keywords/Search Tags:

Few-Shot learning, Object detection, Metric learning, Attention mechanism, Meta-learning

PDF Full Text Request

Related items

1	Research On Few-Shot Image Classification Algorithm Based On Meta-Learning
2	Research On Few-shot Object Detection Algorithm
3	Research On Few-Shot Learning Method Based On Meta-Learning
4	Research On The Metric Few Shot Classification Based On Meta Learning Optimization
5	Research On Few-Shot Classification Model With Attention Mechanism
6	Research On Few-shot Image Classification And Recognition Based On Deep Metric Learning
7	Research And Implementation On Meta-learning Few-shot Object Detection System
8	Research On Key Technologies Of Few-Shot Object Detection
9	Research On Image Classification Based On Few-Shot Learning
10	Research And Implementation Of Object Detection System Based On Few-shot Learning