Visual Tracking Based On Deep Detection Architecture

Posted on:2019-01-10

Degree:Master

Type:Thesis

Country:China

Candidate:P Zhang

Full Text:PDF

GTID:2428330542999174

Subject:Control Science and Engineering

Abstract/Summary:

PDF Full Text Request

Visual tracking is one of the fundamental problems in the field of computer vision and has a variety of applications,such as human-computer interactions,smart surveil-lance systems and autonomous driving.Due to uncertainty of the target and diversity of the scene in tracking task,it remains a challenging problem on how to keep the accuracy,real-time and robustness of the tracker under different challenges.This dissertation,considering the practical situation,aims to build a tracker which balances robustness and real-time performance.With the deep research of some difficult problems in visual tracking,a few results have been achieved for a value of applications.The main work and academic contributions can be summarized as follows:1.We propose a tracker based on object detection architecture Single-Shot Multi-Box Detector(SSD)for target feature extraction and location by applying similarity learning in visual tracking.This tracker uses three loss function for joint learning in optimizing strategy,which overcomes the disadvantages of heavy computation and hard training in similarity learning,and effectively accelerates convergence speed of network and improves performance of the tracker.Experimental results prove the effectiveness and feasibility of the tracker.2.A non-local mean based Discriminative Correlation Filter(DCF)tracker is pro-posed for visual tracking.This tracker uses a fully-convolutional DCF layer,which is composed of two separate convolution sequences,for capturing(target's)local response map and a non-local layer for(background's)global response map.High-resolution finial response map is an additive combination of local and global response map,which improves the ability of the tracker to distinguish the target from the background.The results on the benchmarks show that this tracker gains much improvement in accuracy and robustness when compared with the baseline.3.A feature pyramid based method is presented for size estimation.This method replaces image pyramid with feature pyramid in the process of target size estimation,saving much redundant computation in feature extraction caused by image pyramid and speed up tracking efficiency.Also,applying more dense feature pyramid in feature extraction for better performance will not significantly increases computation.Experi-mental results show that this tracker spends less time in the process of size estimation and keeps the same accuracy after feature pyramid is applied.

Keywords/Search Tags:

Visual Tracking, Similarity Learning, Joint Learning, Correlation Filtering, Non-local Mean, Feature Pyramid

PDF Full Text Request

Related items

1	Visual Tracking Via Local And Nonlocal Similarity Learning
2	A Research On Visual Object Tracking Based On Dynamic Feature Learning
3	Research On Object Tracking Algorithm Based On Correlation Filter
4	Research On Tracking Algorithms Based On Collaborative Feature Learning And Regular Correlation Filtering
5	Research On Correlation Perception And Feature Enhancement For Visual Tracking
6	Visual Object Tracking Via Sparse Learning
7	Research On Visual Tracking Algorithms Based On Feature Fusion And Joint Decision
8	Research Of Visual Tracking Based On Correlation Filter
9	Research On Robust Visual Object Tracking Algorithm
10	The Study Of Deep Learning Based Human Pose Estimation