Research On Asynchronous Feature Representation And Spatio-temporal Regularization Algorithm For Dynamic Gesture Recognition

Posted on:2023-12-28

Degree:Master

Type:Thesis

Country:China

Candidate:H Cui

Full Text:PDF

GTID:2568306815462514

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

Dynamic gesture recognition plays an important role in Augmented Reality,HumanComputer Interaction,Sign Language Recognition,etc.In recent years,deep learning has provided new vitality for pattern recognition and computer vision.However,the current dynamic gesture recognition algorithms based on deep learning still face the following problems:(1)The variability of dynamic gesture appearance and the randomness of duration make recognition more difficult.(2)Traditional regular algorithms and data augmentation cannot effectively solve the overfitting problem when applied to spatio-temporal models and action data.To address the above problems,this thesis carried out relevant research and experiments,the main contents of which are as follows:1.To address problem(1),the thesis proposes an asynchronous spatio-temporal feature extraction method.Firstly,we construct an asynchronous spatio-temporal feature extraction module by a lightweight 3D convolutional network.This module can extract gesture features which have multi-scale spatio-temporal characteristics.That ensures the recognition accuracy of gestures with different appearance sizes and temporal rates.Then,we improve the Long ShortTerm Memory network,and use it to learn the stable long-term features from the short-term asynchronous spatio-temporal features.Finally,we fuse the spatio-temporal features of each time step for the final dynamic gesture recognition.2.To address problem(2),the thesis proposes a spatio-temporal drop regularization method,which called Label-Guided Spatio-Temporal Drop strategy(LGST-Drop).It can not only structure the drop neurons at the frame level,but also regularize the motion information in the channel and temporal dimensions.More over,the drop mask of LGST-Drop is generated by the temporary labels guided by the network,thus reducing the randomness of selecting drop regions and improving the stability of the spatio-temporal regularization process.Through experimental comparison with other mainstream methods,the results demonstrate that the proposed model based on multi-temporal asynchronous spatio-temporal features can significantly improve the gesture recognition performance and show stable results on several typical data sets.In addition,the proposed LGST-Drop method is applied to a variety of recognition networks and experimentally compared with other typical regularization algorithms.The results show that the LGST-Drop algorithm is very competitive.

Keywords/Search Tags:

Dynamic gesture, action recognition, spatio-temporal network, regularization, spatio-temporal regularization

PDF Full Text Request

Related items

1	Research On Spatio-Temporal Indexing Mechanism And Querying Strategy
2	Video Action Recognition Based On 2D Convolution Network Under Spatio-Temporal Feature Enhancement Mechanism
3	Research On Human Skeleton Action Recognition Method Based On Graph Convolutional Network
4	The Research And Implementation Of Spatio-Temporal Data Operations And Query Optimization In Spatio-Temporal Database
5	Research On Spatio-Temporal Action Detection Based On Self-Attention
6	Research On Video Action Recognition Technology Based On Spatiotemporal Feature Extraction
7	Research On Surveillance Video Synopsis Based On Spatio-Temporal Slice
8	Research On Action Recognition Based On Deep Network Learning Of Spatio-temporal Features
9	A Study Of Human Action Recognition Based On Spatio-temporal Features
10	Research On Action Recognition Algorithm Based On Spatio-Temporal Feature Representation