Research On Yoga Action Recognition Based On STF-ResNet Model

Posted on:2024-05-25

Degree:Master

Type:Thesis

Country:China

Candidate:W J Yao

Full Text:PDF

GTID:2557307094479194

Subject:Pattern Recognition and Intelligent Systems

Abstract/Summary:

PDF Full Text Request

Human actions is one of the most intuitive ways to express their true intentions,and accurate recognition of actions can help computers accurately understand the information conveyed by humans,and further support human-computer interaction systems to achieve a more immersive experience.With the continuous maturity of deep learning technology,motion technology detection based on motion video is one of the key applications of computer vision.Nowadays,yoga is a fashionable and convenient aerobic exercise that improves the body’s immune function and relieves anxiety.Usually people choose to search the Internet for resources to learn yoga on their own,but non-standard postures can cause joint damage,contrary to the original purpose of exercise.The significance of studying yoga movement recognition is to improve its recognition accuracy,and use existing resources to combine artificial intelligence with sports to promote the development of intelligent sports.In order to improve the accuracy of action recognition in yoga videos,based on the traditional two-stream convolutional network,this thesis combines the residual structure and proposes a spatial-temporal fusion residual network(STF-ResNet)to solve the problem of yoga action recognition in complex scenes.The main work of this thesis is as follows:(1)Acquisition and production of datasets.Currently,public datasets for motion recognition include HMDB51 and UCF101,which are based on daily human actions,and there is no public dataset on basic yoga action.In this thesis,we collect as well as process yoga action videos through multiple channels to create a yoga action dataset.(2)A spatial-temporal residual fusion network(STF-ResNet)is proposed.By converting the RGB and optical streams of the target region data and feeding them into the STF-ResNet network to extract video spatial and temporal features,the spatial-temporal features are complemented by mixing the spatial and temporal stream features with residuals,and the information loss of the high-level features is compensated by the low-level features;the convolutional block attention module(CBAM)is added before the mixing,the yoga action characteristics are again filtered from both channel and space dimensions.Finally,through experimental analysis,the model in this thesis improves the average recognition accuracy by 6.3% compared with the traditional two-stream convolutional neural network model,in addition,the method also shows good performance on public datasets.(3)For the study of STF-ResNet network model,this thesis designs and implements a yoga action recognition system.The system combines the functions of algorithm analysis as well as data analysis to detect the existing and simulated datasets,and can effectively identify yoga behaviors.By uploading videos of basic yoga action,it provides services such as action recognition and evaluation for yoga practitioners to help them find deficiencies and improve their actions,which is valuable in practical applications.

Keywords/Search Tags:

Yoga poses identification, two-stream networks, spatial-temporal feature mixing, convolutional block attention module

PDF Full Text Request

Related items

1	Research On Stroke Recognition In Badminton Videos Based On Highlights Extraction
2	Experiment Of Decoupled Operators On Two-Stream Convolutional Neural Networks
3	Multi-Stream Heterogeneous Graph Convolutional Network And Its Application In Text Classification
4	Image Classification Algorithm Of Convolutional Neural Network Based On Spatial Pyramid Pooling
5	A Study On The Temporal And Spatial Evolution Characteristics And Knowledge Relevance Of Knowledge Networks In Urban Agglomerations
6	A Study On MOOC Dropout Prediction Based On Learning Behavior Feature Mining
7	Research On Position Recommendation Based On Attention And Convolutional Networks
8	Recognition Of Basic Table Tennis Techniques Based On Spatial Temporal Graph Convolutional
9	Spatial Temporal Feature Of Population Migration And Analysis Of Influencing Factors In Liaoning
10	Research On The Identification,Temporal And Spatial Characteristics And Decision Mechanism Of Population Shrinkage In The Three Provinces Of Northeast China