Research On Human Action Recognition Method Based On 3D Convolutional Neural Network

Posted on:2021-02-20

Degree:Master

Type:Thesis

Country:China

Candidate:Y X Fan

Full Text:PDF

GTID:2428330614461456

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

Video-based human action recognition,as a hot research topic in the field of vision in recent years,is widely used in intelligent human-computer interaction and virtual reality,intelligent video surveillance and content-based video retrieval,smart medical treatment and nursing and other fields.However,how to extract more robust features from complex and changeable human action is a research difficulty in the field of action recognition under the real environment of cluttered background,occlusion and lighting changes.Traditional methods usually require manual design of features and rely on sufficient prior knowledge to achieve a high rate of action recognition.Thanks to the successful application of CNN in visual tasks such as image classification and target detection,many excellent deep learning methods are also gradually used in action recognition research,and some significant progress has been made.This thesis conducts an in-depth study of action recognition based on the 3D CNN architecture.The main work contents are as follows:(1)Due to the high complexity of the existing 3D CNN architecture,which makes it difficult to learn more rich and abstract deep features,a lightweight multi-scale convolution model is proposed.The model increases the local receptive field range in each layer of the network by embedding a lightweight multi-scale convolution module in the 3D convolution residual network.While significantly reducing the complexity of the model,it also extracts the multi-scale features of the target which significantly enhances the ability to represent the target.Finally,the channel attention mechanism is applied to the multi-scale features to extract key features.Experimental results show that the model in this paper not only has a high action recognition rate,but also has the advantage of reducing the complexity of the model.(2)Considering that the RGB image contains rich appearance information,which can describe the details and texture of human action well.While the Flow image contains important action information such as the speed and direction of the moving target.Therefore,an action recognition method based on multi-modal image input is proposed.The method generates intermediate images by fusing useful information in RGB and Flow images,and then forms multi-modal images with RGB images to increase network multi-source input.The time stage of fusing the two modal image features is studied to further improve the network performance.Experimental results show that this method is superior to other 3D CNN architecture methods in action recognition rate.

Keywords/Search Tags:

Human action recognition, lightweight multiscale convolutional module, multiscale features, channel attention mechanism, multi-mode image

PDF Full Text Request

Related items

1	Human Action Recognition Based On Convolutional Neural Networks
2	Research On Human Action Recognition Algorithm Based On Two Stream Convolutional Neural Network
3	Research On Human Skeleton Action Recognition Based On Graph Convolutional Networks And Attention Mechanism
4	Research On Human Action Recognition Method Based On Skeleton Features
5	Research On Face Recognition Method Based On Multiscale Feature Dimension And Manifold Learning
6	Human Action Recognition Based On Two-stream Convolutional Network
7	Research On Human Posture Estimation Based On Simple Baseline Deep Convolution Neural Network
8	Image Segmentation Method Based On The Theory Of Multi-scale Study
9	Research On Human Behavior Recognition Based On Multi-Scale Learning And Attention Mechanism
10	Research On Multi-modal Human Action Recognition Based On Features Fusion And Attention Mechanisms