Research On Dimensional Emotion Recognition Based On Deep Learning

Posted on:2024-01-28

Degree:Master

Type:Thesis

Country:China

Candidate:C X Liu

Full Text:PDF

GTID:2568307136992169

Subject:Electronic information

Abstract/Summary:

PDF Full Text Request

The dimensional emotion recognition task enables computers to automatically recognise human mental states by recognising a given image,video or voice message to quantitatively describe the human mental state,which enables computers to provide a more meaningful service to humans.Traditional methods using manual features lack generalisation modelling capabilities for large scale data,while deep neural network based methods have significant advantages in feature extraction and high dimensional feature processing,hence dimensional emotion recognition tasks are now usually based on neural networks.The main research contents of this paper are as follows:(1)A dimensional sentiment recognition method based on Res Net and optimized RNN is proposed.The optimized RNN is divided into LSTM and GRU,and feature extraction is completed using cascade structure to avoid the model complexity and pre-training difficulties caused by increasing dimensionality.Experiments are conducted on the Aff-Wild2 and AVEC2016 datasets to verify the validity of the proposed method,including a comparative analysis of the two methods.(2)A dimensional emotion recognition method based on Res Net and optimized TCN is proposed,in which a temporal attention module is added to the TCN network to capture the relationship of feature points within the sequence,and proposes a dual dilated convolutional structure to solve the problems of the small perceptual field of shallow network and the inability of deep network to perceive local information of TCN.The experimental results show that the method has good performance in emotion recognition.(3)A multi-task and multi-modal dimensional emotion recognition algorithm is proposed to exploit the intrinsic correlation between different emotion representations to assist the dimensional emotion recognition task with an emotion classification task.And a feature fusion method based on Leader-Follower idea and attention mechanism is used to fuse audio features as well as video features.The experimental results show that the method achieves excellent recognition results.The network model proposed in the paper has been tested on Aff-Wild2 and AVEC2016 with average recognition accuracies of 0.624 and 0.701 respectively,and compared with other mainstream dimensional emotion recognition models,the model in the paper is found to have higher recognition accuracy.

Keywords/Search Tags:

Dimensional Emotion Recognition, Spatio-temporal Feature Extraction, Temporal Attention, Multi-modality, Multi-task

PDF Full Text Request

Related items

1	Bi-Modal Emotion Recognition Based On Speech And Visual Cues
2	Research On Feature Fusion Method Of Speech Emotion Recognition Based On Deep Learning
3	Research On Video Action Recognition Technology Based On Spatiotemporal Feature Extraction
4	Research On Speech Emotion Recognition Based On Deep Learning
5	Research On Video Action Recognition Algorithm Based On Spatio-Temporal Features With 2D Convolutional Neural Networks Framework
6	Research On Spatio-temporal Correlation Feature Extraction And Recognition Of Multi-modal Tactile Signals
7	Research Of Speech Emotion Recognition Based On Deep Spatio-Temporal Representation
8	Video Action Recognition Based On 2D Convolution Network Under Spatio-Temporal Feature Enhancement Mechanism
9	Research On Speech Emotion Recognition Method Based On Multi-feature And Multi-modal Fusion
10	Human Action Recognition Based On Attention Mechanism And Multi-Modality Feature Fusion