Music Genre Recognition Research Based On Improved Deep Convolutional Neural Network

Posted on:2022-02-27

Degree:Master

Type:Thesis

Country:China

Candidate:J M Wang

Full Text:PDF

GTID:2505306722968159

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

The genre expression of audio features and the design of feature extraction in the field of music genre classification seriously affect the accuracy and generalization of the classification method.Therefore,a DCNN-AFC model is proposed of dual attention fusion extraction features,and Enhance the genre expression of the features in the audio power spectrogram.Firstly,to consider the diversity of music genre features in the audio power spectrogram,the Mel spectrum method is used in the feature extraction stage to effectively filter the audio signal by simulating the human ear auditory system,and the dimensionality of the signal after Mel filtering is restored.To ensure the effective retention of the genre feature attributes of the audio signal,and deepen the feature difference extracted between different genres.To reduce the input size and expand the model training scale in the model input stage,the audio power spectrogram obtained by feature extraction is cut into 128×128×1 and input the model,which further improves the calculation speed and training effect of the model.Secondly,The model’s deficiencies in the design of genre feature extraction are designed to design a residual module for deep learning that replaces the original ordinary convolution operation with a residual structure and enhances the depth of the network.While enhancing the highly abstract extraction of genre features,it avoids model degradation such as gradient disappearance.The problem is to ensure the effectiveness of the model;Finally,the channel attention mechanism and spatial attention mechanism are merged,and the genre features of the audio power spectrogram are coordinated and calibrated in the channel domain and the spatial domain,and the features extracted by the dual attention fusion Information input residual module,through the residual characteristics in the module to describe the features extracted by dual attention fusion in detail,improve the differential expression of features between different genres,thereby enhancing the directionality of genre feature extraction,and then improving the classification of music genres effect.Experiments show that the DCNN-AFC model is superior to other commonly used deep learning models in enhancing the feature extraction effect of music genres and the efficiency of genre classification.It also improves the accuracy of music genre classification by 6.27% to 11.35% compared with other deep learning models.Verify the effectiveness and advancement of the DCNN-AFC model.There are 37 figures,12 tables and 54 references in this paper.

Keywords/Search Tags:

music genre classification, deep convolutional neural network, audio frequency spectrum characteristics, structural residual, dual attention fusion

PDF Full Text Request

Related items

1	Research On Music Audio Classification Based On Deep Learning
2	Research On Music Genre Classification Algorithm Based On Deep Learning
3	Research On Music Genre Classification Model Based On Convolutional Neural Network
4	Research On Music Genre Classification And Generation Technology Based On Deep Learning
5	A Feature Fusion Method Based On Multi-level Local Feature Coding And Its Application To Music Genre Recognition
6	Research On Music Genre Classification And Cover Song Identification Based On Deep Convolutional Networks
7	Chinese Painting Image Classification Research Based On Deep Learning
8	Research On Electronic Music Feature Analysis And Genre Classification
9	Research On Artwork Classification Method Based On Multimodal Fusion
10	Research On Multi-modal Music Emotion Classification Based On Audio And Lyric