Research On Bird Sound Recognition Method Based On Transforme

Posted on:2024-08-10

Degree:Master

Type:Thesis

Country:China

Candidate:J H Wang

Full Text:PDF

GTID:2530307106977029

Subject:Electronic information

Abstract/Summary:

PDF Full Text Request

As an important part of the natural ecosystem,bird species provide an important basis for understanding regional biodiversity changes and climate and environmental changes.As an important individual feature of birds,birdsong has a high degree of recognition and has been widely used in the research of bird species identification.In recent years,with the rapid development of signal processing and sound recognition technology,bird species monitoring based on birdsong recognition has shown great application prospects due to its advantages of low cost,wide detection range and small restrictions.Under this research background,this paper aims at the needs of bird species monitoring and analysis,and studies the problems of single feature extraction method,insufficient use of feature information,and insufficient amount of bird sound data collected from field monitoring.The main work of this paper is as follows:(1)Aiming at the problem of low classification accuracy caused by single feature extraction method and insufficient use of feature information in traditional bird sound recognition algorithm,a bird sound recognition method combining Transformer network and convolution neural network is proposed.The time and frequency domain information in the spectrogram feature is extracted by using the strong capture ability of the convolutional neural network to the local feature,and the global time sequence information in the Mel differential feature is extracted by using the correlation ability of the multi-head attention mechanism in the Transformer encoder network to the context information.Finally,the local feature and the global feature are fused and input into the Softmax classifier to obtain the test results.Experiments were carried out on the Birdsdata dataset and Xeno-canto database,and the highest accuracy rates of 97.81% and 89.47% were obtained,respectively.The results show that the birdsong feature parameters obtained after feature fusion can obtain better results in the birdsong recognition test.(2)Aiming at the problem of uneven distribution of bird sound audio data collected from field bird monitoring and low recognition accuracy caused by over-fitting of neural network training,a small sample optimized bird sound recognition method based on bridging transformer is proposed.The model takes the Mel spectrogram as the input feature.On the basis of(1),the attention module and convolution module form a bridge transformer module,which preserves the interactive use of local and global information of the model,while optimizing the overall complexity of the model;Finally,we use the cross-attention mechanism of the sample loss optimization module to model the relationship between the output features and complete the internal sample expansion.The method was tested on the Birdsdata dataset and Xeno-canto database after small sample processing,and the highest accuracy rates were 91.34% and 82.63%,respectively.The experimental results showed that the model optimized the bird sound recognition method in the small sample environment and improved the recognition efficiency.

Keywords/Search Tags:

Bird sound recognition, Feature extraction, Transformer neural network, Convolution neural network, Attention mechanism

PDF Full Text Request

Related items

1	Study On Bird Intelligent Recognition Based On Sound Feature Fusion And Attention Neural Network
2	Research On Bird Species Image Recognition Based On Convolution Neural Network
3	Emotion Recognition Based On EEG Differential Entropy And Attention Mechanism Convolutional Neural Network
4	Single-channel Aliased Bird Sound Separation Based On Attention Mechanism And Empirical Mode Decomposition
5	Research On Recognition Of Combination Motions Based On Timing Sequence Signal Of EMG
6	Research On EEG Based Emotion Recognition Using Deep Learning Method
7	Research And Application Of Homogeneous Graph Node Classification Method Based On Graph Convolution Neural Network
8	Research On Infant Cry Classification Method Based On Graph Convolutional Neural Network And Transformer Representation
9	A Research Of Neural Network Modeling And Disease Prediction Of Heart Sound Signal
10	Auditory Attention Research Based On EEG And Convolutional Neural Network