Dialect Species Recognition Based On Wavenet

Posted on:2021-04-14

Degree:Master

Type:Thesis

Country:China

Candidate:X Li

Full Text:PDF

GTID:2518306470969219

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

Under the environment in which the country is fully promoting Mandarin,the use of dialects has gradually decreased.As a unique language form,dialects have rich cultural heritage.Based on the cultural forms of different regions,China has formed local dialects with different forms.How to use deep learning to identify species in various dialects has become the focus of current research.However,most of the existing technologies are aimed at language recognition,and there are few studies on the recognition of Chinese dialect species and the recognition rate needs to be improved.In addition,the boundary between dialects and dialects is relatively vague,and the extraction of dialect features also encountered bottlenecks.In response to the above problems,this article carried out the following work:(1)Collected six types of dialect audio from different channels.Based on the six kinds of experimental data in the data set,the differences between the audios of dialects are analyzed.It is proposed to combine the pitch features in Chinese dialects with artificial acoustic features as the object of feature extraction.(2)Establish a dialect species recognition model based on Wavenet.The dilated causal convolution method is used for convolution operation,and the gating activation mechanism and residual network are introduced to establish a recognition model.Compared with other classification algorithms,the accuracy rate is 1.57% higher than that of Google NET.(3)To solve the problem of model scale,use knowledge distillation technology to compress the model.Transfer the knowledge acquired by the teacher's network to a streamlined model to guide it in network training.Thus,a smaller model is obtained,and the loss of recognition accuracy does not exceed 2%.This method can accelerate model training while ensuring performance.(4)Analysis and design of dialect species identification system.Apply the proposed classification algorithm to the system.According to the detailed program design,the function of the entire system is realized.The test results show that the response time of the system to recognition is stable within 3s,and the recognition rate reaches 98.33%.The dialect species recognition system implemented in this paper can classify the dialect audio in detail,providing strong technical support for the continuation of dialect culture.As a front-end processing technology for dialect recognition,it will pave the way for future text recognition.The model compression technology used at the same time provides convenience for application deployment to mobile devices.

Keywords/Search Tags:

Wavenet, Feature Extraction, Dialect Species Recognition, Model Compression, Knowledge Distillation

PDF Full Text Request

Related items

1	Research And Implementation Of Compression Algorithm For Multilevel Knowledge Distillation Model Based On Feature Map
2	Design And Implementation Of Compression And Acceleration System Of Lip Reading Model Based On Knowledge Distillation
3	Research On Stage-by-Stage Knowledge Distillation And Assistant Model Based Knowledge Distillation
4	Research And Application Of Model Compression Algorithm Based On Pruning-quantization-knowledge Distillation
5	Research On Dialect Accent Speech Recognition Based On Articulatory Information
6	Research On Knowledge Distillation Algorithm Based On YOLOv3
7	Research On Compression Approach Of Network Of General Object Detection Based On Knowledge Distillation
8	Research And Application Of Lip Reading Recognition And Model Compression Based On Deep Learning
9	Study On Plant Species Recognition Based On Leaf Characteristics And Realization Of Recognition System
10	A Study Of Model Compression Approaches To Deep Learning-based Sequence Models