Research On Voice Conversion From Tibetan Amdo To U-tsang Dialect Based On Deep Learning

Posted on:2021-02-08

Degree:Master

Type:Thesis

Country:China

Candidate:X T Xing

Full Text:PDF

GTID:2415330623982076

Subject:Intelligent information processing

Abstract/Summary:

There are great differences in pronunciation among Tibetan dialects,which makes it difficult for people in different dialects to communicate face to face.In recent years,great progress has been made in the voice conversion of Chinese and English.However,Tibetan voice conversion technology is still in its early stage.At present,there is only one implementation of Tibetan voice conversion based on the five-degree tone model.This method merely uses the parametric method to modify the pitch curve directly,and the converted sound quality is poor.The deep neural network(DNN)is used to complete the voice conversion from Amdo dialect to U-tsang dialect by using parallel and non-parallel corpus respectively.The main research work and innovations are as follows:Firstly,linguistic differences between dialects are analyzed to design parallel and non-parallel corpus respectively.Secondly,the voice conversion from Amdo dialect to U-tsang dialect is realized by using parallel corpus.In the training stage,the acoustic parameters are extracted to train conversion model through the use of DNN.In the conversion stage,the model is used to convert the acoustic parameters of Amdo dialect into that of U-tsang dialect.Then,the U-tsang speech can be synthesized by using vocoder.Furthermore,the voice conversion from Amdo dialect to U-tsang dialect is realized by using non-parallel corpus method.According to the different pronunciation of the two dialects,the pronunciation mapping table is designed.The pronunciation dictionary in the recognition stage and the context-related labels in the synthesis stage are designed according to the pronunciation mapping table.In this method,DNN is used as a network model for speech recognition of Amdo dialect and speech synthesis of U-tsang dialect.Finally,the naturalness,intelligibility and similarity of converted sentences are evaluated.The experimental results show that the non-parallel corpus method is better than the parallel one.

Keywords/Search Tags:

Amdo dialect, U-tsang dialect, Voice conversion, Parallel corpus, Non-parallel corpus

Related items

1	Research On Speech Conversion From Tibetan Amdo Dialect To Tibetan U-tsang Dialect
2	Research On Tibetan Voice Conversion Based On Deep Learning
3	Parallel Processing On Parallel Corpus Of Chinese-English
4	Corresponding Units In Chinese-English Parallel Texts--Corpus-driven Approach
5	Exploring recurrent word combinations in a business English learner corpus: A parallel corpus analysis and its curricular implications
6	Stydy On VerbÂ·prepositions And Multi-Parts Based Onã€Šhongrumongã€‹ Chinese-korean Parallel Corpus
7	A Study Of Huarui Dialect Of Tibetan Amdo Dialect
8	A Study On Construction Principle And Application Of Chinese-English Parallel Translation Corpus
9	On Chinese To Russian Translation Of Beijing Dialect From The Perspective Of Functional Equivalence Theory
10	Parallel Corpus-based Study On The English Translation Of Chinese Verb Directional Constructions