Research On Tibetan Speech Recognition Technology Based On Recurrent Neural Network

Posted on:2020-12-18

Degree:Master

Type:Thesis

Country:China

Candidate:C J Nan

Full Text:PDF

GTID:2415330578464440

Subject:Computer application technology

Abstract/Summary:

PDF Full Text Request

As the most direct,natural and fast way of information interaction between people,voice plays an important role in people's daily life.Speech Recognition(SR)technology is one of the core technologies of human-computer interaction.Its goal is to let the computer convert the speech signal into the corresponding text or command through natural language understanding.The research of speech recognition has important theoretical significance and practical value in realizing voice dialing,voice navigation,indoor equipment control,voice document retrieval,and simple dictation data entry.With the development of neural network technology,cyclic neural networks that use context information to model the correlation of long-term speech have become the mainstream technology.As one of the ancient ethnic minorities in China,the Tibetans have a long history and splendid culture.Their language is an important part of Chinese culture and a bridge for cultural exchange between Tibetans and other brothers.In recent years,the speech recognition of Chinese and English has achieved fruitful results,and related methods and techniques are expected to become the pivot of Tibetan speech recognition.The recurrent neural network can mine the effective time series information in the input features,and enhance the distinguishing performance and expressive ability of the features.It has a greater advantage over traditional neural networks when dealing with continuous,context-sensitive tasks(such as speech recognition).Therefore,this paper studies the Tibetan speech recognition system based on recurrent neural network from the aspects of Tibetan text,acoustic characteristics and selection of modeling units.Firstly,by analyzing the characteristics of Tibetan characters,the Tibetan text preprocessing method,Tibetan character vector representation,and Tibetan language model are studied.Secondly,the speech data corresponding to the text is collected,and the Tibetan speech feature extraction method is studied.Combined with the characteristics of Tibetan speech,the phoneme is used as the modeling unit,and the acoustic model of Tibetan speech recognition is studied by establishing the probability mapping relationship between input speech and output sequence.Finally,the Tibetan speech recognition system was developed based on TensorFlow,and the experimental results of different super-parameters were evaluated by the label error rate and loss value.The experiment proves that the Tibetan speech recognition based on the cyclic neural network achieves good results under the test of closed corpus.

Keywords/Search Tags:

Tibetan, Speech recognition, Neural network, Acoustic modeling, RNN

PDF Full Text Request

Related items

1	Tibetan Lhasa Acoustic Model Based On LSTM-CTC Speech Recognition System
2	Research On Tibetan Speech Recognition Based On Deep Convolutional Neural Network
3	Research On Neural Network Based Tibetan Speech Synthesis Technique
4	Text Analysis Of Speech Synthesis Based On Statistical Parameters Of Tibetan Language In Specific Fields
5	Design And Implementation Of Printed Tibetan Recognition Software On Android Platform
6	Research On Tibetan Ujin Print Recognition Technology Based On Syllable Segmentation
7	An Empirical Study On The Impact Of Speech Recognition On The Quality Of Neural Network Machine Interpreting
8	Evaluation Of Lute's Acoustic Quality And Recognition Of Lute's Musical Note Based On Artificial Neural Network
9	Research On Word Segmentation And Part-of-speech Of Tibetan On Neural Network
10	Emotion Recognition Of Mouse Track Based On Neural Network