Research And Implementation Of Tibetan Language Model Based On RNN

Posted on:2020-03-12

Degree:Master

Type:Thesis

Country:China

Candidate:N Yang

Full Text:PDF

GTID:2415330572993900

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

With the rapid spread of the Internet and the rapid update of information,AI has become an important direction for the future development of science and technology.Speech recognition is an important branch of artificial intelligence research.Its purpose is to enable machines and people to communicate with each other through voice to realize human-computer interaction.At present,speech recognition has achieved a high recognition rate in large languages such as English and Chinese,but there is a relatively lack of research on small languages such as Tibetan.The language model is an important module in speech recognition,and it is also the main form of language factual relationship,which greatly affects the final effect of the speech recognition system.In addition to speech recognition,language models are also widely used in machine translation,automatic word segmentation,and syntactic analysis.This paper mainly studies the language model based on Recurrent Neural Network(RNN)and the traditional N-gram statistical language model,constructs relevant Tibetan language models and tests the performance of the model.By comparing parameters and adding optimization methods,the experiment compares the confusion of the two.The purpose is to obtain a Tibetan language model with better recognition performance,so that in the subsequent Tibetan speech recognition system,the acoustic model can be combined to obtain a more accurate recognition rate.The traditional N-gram language model is a shallow model.As the amount of data increases and the complexity of the data structure increases,the data will be sparse and so on,and its modeling ability will also decrease.The RNN is deeper,which has better learning and modeling capabilities than the N-gram model.In this study,by changing the number of hidden layer neurons in the RNN Tibetan language model,adding class layer acceleration operations in the output layer,and using context word vector features and LSTM training,the standard language model caused by gradient disappearance cannot effectively obtain long-distance constraints.The experimental results show that the optimized Tibetan RNN language model performs better than the traditional N-gram language model,but the training time is relatively long and the process is complex.

Keywords/Search Tags:

Tibetan, speech recognition, language model, N-gram language model, RNN language model

PDF Full Text Request

Related items

1	The Research On Tibetan Speech Recognition Technology
2	Tibetan Language Model Integrating Morphological Structure And Grammatical Relations
3	Research On Optimization Of Language Model Based On Statistical Machine Translation
4	Research On The Identification Of Sui Language
5	Application Of Henan's Dialect Speech Recognition Based On Acoustic Model
6	Research On The Semantic Retrieval Method Of Tibetan Language Based On Neural Network Language Model
7	A Report On The Translation Of An Excerpt From Speech And Language Processing:An Introduction To Natural Language Processing,Computational Linguistics,and Speech Recognition
8	Finding a language fingerprint: Using the Hyperspace Analogue to Language (HAL) model to detect individual and population linguistic patterns
9	Research On Tibetan Multi-task Learning Acoustic Model Based On DNN-HMM
10	A Study On The Effectiveness Of The Language Game Model In The Intensive Reading For English Majors