Study On The Key Technology Of The Speech Recognition And Itâ€™s Improved Algorithm

Posted on:2015-12-21

Degree:Master

Type:Thesis

Country:China

Candidate:F Z Liu

Full Text:PDF

GTID:2298330422985380

Subject:Traffic Information Engineering & Control

Abstract/Summary:

PDF Full Text Request

As one of the important fields in the research of computer technology, speech recognitionhas great research value and widely application prospects. But most speech recognitionsystems at present are still limited in lab. Although there have been some products of thespeech recognition at the marketï¼Œthere is also a large gap between the peopleâ€™s expectationand the actual use effect.This thesis focus on the two keys of the speech recognition system: efficiency andaccuracy. Along with the flow of the technology of speech recognitionï¼Œthis thesis goes deepinto the research on the key points of speech recognitionï¼ŽFirstly, the thesis introduces the basic principle and process of the speech recognition.Then the thesis analysis the digital model of speech signal and the problems of itspreprocessing. For the bugs of the traditional voice end point detection, a new method of thevoice end point detection based on the image edge detection is proposed in this thesis. Theresults of the experiment show that the new algorithm improves the detection accuracy rate by18.6%. It is superior to the traditional algorithmï¼ŽSecondly, the characteristics and extraction process of several common speech featureparameters are introduced and discussed in the thesis. Based on the in-depth study of MFCCparameter, this thesis propose an improved MFCC parameter after a series of processing onMFCC feature such as weighting, differential, and PCA dimensionality reduction. Theexperiment results demonstrate that the new feature parameter makes the speech recognitionsystem more robust. The average recognition accuracy in three kinds of SNR environmentsincreases9.2%and4.3%respectively compared to the traditional feature parameters thatLPCC and MFCC. The average training time has been shortened by18.2%and11.5%respectively.This thesis researches on the technology of speech recognition based on HMMemphatically. Owing to the application of the HMM in the research of speech recognition,the speech recognition has made considerable headwayï¼ŽThe HMM has already been the mostwidely used modeling technology in speech recognition currently. For the traditional HMMmodel initialization is too simple and crude, the thesis proposed a new improved HMM model initialization algorithm. The results of the experiment prove that the new model initializationalgorithm can reduce average training time by36.9%and improve the system recognition rateby5.2%.Finally, on the basis of the theoretical research on speech recognition, using MATLABsimulation software and VoiceBox speech signal processing toolbox, the thesis establishes asmall speech recognition simulation system which is used to do the experiment. For isolateword speech recognition, the recognition accuracy rate of the system has reached95.5%.

Keywords/Search Tags:

speech recognition, voice endpoint detection, extraction of speechfeature parameters, HMM, mini speech recognition system

PDF Full Text Request

Related items

1	The Research Of Front-end Processing Technology Based On The Speaker-independent Speech Recognition
2	Research Of The Characteristics Parameters Extraction In The Personal Of Speech Recognition
3	Research On Key Technologies Of Speech Recognition In Tank Noise Environment
4	The Research On The Speech Recognition System In The Noisy Environment
5	Speech Recognition System Of Speaker-independent And Isolated Words Based On DSP
6	Airborne Voice Recognition Technology And Environment
7	Research And Implementation Of The Speech Recognition Technology Based On DSP
8	Research On Robust Speech Recognition
9	Research On Mandarin Connected Digit Speech Recognition
10	Research On The Speech Emotion Recognition Based On Voice Signal