A neural predictive HMM architecture for speech and speaker recognition

Posted on:1995-02-01

Degree:Ph.D

Type:Dissertation

University:University of Waterloo (Canada)

Candidate:Hassanein, Khaled Saad

Full Text:PDF

GTID:1478390014491074

Subject:Engineering

Abstract/Summary:

PDF Full Text Request

A speech recognition system is developed utilizing a neural predictive hidden Markov model architecture. In this framework multi-layered feed-forward neural networks are employed to implement accurate nonlinear speech frame prediction. A Markov chain is used to control changes in the weight parameters of these predictive networks. We show that speech recognition accuracy is closely linked to the capability of the predictive model in representing long-term temporal correlations in the speech signal.;Analytical expressions are derived for the correlation functions of various types of predictive models (linear, nonlinear, and jointly linear/nonlinear) in order to determine the faithfulness of these models to the actual speech data. This analysis, computer simulation and speech recognition experiments suggest that when both nonlinear and linear prediction are jointly performed within the hidden layer of the neural network predictor, the model is better at capturing long-term speech data correlations, and consequently, improving speech recognition accuracy. A convergence analysis was carried out indicating the advantages, in terms of better conditioning, of including explicit linear nodes within the hidden layer of neural predictive models.;Performance estimates are also obtained for the neural predictive HMM algorithm when mapped to a parallel systolic-based architecture, indicating its real time speech recognition performance capabilities.;A corrective training technique based on the Maximum Mutual Information criterion is then developed for training this class of models. Speech recognition systems trained using this approach were shown to significantly outperform their noncorrective Maximum Likelihood trained counterparts in several speech recognition tasks.;Finally, the Neural Predictive Hidden Markov Model (NP-HMM) is successfully used to implement an accurate and robust fixed-text speaker identification system.

Keywords/Search Tags:

Neural predictive, Speech, Recognition, Architecture, Hidden, Model, Markov

PDF Full Text Request

Related items

1	Speech Recognition Algorithm
2	Research Of Speech Recognition Based On Mixture Feature Extraction And Improved Continuous Hidden Markov Model
3	Speech Recognition Method Based On Hidden Markov Models
4	The Research On Speech Recognition Based On Hidden Markov Model In Noisy Environment
5	A Hybrid Approach For Speech Recognition Based On Combination Of HMM And RBF Neural Network
6	Research On And Implementation Of Continuous Speech Recognition System
7	Study Of Mandarin Digit Speech Recognition Algorithm Based On HMM Model
8	Speech Recognition System Based On HMM/ANN Hybrid Model
9	Study Of Speech Recognition System For Mandarin Digit Based On HMM
10	Voice Control Application