Automatic accent assessment based on phonetic mismatch and human perception | | Posted on:2012-05-27 | Degree:M.S.E.E | Type:Thesis | | University:The University of Texas at Dallas | Candidate:William, Freddy | Full Text:PDF | | GTID:2465390011960156 | Subject:Engineering | | Abstract/Summary: | PDF Full Text Request | | The variability in pronunciation brought on by accent of non-native speakers causes significant changes in the quality of speech produced. Recently, automatic accent assessment systems have been employed to improve the pronunciation proficiency for non-native speakers. In this study, a new algorithm for automatic accent evaluation of native and non-native speakers is presented. The proposed system consists of two main steps: alignment and scoring. In the alignment step, the speech utterance is processed using a Weighted Finite State Transducer (WFST) based technique to automatically estimate the pronunciation mismatches (substitutions, deletions, and insertions). Subsequently, in the scoring step, two scoring systems which utilize the pronunciation mismatches from the alignment phase are proposed: (i) a WFST-scoring system to measure the degree of "accentedness" on a scale from -1 (non-native like) to +1 (native like), and a (ii) Maximum Entropy (ME) based technique to assign perceptually motivated scores to pronunciation mismatches. The accent scores provided from the WFST-scoring system as well as the ME scoring system are termed as the WFST and P-WFST (perceptual WFST) accent scores, respectively. The proposed systems are evaluated on American English (AE) spoken by native and non-native (native speakers of Mandarin-Chinese) speakers from the CU-Accent corpus. A listener evaluation of 50 Native American English (N-AE) were employed to assist in validating the performance of the proposed accent assessment systems. The proposed P-WFST algorithm shows higher and more consistent correlation with human evaluated accent scores, when compared to the Goodness Of Pronunciation (GOP) measure. The proposed solution for accent classification and assessment based on WFST and P-WFST scores show that an effective advancement is possible which correlates well with human perception. | | Keywords/Search Tags: | Accent, Assessment, Human, WFST, Non-native speakers, Pronunciation, Scores | PDF Full Text Request | Related items |
| |
|