Research On Whisper-to-Normal Speech Conversion Based On Generative Adversarial Network

Posted on:2023-10-05

Degree:Master

Type:Thesis

Country:China

Candidate:T Gao

Full Text:PDF

GTID:2568307043988759

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

Whispered speech is a special pronunciation style of human-beings,which is produced with no vocal-cord vibration.Whispered speech is widely used for private speech communication in public places.In addition,aphonic individuals with laryngectomy as well as those with low vocal capability also adopt whispering as their primary pronunciation form for oral communication.Due to its low energy,whispered speech is often transformed to normal speech for improving its speech quality.This thesis focuses on the research of whisper-to-normal conversion methods based on generative adversarial networks.The major works are as follows:Firstly,a novel generative adversarial network based whisper-to-normal speech conversion method was proposed.An “encoding-decoding" structure was adopted in the generator,which makes whispered speech feature as the input,and outputs the converted normal speech.Experimental results show that the proposed method obtained better speech quality of the converted speech than traditional GMM and BLSTM based methods.In order to utilize the correlation between the acoustic features of successive speech frames,a novel attention-guided generative adversarial network was proposed for whisper-to-normal speech conversion.The experimental results show that compared with the previous "encoding-decoding" based GAN method,this attention-guided GAN method improves the whisper-to-normal conversion performance in aspect of speech quality.

Keywords/Search Tags:

Whisper-to-normal speech conversion, Generative adversarial network, Encoding-Decoding, Attention mechanism

PDF Full Text Request

Related items

1	Research On Whisper To Normal Speech Conversion Based On Convolutional Neural Network
2	Research On Whisper To Normal Speech Conversion Based On Deep Neural Networks
3	Whisper To Speech Conversion And Whisper Recognition Modeling Method
4	The Research Of Personalized Speech Synthesis Based On Generative Adversarial Network
5	Study On The Conversion Of Whispered Speech Into Normal Speech By Feature Mapping
6	Research On Robust Image Steganography Method Based On Generative Adversarial Network
7	Research And Application Of Image Captioning Algorithm Based On Generative Adversarial Network
8	Research On Speech Conversion Algorithm Based On Generative Countermeasure Network
9	Research On Image Style Conversion Method Based On Generative Adversarial Network
10	Speech Enhancement Of Deep Neural Networks Combined With Attention Mechanism