Research On Convolutional Network And Its Variants In Keyword Spotting

Posted on:2022-09-01

Degree:Master

Type:Thesis

Country:China

Candidate:C Y Mei

Full Text:PDF

GTID:2518306551453964

Subject:Master of Engineering

Abstract/Summary:

PDF Full Text Request

Voice wake-up is the entry point for human-machine voice interaction.High accuracy and low false wake-up rate are the basis of a good experience.At the same time,in order to adapt to the computing conditions of mobile devices,the memory and computing resource are also required to be as low as possible.In response to the two requirements of arousal performance and resource occupancy,the focus of field research has shifted from methods based on hidden Markov models to neural network methods that use simple post-processing.Deep neural networks(DNN)and convolutional neural networks(CNN)are widely used based on cross-entropy system trained with framewise alignment data.In view of the large amount of calculation in cross-channel operations of ordinary convolutional networks,Use depthwise separable convolution to separate the convolution operations of cross-channel correlation and spatial correlation.Based on the dilated convolution and depthwise separable convolution structure,this paper proposes an efficient model to improve the performance of the cross-entropy training system.This structure exhibits higher power efficiency and accuracy performance due to the expansion of the receptive field and the separation of depthwise convolution and pointwise convolution.At the same time,inspired by the application of WaveNet in the field of Text To Speech(TTS),a universal and optional keyword wake-up system based on CTC loss is proposed,which performs better than the previous structure based on cross-entropy loss function after feeded domain data.

Keywords/Search Tags:

Keyword Spotting, Voice Wake-up, Dilated Convolution, Depthwise Separable Convolution, WaveNet, CTC

PDF Full Text Request

Related items

1	Study On Keyword Recognition Based On Neural Network
2	Deep Convolution Neural Network And Its Application In Ground Image Target Recognition
3	Research On Website Fingerprinting Attack Technology Of Tor Based On Deep Learning
4	Research On Video Semantic Segmentation Algorithm Based On Deep Learning
5	Research And Application Of Lightweight Shadow Detection Algorithms Based On MobileNetV3
6	Study On Polyphonic Sound Event Detection Based On Deep Learning
7	The Design And FPGA Verification Of A CNN Accelerator With Depthwise Separable Convolutions
8	Research On Target Detection Based On Improved Convolutional Neural Network
9	Research On Text Sentiment Analysis Algorithm Based On Pre-Trained Language Model
10	Design Of Ultra Low Power Intelligent Keyword Spotting Chip Based On Quantitative Feature Extraction