Research On Classification Of Speakerâ€™s Properties Based On Multi-instance Learning

Posted on:2014-09-19

Degree:Master

Type:Thesis

Country:China

Candidate:N Zhang

Full Text:PDF

GTID:2298330422474531

Subject:Optics

Abstract/Summary:

PDF Full Text Request

The classification of speakersâ€™ properties is a process of estimating the speakersâ€™geographical and gender information based on speech. It has important applicationvalue in many fields, for example: processing of multilingual information,machine translation, criminal investigation information of publicsecurity, military intelligence gathering and so on.Multi-instance learning is an effective machine learning algorithm to solve theambiguity problem. It is commonly used image retrieval,text classification and someother static pattern classification, but not commonly used in speech signal processing.Multi-Instance Learning, a time-varying method of gender identification is proposed inthis paper. Whatâ€™s more, itâ€™s applied to the dialect identification. The main achievementsare as follows:1.The Chinese dialect database was extended and labeled mainly on thenorthern dialect: min dialect, xiang dialect, gan dialect, wu dialect, yue dialect,kejiadialect and mandarin. Each segment of the speech was labeled with the speakersâ€™information, such as gender, age, record of time and city.2. Time-varying multi-instance model was proposed. Due to the continuity of thespeech signal, the speech segments of the speech signal were cut into several piecesmanually, acoustic characteristics was extracted from speech signal, finally K-meansalgorithm was used to get instances from bags.3. Two-point model was proposed to replace the single point model. Underdifferent scale transformation, different categories of maximum diversity density pointwere calculated respectively.4. Bags-kNN classification was proposed. On the backend stage, distancemeasure was solved the problem between sets, the traditional threshold judgementwas replaced to enhance the performance of the classifier.

Keywords/Search Tags:

Multi-Instance Learning, Gender Identification, Dialect Identification, EM-DD algorithm

PDF Full Text Request

Related items

1	Research On Speech Language Identification Based On Deep Learning Network
2	Studies On Gender Identification Based On Handwriting
3	Research Ofkorean Dialect Identification Based On Prosody
4	Research On Dialect Identification Based On Deep Learning
5	Research On Person Re-identification Algorithm Based On Middle-Level Features And Visual Saliency
6	Chinese Dialect Identification Based On Manifold Learning
7	The Research Of Chinese Dialect Identification Based On Speech Feature Analysis
8	Study Of Pitch Detection Algorithm And The Application In Dialect Identification
9	Research On Pedestrian Re-identification In Intelligent Monitoring System
10	Gender Recognition Of Multispectral Face Image Based On Deep Learning