Font Size: a A A

Study On The Characteristics Of Tibetan Proverbs Based On Corpus

Posted on:2018-09-03Degree:MasterType:Thesis
Country:ChinaCandidate:Z Z X DangFull Text:PDF
GTID:2335330515486065Subject:Chinese Ethnic Language and Literature
Abstract/Summary:PDF Full Text Request
This paper is based on the collection and collation of Amdo,Kham,Tibet and other regions of the Tibetan proverbs text,a Tibetan proverb corpus.TibetsegementHMM software Ver3.0 software word segmentation and part of speech tagging of corpus and corpus,with "tagging set" for the artificial proofreading,so as to construct a thesaurus of Tibetan proverbs.In Tibetan proverbs corpus as the research sample,analysis of the characteristics of Tibetan proverbs.This paper is divided into six chapters.The first section describes the concept and definition of Tibetan proverbs and the Tibetan proverbs text corpus collection of proverbs,explain the situation at home and abroad and the purpose and significance of Tibetan proverbs.The second chapter describes the genre features of Tibetan proverbs according to sentence classification statistics and classification of Tibetan proverbs.The statistical results of sentence classification Tibetan proverbs,two sentences for the largest number of sentences,but also people use the highest frequency of sentence in everyday language.The third chapter through the analysis of the sentence structure features of Tibetan proverbs of Tibetan proverbs and dual asymmetry,ultrashort and ellipsis structure and some sentence structures.The fourth chapter is the study of the lexical features of Tibetan proverbs.Through the analysis of the internal structure of word length and vocabulary and vocabulary of different parts of speech in Tibetan proverbs words in syllable construction to construct the Tibetan proverb vocabulary.Based on the text can provide certain syntactic parsing and other fields of Tibetan information processing.The fifth chapter is the Tibetan proverb vocabulary in statistical word frequency and high frequency words,and analyses it.The sixth chapter in Tibetan grammar works "accent theory" as the standard,analysis of Tibetan grammar words in classification and Tibetan split rules.The Tibetan syllable segmentation and proverbs constructions based on narrative rhythm model scheme of Tibetan proverbs.The research can provide a new method and means of Tibetan information processing and Tibetan grammar research.
Keywords/Search Tags:information processing, Tibetan proverb features, statistical analysis
PDF Full Text Request
Related items