Font Size: a A A

Study On The Tagging Of Mongolian Corpus And Relative Methodlogy

Posted on:2009-12-30Degree:MasterType:Thesis
Country:ChinaCandidate:R G W MuFull Text:PDF
GTID:2155360245986944Subject:Linguistics and Applied Linguistics
Abstract/Summary:PDF Full Text Request
At present ,the building of Corpus has become an indispensable component of the Natural Language Processing .As an important linguistry resource ,the development of Corpus provides knowledge support for Natural Language Processing . The building of Mongolian Corpus has started from 1980s, and it has acquired some achievements .The researchers have built corpus of different periods , different characters and different usages . However , the study of corpus in the past put emphasis on the nature of language ,and there are many problems when dealing with the deeper side of Natural Language .In order to make the technology of processing of Mongolian Corpus reach the tagging deepness of Corpus which has great influence at home and abroad as soon as possible , in this paper the author compares the Mongolian Corpus and the famous corpus at home and abroad from different perspectives , analyses the problems existing in the processing of Mongolian Corpus ,and achieves the automatic lexical annotation and post-processing of mongolian corpus it has more than 260, 000 words , and analyses the problems that meeting with in this process , based on this puts forward some solving methods for those problems.
Keywords/Search Tags:Corpus, Mongolian Corpus, annotation
PDF Full Text Request
Related items