Shallow Parsing On Tibetan Textbook Of Junior Middle School

Posted on:2013-10-22

Degree:Master

Type:Thesis

Country:China

Candidate:W L Wang

Full Text:PDF

GTID:2247330395970960

Subject:Linguistics and Applied Linguistics

Abstract/Summary:

PDF Full Text Request

In the area of natural language processing, full parsing is a key point as well as a difficult point. Full parsing is not believed to be solved thoroughly based on the present study. But partial parsing, i.e. chunk parsing, can be used well to not only process natural language sentences but also lower the difficulty of analysis.Then on the basis of the previous definition of chunk, there are six types of chunk defined, which depends on syntactic marks in Tibetan textbook of junior middle school and grammar rules of modern Tibetan. Carried out the research of the following aspects:blocks of type, statistics of frequency and distribution and blocks the statistical co-occurrence relations, Intuitive evaluation of the structural composition and content of the junior high school of Tibetan language materials constitute.. And with the group block distribution reflects the value of the blocks.One part of the shallow parsing system is studied in this thesis, i.e. the chunk statistics and analysis of Tibetan textbook of junior middle school besides the difficulty. Two parts of chunk statistics are not only the general chunk type statistics but also the chunk frequency distribution and co-occurrence relations among chunk. As a result, the scientific difficulty of textbook is analyzed according to the statistics and some problems and shortage of textbook are found out. Furthermore, it can give some reasonable suggestions on designing textbook through data analysis.In this paper, the chunk simplifies sentence structure and improves the entire function of machine translation. It offers an idea for evaluating Tibetan textbook of junior middle school objectively, the foundation for improving the compiling quality of Tibetan textbook. Meanwhile, the result can be used in other areas of natural language processing, like information retrieval and text categorization.

Keywords/Search Tags:

natural language processing, chunk, corpus, statistics, Distribution, Tibetan language

PDF Full Text Request

Related items

1	The Construction Of Knowledge Graph In Basic Education Field Based On Natural Language Processing
2	Research On Chinese Web Forum Based On Natural Language Processing
3	Research On The Limited Domain Dynamic Geometry Natural Language Drawing Method
4	Strengthen The Study Of The Cognitive Mechanism And Effectiveness Of The Mathematics Language Conversion Of Tibetan Junior High School Students
5	The Relationship Between Spatial Ability,language Ability And Algebra Processing Of Tibetan Senior High School Students
6	Analysis For Students' Evaluation Of Teaching Information And Research On Scoring Prediction
7	Corpus-based Language Statistics And Analysis Of High School Chinese Textbooks
8	A Thorough Analysis On "the Tibetan Language Classes" Of Kindergarten In Ethnic Minority Areas In Northwest
9	The Research Of Effects Of Tibetan Students Learning Chinese Language Influenced By Parentsâ€™ Attitudes
10	Research And Analysis Of Mathematics Teaching Language In High School