Font Size: a A A

The Design And Application Of The Han Dynasty Bamboo Slips Corpus

Posted on:2017-03-07Degree:MasterType:Thesis
Country:ChinaCandidate:B Y WenFull Text:PDF
GTID:2295330485963393Subject:Chinese Philology
Abstract/Summary:PDF Full Text Request
With the development of computer science, the technology of information processing and information storage technology has been developed. It has brought new research methods, research methods and research ideas to the traditional Chinese language and characters. The Han Dynasty bamboo slips as precious materials and carrier of China’s history of Han Dynasty, have high research value. Because of its unique advantages such as convenient, fast and strong information processing ability, the digital technology has made the mainstream research institutions begin to work on the digital work of ancient books. By building a corpus of bamboo slips of Han Dynasty, from data collection, automatic annotation, network platform to build several aspects, the gradual completion of the bamboo slips of digital work.In this paper, under the computer database knowledge, knowledge of network programming, bamboo and wooden slips of knowledge and the automation of Chinese information processing knowledge guidance, build a corpus of bamboo slips and try to explore. During this period the bamboo slips of the database construction, Han Jian automatic word segmentation, part of speech automatic annotation work, Han corpus network platform design etc.. Some of the practical problems encountered in the construction of database, network platform construction is solved, and gives an exploratory attempt in the field of bamboo slips automatic word segmentation and part of speech tagging. Automatic identification technology and the use of corpus, rough statistics of bamboo slips of the Han Dynasty unearthed in northwest of names, in the name, and Zhangjiashan Bamboo Slips of Han Dynasty, Mawangdui Han Dynasty, Wuwei Han Dynasty, and other ancient bamboo slips of function words and content words.The thesis consists of five parts:1) introduction. Respectively introduce the background, research significance, construction and application status of bamboo slips of the Han Dynasty corpus, the present situation analysis and improvement goals, research methods and innovations.2) the second chapter introduces the process design of the corpus of Han dynasty. From two aspects of database and network platform, introduces the function and characteristics of the Han Dynasty and the corpus building steps.3) the third chapter is a corpus based study of Han Dynasty, the Han Dynasty text automatic word segmentation and POS tagging do. The key of the research in the northwest Han Dynasty name, name and topic, gerunds, words of ancient silk books inquiry class.4) the fourth chapter is the conclusion. In this paper, the technology used in the research process, research results have been summarized.5) references,appendix and postscript.
Keywords/Search Tags:Bamboo Slips, Corpus, Database, Network Platform, Automatic Word Segmentation, POS Tagging
PDF Full Text Request
Related items