Font Size: a A A

The Construction Of Variant Character Set And Development Of Retrieval System Based On Hanyu Da Zidian(2nd)

Posted on:2018-01-17Degree:MasterType:Thesis
Country:ChinaCandidate:M N WangFull Text:PDF
GTID:2405330569475597Subject:Linguistics and Applied Linguistics
Abstract/Summary:PDF Full Text Request
With the development of information technology,academic research has gradually changed from traditional paper research to computer-assisted research.The most fundamental requirement is the reading and retrieval of electronic texts,and the current retrieval system does not include Chinese characters which is not disposed by computer or replaced with the picture,and do not implement truly complete retrieval.The digital processing of variant characters is an important part of solving this problem.Currently,Hanyu Da Zidian(2nd)is the dictionary which is the largest,no one has made a systematic statistics on the variant relationship,For this reason,this paper take the identical variant characters table in the Hanyu Da Zidian as our research target,establish a variant characters font library for electronic text,and design and implement the text retrieval system in the full sense.The paper is on the basis of counting out the identical variant characters with“Tong X”exhaustively,through creating a relationship join which is similar with“Tongyong”and“Diyong”,and sorting out completely variant characters table.In the process of constructing the variant character set this paper create the Chinese characters which can not be identified by computer,and complete the input method settings.Through the statistics found that there are 4828 Chinese characters which is not disposed by computer,and finally implement digital process of the all identical variant characters through new font library;And learn from the thought of Zihai Liangfen Input,to achieve the input and display of new words,ultimately implementing digital process of the variant character table,and building the variant character set.On this basis,this paper design and implement functions:the inquiry of variant character group,the frequency of occurrence and the environment of appearance statistics in ancient books,and can be tested individually,and through input a word,to find out all variant characters which are related to it and the using environment in ancient books,the range of retrieval can choose flexibility,or retrieve on the specified ancient text of the library,or retrieve on the ancient book library.In the design process of retrieval system,resolve the recognition and dispose work of the four-byte Chinese characters and the new word,mainly to judge the searching characters with the AscW()function,and finally implement the retrieval of all the characters in the variant characters table.By the construction of variant character set and development of retrieval system,and finally realized the digital search of the full text.while improving the usage rate of electronic text at the same time,and providing convenience for the study of Chinese characters ontology and literature version research.
Keywords/Search Tags:Hanyu Da Zidian(2nd), variant character, electronic texts, set of variant character, Retrieval System
PDF Full Text Request
Related items