Font Size: a A A

The Entropy Of The Chinese And Its Application In The Ontology Research Of The Chinese

Posted on:2014-01-27Degree:DoctorType:Dissertation
Country:ChinaCandidate:X P XuFull Text:PDF
GTID:1225330398959122Subject:Linguistics and Applied Linguistics
Abstract/Summary:PDF Full Text Request
On the basis of word segmentation and statistical calculation of small text sample, this thesis summarizes the basic theory on the entropy of the Chinese in linguistics, lexicology, and informatics and analyzes the possibility of the entropy in ontology research of the Chinese with the research method of corpus linguistics and mathematical statistics. The thesis discusses the application method for ontology research of the entropy in Chinese with examples, which provides a feasible and quantized analysis method. The thesis consists of five chapters:Chapter One: Introduction; Chapter Two:The entropy of the Chinese and its application in the ontology research of the Chinese; Chapter Three:the entropy of the Chinese and its application in the ontology research of the Chinese; Chapter Four:the entropy of the Chinese and the Zipfs Law; Chapter Five:Conclusion.Starting from the informatization research of language, Chapter One points out the feasibility of research ontology language with the method of information from the aspect of the similarity between the rules of the language and information coding. The practical significance, history, status quo and problems of this research are then introduced briefly. The theory and research method are put forward and some problems existing in the research are explained.Chapter Two first expounds the basic theory of the information entropy in details, the theoretical foundation. On the basis of preview research result, this thesis discusses the determination method and history of Chinese entropy and gives clear definition on the position of word frequency statistics in entropy calculation, which holds the opinion that word frequency statistics is the initial form of Chinese entropy calculation. Then the comparison between the entropy of different language is discussed. At the end of the chapter, the application method of Chinese entropy in the ontology research of the language is put forward with the analysis of the chivalry novels of Ku Lung and Louis Cha. Chapter Three first gives a distinction between word and phrase, defines the position of word frequency in word entropy calculation and gives the determined value. On the basis of this, the redundancy of the Chinese is discussed. Then, the application of the word entropy in the ontology research of Chinese is expounded specially. Examples are given on the application of the word entropy in the field of the comparison of different style, grammaticalization researches, text duration and computational stylistics.Chapter Four introduces an important statistic distribution law—Zipf s Law in language. With the calculation result of the Chinese entropy, word entropy, it is proved that the distribution of the Chinese in small text corpus is in line with the Zipf’s Law. Meanwhile, it is found that the distribution of the text’s self-information possesses high consistency by the Zipf s Law method, which improves the academic value of this research.The Conclusion gives a summery to this thesis and points out its limitations. Sections for further research is provided.
Keywords/Search Tags:Chinese, entropy, ontology research, application, Zipf’s Law
PDF Full Text Request
Related items