Font Size: a A A

A Study On Frequency Statistics And Distribution Of Ancient Books In The Han,wei,Jin,Southern And Northern Dynasties

Posted on:2021-09-05Degree:MasterType:Thesis
Country:ChinaCandidate:J Z LiuFull Text:PDF
GTID:2505306104489614Subject:Linguistics and Applied Linguistics
Abstract/Summary:PDF Full Text Request
Chinese character frequency is the fourth element of Chinese characters in addition to the three elements of shape,sound and meaning.Investigating the frequency of ancient books in ancient dynasties helps to understand the development and evolution of Chinese characters,and is of great significance to the study of the appearance of ancient Chinese characters and the changes in social thoughts and cultural styles of ancient dynasties.After examining the current status of research on Chinese character frequency at home and abroad,this paper found that there were few previous research results.Among them,the main research was on the character frequency of special books,followed by the study of word frequency over time,and the study of dysfunctional character frequency was rare.This article takes the study of the frequency of dysfunctional dynasties as the starting point,selects the ancient books and literatures of the Han,Wei,Jin,and Southern and Northern Dynasties as the research object and the original corpus,and conducts a comprehensive review of the frequency of the ancient books,the Han,Wei,Jin,and Southern and Northern Dynasties from three perspectives: overall,diachronic,and synchronic Systematically collect statistics to obtain first-hand data,and conduct research on word frequency distribution based on these data.Based on the principles of database integrity,exhaustion and accuracy of Chinese ancient books,this paper first constructs a overall corpus of ancient books from the Han,Wei,Jin,and Southern and Northern Dynasties,a diachronic corpus of ancient books from the Han,Wei,Jin,and Northern and Southern Dynasties,and a synchronic corpus of ancient books from the Han,Wei,Jin,and Northern and Southern Dynasties.Then,the word frequency statistics and word frequency classification of the overall corpus were carried out,and the analysis found that the word frequency of the overall corpus roughly conformed to the Zipf distribution,and the frequency distribution of the core word area and the one-use word area were very uneven.The word frequency distribution of the word area has a significant influence.At the same time,the character frequency distribution of the Han,Wei,Jin,Southern and Northern Dynasties was more uniform than that of the pre-Qin period.Afterwards,this paper made word frequency statistics on the four sub-corpora of the diachronic corpus,and compared and analyzed the situation of using words in ancient books in the four historical stages of the Han Dynasty,the Three Kingdoms,the Two Jin Dynasty,and the Northern and Southern Dynasties from a diachronic perspective.By calculating the Levinstein distance of the extremely high-frequency characters of ancient books at various historical stages,this paper finds that the closer the eras are to each other,the closer the frequency distribution of the extremely high-frequency characters is;The distribution is the most uneven,the two Jin and one word are the most evenly distributed,and the difference is small.Finally,the article has conducted word frequency statistics on 18 sub-corpora under the synchronic corpus.From the synchronic perspective,it has compared and analyzed the situation of using words in different categories of ancient books.It is found that the total word volume,single word volume,average word frequency,discrete coefficients,Chinese characters In the first-order entropy,there are large differences among the 18 sub-libraries,and the first 4 data indicators show varying degrees of linear correlation.In addition,the 18 sub-libraries of extremely high-frequency words are widely distributed in the real-word class words with the characteristics of different types of ancient books.So far,through the use of corpus linguistics methods,computer programming methods,and statistical methods,and from a more comprehensive perspective,this paper has completed the research on the frequency statistics and distribution of ancient books in the Han,Wei,Jin,and Northern and Southern Dynasties,to a certain extent.The blank of the research on the word frequency of the dynasties in the Han,Wei,Jin,and Northern and Southern Dynasties.
Keywords/Search Tags:Ancient Chinese character frequency, Word frequency statistics, Word frequency grading, Word frequency distribution, Han,Wei,Jin,Southern and Northern Dynasties
PDF Full Text Request
Related items