| Based on the corpus, this paper mainly deals with the statistical study on words and expressions used in Chinese textbooks of secondary schools in Shanghai. There are five parts in all.In the first part, the thought and content of this paper is introduced. Then a brief introduction of corpus and the research situation of text books used in secondary school follow.Secondly, the constructing situation of corpus of textbooks of secondary schools is briefly introduced. This makes a good foundation for the later statistical study. Though the construction of corpus of secondary school textbooks takes up only a small part in the paper, it takes most of time and energy to do this part of research. Without this basic research, we cannot go on with this research.In the main part, the statistics and analysis of words and expressions used in Chinese textbooks of secondary schools are discussed in detail, including using frequency and coverage rate. Besides this data, the word-using data of classical Chinese, contemporary Chinese and translated literature are also put in statistics respectively. Words used of the textbook are only discussed in statistics without analysis in light of the length of the article.The fourth part is the results and discussions. In this part, two aspects are discussed: the frequency of words and expressions and the complexity of texts.The fifth part is conclusions and expectations.Followings are the conclusions:1. The using frequency of single word is 8 to 9 times larger than that of expressions.2. Three-fourths of the total are commonly used words while the rest are not. On the other side, 97.23% of the commonly used words can be seen in textbooks. So we come to a conclusion that textbooks of secondary schools aim to review the words learned in elementary stage as well as to learn more non-commonly used word.3. Translated literature is usually longer than contemporary Chinese text and the words used are statistically scattered. The coverage rate of the high-frequency words used in translated literature is relatively stable compared with that of the whole text book.4. Classical Chinese text is usually short and the words used are scattered. The high-frequency words used in classical Chinese text are relatively concentrated and the higher frequency, the more concentrated.5. The larger corpus it is, the lower proportion of low-frequency words is used. The proportion of low-frequency words used in translated literature is much higher than that of contemporary Chinese texts.Based on the analysis of research data, we also find some problems. The current edition of Chinese textbooks lacks sense of science based on the statistical data of corpus. There are two major drawbacks: the low reappearance rate of new words and the unscientific sequence of texts. |