| On the basis of linguistic theories, with the purpose of identifying the units of“yao X†in information processing, we focus on the 4 points:1).Researching large-scale text corpus, classifying the units of “yaoXâ€; 2).Abstracting the features of every class of “yao Xâ€,3).Giving the rules of computer processing; 4).Resolving the ambiguity problems. This paper is including 7 chapters:Chapter 0, introduction, which defines the objects and the significance of our studies, outlines the status of the related studies, and introduces the research thinking and methods of this paper.Chapter 1, the research and classification of “yao X†corpus. This chapter, basing the overall distribution of “yao X†in the corpus, classifies “yao X†into 3 forms.Chapter 2, analyzing the non-syntax-structure of “yao Xâ€. There are 2 types of no-syntax-structure “yao Xâ€:1).boundary group and morpheme group;2).word string group. We study and list out every left boundary, right boundary and morpheme group of “yao Xâ€.Chapter 3, analyzing the usage of the phrase structure of “yao Xâ€. We classifies the phrase structure of “yao X†into 2 types: “yao†is not auxiliary verb and “yao†is auxiliary verb, researches the features of “yao X†phrases.Chapter 4, analyzing the usage of the words “yao Xâ€. We Classify word “yao Xâ€by the part of speech, research the word frequencies, syntactic circumstances and the collocations in the texts.Chapter 5, establishing words list and rules.Basing on the information processing,we establishes the list of disambiguation, rules and illuminates the processing steps.Chapter 6, the epilogue. In this chapter, we summarizes the conclusions of this study, and point out the insufficient and future studies that we need to improve. |