Font Size: a A A

Modern Chinese Dynamic Auxiliary "" Automatically Generated

Posted on:2008-12-30Degree:MasterType:Thesis
Country:ChinaCandidate:X L HeFull Text:PDF
GTID:2205360215954451Subject:Linguistics and Applied Linguistics
Abstract/Summary:PDF Full Text Request
Natural Language Generation is a quite active field of the natural language processing which is based on computational linguistics and artificial intelligence now.It studies how to use computer to generate natural language text, and it has extremely important using value. The research can be regarded as a kind of technological means to test the particular language theory, and offer feedbacking for theoretical linguistics constantly, promote linguistics to develop in the depth direction.Dynamic auxiliary word "le" has been the focus and difficult point of the traditional language study.The characteristic and the rule of using of the dynamic auxiliary word "le" is one of the difficult point.Though the linguists have already paid close attention to it, but its research approaches are almost traditional semanteme and grammar describing, are mostly qualitative analysis, and lack of an overall investigations,and it still exists much differences, therefore,it lacks of the convincingness of the whole, and some need to reconsider.Moreover, it still has a lot of problem to solve,for example,how about the actual conditions of the dynamic auxiliary word "le" in modern Chinese? Is there regularity? If it is regular, what laws are there?This text mainly discussed how to generate the dynamic auxiliary word "le" after the verbs automaticly by using the method of the Natural Language Generation. This text is on the basis of utilizing the achievements of the traditional language reseach now, through studying and using the theory, method, experience and lesson of forefathers for reference, combine the extensive true language material, utilize the technology which takes rule as the core, and uses statistics in order to complement and combine the two together, to reconsider the using of the dynamic auxiliary word "le" in the angle of natural language generation.Observation and statistics of the language material is the starting point of generating the dynamic auxiliary word "le". We investigate a considerable amount of "le" with verbs in extensive corpus,in order to statistic the actual situation of the using of the dynamic auxiliary word "le" after verbs,and to summarize the regularity of the verbs which can add "le",etc. The rules are formalized and optimized from observing and statisticing the corpus and summarizing the forefathers' research results.Our main technology of the generating experiment is adopted on the basis of the regular generation strategy. On the basis of summarizing the research results of the traditional linguistics and promoting knowledge by statistics and observation of the corpus, we improve the create-rule storehouse, and form two main create-rules collection: "the rules which can't add 'le' " and " the rules which can add 'le'". In the rule-based generation system, we solve the problem of the conflict among the rules effectively through organizing the rule in order.To preserve the rule according to different levels and subdivide every type of the rules are the main tactics.We separately weigh the result of generating in two units,one is the verb which are marked "V " and the other is the big sentence,so the data have more levels, and they are more objective. Meanwhile,we also consider the complicated sitiatio n that it is proper for some verbs whether they add dynamic auxiliary words"le" or not. We adopt two bottom pieces to weigh the results of generating. The fist is a rigid copy which is totally faithful to the original text; The second is an elastic copywhich is joined manual intervention, but it has improved the correct rate greatly. Regards to the accurate rate, close test and open test both have good results.
Keywords/Search Tags:Natural language Generation, Dynamic auxiliary word, "le"
PDF Full Text Request
Related items