Font Size: a A A

An Automatic Extraction Study Of Chinese Basic Vocabulary Based On Genetic Algorithms

Posted on:2008-02-20Degree:MasterType:Thesis
Country:ChinaCandidate:Z P ZhangFull Text:PDF
GTID:2178360218451974Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
The basic word is the important part of the vocabulary and many languageresearches. Since the concept of basic vocabulary was proposed, the domesticscholars have raised a study upsurge of the basic vocabulary. After half century,research has made considerable achievements. Various identification methodsof the Chinese basic vocabulary were proposed. That provided the referencewhich we considered three characteristics of the basic vocabulary.GA is the new computational method which the life science and theengineering mutually jointly form. It is not only self-organizing, adaptive, andthe intelligent self-learning characteristics, but also the inherent nature ofparallel characteristics. The algorithm evaluates the fit and unfit quality of theindividual by fitness. GA has become an effective tool which the non-linearoptimization and the system recognition by more than 30 years research andthe application. It has been widely used in the application, including functionoptimization, combinatorial optimization, production scheduling and automaticcontrol, machine learning, image processing, artificial life, machine learningand other fields. In the natural language processing aspect, GA also receivedtakes to application in information extraction, the text Classification andClustering, the data mining, Knowledge library automatic production, thehandwritten character recognition and so on and it has obtained the very goodeffect. The practice proved that GA is an optimized method of the modern. It is appropriate to be used in large-scale and complex discrete space GlobalOptimization. It is better than the conventional method in the solution speedand the quality, is a high speed approximate method.Automatic extraction of Chinese basic vocabulary is the learning process.First, GA analyzes three characteristics of the basic vocabulary enumerated bylinguists listed, and studies and summarizes the rule of these words. Then, GAcompute in word table of' Engineering Definition of Contemporary ChineseCommon-using Words" base on that rule. The paper describes the operations ofthe genetic algorithm in detail.
Keywords/Search Tags:Genetic Algorithms, Chinese Basic Vocabulary, automatic extraction
PDF Full Text Request
Related items