Font Size: a A A

Study On The Processing Of Natural Language And Search Engine

Posted on:2008-10-01Degree:MasterType:Thesis
Country:ChinaCandidate:J ZhouFull Text:PDF
GTID:2178360242960090Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
The search engine which is the most important information ,searching tool on internet has been widely used at all aspects.However ,because of the fast increase ment of the network amount of informatior and disorder of network information ,the traditional search engine has not satified the personal and intelectual need for informationSearching the engine passes procedure of oneself to collect from the net with analysis web page, build up index, contribute efforts to the customer.This type of auto which matches a realization to check to seek through a keyword renews of search an engine advantage is the web page amount for covering is huge, because it owns according to the full text index of key word, it provided an entrance for all customers of on-line surfings, all customers can from search engine to set out to arrive the net that oneself wants up the whichever place.The natural language comprehension technique is an intelligence to search engine a key contents.Searching the engine wants to carry out intelligence to turn with characteristic, have to can induce and study the interest a fondness for of customer, comprehend the search claim of customer, compare to search result and customer interest the relativity of of a fondness for, recommend the most desirable text file to the customer, these all can not get away from a natural language comprehension technique.The natural language identifies and the processing is one of the most important topics that the artificial intelligence studies, is also the key of artificial intelligence research.Speak for the sake of making the artificial intelligence system obtain mankind's knowledge more availably for the research of artificial intelligence, there is stronger study function, have to have a very tall person machine dialogue an ability, so the system have to have stronger natural language to identify and handle an ability.Actually, all of the basic problems of natural language processing and other realms(if axioms certificate, problem answer, the mode identify, the machine and robot science etc.) of artificial intelligence are a knowledge expression and make use of a problem.Say of a little bit more overall be:how obtain various different knowledge, and can use by a kind of calculator and the method expression knowledge for handling.Once the appropriate knowledge structure built up with express theories well, so natural language processing of the bottleneck problem also removed.The textual research contents is mainly the method of natural language processing with currently the theories, technique of that realm.Elaborated to search the principle of engine in detail, search the basic structure of engine and search the inspectional function of engine.I am thorough to study related analysis to include the phrase method analysis and sentence construction analysis strategy.Did comparison research to various in common use automatic participle calculate way, improved a MM method, raised to slice a phrase efficiency.Pass distich legal theory various comparison of grammar characteristics in theory, find out the most suitable for describe natural language of grammar, gave it related calculate way.The automatic participle is the square one that the modern Chinese language carries on sentence construction analysis, is follow-up phrasing and the language righteousness analytical foundation.Because calculator's ising engaged in sentence construction analysis is the machine phrase and sentence construction rule database the phrasing knowledge of the basis.The machine phrase registered phrase method, sentence construction and language righteousness knowledge of each phrase.But sentence construction rule the database take knowledges, such as phrase, phrase and language righteousness...etc. as foundation to construct.The sentence that therefore a series of Chinese characterses constitute has to be advanced to go participle, then can make use of the machine phrase and rule database, also just probably carry on sentence construction analysis.The sentence construction analysis means the single phrase sequence of judgment importation can constitute sentence of conforming to the phrasing, sampling the sentence structure of the sentence that conforms to phrasing.Also namely applied sentence construction rule and other knowledge, will input sentence the line of the of single phrase order of sequence, become a not- line data structure, such as short language structure tree etc..The sentence construction system is an important stage that the natural language handles process, saying generally, the sentence construction system includes: form phrasing system, analysis the automatic and born system of control system and language sentence.The language righteousness analytical main mission is the vocabulary language righteousness unit form which produces a language text originally with them it the language righteousness of relation.Pass to handle theories and search the research of engine principle to the natural language, designed 1 to take natural language processing as core to search the engine model-intelligence to search engine model technically, gave function and detailed calculate way of each mold piece in the model.The main calculate way that body use now of its intelligence efficiently sex.The its more traditional engine technique included a very big improvement, it used concept to expand to raise system to recruit to return to rate first;It considered concept well in the text file place's position dissimilarity and its importance when the text file handle also different fact, improved engine to check a quasi- rate, put forward and used the dynamic state born technique of inspectional vector, avoid the data sparse problem to raise a movement efficiency.Make use of at the function aspect characteristic processing the way detect the search activity of customer over a long period of time, judging the interest of customer, donating an information automatically to the customer, in the meantime used an information feedback's technique to carry out person's machine to hand over with each other.Chapter 1 introduction natural language comprehends to say and study a dynamic state all.Introduced the research background, main contents of the thesis.Chapter 2 introduced to search engine principle.The data which includes to search engine collects a mark to lead a mechanism, mechanism and customer index of the data organization mechanism.Still introduced to search various inspectional function of engine.Give finally the intelligence search the frame structure of engine.Chapter 3 introduced related analysis.The in common use calculate way, sentence construction which includes automatic participle analyzes and the language righteousness analyze.Chapter 4 design and the realization which introduced to search engine model.
Keywords/Search Tags:Natural Language Processing, Search Engine, Segmentation, Syntactic analysis
PDF Full Text Request
Related items