Font Size: a A A

Research On Focused Search Engine For Forestry

Posted on:2006-10-08Degree:MasterType:Thesis
Country:ChinaCandidate:H CaoFull Text:PDF
GTID:2133360152488416Subject:Forest managers
Abstract/Summary:PDF Full Text Request
Search engines are the most important information query tools from the World Wide Web and the key to the internet research and utilization. Following the WWW information's blasted and multivariant growing up, Focused Search Engines are becoming researching focus. This dissertation focused on the topic specific Chinese web information accessing and its screening technology, designed and accomplished a Forestry Information Search (FIS) engine which kernel was the topic specific information gathering FRobot. We firstly introduced history and present condition of nowadays general purpose comprehensive search engines. Secondly, we analyzed their classification, working mechanism and defects. And what's more, Focused Search Engines were surveyed. Based on stressed investigating of information query models, focused retrieval strategies, fish algorithm, method of weighted index and retrieval technologies, a Focused Search Engine designing outline was suggested. According to the outline, we accomplished Forestry Information Search (FIS) engine, which was integrated html text analysis, homepage interrelate, content detection, database index, Vector Space Model (VSM), and improved fish algorithm. FIS engine is the specific forestry web information system, it has higher accuracy rate than general search engines, and quickly offer complete forest information. At last, much experience of FIS researching and developing was summed up and the system foreground was indicated.
Keywords/Search Tags:search engine, focused crawling, Vector Space Model, Focused Search engine, forestry
PDF Full Text Request
Related items