Design And Research Of Network Spider

Posted on:2014-07-20

Degree:Master

Type:Thesis

Country:China

Candidate:M L Zhao

Full Text:PDF

GTID:2268330401964607

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

Network resources are very rich, but how to effectively search for information is adifficult thing. Create a search engine is the best way to solve this problem.Multi-threaded web crawler program is the first algorithm in accordance with thewidth from the specified Web page to parse, search, and to crawl each URL to search,save and new entrance on the Internet constantly crawling the URL is automaticallyRunthe daemon.Web crawler application socket socket, regular expression, the HTTP protocol,windows network programming and other related technology, the Web crawler is a runin the background to the configuration file as the initial URL, down to crawl to thebreadth-first algorithm, save target URL of the network program in C++language asthe implementation language, and in VC6.0debugging by ordinary users be able toperform web search task.This thesis first details the system architecture of the Internet-based search engines,and then provides details on how to design and implement search engine search engine-Web crawler. Of the subject completed the following work:1. Complete analysis of Web crawler SPIDER architecture;2. To complete the design of the main function module;3. My SQL database;4. URL parsing queue management;5. To achieve the design of the individual function module;6. Carried out the testing of the system of network reptiles.In addition, in the Design and Implementation of Web crawler chapters in addition tothe detailed elaboration of the technical core combined with the realization of themulti-threaded web crawler code to illustrate, and easy to understand.

Keywords/Search Tags:

SPIDER, Breadth First Search, multi-threads, Internet searchengine, Network spider, URL Captures

PDF Full Text Request

Related items

1	Researc And Implementatio Of The Web Spider Of The Subject-Oriented Search Engine
2	Based On The Design And Implementation Of The Theme Of The Breadth-first Crawler
3	The Research And Implementation On The Spider Of The Vertical Search Engines Based On The Reinforcement Learning
4	Web Spider Design And Realization Of Intelligent Search Engine
5	Design And Implementation Of A Spider For Topic-Specific Search Engine
6	Research And Achievement Of The Search Strategic For The Topic Search Engine Spider
7	Research And Implementation For Web Spider Based On Web Data Mining
8	Theme Spider And Achieve
9	Improvement And Realization Of Vertical Search Arithmetics In Web Spider
10	Based On Web Spider Search Strategy To Consolidate Learning