| There are many kinds of resource in World Wide Web at present, and the amount of them is increasing rapidly everyday. It becomes a habit for internet users to search the useful information of their own. The apperance of search engine helps us to realize the searching in a convenient way, so it becomes a more and more important tool to surf the web.However, the results returned by search engine often cann't meet our needs well because the resource of internet is lack of a given criterion and many web pages have bad structure and regulation. The broad fields that web pages refers to also lead search engine to search out many information that has little relativity with our query. What we hope is that the information we need can place in the front of the search result, so we can find out the right information easily.This thesis aims at implementing an efficient ranking algorithm to give users a high precision of getting search results. This paper compares the ranking method based on traditional information retrieval with the algorithm based on link structure of webs and analyze their advantages and disadvantages. On foundation of these, bring forward a ranking algorithm combined PageRank with relativity between anchor text and web pages. Totally speaking, the thesis includes the following parts:(1) According to analyze the web content, distilling all URLs and anchor texts of the page. URL index is established based on these.(2) Inverted index of anchor text is established to solve the problem that some special pages cann't store in full-text index.(3) Web contents are distilled to compute similarity between web contents and anchor texts.(4) Based on link sructure of webs, link database is created to compute the offline PageRank and improved PageRank.(5) Simulating the input query by users and return the results ranked by PageRank. The results are estimated by query precision. The result returned by experiment reflects the importance of ranking method in practice. It can guide users to find out useful information easily and efficiently. The application of PageRank based on link structure makes authority pages place in front place of ranking results, and the improvement of the algorithm can get better query precision. |