Font Size: a A A

The Application Of The Web Classification Based On The Clustering Method

Posted on:2009-11-02Degree:MasterType:Thesis
Country:ChinaCandidate:C W WuFull Text:PDF
GTID:2178360272957289Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
With the fast development of Internet, the function of network is more and more important in people's daily life and work. The network has already become the new carrier of information. At the same time, the scale of Internet is expanding at the explosive pace too. The webpages which include a large number of information are still increasing at the surprising pace. So, how to fast and efficiently abstract the information existing in the webpages, help users to inquire is becoming the urgent problem to be solved.Around how to classify the documents fast and accurately in this article and how to show the results to the users clearly, make the users to inquire efficiently.The research results are used in the clustering of the webpages.Used the clustering algorithm to solve the problems that there are many repeat, useless webpages, and the degree correlated with searching is not very high. The classification of the webpage is a kind of effective method to inquire the Internet's informations. It is a kind of new developing direction of the information inquiry at the same time.Classified the webpages in Internet,can classfy the pages with the contents of the pages, facilitate users' inquiry, improve the efficiency of the inquiry. Because the linear structure time advantage of cluster's algorithm STC (Suffix Tree Clustering), cluster's result is suitable for practical application. Under the actual conditions,this article try to use STC to classify the webpages, regard real environment for use as the prerequisite, improve the execution efficiency in the course, improve the description of the categorised result at the same time, benefit the inquiry of the results, improve actual service efficiency.
Keywords/Search Tags:Clustering of the webpages, Information inquiry, Keywords, Weight, VSM, STC
PDF Full Text Request
Related items