Font Size: a A A

Research Of The Ill Text Information Retrieval And Monitoring System In Uighur Web

Posted on:2007-10-06Degree:MasterType:Thesis
Country:ChinaCandidate:L Z ChenFull Text:PDF
GTID:2178360185966263Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
As the World Wide Web continues to grow at an exponential rate, there is a large amount of Web contents that are inappropriate to access, such as erotic, violent information. Because of the shortage of effective surveillance, a great deal of ill information, which should originally be banned, overruns the Internet. Controlling such information prevalent is one of the most important research areas of network security.The paper mainly aiming at Uighur web information, by the research of the Uighur characteristic and Web information idea, discuss the ill text information retrieval and monitoring system in Uighur web.Firstly, the paper introduces the main theoretics and technologies of the Web information retrieval. Then it applies the Spider to realize the information gathering. According to characteristic of Uighur language, using Uighur stemming based on table searching regular and arithmetic of the combined mode, Uighur text segmentation is realized; using Vector Space Model, the paper switches Uighur text information into structured data; And appling clustering analytical method, these structured text is clustered. At last, it makes the clustering result applied in the ill text information retrieval and monitoring in the Uighur network to solve the problems that there are ill information exist in the network.The paper is a successful exploration and research. It gives a solid foundation to the further improve and study of the ill text information retrieval and monitoring in the Uighur network. And it has high quality application value.
Keywords/Search Tags:information retrieval, clustering, monitoring, information gathering
PDF Full Text Request
Related items