| With the development of network technology, the Internet has become one necessary source from which we obtain knowledge and information. The scale of net development is growing, the quantity of information we can get from Internet is elevating. However, there is a shortage of uniform management of it all, the proliferation of any kind and unrestricted information, the steady rise of criminal and pornographic use of it. Although we already have start against the nasty and porn website in our country, we also need technology filtrate the infective information such as information of erotic and violence as to guarantee a safe and healthy net environment.Currently, we can not forbid effectively the sensitive information to transmit if simply use one filter technique, because of the various types of the sensitive information and the forms of communication. So we take it as the background of the paper, and do research on the key technique based on text matching and image recognition. We combined the two techniques to establish the text-matching and image-recognition module, and design the sensitive information monitoring system.According to the large and covert characteristic of the current network text, in the sensitive text monitoring, we combined the WM algorithm and fuzzy matching to identify the text in which contain sensitive information. First of all, we pretreated the text aimed at the three hidden forms of text communication, and, it also converted fuzzy matching to another form of precise matching. Then, we searched the sensitive key words which appeared in the text used WM algorithm. If there were certain key words appeared in web, we used fuzzy matching and set a value of similarity in order to discriminate sensitive text information.According to the characteristic of more exposed skin, in the sensitive image monitoring, we use skin-color detection model and texture discrimination model to select skin area and build binary image. Complexion is the most direct and enrich information in images and there are many color space can be used in skin-color detection algorithm. We used the combination of YUV and YIQ color space to do skin-color detection. But the erotic images may be detected in error by the skin color model, because of the color's similarities. By comparison and analysis, we used the gray stat method to establish texture discrimination model, and judge whether a point and around part are smooth as skin area. Finally, we extract three characters as eigenvector to train SVM classifier and discriminate sensitive images.The sensitive information monitoring system is an intelligent agent-based system, it used the combination of text matching and image recognition to determine whether the web contain sensitive content, and then record and deal with it as feedback of information. Before the image recognition, the monitoring prototype will do text matching, which shortened the filter time and made the system more real-time.The innovation of the paper is that, we used the combination of skin-color detection, texture discrimination and SVM classification in the process, and improved the correctness of image recognition; and in application, the monitoring system is integrated with intelligent agent instead of browser, so it can search automatically in a certain rang.The paper is consisted by five chapter: We introduce the background, the significance and the content of this paper in chapter 1; analyze the technology currently about sensitive information monitoring in chapter 2; introduce detailed the technology about sensitive text monitoring and establish the text matching model in chapter 3;introduce detailed the technology about sensitive image monitoring and establish the image recognition model in chapter 4; finally, design and realize the prototype of the sensitive information monitoring system in chapter 5. |