| With the continuous development of network technology, especially the rapid popularity of mobile Internet, network has become a more important part of people’s life. People can obtain information from the Internet at every time and spread information on network. Users can easily express their feelings and ideas to the network, many of these online speech convergence of network public opinion in the whole social public opinion are in the increasing influence. Chinese Internet public opinion emergency occurred frequently in recent years, and the speed and scale of its outbreak is gradually increasing. Network information on artificial release way to guide network public opinion has been more and more unbearable, because it can’t afford large amount of users in the Internet. Therefore, our government need a highly efficient and reliable method to deal with the modern network public opinion problem.According to the above situation, this thesis at the perspective of the spread of the Internet information technology, based on B/S model of network public opinion guide platform system architecture design, put forward system requirements analysis and design, and complete the system prototype with the combined network information collection technology and Web information automatically release technology. Finally, realize our main function--effectively guide public opinion as to major news sites, microblogging, post bar and BBS information.The main contents of this thesis include: requirements analysis and general design to the network public opinion guide platform system; analyze the function of foreground and background mainly according to the hierarchy of B/S structure model; design the system’s hardware and software architecture and introduce the three main subsystems-- corpus management system, information release system and evaluation of task management system; realize the prototype system; construct platform for network public opinion guidance system in a Linux environment; research on the key technology of system, analysis of distributed Web crawler system built for language acquisition technology, research on Scrapy crawler frame,design and implement the crawler scheduler on the basis of Scrapy, using Bloom Filter algorithm to remove repeated URLs, using regular expressions and XPath path language to constitute web page template library to extract corpus automatically, finally built the distributed web crawl system; for web information publishing technology,we mainly adopt the way of call open platform API interface is given priority to, driven by process simulation browser HTTP communication behavior and the way its engine is complementary, and in the process of information release corpus retell the method of using synonyms replacement; Finally, test the system specifically, including function test of simulated user behavior and performance test on language acquisition of web crawl and the success rate of web information release.In this thesis, we introduced how we complete the network public opinion guide application development platform system through the above research contents. After testing, the system had achieve project goals in function and performance. This thesis can provides some reference value for the related researchers in network public opinion guide. |