Font Size: a A A

Design And Implementation Of Multi Information Web System Of Automotive Industry Based On Focused Crawler

Posted on:2016-06-23Degree:MasterType:Thesis
Country:ChinaCandidate:H C MaFull Text:PDF
GTID:2308330461472115Subject:Computer application technology
Abstract/Summary:PDF Full Text Request
With the high speed development of automobile industry, the industry competition is increasingly fierce. The intensifying mergers of domestic auto enterprises, the access in succession of large foreign multinational company, the rising price of raw materials, the continuous changing of consumer demand, the sharp decrease in profit of overall automobile industry, all those factors forced car companies to have a more comprehensive understanding of all related information of the market, like as:the dynamic of automobile industry, the policies and regulations of dynamic industry, the price of components, etc. Lots of the information exists in the Internet, but those mainstream and traditional search engines always return inaccurate/incomplete/out of date information to those professionals when they retrieve related information in their professional field. Meanwhile, the mainstream automotive industry related websites mainly serves the consumer of automotive product. So it is not so convenience for the business users to get relevant information through the Internet.Aiming at the existing flaws of general search engine, the focused crawler technology emerges in response to the needs of times. It can automatically collect all web pages that relevant about the theme from the Internet and avoid irrelevant pages. Therefore, using focused crawler to build a web system of automotive industry for multi information collection can solve the inconvenience for the automobile enterprise users to retrieve information and achieve the aim of better understanding of the market. Furthermore, it can assist the automobile enterprises to generate marketing strategy and improve the market competitiveness.The goal of this thesis is to establish a web system of automotive industry for multi information collection and use it to provide the news of auto industry, industry policies/regulations and the price of component the automobile enterprise users. This thesis can be divided into the following parts:First of all, under the background of increasing fierce competition of automobile industry and exist flaws of information collection ability of the general search engine, this thesis would like to draw forth the topic of using the focused crawler technology to collect information of automobile industry. On the basis of it, the thesis will separately discuss the real needs for the automotive industry news, industry policies and regulations, price of automobile parts.Secondly, on the basis of demands analysis, the thesis will design an overall solution by combing the focused crawler technology.Thirdly, for the focused crawler, the thesis will do the research on its structure and working principle. And then do intensive study on realization of each module, including the web content analysis and segmentation technology. Do research on the current mainstream subject benchmark model and combine the actual demand, then selected the vector space model as the theme of this benchmark model. Do research on the current mainstream keyword weighting method TF and TF-IDF. Design the search strategy of this thesis base on researching of the current mainstream search strategy.Finally, under all above researches, completed the development of the multi information system for automobile industry based on theme web crawler.
Keywords/Search Tags:focused crawler, vector space model, search strategy, automobile industry information
PDF Full Text Request
Related items