Font Size: a A A

Based Heritrix And Lucene To Design And Implementation The Domestic Air Tickets Price Comparison System

Posted on:2017-07-07Degree:MasterType:Thesis
Country:ChinaCandidate:X Y LiFull Text:PDF
GTID:2392330590968412Subject:Software engineering
Abstract/Summary:PDF Full Text Request
With the high-speed development of information industry,the Internet has gradually become the important carrier of many industries.The current e-commerce emerge in endlessly,from all walks of life value system is a very important application in the electronic commerce development,can facilitate users to compare to the product price,also can provide each big electricity pricing reference.Compare system gradually become major e-commerce sites to improve its competitiveness and increase the user viscosity.In the field of aviation,electronic ticket has been very widely used,online booking system has made great people travel.But is also very competitive in the field of ticket sales,the same flight ticket prices in different sales site difference is very big also,trouble brings to the user to buy the ticket price,also brings to the ticket sales site pricing chaos.Aiming at this situation,based on open source Heritrix and Lucene project,design and implement a system based on the domestic air tickets.To crawl the Internet ticket prices data,this system after treatment and comparison,to a certain route or the flight prices are sorted,and whether the lower than average price and price whether the right judgment,at the same time,will give the user the ticket jump links.Paper's main work includes the following several aspects:1.The demand analysis of the system.For this system,analyses the system target,function and performance requirements,etc.The paper has been clear about the composition of system and requirements.2.The key technology to comparison system are analyzed in detail.Discussed the topic crawler,Chinese word segmentation,and Hertrix and use of the Lucene.Combined with the theme of the ticket price analyzed,the technology of how to use these to solve subject correlation calculation,page parsing DOM and key technical problems such as filling.3.This thesis analyzes the overall design of the system,the composition of each module and the database of the system,and analyzes and discusses the key table of the database.This thesis also discusses the information of flight ticket information,data correlation calculation,through the page processing to achieve the target data extraction and storage.To deal with the information of the ticket price data to obtain the lowest fare and realize the price chart,and finally the users can inquiry information this through the terminals.4.Testing and validation.This system function and performance were tested and proven,the testing results show that using Hertrix and Lucene technology implements the ticket price comparison.This thesis explores the design and implementation of the Hertrix and Lucene technology in the ticket price comparison system,and has been applied in a certain range.
Keywords/Search Tags:Compare system, Chinese word segmentation, Ticket prices, Hertrix, Lucene
PDF Full Text Request
Related items