Analysis Of E-Commerce Information Based On Crawler And Data Mining

Posted on:2019-06-25

Degree:Master

Type:Thesis

Country:China

Candidate:B Q Lv

Full Text:PDF

GTID:2359330569489326

Subject:Applied statistics

Abstract/Summary:

PDF Full Text Request

With the rapid development of the Internet industry,the leap-like evolution of computer-related technologies and the continuous expansion of industry scale innovation.The impact of the e-commerce platform on people’s lives is self-evident.On the “double eleventh” day of 2017,Taobao’s transaction volume on one-day platforms exceeded RMB 168.2 billion.People’s daily lives can not be separated from e-commerce platform.In the e-commerce platform’s online transactions,consumers can obtain various types of information by browsing the web pages of goods to determine whether to purchase,and these information can bring about better opportunities for marketing.Therefore,This paper will combine with The Crawler Technology and the corresponding Data Mining Methods to collect and inquire into the e-commerce webpage information,expect to build a systematic method from data extraction to data mining,help the e-commerce and consumers to explore demand better,and provide a reference for enriching marketing planning.From view of e-commerce consumer groups,the proportion of users in the 80 s and 90 s is nearly 80%,which is the core user of e-commerce spending.Consumers in this age group must undergo interviews and employment.Therefore,choosing suits as a research object is very representative.This paper through build a web automation crawler framework obtained more than 8,000 e-commerce webpage information on suits from Taobao.And the data mining method which is suitable selected by checking the data information.The final decision was to combination of descriptive analysis,regression analysis,and text clustering analysis to explore data values,thus providing high-quality data mining technology support for the rapidly evolving e-commerce industry.The main work of this article is divided into six parts:The first part: Introduction,mainly elaborates the research background,research content and structure arrangement.The second part: Introduces the method of data extraction and storage.It briefly describes the tools and its advantages and disadvantages,and lays a foundation for thedata crawling.The third part :The processing method of webpage information,which respectively describes the multivariate regression analysis model about the sales volume of goods and the “unsupervised learning” text clustering method used for webpage information.The fourth part: The development of K-Means algorithm,from the data preprocessing,VSM algorithm,IDFTF-algorithm and the evaluation of K-Means algorithm and so on,recorded the algorithm implementation process in detail.The fifth part: The empirical part,from the environmental configuration,data acquisition and storage,descriptive analysis,regression analysis,text clustering to build a set of systematic data exploration methods to maximize the exploration of data values to achieve research purposes.The sixth part:Summary and Outlook.It provides reference suggestions and support for data mining and market development of e-commerce website information.

Keywords/Search Tags:

E-commerce, Crawler, Data Mining, Text Clustering Algorithm, The Multivariate Regression Analysis Model, K-Means Algorithm

PDF Full Text Request

Related items

1	Netcom Customers Based On Data Mining Research And Application Of Market Segmentation
2	Applied Research On Marketing Data Of X-large Supermarkets Based On Data Mining
3	Clustering And Classification Of Data And Text Using Such Technologies As Genetic Algorithm
4	Svm And K-means Algorithm In Ihrs In Research And Application
5	Application Of Improved K-means Algorithm In E-commerce Recommendation System
6	Research On Communication Behavior Of Telecom Users Based On Large Data Platform
7	Research On The Application Of Mobile Electronic Commerce Information Push Based On BP Algorithm
8	Research On E-commerce Recommendation Algorithm Based On Singular Value Decomposition And K-means Clustering
9	Research On K-means Clustering Markov Chain Method For Forecasting The Trend Of Individual Stock
10	The Empirical Research Of Shanghai Telecom Customer Churn Problem Based On Data Mining