Research On Consumer Purchasing Forecast Based On Data Mining

Posted on:2017-03-23

Degree:Master

Type:Thesis

Country:China

Candidate:S Ma

Full Text:PDF

GTID:2209330485950951

Subject:Applied Statistics

Abstract/Summary:

PDF Full Text Request

With the development of e-commerce, online shopping has become one of the major consumption way. Comparing to the offline consumption, online shopping provides low cost and variety choices of goods for the consumers with less limitation of opening hours or shopping places. However, it is nothing but the abundant information and variety shopping choices of online shopping that take more time and energy from the consumers to find the appropriate merchandise. Meanwhile, the fierce competition between the e-commerce platforms forces the merchants to refine product demands in order meet the customersâ€™ needs better, which also narrowing range of targeting users. It is an important and noticeable segment for the e-commerce to find target consumers among the crowd rapidly efficiently and to draw up marketing program more specifically, for the coming competition and development. The large quantity of customersâ€™ behavior data on the e-commerce platform makes it possible to analyze their purchasing intention and consumption habits, thus realizes the precise commodity recommend of one-to-one.In this study, Tianchi big data platform was applied to provide real data, in order to predict the interact purchasing between the customer and the products which have interactive relationship. There are four steps of model building. The first step is data pre-treatment, exploring the basic distribution of data and pre-processing the data. This step provides reference and basis for the exaction methods of features and the selection of algorithm. The second step is sample selection. The reason for this step is that problem exists in the sample data that the number of positive sample is far exceeded the number of negative sample. The problem can be solved by three data processing. First, increase the number of positive sample by Moving Window. Second, compress the time window of interactive sample before prediction via timeliness of interactionanalysis to decrease the ratio of positive and negative samples. Third, randomly choose negative sample without replacement, but choosing all the positive sample. The third step is feature engineering. Construct the feature of user, item, item-category and user-item in multi-dimension. Then process and expand the feature group using different methods. That is, to get the second level features which are more applicable to the predicted model by different transformation on the basis of simple features; to get the derivative features which is more capable of showing the data feature and operation requirement by different combination on the basis of single features. Features are the independent variable of model prediction and determine the upper limit of prediction effect in prediction model. The upper limit can be reached by trying different algorithms and adjusting parameters. The forth step is model training and prediction. Logistic regression and GBDT are applied in this study to build prediction model. After the comparison of test set, the prediction of GBDT is found to be better than the other one. In order to improve the model prediction, the result of logistic regression is added as new features to re-predict in GBDT model. The prediction effect is increased to a level which is higher than single model. The reason is that GBDT itself is based on the strong classifier of regression tree.

Keywords/Search Tags:

Recommender, Systemfeature engineering, LogicalRegression, GBDT, Fusion model

PDF Full Text Request

Related items

1	A Study On Collaborative Filtering Recommender Model Based On Trust
2	Quantitative Investment Model Based On Improved GBDT
3	Research On Multi-factor Stock Selection Based On Regression Method And GBDT
4	Research On Security Problem Of Personalized Collaborative Filtering Recommender Algorithm
5	Local Government Debt Risk Rating And Early Warning Research Based On GBDT
6	Study Of E-Commerce Recommender System Based On Customers' Interest And Collaboration
7	The Study On Feature Engineering Construction And Application Based On User Behavior
8	The Research Of E-Commerce Personalized Recommender Systems
9	Design Of FOF Combination Scheme Based On GBDT Algorithm
10	Research On The Recommender System Based On C2C E-Commercial Enviroment