Research On IPTV QOS Log Analysis Method

Posted on:2014-08-08

Degree:Master

Type:Thesis

Country:China

Candidate:M Y Li

Full Text:PDF

GTID:2308330464457981

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

In recent years, big data is one of the most popular IT technology, the amount of data have now reached the new heights in every field. Data is first reached saturation in IT industry. Then IT industry becomes the most advanced in the field of big data technology, the major Internet companies and database vendors have developed or launched their own big data product in order to solve the daily operation problems that they encountered.This paper describes the IPTVQos log analysis project. We need to find a suitable solution to fulfill the demand of data mining and proposed clustering problem of the customers. To find the right solution, the paper first describes the background of the project. Then it introduced early progress of the project and the coping measures. Then it point out the current projectâ€™s features expected:1, abnormal data removing; 2, find the similarity of abnormal records in massive data. In response to questions, the paper lists a number of big data solutions. Then it compare them, and ultimately choose Hadoop as the big data platform to complete the data analysis, the paper then describes the various components of Hadoop framework characteristics of each part, the use of methods and usage sitiations.In the next section, the paper lists the technical difficulties encountered during the development process.1, scan the large amount of data.2, the calculation of the big data cost too much time.3, dependencies exists between attributes.4, lack of the attribute to be sort.5, the intelligence analysis of ranking results.6, remove the abnormal data. To analyze the various technical difficulties, auther proposed a series of solutions and analyze the technical feasibility of the solution as well as time and space complexities. Then, based on the theory proposed the auther proves it by experiment. In the final section, the paper lists the key code used and explained it.

Keywords/Search Tags:

big data, distribute, Hadoop, MapReduce, IPTVQos, clustering, K-means, data mining, permutations and combinations

PDF Full Text Request

Related items

1	Research And Application Of Hadoop Distributed Clustering Mining Method Based On Virtual Machine
2	Research On Algorithm Of Data Mining Based On Hadoop
3	The Research Of Clustering Mining Based On Logistics History Data On The Hadoop
4	Research On Spatial Data Mining Based On Hadoop
5	The Research And Application Of Security Log Clustering Mining Algorithm Based On Hadoop Platform
6	Accelerating Clustering Algorithm On The Cuda Graphics Processor
7	Research And Implementation Of Data Mining Algorithms Based On Cloud Platform
8	Research On Parallel Clustering Algorithm For Large - Scale Data Set
9	Research Of Clustering Mining Algorithm Oriented Big Data
10	Research On Clustering Algorithms In Data Mining