A Distributed Parallel Computation For Weighted Tensor Approximation

Posted on:2019-07-21

Degree:Master

Type:Thesis

Country:China

Candidate:Q Ding

Full Text:PDF

GTID:2370330593951081

Subject:Software engineering

Abstract/Summary:

PDF Full Text Request

This paper is aimed at implementing weighted tensor approximation(WTA)on Apache Spark,and achieving less processing time.Unlike traditional tensor approximation,this paper takes the validity of the input data into consideration during the data compression process.By giving different weights to valid and invalid data,we can eliminate the impact of invalid data and obtain better approximation results.However,because of the large amount of raw data,the compression process of WTA using single node not only takes a long time,but also poses a great challenge for hardware.In this paper,we investigate on how to parallelize the compression process for less processing time.We choose to implement WTA on Spark,whose computational performance is often faster than other distributed computing platforms,such as Apache Hadoop.The feasibility of WTA on Spark is achieved by transforming the original multilinear problem into a common linear one.The input tensor is also partitioned into small blocks to further reduce compression time.And we established a set of fast operation methods that are suitable for distribute computing on Spark.The experimental results show that WTA achieves better rendering effect than TA,and the results fully show that the multi-nodes distribute computing on Spark is much faster than single node.

Keywords/Search Tags:

Multilinear Models, Parallel Computing, Weighted Tensor Approximation, Apache Spark

PDF Full Text Request

Related items

1	MODIS SST Fast Retrieval Method Based On Apache Spark
2	A Research On Distributed Logistics Optimization Algorithm Based On Spark
3	Comparative Analysis And Visualization Of Scalable Gene Sequences Based On Apache Spark
4	Design And Implementation Of A Spark Autotuning System
5	Parallel Computing Of Spark-based Geospatial Analysis Algorithms
6	Research And Implementation Of Seismic Big Data Parallel Processing System Based On Spark
7	A Tensor Based Big Data Efficicent Computation And Multimodal Analysis Apppoach
8	Weighted Norm Inequalities For Some Multilinear Operators
9	The Design And Implementation Of Community Detect Algorithm Based On Spark For Large Scale And Complex Network
10	The Boundedness Of Multilinear Singular Integrals On Weighted Lorentz Spaces