MapReduce-based Resource Scheduling Model And Algorithm Research In Cloud Environment

Posted on:2013-02-28

Degree:Master

Type:Thesis

Country:China

Candidate:T Tao

Full Text:PDF

GTID:2248330371970880

Subject:Computer technology

Abstract/Summary:

PDF Full Text Request

As an emerging cloud computing model, MapReduce has been widely used in the large-scale intensive data applications, such as web crawling, scientific computing and data mining, etc. MapReduce library provides designers with a transparent hardware development environment, making it easy to store and use, and simplifies the problem of the past, parallel computing architecture of the underlying operating difficulties. MapReduce-based systems have the advantages of independent storage, high scalability and fault tolerance mechanisms. Although MapReduce itself with above advantages, due to mechanism at this stage is not mature, resource scheduling has always been one of the biggest limitations on its implementation efficiency.In this article, the famous open source environment implementation-Hadoop, is adopted for heterogeneous device environment and applications, to summarize the unreasonable resource scheduling mechanism in MapReduce and propose several corresponding improvement ideas. The main contents are as follows.(1) On the basis of MapReduceâ€™s original way of resource scheduling in homogeneous environment, a Dynamic Proportional Resource Scheduling algorithm (DPRS) is promoted, dynamically monitoring the load status of nodes, rationally allocating tasks resources and improving the unbalance problems of original mechanism in heterogeneous environment.(2) To ensure data execution on local machines, a Local Computing Power Optimization (LCPO) model is promoted, eliminating the original backup overhead, reducing network traffic flow and improving Map tasksâ€™efficiency in heterogeneous environment.(3) To improve the backup execution efficiency of straggled Reduce tasks and resolve the misjudged problems of straggled nodes in heterogeneous environment, the Fast Long Task Backup algorithm (FLTB) is proposed.(4) As for the uneven balanced distribution of input data among Map tasks, a heuristic data partition treatment is employed to improve the data imbalance problem in heterogeneous environment.

Keywords/Search Tags:

MapReduce, Dynamic Proportional Resource Scheduling(DPRS), Local Computing Power Optimization(LCPO), Fast Long Task Backup(FLTB), heuristic data partition treatmen

PDF Full Text Request

Related items

1	Research On Task Scheduling Algorithm Based On Task Backup For Cloud Computing
2	Research On The Resource Deployment And Task Scheduling In Cloud Computing
3	Research Of Task Partition And Resource Allocation Algorithms For Load Balance In Spark Computing Environment
4	Research On Efficient Task Partition And Scheduling In MapReduce Data Processing System
5	Researches On Optimization Of Resource Allocation For MapReduce Scheduling
6	Research On MapReduce Performance Optimization Based On Hadoop
7	Research On Energy Optimization For Multiprocessor SoC With Task Scheduling And Cache Partitioning
8	The Research On High Performance Task Scheduling Technology Based On Mapreduce In Cloud Computing
9	Research On Performance Optimization Of MapReduce Model
10	MapReduce Job Oriented Collaborative Optimization On Cloud Data Center Network Resource