Research On Scheduling Strategy And Parallel Load Flow Solution In Spark

Posted on:2020-07-12

Degree:Master

Type:Thesis

Country:China

Candidate:Y Sun

Full Text:PDF

GTID:2392330599451237

Subject:Power system and its automation

Abstract/Summary:

PDF Full Text Request

Cloud computing is a key issue of power system research.The parallel computing framework for data transmission using Spark clusters in power flow analysis has becoming a hot topic.Compared to traditional serial systems,Spark Cloud can reduce the convergence time of operations.For data-intensive tasks,it can increase the running speed by dozens of times.While the parallel computing of power system is developing rapidly,how to improve the performance and memory utilization of Spark scheduling system has becoming an urgent problem to be solved.This thesis is based on the cloud computing engine Spark.Firstly,it studies the behavior of RDD,optimizes the top-level scheduling mechanism such as task filtering and partition caching.Secondly,it aims at the comprehensive performance and the underlying computing resources of the system.Finally,a parallel power flow algorithm for distributed computing is proposed.The parallel update of the Jacobian array and the parallel iteration of the modified equation are implemented in Spark cloud.(1)By analyzing the source code and introducing two characteristic parameters of the Spark operation stream,the dynamic priority screening of the task is realized;Based on the analysis and optimization of the task structure,combined with the distributed characteristics of the RDD,improving the operational efficiency of the task under limited resources.(2)By analyzing the communication mechanism between RDD nodes,the hierarchical scheduling strategy of Spark computing flow is established to achieve high performance computing,cost reduction and load balancing.A multi-objective optimization algorithm considering preference regions is proposed.Simulation tests shows that the overall energy efficiency of the algorithm is better than traditional algorithms such.(3)By Including: correction of the correction amount related to the sparse matrix,and Distributed multiplication of high dimensional matrices.Finally,a parallel computing cluster consisting of Spark and Hadoop is implemented.The feasibility and effectiveness of the algorithm are verified in IEEE synthesis system.

Keywords/Search Tags:

Cloud Computing on Spark, RDD Cache, Multi-Objective Scheduling, Power Flow Parallel Calculation

PDF Full Text Request

Related items

1	Parallel Implementation Of Power System Power Flow Algorithm Based On Spark Platform
2	Study On The Power Flow In Power Systems Based On Cloud Computing
3	Research On Cloud Computing Task Scheduling For Remote Sensing Big Data Applications
4	Research Of Multicore Runtime Scheduling System For Parallel Power Calculation
5	Distribution Power Flow Calculation Based On Cloud Computing Technology
6	Parallel Damage Identification Of Frame Structure Based On Cloud Computing
7	Research On Key Issues Of Cloud Computing For Performance Analysis Of Electrical Equipment
8	Research On Parallel Computing Of Electric Power System Based On Grid
9	Research Of Short-term Photovoltaic Power Forecasting Based On Cloud Computing And Machine Learning
10	Research On Storage Optimization And Parallel Processing Of Power Equipment Monitoring Big Data Based On Cloud Platform