Load Modeling Accurate Data Acquisition Based On Cloud Computing

Posted on:2018-05-20

Degree:Master

Type:Thesis

Country:China

Candidate:D P Wu

Full Text:PDF

GTID:2322330512491186

Subject:Power system and its automation

Abstract/Summary:

PDF Full Text Request

With the development of economy,increasing demand for electricity and scale-up of network,it meets high requirements for security and stability of power supply.In order to improve operation stability,essentiality for accurate load model has drawn much attention.High-accuracy load data collected by field devices is essential for load modeling.However,sampling rate of disturbance events,the number of monitoring devices,has raise up a question that much data need to be processed.Currently,electric monitoring data generally relies on centralized computing platform located in the dispatch center whose computing forces and storage capability is limited.The data management based on traditional relational database cannot meet requirements or need to pay high cost.How to process massive monitoring data reliably and quickly has become a significance issue in analysis of load modeling.This paper focus on Hadoop ecosystem,which is relied on computing fabric MapReduce and storage structure HDFS.The core of platform is Hive database and HBase is also implemented to realize fast query.Design and implementation of massive power load data cloud computing platform based on Hadoop has been carried out.In data storage,this paper analyze COMTRADE file format of different monitoring devices,design structure of Hive table to process data from different sources.HBase is a NoSQL database and has the characteristics of fault tolerance,scalability and reliability.This paper design HBase table to store COMTRADE files.The designs take advantage of database’s different properties and improve the integration degree of platform.In query optimization,Hive query language(HQL)will be converted to MapReduce job whose process time is long.The Hive tables are mapped to HBase that can take advantage of Hive and HBase respectively.The Hive is designed to store sampling data,which is stored in RDBMS commonly and NoSQL data is stored in HBase.The design reduces time delay and increases efficiency.In power load computation and data conversion,a ETL tool,Sqoop,is employed that can turn ETL into MapReduce job and it eases programming work.A parallel DFT algorithm is employed to reduce computing load in electric parameters extraction.In the experimental section,comparison between Hive and MySQL in query,comparison between Hive and HBase in database loading,comparison between API and MapReduce in database loading are studied.The experiments show that superiority of power load processing platform based on Hadoop is proved.

Keywords/Search Tags:

load modeling, massive data, Hadoop, Hive, HBase

PDF Full Text Request

Related items

1	Distributed Storage And Analysis Of Massive Urban Traffic Flow Data Based On Hadoop
2	Storage And Processing Of Massive Data In The Power Distribution Network Based On Hadoop
3	Researsh On Distributed Compression Storage And Fault Tolerance Of Monitoring Information Flow Of Power Supply System Of Electrified Railway Based On Hadoop
4	Key Technology Of QAR Data Organization And Analysis Based On Hadoop
5	Research Of Massive Data Distributed Storage And Processing System In Internet Of Vehicles
6	Research On Traffic Big Data Query Optimization Based On HBase
7	Massive Vehicle Point Cloud Management Based On Distributed File System
8	Research On Construction And Application Of Pavement Image Database Based On HBase
9	Real Time Data Analysis System Of ISCS Based On Hadoop
10	Design And Implementation Of Traffic Information Paraller Processing Cloud Platform Based On Hadoop