Research On Optimal Scheduling Of Regional Integrated Energy System Based On Deep Reinforcement Learnin

Posted on:2024-06-08

Degree:Master

Type:Thesis

Country:China

Candidate:W J Luo

Full Text:PDF

GTID:2532307130472074

Subject:Electrical engineering

Abstract/Summary:

PDF Full Text Request

Regional integrated energy system is one of the important measures to comprehensively promote sustainable development,which aims to achieve efficient application of multiple energy sources by using advanced communication and control technologies.However,in the regional integrated energy system,the instability of clean energy and the coupling of multiple energy sources make the physical model very complex,while the access of various intelligent communication devices will generate a large amount of high-dimensional data,making data processing more difficult,and the commonly used traditional optimization algorithms will be difficult to apply due to the increase in computational cost resulting in too slow solution rates.Deep reinforcement learning is a more popular research area in machine learning,which has an efficient ability to solve problems with data and complex models.Therefore,in order to improve clean energy utilization and reduce carbon emissions,and to achieve more generalizability,robustness and efficiency of regional integrated energy system optimal scheduling,an optimal scheduling method of regional integrated energy system based on advantage learning soft actor-critic(SAC)algorithm and transfer learning is proposed.The study was conducted on:1)This paper investigates the theory of reinforcement learning and deep reinforcement learning,outlines the basic elements of deep reinforcement learning model building,and the learning principles of different types of deep reinforcement learning algorithms.2)The operation mechanism and related characteristics of the internal units of the regional integrated energy system are analyzed,and the corresponding mathematical model is constructed,as well as the objective function containing the carbon emission and profit of the regional integrated energy system.3)To establish a model of the state and action space and reward function of the regional integrated energy system based on deep reinforcement learning of policy gradient,and to demonstrate by arithmetic examples that among several deep reinforcement learning algorithms based on policy gradient,the maximum entropy mechanism of Soft Actor-critic makes the optimal scheduling of the regional integrated energy system more robust.4)The SAC algorithm has a complex neural network structure and iterative mechanism,so it will take more time in training.The advantage learning theory is adopted to build a regional integrated energy system model based on the advantage learning SAC algorithm to speed up the convergence speed when the algorithm is trained.The effectiveness of the algorithm to incorporate advantage learning is demonstrated by comparing the performance of several policy gradient-based deep reinforcement learning algorithms,traditional particle swarm algorithms,and mixed integer programming in specific arithmetic cases.5)To improve the learning efficiency of the intelligences and the generalization ability to cope with new scenarios,transfer learning is introduced in this paper.The algorithms show that the optimal scheduling strategy based on advantage learning soft actor-critic algorithm and transfer learning has better robustness,generalization ability and efficient learning efficiency,and can realize flexible and efficient scheduling of regional integrated energy systems.

Keywords/Search Tags:

Regional integrated energy system, Deep reinforcement learning, Soft actor-critic, Transfer learning, Advantage learning

PDF Full Text Request

Related items

1	Research On Bidding Strategy Of Generators In Electricity Market Based On Asynchronous Advantage Actor-Critic Reinforcement Learning
2	Asynchronous Generalized Advantage Actor-critic And Application In Automatic Driving
3	Base On Deep Reinforcement Learning For Traffic Signals Timing
4	Research On Energy Management Strategy Of Fuel Cell Hybrid Vehicle Based On Improved Soft Actor-Critic Algorithm
5	Research On Automatic Power Generation Control Based On Multi-agent Transfer Reinforcement Learnin
6	Energy-efficient Control Methods For Metro Trains Based On Reinforcement Learning Approach
7	Study On Adaptive Pid Control Strategy Based On Actor-critic Learning
8	Research On Flexible Job Shop Scheduling Based On Reinforcement Learning Algorithm
9	Research On Autonomous Navigation Algorithm For Cellular-connected UAV Based On Deep Reinforcement Learning
10	Application Research Of Deep Reinforcement Learning SAC Algorithm In Motion Control