Research On Deep Reinforcement Learning Methods For Solving Flowshop Scheduling Problem

Posted on:2021-05-30

Degree:Master

Type:Thesis

Country:China

Candidate:R Y Pan

Full Text:PDF

GTID:2392330614471932

Subject:Computer Science and Technology

Abstract/Summary:

PDF Full Text Request

As one of the most widely studied combinatorial optimization problems,job scheduling problem is an abstract model of practical problems from the fields of transportation,logistics,and factories.In combinatorial optimization problems,the complex constraints and the huge solution space increase the difficulty of theirs solving.Traditional solution methods have some limitations.Reinforcement learning,as the most concerned field of machine learning recently,has successful applications in many fields.Therefore,in order to explore the application of artificial intelligence algorithms in practical production,this paper focuses on deep reinforcement learning algorithms to solve the permutation flowshop scheduling problem,a typical combinatorial optimization problem.The main work of the paper is as follows:(1)The actor-critic model is established to solve the permutation flow shop scheduling problem.Firstly,the mathematical model of the permutation flow shop scheduling problem is established according to the constraints and objective function.The essence of the problem is to determine the scheduling order of the jobs,which can be abstracted as the sequential decision problem.In order to solve the scheduling problem,this paper introduces two neural network models to help make the sequence decision.One is a pointer network,which is a special encoder-decoder model.The other is an attention network which combines the Transformer and pointer network.In order to solve the discussed problem,a deep reinforcement learning framework of actor-critic is designed based on these two sequential decision models.(2)The heterogeneous actor-critic model is proposed.According to the characteristics of the permutation flow shop scheduling problem and the task characteristics of different modules in the model,this paper analyzes the functional characteristics of the existing network,and designs a heterogeneous actor-critic algorithm.In the part of model training,the ?-greedy strategy is used to improve the exploration ability of the model while keeping the training effect of the model.Finally,this paper combines the 2-opt algorithm into the model to further improve the solution ability of the model.Experiments show that the actor-critic algorithm proposed in this paper is efficient and superior in solving the permutation flow shop scheduling problem.Compared with the traditional methods,especially the metaheuristic algorithm,the solution of this proposed model is very close and even exceeds the metaheuristic algorithm in some cases.

Keywords/Search Tags:

Actor-critic, Multi-head attention, Flowshop scheduling problem, Deep reinforcement learning

PDF Full Text Request

Related items

1	Research On Electric Vehicle Routing Problem Based On Reinforcement Learning
2	Actor-Critic Reinforcement Learning And Applications To Automatic Ship Berthing
3	A Resequencing Method For Automobile Painting With Rework Based On Reinforcement Learning
4	Study On Adaptive Pid Control Strategy Based On Actor-critic Learning
5	Research And Implementation Of Actor-Critic Algorithm Model For Aircraft Autonomous Landing
6	Research On Bidding Strategy Of Generators In Electricity Market Based On Asynchronous Advantage Actor-Critic Reinforcement Learning
7	Asynchronous Generalized Advantage Actor-critic And Application In Automatic Driving
8	Study On Deep Reinforcement Learning Algorithms Solving Open Shop Scheduling Problem
9	Research On Reinforcement Learning Control Method Of Micro Air Vehicle
10	Research On Algorithms Of Gradient Temporal Difference Evaluation Network For Deep Reinforcement Learning