Sequential Decision Models Based On Q-Learning Algorithm

Posted on:2020-07-25

Degree:Master

Type:Thesis

Country:China

Candidate:L Y Liu

Full Text:PDF

GTID:2370330596485563

Subject:Mathematics

Abstract/Summary:

PDF Full Text Request

Sequential decision problem has been widely used in various fields,while reinforcement learning algorithms provide a powerful tool for solving sequential decision problems.This thesis mainly studies the improvement of Q-learning algorithm,which is a typical algorithm of reinforcement learning,and presents its application into incremental classification problems and group decision making problems.In classification problems,if the classification information used to train the classifier is not obtained at one time but given in the form of sequence,incremental learning provides an important solution.However,the order of samples which are used to train the classifier will seriously affect its performance of the classifier when incremental learning method is employed,especially for the weak classifier.The reason is that the incremental learning method is prone to prematurely add noise data into the training set,reduce the accuracy of the classifier consequently.To solve this problem,this paper proposes an incremental classification model based on Q-learning algorithm.The model uses the classical Q-learning algorithm to select the incremental sample sequence reasonably,therefore reduce the negative impact of noise data,and be able to label the samples automatically during the learning process.At the same time,in order to overcome the computational complexity of the algorithm when the scale of non-labeled training data becomes large,this thesis further proposes a batch incremental classification model.This model reduces the computational complexity of the model and save the storage space effectively as well.Experimental results show that the incremental classification model based on Q-learning algorithm has the advantages of high classification accuracy and strong real-time performance by integrating the incremental learning method and reinforcement learning technology.Multistage group decision-making problem is a kind of typical sequence decision problems.In most applications of the real world,due to the uncertainty of the state space(for example,the state transition probability matrix is completely unknown)faced by the decisionmakers,they need to obtain further information by interacting with the environment dynamically to seek an optimal strategy with higher degree of consensus.Therefore,we establish the basic algorithm model of the multi-stage group decision Q-learning algorithm to learn the optimal group strategy by improving the Q-learning algorithm and improving the iterative process of the algorithm.At the same time,it is proved that the multi-stage group optimal strategy based on Q-learning algorithm is also the strategy with the highest group consensus degree.Finally,an example shows the rationality and feasibility of the algorithm.

Keywords/Search Tags:

Sequential decision making problems, Q-learning algorithm, Incremental learning, Multi-stage group decision making

PDF Full Text Request

Related items

1	Research On Multi-attribute Group Decision Making Problems Based On Genetic Algorithm
2	Research On The Complex Large Group Decision-making Problem Under Interval 2-tuple Linguistic Environment
3	Design And Implementation Of Game Decision-making Mechanism Based On Deep Reignfrocement Learning Algorithm
4	The Application Of Multi-attribute Group Decision-making Based On Uncertain Information In Investment Problems
5	The Study On Optimization Method For Sequential Decision-making In Supply Chain
6	Research And Application Of Multi-attribute Decision Making Method For Heterogeneous Information
7	Research On Methods Of Two Types Of Uncertain Multi-attribute Decision Making
8	Research On Methods And Applications For Interactive Multiple Attribue Decision Making Under Uncertainty
9	Some Theories And Methods In Group Decision-Making And Multiobjective Decision-Making
10	Grey Multi-attribute Decision-making Method Under Mixed Information And Its Applications