Automatic basis function construction for reinforcement learning and approximate dynamic programming

Posted on:2009-02-16

Degree:M.Sc

Type:Thesis

University:McGill University (Canada)

Candidate:Keller, Philipp W

Full Text:PDF

GTID:2448390002997953

Subject:Operations Research

Abstract/Summary:

We address the problem of automatically constructing basis functions for linear approximation of the value function of a Markov decision process (MDP). Our work builds on results by Bertsekas and Castanon (1989) who proposed a method for automatically aggregating states to speed up value iteration. We propose to use neighbourhood component analysis, a dimensionality reduction technique created for supervised learning, in order to map a high-dimensional state space to a low-dimensional space, based on the Bellman error, or on the temporal difference (TD) error. We then place basis functions in the lower-dimensional space. These are added as new features for the linear function approximator. This approach is applied to a high-dimensional inventory control problem, and to a number of benchmark reinforcement learning problems.

Keywords/Search Tags:

Basis, Function

Related items

1	Research On Establishing An Improved Method To Determine The Radial Basis Function Center Of Radial Basis Function Neural Network
2	Higher-order Method Of Moments Based On Modified Legendre Polynomial Basis Functions
3	Research On Application Of Radial Basis Function In Reverse Engineering
4	Research On Polynomial Curves And Surfaces With Two Shape Parameters
5	Research On Reninforcement Learning Network Algorithm With Self-adaptive Basis Function
6	Analysis and synthesis of matched basis function repetitive control
7	Electromagnetic Parameter Extraction Of RFICs
8	Automatic basis function construction for reinforcement learning and approximate dynamic programming
9	The Application Of The Multilevel Characteristic Basis Function And Its Improved Technology In Electromagnetic Scattering Problems
10	General Robot Motor Skill Learning With Basis Function Self-reconstruction