Font Size: a A A

Adaptive cost-based policy mapping for imitation

Posted on:2004-01-04Degree:M.SType:Thesis
University:The University of Texas at ArlingtonCandidate:Gudla, Srichandan VenkatFull Text:PDF
GTID:2465390011460642Subject:Computer Science
Abstract/Summary:PDF Full Text Request
Imitation represents a powerful approach for programming and autonomous learning in robot and computer systems. An important aspect of imitation is the mapping of observations to an executable control strategy. This is particularly important if the behavioral capabilities of the observed and imitating agent differ significantly. In this thesis this problem has been addressed by locally optimizing a cost function representing the deviation from the observed state sequence and the cost of the actions required to perform the imitation. The result are imitation strategies that can be performed by the imitating agent and that as closely as possible resemble the observations of the demonstrating agent. A reinforcement learning component is introduced to learn an optimal weight vector in the cost function over several trials and then improve the quality of imitation. The performance of this approach is illustrated within the context of a simulated multi-agent environment.
Keywords/Search Tags:Imitation
PDF Full Text Request
Related items