Learning procedural planning knowledge in complex environment

Posted on:1997-10-19

Degree:Ph.D

Type:Dissertation

University:University of Michigan

Candidate:Pearson, Douglas John

Full Text:PDF

GTID:1467390014982257

Subject:Computer Science

Abstract/Summary:

In complex, dynamic environments, an agent's knowledge of the environment (its domain knowledge) will rarely be complete and correct. Existing approaches to learning and correcting domain knowledge have focused on either learning procedural knowledge to directly guide execution (e.g. reinforcement learners) or learning declarative planning knowledge (e.g. theory revision systems). Systems that only learn execution knowledge are generally only applicable to small domains. In these domains it is possible to learn an execution policy that covers the entire state space, making planning unnecessary. Conversely, existing approaches to learning declarative planning knowledge are applicable to large domains, but they are limited to simple agents, where actions produce immediate, deterministic effects in fully sensed, noise-free environments, and where there are no exogenous events.;This research investigates the use of procedural knowledge to support the learning of planning knowledge in large and complex environments. We describe a series of environmental properties that constrain learning and are violated by existing approaches to learning planning knowledge. We then present an operator-based representation for planning knowledge that is sufficiently expressive to model complex, conditional actions that produce sequential effects over time. We then present IMPROV, a system for learning and correcting errors in this planning knowledge that only requires procedural access to the knowledge. This procedural restriction ensures that learning remains tractable, even over these large, expressive representations. We first explain how IMPROV incrementally learns operator precondition knowledge. We then demonstrate how a hierarchical, operator-based representation can be used to reduce the problem of learning operator effects to the problem of learning operator preconditions. This result allows IMPROV to use a single learning method to learn both operator preconditions and effects. This also allows IMPROV to learn complex models of actions that produce conditional or sequential effects. Finally, we test the system in two sample domains and empirically demonstrate that it satisfies many of the constraints faced by learning agents in complex and challenging environments.

Keywords/Search Tags:

Complex, Planning knowledge, Learn, Environments, Procedural, Domains, IMPROV

Related items

1	High School Students' Research On Cognitive Process Of Mathematical Procedural Knowledge
2	Undergraduate engineering students' understanding of complex circuits: an investigation of the intersection of students' prior knowledge, design of learning environments and the nature of the content
3	Research On The Present Situation Of Chinese Knowledge And Strengthening The Construction Of The Procedural Knowledge Of The Chinese Language Course
4	Effect Of The Levels Of Procedural Knowledge In Disciplinary Domain Knowledge Teaching On Secondary School Mathematical Problems’ Representational Strategies
5	Research On General Technology Course In Ordinary High School-visualization Analysis Of Mapping Knowledge Domains Based On CiteSpace
6	Research On Teaching Of Procedural Knowledge In Junior Middle School Physics
7	Research On Complex Logistics Team Building And Operation
8	Analysis Of Mapping Knowledge Domains And Vista Of The Study On College Tennis In China(1989~2018) Based On Citespace
9	How students learn and how teachers teach: An investigation of middle and secondary science teachers' instructional practices and alignment of instruction with the How People Learn (HPL) framework for effective learning environments and instruction
10	The Design And Application Of Flipping Classroom Teaching Of Procedural Knowledge