Crucial factors affecting cooperative multirobot learning: Theoretical and practical aspects

Posted on:2004-12-10

Degree:Ph.D

Type:Dissertation

University:Carnegie Mellon University

Candidate:Tangamchit, Poj

Full Text:PDF

GTID:1467390011976088

Subject:Engineering

Abstract/Summary:

PDF Full Text Request

Multirobot learning enables robots to adapt themselves to their environment using real-world experience. Multirobot learning is a new research area, thus there are no standards that define systematic implementation. Researchers have proposed several methods to implement learning in decentralized multirobot systems. The most common way is to implement a learning entity on each robot separately. Each learning entity uses a single-robot algorithm, but has a specially designed reward system in order to achieve the best performance with the aid of human intelligence. These special reward systems are usually in the form of subgoals, heuristics, shaped reinforcement, and progress estimators, which vary in details according to different tasks.; This dissertation focuses on the use of a traditional reward system, which gives rewards to robots only when the robots reach desired goals. Our research question is whether decentralized multirobot systems that have traditional reward systems can achieve optimal performance. Our experiments indicate that traditional learning methods can be effectively used with decentralized multirobot systems, but with certain conditions. The success and the effectiveness of this method are potentially affected by various factors that we classify into two groups: the nature of the robots and the nature of the learning entities. We methodically test the effect of varying five common factors (reward scope, value function of the learning algorithm, diversity of robots, number of robots and delay of global information), first in simulation and then on real robots. The results show that three of these factors, reward scope, value function of the learning algorithm, and delay of global information, if set up incorrectly, can prevent optimal, cooperative solutions.; At the end of this dissertation, we propose dynamic task selection, which is a multirobot group architecture that allows task sharing and promotes robustness. In the last chapter, we propose the use of heuristics to help speed up learning by biasing robots' exploration without disturbing the original goal.

Keywords/Search Tags:

Multirobot, Robots, Factors

PDF Full Text Request

Related items

1	Coordination of multirobot teams and groups in constrained environments: Models, abstractions, and control policies
2	An auction-based approach to complex task allocation for multirobot teams
3	A Study Of The Induction Effect Of Social Robots On Social Behavior Of Autistic Children
4	Research On Image Emotion Recognition And Its Application In Teaching Robots
5	A Comparative Study Of The Teaching Effect Of Physical Robots And Virtual Robots
6	A multi-case study of student interactions with educational robots and impact on science, technology, engineering, and math (stem) learning and attitudes
7	A Study Of The Influence Of Empathy And Culture On The Interaction Between Humans And Social Robots
8	Research On The Design Of Practical Training Courses For Industrial Robots In Vocational Schools Under The Background Of Artificial Intelligence
9	Robots,Tasks And Employment Polarization Effect
10	The Study Of Instructional Activities Design On Robots Education Of Primary And Middle Schools In The Perspective Of Maker