Solving the iterated prisoner's dilemma using learning automation

Posted on:2007-10-17

Degree:M.Sc

Type:Thesis

University:Carleton University (Canada)

Candidate:Wu, Yaojun

Full Text:PDF

GTID:2446390005979077

Subject:Computer Science

Abstract/Summary:

The Prisoner's Dilemma (PD) has been discussed extensively to model the conflict between competition and cooperation, or between individual and collective rationality [1][8][9][10][11]. The machine learning community has an interest in the iterated PD (IPD) game, but has special interest in the behavior of the individual in the IPD game.;A family of estimate-based strategies, which are based on the pursuit scheme and interconnected Learning Automaton (LA) structure, is developed in this Thesis. These achieve a high performance in playing the IPD game in Nonstationary Environments. Simulation results show that, our proposed scheme, the IPPP (Interconnected Learning Automata with the Preceding Penalty Limit Pursuit), achieves 4.86% lower costs than the SLASH (Stochastic Learning Automata with States of History) strategy, which is the most efficient learning strategy reported in the literature. The advantages of IPPP are its quick detection of strategy switches, fast convergence, and its good generalization capability.

Keywords/Search Tags:

Prisoner's dilemma, IPD game

Related items

1	Game And Equilibrium Under Uncertainty
2	Study On The Rural Governance Interests Subject Game Strategy Analysis And Countermeasure
3	The Research Of Game Thtory's Application On The Execution Of Construction Engineering
4	Social Conflict Management Game Perspective
5	The Evolution Of International Norms
6	The First Nuclear Crisis In The U.s. And The Dprk, "repeated Prisoner's Dilemma Game Analysis
7	Swn Model Is Applied To The Basic Research For Economic And Management Fields
8	Media Supervision On Jurisdiction
9	Study On Affairs Of Prisoner Of War In Chinese Army
10	BEHAVIOR OF LOW AND HIGH TRUST JUVENILE DELINQUENTS WHEN PLAYING A MODIFIED PRISONER'S DILEMMA GAME