State clustering in Markov decision processes with an application in information sharing

Posted on:2006-06-06

Degree:Ph.D

Type:Dissertation

University:North Carolina State University

Candidate:Berrings, Lauren Marie

Full Text:PDF

GTID:1458390008973912

Subject:Engineering

Abstract/Summary:

This research examines state clustering in Markov Decision processes, specifically addressing the problem referred to as Markov Decision process with restricted observations. The general problem is a special case of a Partially Observable Markov Decision process where the state space is partitioned into mutually exclusive sets representing the observable portion of the process. The goal is to find an optimal policy defined over the partition of the state space that minimizes (maximizes) some performance objective. Algorithms presented to solve this problem for the infinite horizon undiscounted average cost case have largely been based on enumerative procedures. A heuristic solution procedure based on Howard's (1960) policy iteration method is presented.; Applications of Markov decision processes with restricted observations exist in networks of queues, retrial queues, maintenance problems and queuing networks with server control. A new application area is proposed in the field of information sharing to measure the value of information sharing in a supply chain under optimal control. This is achieved by representing a model of full information sharing as a completely observable Markov Decision process (MDP), while no information sharing is represented as an MDP with restricted observations. Solution procedures are presented for the general Markov Decision process with restricted observations. Heuristic solutions are evaluated against the optimal solution obtained via total enumeration. Both random Markov Decision processes and information sharing problems are studied. The value of sharing information in a two-stage supply chain system is studied. The influence of capacity, demand, cost and retailer policy on the value of information sharing is considered. Insight on the structure of the optimal policy with and without information sharing is provided.

Keywords/Search Tags:

Markov decision, Information sharing, State, Restricted observations, Policy, Optimal

Related items

1	Performance guarantee of a sub-optimal policy for a discrete Markov decision process and its application to a robotic surveillance problem
2	Risk -sensitive control of discrete -time partially observed Markov decision processes
3	Theories, Algortihms And Applications Of Policy Gradient Reinforcement Learning
4	Research Of Policy Algorithms Applied To Perceptual Decision-Making Tasks
5	The Reinforcement Learning Research Based On Internal State In Partially Observable Markov Decision Processes
6	Data Compression Rearch Based On The Markov Decision Process
7	Optimal Transmission Strategy Of P2P Real-time Communication In Fading Channel
8	Research On Attack Schedule Against State Estimation In Cyber-physical Systems
9	Optimal learning: Computational procedures for Bayes-adaptive Markov decision processes
10	Spectrum Sharing Strategy And Optimal Cognitive Transmission In Cognitive Radio Networks Based On ARQ Feedback