Show simple item record

A linear programming approach to constrained nonstationary infinite-horizon Markov decision processes

dc.contributor.authorLee, Ilbin
dc.contributor.authorEpelman, Marina A
dc.contributor.authorRomeijn, H. Edwin
dc.contributor.authorSmith, Robert L
dc.date.accessioned2014-12-20T22:26:36Z
dc.date.available2014-12-20T22:26:36Z
dc.date.issued2013-03-06
dc.identifier.urihttps://hdl.handle.net/2027.42/109729
dc.description.abstractConstrained Markov decision processes (MDPs) are MDPs optimizing an objective function while satisfying additional constraints. We study a class of infinite-horizon constrained MDPs with nonstationary problem data, finite state space, and discounted cost criterion. This problem can equivalently be formulated as a countably infinite linear program (CILP), i.e., a linear program (LP) with a countably infinite number of variables and constraints. Unlike finite LPs, CILPs can fail to satisfy useful theoretical properties such as duality, and to date there does not exist a general solution method for such problems. Specifically, the characterization of extreme points as basic feasible solutions in finite LPs does not extend to general CILPs. In this paper, we provide duality results and a complete characterization of extreme points of the CILP formulation of constrained nonstationary MDPs with finite state space, and illustrate the characterization for special cases. As a corollary, we obtain the existence of a K-randomized optimal policy, where K is the number of constraints.en_US
dc.description.sponsorshipNational Science Foundation grant CMMI-1333260 and CMMI-0926508en_US
dc.language.isoen_USen_US
dc.relation.ispartofseriesIndustrial and Operation Engineering Technical Report 13-01en_US
dc.titleA linear programming approach to constrained nonstationary infinite-horizon Markov decision processesen_US
dc.typeTechnical Reporten_US
dc.subject.hlbsecondlevelIndustrial and Operations Engineering
dc.subject.hlbtoplevelEngineering
dc.contributor.affiliationumcampusAnn Arboren_US
dc.description.bitstreamurlhttp://deepblue.lib.umich.edu/bitstream/2027.42/109729/1/TR13-01.pdf
dc.identifier.sourceTechnical Reporten_US
dc.owningcollnameIndustrial and Operations Engineering, Department of (IOE)


Files in this item

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.