Show simple item record

On Predictive Linear Gaussian Models.

dc.contributor.authorRudary, Matthew R.en_US
dc.date.accessioned2009-05-15T15:15:53Z
dc.date.availableNO_RESTRICTIONen_US
dc.date.available2009-05-15T15:15:53Z
dc.date.issued2009en_US
dc.date.submitteden_US
dc.identifier.urihttps://hdl.handle.net/2027.42/62307
dc.description.abstractModels are used by artificial agents to make predictions about the future; agents then use these predictions to modify their behavior. In many cases, these models are not known a priori and so the agent must learn a model through experience with a system. At the core of most models is the concept of state--an estimate of the current situation of the world from which a model's predictions are derived. A recent development in the study of models is the predictive state} model. Predictive state models use predictions about potential future events as their state, as opposed to unobserved, unobservable variables, as in most traditional models. For example, a traditional model may represent a robot's location using latitude and longitude, which is unobservable without a GPS unit. A predictive state model of the same robot might represent its location with two events like "If I traveled forward 4 feet I would hit a wall" and "If I turned right and traveled forward 6 feet I would move into a hallway." This dissertation presents two models that expand the limits of predictive state models, which had mostly modeled dynamical systems with discrete, scalar-valued observations, with linear predictions of future events. The first model, the e-test predictive state representation (EPSR), is the first nonlinear predictive state model that can be used to model a large class of dynamical systems. The EPSR models deterministic systems with discrete actions and observations, and is sometimes exponentially smaller than the equivalent model with linear predictions. The second model is the predictive linear Gaussian model (PLG), which models dynamical systems with continuous vector-valued actions and observations. I present theoretical results that show the PLG is representationally equivalent to the linear dynamical system (LDS), a popular traditional model, and that the parameter estimation algorithm I present is consistent--that is, in the limit of infinite data, it produces a correct model. I also apply this algorithm to a) a number of artificial, randomly generated systems and b) a real-world traffic prediction problem, and show that it performs well compared to expectation maximization, a parameter estimation algorithm for the LDS.en_US
dc.format.extent808024 bytes
dc.format.extent1373 bytes
dc.format.mimetypeapplication/octet-stream
dc.format.mimetypetext/plain
dc.language.isoen_USen_US
dc.subjectReinforcement Learningen_US
dc.subjectPredictive State Representationsen_US
dc.subjectArtificial Intelligenceen_US
dc.subjectStatistical Modelsen_US
dc.subjectLinear Dynamical Systemsen_US
dc.titleOn Predictive Linear Gaussian Models.en_US
dc.typeThesisen_US
dc.description.thesisdegreenamePhDen_US
dc.description.thesisdegreedisciplineComputer Science & Engineeringen_US
dc.description.thesisdegreegrantorUniversity of Michigan, Horace H. Rackham School of Graduate Studiesen_US
dc.contributor.committeememberBaveja, Satinder Singhen_US
dc.contributor.committeememberHero III, Alfred O.en_US
dc.contributor.committeememberMurphy, Susanen_US
dc.contributor.committeememberPollack, Martha E.en_US
dc.subject.hlbsecondlevelComputer Scienceen_US
dc.subject.hlbtoplevelEngineeringen_US
dc.description.bitstreamurlhttp://deepblue.lib.umich.edu/bitstream/2027.42/62307/1/mrudary_1.pdf
dc.owningcollnameDissertations and Theses (Ph.D. and Master's)


Files in this item

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.