Simple Partial Models for Complex Dynamical Systems.

Talvitie, Erik N.

Simple Partial Models for Complex Dynamical Systems.

dc.contributor.author	Talvitie, Erik N.	en_US
dc.date.accessioned	2011-01-18T16:17:08Z
dc.date.available	NO_RESTRICTION	en_US
dc.date.available	2011-01-18T16:17:08Z
dc.date.issued	2010	en_US
dc.date.submitted	2010	en_US
dc.identifier.uri	https://hdl.handle.net/2027.42/78893
dc.description.abstract	An agent in an unknown environment may wish to learn a model that allows it to make predictions about future events and anticipate the consequences of its actions. Such a model can greatly enhance the agent's ability to make good decisions. However, in environments like the one in which we live, which is stochastic, partially observable, and high dimensional, learning a model is a challenge. One approach when faced with a difficult model learning problem is not to model the entire system. Instead, one might focus on the most important aspects of the environment and give up on modeling complicated, irrelevant phenomena. This intuition can be formalized using partial models, which are models that make only a restricted set of predictions in only a restricted set of circumstances. Because a partial model has limited prediction responsibilities, it may be significantly simpler than a complete model. Partial models have been studied in many contexts, mostly under the Markov assumption, where the agent is assumed to have access to the full state of the world. In this setting, predictions can be learned directly as functions of state and the process of learning a partial model is often as simple as estimating only the desired predictions and omitting the rest from the model. As such, much of the relevant work has focused on the challenging question of which partial models should be learned (rather than how to learn them). In the partially observable case, however, where state is assumed to be hidden from the agent, the basic problem of how to learn a partial model poses significant challenges. The goal of this thesis is to provide general results and methods for learning partial models in partially observable systems. The main challenges posed by partial observability are formalized and learning methods are developed to address these issues. The methods presented are demonstrated empirically to learn partial models in systems that are too complex for standard, complete model learning methods. Finally, many partial models are learned and composed to form complete models that are used for model-based planning in high dimensional arcade game examples.	en_US
dc.format.extent	1325814 bytes
dc.format.extent	1373 bytes
dc.format.mimetype	application/octet-stream
dc.format.mimetype	text/plain
dc.language.iso	en_US	en_US
dc.subject	Artificial Intelligence	en_US
dc.subject	Machine Learning	en_US
dc.title	Simple Partial Models for Complex Dynamical Systems.	en_US
dc.type	Thesis	en_US
dc.description.thesisdegreename	PhD	en_US
dc.description.thesisdegreediscipline	Computer Science & Engineering	en_US
dc.description.thesisdegreegrantor	University of Michigan, Horace H. Rackham School of Graduate Studies	en_US
dc.contributor.committeemember	Baveja, Satinder Singh	en_US
dc.contributor.committeemember	Eustice, Ryan M.	en_US
dc.contributor.committeemember	Kuipers, Benjamin	en_US
dc.contributor.committeemember	Laird, John E.	en_US
dc.subject.hlbsecondlevel	Computer Science	en_US
dc.subject.hlbtoplevel	Engineering	en_US
dc.description.bitstreamurl	http://deepblue.lib.umich.edu/bitstream/2027.42/78893/1/etalviti_1.pdf
dc.owningcollname	Dissertations and Theses (Ph.D. and Master's)

Files in this item

Name:: etalviti_1.pdf
Size:: 1.264MB
Format:: PDF

View/Open

Dissertations and Theses (Ph.D. and Master's)

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.