Simple Partial Models for Complex Dynamical Systems.
dc.contributor.author | Talvitie, Erik N. | en_US |
dc.date.accessioned | 2011-01-18T16:17:08Z | |
dc.date.available | NO_RESTRICTION | en_US |
dc.date.available | 2011-01-18T16:17:08Z | |
dc.date.issued | 2010 | en_US |
dc.date.submitted | 2010 | en_US |
dc.identifier.uri | https://hdl.handle.net/2027.42/78893 | |
dc.description.abstract | An agent in an unknown environment may wish to learn a model that allows it to make predictions about future events and anticipate the consequences of its actions. Such a model can greatly enhance the agent's ability to make good decisions. However, in environments like the one in which we live, which is stochastic, partially observable, and high dimensional, learning a model is a challenge. One approach when faced with a difficult model learning problem is not to model the entire system. Instead, one might focus on the most important aspects of the environment and give up on modeling complicated, irrelevant phenomena. This intuition can be formalized using partial models, which are models that make only a restricted set of predictions in only a restricted set of circumstances. Because a partial model has limited prediction responsibilities, it may be significantly simpler than a complete model. Partial models have been studied in many contexts, mostly under the Markov assumption, where the agent is assumed to have access to the full state of the world. In this setting, predictions can be learned directly as functions of state and the process of learning a partial model is often as simple as estimating only the desired predictions and omitting the rest from the model. As such, much of the relevant work has focused on the challenging question of which partial models should be learned (rather than how to learn them). In the partially observable case, however, where state is assumed to be hidden from the agent, the basic problem of how to learn a partial model poses significant challenges. The goal of this thesis is to provide general results and methods for learning partial models in partially observable systems. The main challenges posed by partial observability are formalized and learning methods are developed to address these issues. The methods presented are demonstrated empirically to learn partial models in systems that are too complex for standard, complete model learning methods. Finally, many partial models are learned and composed to form complete models that are used for model-based planning in high dimensional arcade game examples. | en_US |
dc.format.extent | 1325814 bytes | |
dc.format.extent | 1373 bytes | |
dc.format.mimetype | application/octet-stream | |
dc.format.mimetype | text/plain | |
dc.language.iso | en_US | en_US |
dc.subject | Artificial Intelligence | en_US |
dc.subject | Machine Learning | en_US |
dc.title | Simple Partial Models for Complex Dynamical Systems. | en_US |
dc.type | Thesis | en_US |
dc.description.thesisdegreename | PhD | en_US |
dc.description.thesisdegreediscipline | Computer Science & Engineering | en_US |
dc.description.thesisdegreegrantor | University of Michigan, Horace H. Rackham School of Graduate Studies | en_US |
dc.contributor.committeemember | Baveja, Satinder Singh | en_US |
dc.contributor.committeemember | Eustice, Ryan M. | en_US |
dc.contributor.committeemember | Kuipers, Benjamin | en_US |
dc.contributor.committeemember | Laird, John E. | en_US |
dc.subject.hlbsecondlevel | Computer Science | en_US |
dc.subject.hlbtoplevel | Engineering | en_US |
dc.description.bitstreamurl | http://deepblue.lib.umich.edu/bitstream/2027.42/78893/1/etalviti_1.pdf | |
dc.owningcollname | Dissertations and Theses (Ph.D. and Master's) |
Files in this item
Remediation of Harmful Language
The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.
Accessibility
If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.