Learning Integrated Relational and Continuous Action Models for Continuous Domains.

Xu, Joseph Zhen Ying

Learning Integrated Relational and Continuous Action Models for Continuous Domains.

dc.contributor.author	Xu, Joseph Zhen Ying	en_US
dc.date.accessioned	2014-01-16T20:41:25Z
dc.date.available	NO_RESTRICTION	en_US
dc.date.available	2014-01-16T20:41:25Z
dc.date.issued	2013	en_US
dc.date.submitted	2013	en_US
dc.identifier.uri	https://hdl.handle.net/2027.42/102383
dc.description.abstract	Long-living autonomous agents must be able to learn to perform competently in novel environments. One important aspect of competence is the ability to plan, which entails the ability to learn models of the agent's own actions and their effects on the environment. This thesis describes an approach to learn action models of environments with continuous-valued spatial states and realistic physics consisting of multiple interacting rigid objects. In such environments, we hypothesize that objects exhibit multiple qualitatively distinct behaviors based on their relationships to each other and how they interact. We call these qualitatively distinct behaviors modes. Our approach models individual modes with linear functions. We extend the standard propositional function representation with learned knowledge about the roles of objects in determining the outcomes of functions. Roles are learned as first-order relations using the FOIL algorithm. This allows the functions modeling individual modes to be "instantiated" with different sets of objects, similar to relational rules such as STRIPS operators. We also use FOIL to learn preconditions for each mode consisting of clauses that test spatial relationships between objects. These relational preconditions naturally capture the interaction dynamics of spatial domains and allow faster learning and generalization of the model. The combination of continuous linear functions, relational roles, and relational mode preconditions effectively capture both continuous and relational regularities prominent in spatial domains. This results in faster and more general action modeling in these domains. We evaluate the algorithm on two domains, one involving pushing stacks of boxes against frictional resistance, and one in which a ball interacts with obstacles in a physics simulator. We show that our algorithm learns more accurate models than locally weighted regression in the physics simulator domain. We also show that relational mode preconditions learned with FOIL are more accurate than continuous classifiers learned with support vector machines and k-nearest-neighbor.	en_US
dc.language.iso	en_US	en_US
dc.subject	Action Modeling	en_US
dc.subject	Learning	en_US
dc.subject	Combined Symbolic/Continuous Representation	en_US
dc.title	Learning Integrated Relational and Continuous Action Models for Continuous Domains.	en_US
dc.type	Thesis	en_US
dc.description.thesisdegreename	PhD	en_US
dc.description.thesisdegreediscipline	Computer Science & Engineering	en_US
dc.description.thesisdegreegrantor	University of Michigan, Horace H. Rackham School of Graduate Studies	en_US
dc.contributor.committeemember	Laird, John E.	en_US
dc.contributor.committeemember	Lewis, Richard L.	en_US
dc.contributor.committeemember	Kuipers, Benjamin	en_US
dc.contributor.committeemember	Lee, Honglak	en_US
dc.contributor.committeemember	Baveja, Satinder Singh	en_US
dc.subject.hlbsecondlevel	Computer Science	en_US
dc.subject.hlbtoplevel	Engineering	en_US
dc.description.bitstreamurl	http://deepblue.lib.umich.edu/bitstream/2027.42/102383/1/jzxu_1.pdf
dc.owningcollname	Dissertations and Theses (Ph.D. and Master's)

Files in this item

Name:: jzxu_1.pdf
Size:: 1.455MB
Format:: PDF

View/Open

Dissertations and Theses (Ph.D. and Master's)

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.