Show simple item record

A Generalization Error for Q-Learning

dc.contributor.authorMurphy, Susan A.en_US
dc.date.accessioned2007-12-06T19:26:00Z
dc.date.available2007-12-06T19:26:00Z
dc.date.issued2005-07en_US
dc.identifier.citationJournal of Machine Learning Research 2005; 6(Jul):1073-1097 <http://hdl.handle.net/2027.42/57425>en_US
dc.identifier.urihttps://hdl.handle.net/2027.42/57425
dc.identifier.urihttp://www.ncbi.nlm.nih.gov/sites/entrez?cmd=retrieve&db=pubmed&list_uids=16763665&dopt=citationen_US
dc.description.abstractPlanning problems that involve learning a policy from a single training set of ?nite horizon trajectories arise in both social science and medical ?elds. We consider Q-learning with function approximation for this setting and derive an upper bound on the generalization error. This upper bound is in terms of quantities minimized by a Q-learning algorithm, the complexity of the approximation space and an approximation term due to the mismatch between Q-learning and the goal of learning a policy that maximizes the value function.en_US
dc.description.sponsorshipNational Institutes of Health (NIDA grants K02 DA15674 and P50 DA 10075 to the Methodology Center)en_US
dc.format.extent1343 bytes
dc.format.extent196538 bytes
dc.format.extent73733 bytes
dc.format.mimetypetext/plain
dc.format.mimetypeapplication/pdf
dc.format.mimetypetext/plain
dc.language.isoen_USen_US
dc.subjectMultistage Decisionsen_US
dc.subjectDynamic Programmingen_US
dc.subjectReinforcement Learningen_US
dc.subjectBatch Dataen_US
dc.subject.classificationISR - Institute for Social Researchen_US
dc.titleA Generalization Error for Q-Learningen_US
dc.typeArticleen_US
dc.subject.hlbsecondlevelSocial Sciences (General)en_US
dc.subject.hlbtoplevelSocial Sciencesen_US
dc.description.peerreviewedPeer Revieweden_US
dc.contributor.affiliationumInstitute for Social Researchen_US
dc.contributor.affiliationumDepartment of Statisticsen_US
dc.contributor.affiliationumDepartmentof Psychiatryen_US
dc.contributor.affiliationumcampusAnn Arboren_US
dc.identifier.pmid1475741en_US
dc.identifier.pmid16763665en_US
dc.description.bitstreamurlhttp://deepblue.lib.umich.edu/bitstream/2027.42/57425/2/murphy05a.pdfen_US
dc.owningcollnameInstitute for Social Research (ISR)


Files in this item

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe its collections in a way that respects the people and communities who create, use, and are represented in them. We encourage you to Contact Us anonymously if you encounter harmful or problematic language in catalog records or finding aids. More information about our policies and practices is available at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.