Kernel Methods for Learning with Limited Labeled Data
dc.contributor.author | Deshmukh, Aniket Anand | |
dc.date.accessioned | 2019-07-08T19:41:58Z | |
dc.date.available | NO_RESTRICTION | |
dc.date.available | 2019-07-08T19:41:58Z | |
dc.date.issued | 2019 | |
dc.date.submitted | ||
dc.identifier.uri | https://hdl.handle.net/2027.42/149810 | |
dc.description.abstract | Machine learning is a rapidly developing technology that enables a system to automatically learn and improve from experience. Modern machine learning algorithms have achieved state-of-the-art performances on a variety of tasks such as speech recognition, image classification, machine translation, playing games like Go, Dota 2, etc. However, one of the biggest challenges in applying these machine learning algorithms in the real world is that they require huge amount of labeled data for the training. In the real world, the amount of labeled training data is often limited. In this thesis, we address three challenges in learning with limited labeled data using kernel methods. In our first contribution, we provide an efficient way to solve an existing domain generalization algorithm and extend the theoretical analysis to multiclass classification. As a second contribution, we propose a multi-task learning framework for contextual bandit problems. We propose an upper confidence bound-based multi-task learning algorithm for contextual bandits, establish a corresponding regret bound, and interpret this bound to quantify the advantages of learning in the presence of high task (arm) similarity. Our third contribution is to provide a simple regret guarantee (best policy identification) in a contextual bandits setup. Our experiments examine a novel application to adaptive sensor selection for magnetic field estimation in interplanetary spacecraft and demonstrate considerable improvements of our algorithm over algorithms designed to minimize the cumulative regret. | |
dc.language.iso | en_US | |
dc.subject | Machine Learning | |
dc.subject | Limited Data | |
dc.subject | Contextual Bandits | |
dc.subject | Domain Generalization | |
dc.title | Kernel Methods for Learning with Limited Labeled Data | |
dc.type | Thesis | |
dc.description.thesisdegreename | PhD | en_US |
dc.description.thesisdegreediscipline | Electrical and Computer Engineering | |
dc.description.thesisdegreegrantor | University of Michigan, Horace H. Rackham School of Graduate Studies | |
dc.contributor.committeemember | Scott, Clayton D | |
dc.contributor.committeemember | Tewari, Ambuj | |
dc.contributor.committeemember | Hero III, Alfred O | |
dc.contributor.committeemember | Schwartz, Eric Michael | |
dc.subject.hlbsecondlevel | Electrical Engineering | |
dc.subject.hlbtoplevel | Engineering | |
dc.description.bitstreamurl | https://deepblue.lib.umich.edu/bitstream/2027.42/149810/1/aniketde_1.pdf | |
dc.identifier.orcid | 0000-0002-7292-8436 | |
dc.identifier.name-orcid | Deshmukh, Aniket Anand; 0000-0002-7292-8436 | en_US |
dc.owningcollname | Dissertations and Theses (Ph.D. and Master's) |
Files in this item
Remediation of Harmful Language
The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.
Accessibility
If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.