The Power of Adaptivity for Decision-Making under Uncertainty

Ghuge, Rohan

The Power of Adaptivity for Decision-Making under Uncertainty

dc.contributor.author	Ghuge, Rohan
dc.date.accessioned	2023-05-25T14:48:10Z
dc.date.available	2023-05-25T14:48:10Z
dc.date.issued	2023
dc.date.submitted	2023
dc.identifier.uri	https://hdl.handle.net/2027.42/176657
dc.description.abstract	In this thesis, we study the role of adaptivity in decision-making problems under uncertainty. The first part of the thesis focuses on combinatorial problems, while the second part of the thesis deals with the $K$-armed dueling bandits problem. Combinatorial optimization captures many natural decision-making problems such as matching, load balancing, assortment optimization, network design, and submodular optimization. In many practical settings, we have to solve such combinatorial problems under uncertainty; specifically when we only have partial knowledge about the input. Solutions to such problems are sequential decision processes that make decisions one by one “adaptively” (depending on prior observations). While such adaptive solutions achieve the best objective, the inherently sequential nature makes them undesirable in many applications. Specifically, we ask: how well can solutions with only a few adaptive rounds approximate fully adaptive solutions? (1) We study (and answer) the above question for the stochastic submodular cover and scenario submodular cover problems. These models capture many problems such as sensor placement with unreliable sensors, optimal decision tree, stochastic set cover, and correlated knapsack cover. We show how to obtain solutions that approximate fully-adaptive solutions using only a few “rounds” of adaptivity. (2) We also study the stochastic score classification problem. We provide the first constant-factor approximation algorithm for this problem, which improves over the previously-known logarithmic approximation ratio. Moreover, our algorithm is non adaptive: it just involves performing tests in a fixed order until the class is identified. (3) For both problems, we present experimental results demonstrating the practical efficacy of our algorithms. We find that a few rounds of adaptivity suffice to obtain high-quality solutions in practice. In the second part of the thesis, we study the K-armed dueling bandits problem, which has applications in a wide-variety of domains like search ranking, recommendation systems and sports ranking where eliciting qualitative feedback is easy while real-valued feedback is not easily interpretable; thus, it has been a popular topic of research in the machine learning community. Previous works have only focused on the sequential setting where the policy adapts after every comparison. However, in many applications such as search ranking and recommendation systems, it is preferable to perform comparisons in a limited number of {parallel} batches. We {introduce} and study the {batched dueling bandits} problem under two standard settings: (i) existence of a Condorcet winner, and (ii) strong stochastic transitivity and stochastic triangle inequality. For both settings, we obtain algorithms with a smooth trade-off between the number of batches and regret. We complement our regret analysis with a nearly-matching lower bound. Finally, in experiments over a variety of real-world datasets, we observe that our algorithm using only a logarithmic number of rounds achieves almost the same performance as fully sequential algorithms (that use T rounds).
dc.language.iso	en_US
dc.subject	stochastic optimization
dc.subject	submodularity
dc.subject	approximation algorithms
dc.subject	adaptivity gaps
dc.subject	dueling bandits
dc.title	The Power of Adaptivity for Decision-Making under Uncertainty
dc.type	Thesis
dc.description.thesisdegreename	PhD	en_US
dc.description.thesisdegreediscipline	Industrial & Operations Engineering
dc.description.thesisdegreegrantor	University of Michigan, Horace H. Rackham School of Graduate Studies
dc.contributor.committeemember	Nagarajan, Viswanath
dc.contributor.committeemember	Bansal, Nikhil
dc.contributor.committeemember	Chao, Xiuli
dc.contributor.committeemember	Lee, Jon
dc.contributor.committeemember	Shi, Cong
dc.subject.hlbsecondlevel	Industrial and Operations Engineering
dc.subject.hlbtoplevel	Engineering
dc.description.bitstreamurl	http://deepblue.lib.umich.edu/bitstream/2027.42/176657/1/rghuge_1.pdf
dc.identifier.doi	https://dx.doi.org/10.7302/7506
dc.identifier.orcid	0000-0002-4681-9121
dc.identifier.name-orcid	Ghuge, Rohan; 0000-0002-4681-9121	en_US
dc.working.doi	10.7302/7506	en
dc.owningcollname	Dissertations and Theses (Ph.D. and Master's)

Files in this item

Name:: rghuge_1.pdf
Size:: 12.36MB
Format:: PDF

View/Open

Dissertations and Theses (Ph.D. and Master's)

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.