Show simple item record

Classification via Multiple Hyperplanes: Loss functions, Overparametrization, and Interpolation

dc.contributor.authorWang, Yutong
dc.date.accessioned2022-09-06T16:06:03Z
dc.date.available2022-09-06T16:06:03Z
dc.date.issued2022
dc.date.submitted2022
dc.identifier.urihttps://hdl.handle.net/2027.42/174328
dc.description.abstractMany well-established classification algorithms such as support vector machines (SVM) are originally proposed as large-margin classifiers from a single hyperplane. This dissertation is divided into two halves, each half studying classification from the perspective of using multiple hyperplanes. The first half introduces a new framework for multiclass loss functions called the permutation-equivariant and relative margin-based (PERM) losses, inspired by multiclass classification with multiple hyperplanes. Using our framework, we establish statistical and optimization results on Weston-Watkins multiclass SVMs. Furthermore, we provide sufficient conditions for the classification-calibration of a general family of PERM losses. These sufficient conditions subsume all previously known and establish new classification-calibration results. The second half focuses on hyperplane arrangement classifiers (HACs). When implemented as neural networks, we show that the HACs can be overparameterized yet still have small VC dimensions and further achieve minimax optimality (assuming the empirical risk minimization can be solved to optimality). By using an ensemble of randomly initialized HACs, we demonstrate for the first time an interpolating ensemble method that is consistent for a broad class of distributions in arbitrary dimensions. We discuss the significance of these results in the context of recent advances in the theory of overparameterized learning.
dc.language.isoen_US
dc.subjectmachine learning
dc.subjectstatistical learning theory
dc.subjectsupport vector machines
dc.subjectneural networks
dc.subjectensemble methods
dc.titleClassification via Multiple Hyperplanes: Loss functions, Overparametrization, and Interpolation
dc.typeThesis
dc.description.thesisdegreenamePhDen_US
dc.description.thesisdegreedisciplineElectrical and Computer Engineering
dc.description.thesisdegreegrantorUniversity of Michigan, Horace H. Rackham School of Graduate Studies
dc.contributor.committeememberScott, Clayton D
dc.contributor.committeememberTewari, Ambuj
dc.contributor.committeememberBalzano, Laura
dc.contributor.committeememberQu, Qing
dc.subject.hlbsecondlevelElectrical Engineering
dc.subject.hlbtoplevelEngineering
dc.description.bitstreamurlhttp://deepblue.lib.umich.edu/bitstream/2027.42/174328/1/yutongw_1.pdf
dc.identifier.doihttps://dx.doi.org/10.7302/6059
dc.identifier.orcid0000-0001-7472-6750
dc.identifier.name-orcidWang, Yutong; 0000-0001-7472-6750en_US
dc.working.doi10.7302/6059en
dc.owningcollnameDissertations and Theses (Ph.D. and Master's)


Files in this item

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.