Show simple item record

Visual Query Optimization: Algorithms and Software Systems

dc.contributor.authorZekany, Stephen
dc.date.accessioned2023-09-22T15:46:12Z
dc.date.available2023-09-22T15:46:12Z
dc.date.issued2023
dc.date.submitted2023
dc.identifier.urihttps://hdl.handle.net/2027.42/178124
dc.description.abstractThe development of inexpensive digital cameras and availability of local and cloud data storage has lead to incredible amounts of digital video production. Meanwhile, the amount of compute used for frontier machine learning workloads is growing even faster than Moore's Law. In this thesis, I focus on the problem of visual analytics for dashboard camera video. Novel algorithms and systems which make large-scale video search tractable are highly important for autonomous vehicle development, which requires curating a library of video of the vehicle performing in real-world conditions. I first show and evaluate techniques to reconstruct, classify, and index vehicle maneuvers for dashboard camera video, and then evaluate search techniques for visual queries which are independent of these algorithms. First, I present our work describing a novel approach for detecting vehicle maneuvers from monocular dash-cam video building by leveraging deep learning techniques to estimate frame-accurate ego-vehicle movement and techniques to search video efficiently for disjoint computer vision operations. We show a technique to process these motion estimates into a vehicle trajectory, and compare trajectory segments to the reference maneuvers. We then extend the idea of motion estimation for maneuver detection to lane changes, highway events, and acceleration. We discuss and evaluate a ``min pool'' approach to allow each classifier to select the closest match between several reference maneuvers, which we find outperforms a single reference maneuver classification approach. We then build a classifier pipeline to operate the individual classifiers and prioritize detected maneuvers, and show statistical results. Then, I present work on an end-to-end software system to process human-semantic queries for dashboard camera video among arbitrary visual processing techniques and models. We adapt computer vision models for use in an existing distributed video processing framework by writing API functionality for object detection, semantic segmentation, depth, and optical flow. We design and demonstrate a query parser to dynamically parse, plan, and execute visual queries. This infrastructure then allows us to focus on how to do cost planning for these visual analytics queries by adapting database cost planning metrics to the use of computer vision techniques.
dc.language.isoen_US
dc.subjectVisual query optimization
dc.subjectVideo search
dc.titleVisual Query Optimization: Algorithms and Software Systems
dc.typeThesis
dc.description.thesisdegreenamePhDen_US
dc.description.thesisdegreedisciplineComputer Science & Engineering
dc.description.thesisdegreegrantorUniversity of Michigan, Horace H. Rackham School of Graduate Studies
dc.contributor.committeememberDreslinski Jr, Ronald
dc.contributor.committeememberWenisch, Thomas F
dc.contributor.committeememberWentzloff, David D
dc.contributor.committeememberMudge, Trevor N
dc.subject.hlbsecondlevelComputer Science
dc.subject.hlbtoplevelEngineering
dc.description.bitstreamurlhttp://deepblue.lib.umich.edu/bitstream/2027.42/178124/1/szekany_1.pdf
dc.identifier.doihttps://dx.doi.org/10.7302/8581
dc.identifier.orcid0000-0003-1914-9318
dc.identifier.name-orcidZekany, Stephen; 0000-0003-1914-9318en_US
dc.working.doi10.7302/8581en
dc.owningcollnameDissertations and Theses (Ph.D. and Master's)


Files in this item

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.