Spatial-Semantic 3D Robot Perception with Computational Symmetries

Zhang, Ray

Spatial-Semantic 3D Robot Perception with Computational Symmetries

dc.contributor.author	Zhang, Ray
dc.date.accessioned	2024-05-22T17:24:07Z
dc.date.available	2024-05-22T17:24:07Z
dc.date.issued	2024
dc.date.submitted	2024
dc.identifier.uri	https://hdl.handle.net/2027.42/193315
dc.description.abstract	Robot localization and mapping is a process where a robot constructs a global spatial model of a scene based on multiple local observations. It is a fundamental problem supporting other components of robot autonomy, such as planning and navigation. In this thesis, we are concerned with building a robust and semantic-aware framework of robot perception. With the help of modern vision sensors, recent perception techniques are capable of producing fast and accurate estimations for robot positions and 3D geometric structures under feature-rich environments. In perceptually degraded situations, mainstream algorithms' dependencies on feature quality and scene structure are no longer valid. Besides, the advances of deep neural networks provide ways of learning point-wise semantic information. However, these data-driven methods require an enormous amount of labeled data to capture the underlying distribution of the practical real world. Furthermore, effective strategies for fusing the available predictions from upstream machine learning algorithms into a consistent and efficient spatial-semantic environment model remain an ongoing challenge. To address these problems, we propose leveraging the tools from the reproducing kernel Hilbert space and the equivariance learning theory based on the symmetry of input data to construct a continuous and functional representation of the sensor observations. This representation enables a novel formulation of the localization problem that is robust towards sensor noise and outliers. Furthermore, this representation can tightly couple the spatial-semantic relations for different perception tasks. Specifically, in Chapter 4, we detail the functional and equivariant formulation that integrates geometric and semantic measurements and then introduce a frame-to-frame sensor registration framework. It outperforms the existing frame-to-frame registration methods in accuracy and robustness. Chapter 5 extends the two-frame registration to a multi-frame bundle adjustment formulation. It demonstrates the potential in local and large-scale robot trajectory estimations, with robust performances in feature-less and outlier-rich scenarios. Finally, Chapter 6 presents an unsupervised equivariance learning framework for learning the semantic features of the above sensor representation, which excels in environments with limited ground truth data. It offers robust adaptation and improved resilience against outliers or mismatches in simulated and unseen real-world datasets. The aforementioned contributions enhance the data efficiency, generalization, semantic understanding, and robustness of robot perception techniques within the demanding context of perceptually degraded applications.
dc.language.iso	en_US
dc.subject	SLAM
dc.subject	Sensor Registration
dc.subject	Deep Learning
dc.subject	Robot Perception
dc.subject	Semantic Perception
dc.title	Spatial-Semantic 3D Robot Perception with Computational Symmetries
dc.type	Thesis
dc.description.thesisdegreename	PhD
dc.description.thesisdegreediscipline	Robotics
dc.description.thesisdegreegrantor	University of Michigan, Horace H. Rackham School of Graduate Studies
dc.contributor.committeemember	Eustice, Ryan M
dc.contributor.committeemember	Ghaffari Jadidi, Maani
dc.contributor.committeemember	Grizzle, Jessy W
dc.contributor.committeemember	Fazeli, Nima
dc.contributor.committeemember	Vasudevan, Ram
dc.subject.hlbsecondlevel	Computer Science
dc.subject.hlbtoplevel	Engineering
dc.contributor.affiliationumcampus	Ann Arbor
dc.description.bitstreamurl	http://deepblue.lib.umich.edu/bitstream/2027.42/193315/1/rzh_1.pdf
dc.identifier.doi	https://dx.doi.org/10.7302/22960
dc.identifier.orcid	0000-0001-9599-931X
dc.identifier.name-orcid	Zhang, Ray; 0000-0001-9599-931X	en_US
dc.working.doi	10.7302/22960	en
dc.owningcollname	Dissertations and Theses (Ph.D. and Master's)

Files in this item

Name:: rzh_1.pdf
Size:: 26.67MB
Format:: PDF

View/Open

Dissertations and Theses (Ph.D. and Master's)

Show simple item record

Remediation of Harmful Language

The University of Michigan Library aims to describe its collections in a way that respects the people and communities who create, use, and are represented in them. We encourage you to Contact Us anonymously if you encounter harmful or problematic language in catalog records or finding aids. More information about our policies and practices is available at Remediation of Harmful Language.

Accessibility

If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.