Transformers for Object Detection: A Comparative Study on a Large-Scale Autonomous Driving Dataset
dc.contributor.author | Jian, Yu-Min | |
dc.contributor.author | Shrivastava, Shubham | |
dc.contributor.author | Parchami, Armin | |
dc.contributor.author | Chakravarty, Punarjay | |
dc.contributor.author | Skinner, Katherine A. | |
dc.contributor.author | Du, Xiaoxiao | |
dc.date.accessioned | 2024-10-21T20:46:51Z | |
dc.date.available | 2024-10-21T20:46:51Z | |
dc.date.issued | 2023 | |
dc.identifier.uri | https://hdl.handle.net/2027.42/195299 | en |
dc.description.abstract | Recently, the transformer networks have emerged as a powerful tool for 2-D object detection from RGB camera images. This paper provides a comparative study on the performance of representative, state-of-the-art transformer-based object detection methods, including the Detection Transformer (DETR) and its variants, and demonstrates their performance and benchmark results on a new large-scale, real-world autonomous driving dataset. This paper also provides ablation studies on the effects of pre-training, post-processing, and cross-validation schemes on 2D object detection performance of transformer networks, which can be used to guide the selection and construction of transformer-based object detection algorithms based on their detection effectiveness and efficiency. | en_US |
dc.description.sponsorship | This work was supported by a grant from Ford Motor Company via the Ford-UM Alliance under award N028603. | en_US |
dc.language.iso | en_US | en_US |
dc.subject | transformer | en_US |
dc.subject | neural network | en_US |
dc.subject | object detection | en_US |
dc.subject | autonomous driving | en_US |
dc.subject | deep learning | en_US |
dc.subject | robot perception | en_US |
dc.title | Transformers for Object Detection: A Comparative Study on a Large-Scale Autonomous Driving Dataset | en_US |
dc.type | Technical Report | en_US |
dc.subject.hlbsecondlevel | Robotics | |
dc.subject.hlbtoplevel | Engineering | |
dc.contributor.affiliationother | Ford Motor Company | en_US |
dc.contributor.affiliationumcampus | Ann Arbor | en_US |
dc.description.bitstreamurl | http://deepblue.lib.umich.edu/bitstream/2027.42/195299/1/2023_iros_DETR__Copy_DeepBlue_Version_2024_ (1).pdf | |
dc.identifier.doi | https://dx.doi.org/10.7302/24495 | |
dc.identifier.orcid | https://orcid.org/0000-0003-1504-966X | en_US |
dc.description.depositor | SELF | en_US |
dc.identifier.name-orcid | Du, Xiaoxiao; 0000-0003-1504-966X | en_US |
dc.working.doi | 10.7302/24495 | en_US |
dc.owningcollname | Robotics, Department of |
Files in this item
Remediation of Harmful Language
The University of Michigan Library aims to describe its collections in a way that respects the people and communities who create, use, and are represented in them. We encourage you to Contact Us anonymously if you encounter harmful or problematic language in catalog records or finding aids. More information about our policies and practices is available at Remediation of Harmful Language.
Accessibility
If you are unable to use this file in its current format, please select the Contact Us link and we can modify it to make it more accessible to you.