Work Description

Title: SID: Stereo Image Dataset for Autonomous Driving in Adverse Conditions Open Access Deposited

h
Attribute Value
Methodology
  • The Stereo Image Dataset (SID) was recorded using the ZED stereo camera mounted on a vehicle, capturing images at a 720p resolution with a frame rate of 20Hz under varying weather conditions and times of day. The camera was occasionally positioned below the windshield during heavy rain to protect it from direct exposure to raindrops. A fixed route within the University of Michigan–Dearborn campus and other urban and residential areas provided a diverse range of environments. The camera system produced stereo image pairs with factory calibration for depth estimation. Custom data logging software was created using the ZED camera API, initially storing images in BMP format and later converting them to PNG for efficient data storage. Metadata annotations, including weather conditions, time of day, and road status, were manually verified and documented post-capture, offering detailed context for each image sequence.
Description
  • The SID dataset was curated to support advanced research in autonomous driving systems, particularly focusing on perception under adverse weather and lighting conditions. This dataset encompasses over 178k high-resolution stereo image pairs organized into 27 sequences, reflecting a rich variety of conditions such as snow, rain, fog, and low light. It covers dynamic changes in driving scenarios and environmental backgrounds, including university campuses, residential streets, and urban settings. The dataset is designed to challenge perception algorithms with scenarios such as partially obscured camera lenses and varying visibility, promoting the development of robust computer vision models. No specialized software or scripts are necessary for accessing the image data, as the files are provided in standard PNG format. However, researchers and developers may require their image processing and computer vision toolkits to utilize the dataset effectively in their work.
Creator
Creator ORCID
  • 0000-0001-9518-2828; 0009-0004-8323-7501; N/A; 0000-0002-0576-2720; 0000-0002-4031-3584; 0000-0002-0795-1429; , 0000-0002-3473-6978
Depositor
  • zelshair@umich.edu
Contact information
Discipline
Keyword
Citations to related material
  • El-Shair, Z.A., Abu-raddaha, A., Cofield, A., Alawneh, H., Aladem, M., Hamzeh, Y., and Rawashdeh, S.A., 2024. SID: Stereo Image Dataset for Autonomous Driving in Adverse Conditions. In 2024 IEEE National Aerospace and Electronics Conference (NAECON). IEEE. In press.
Resource type
Last modified
  • 07/03/2024
Published
  • 07/03/2024
DOI
  • https://doi.org/10.7302/esz6-nv83
License
To Cite this Work:
El Shair, Z. A., Abu-raddaha, A., Cofield, A., Alawneh, H., Aladem, M., Hamzeh, Y., Rawashdeh, S. A. (2024). SID: Stereo Image Dataset for Autonomous Driving in Adverse Conditions [Data set], University of Michigan - Deep Blue Data. https://doi.org/10.7302/esz6-nv83

Relationships

This work is not a member of any user collections.

Files (Count: 7; Size: 194 GB)

Date: 3 July, 2024

Dataset Title: SID: Stereo Image Dataset for Autonomous Driving in Adverse Conditions

Dataset Creators: Z.A. El-Shair, A. Abu-raddaha, A. Cofield, H. Alawneh, M. Aladem, Y. Hamzeh, & S.A. Rawashdeh

Dataset Contact: Zaid A. El-Shair zelshair@umich.edu

Funding: This project was supported by a research initiation and development grant from the office of research and sponsored programs at the University of Michigan-Dearborn.

Key Points:
- Large-scale stereo image dataset tailored for autonomous driving perception algorithms.
- Captures a wide range of adverse weather and complex lighting conditions.
- Offers 27 sequences totaling over 178k stereo image pairs, each accompanied by comprehensive sequence-level that include weather conditions, time of day, road conditions, camera soiling, and lens obstructions.

Research Overview:
SID was created to facilitate the research and development of robust perception algorithms for autonomous driving under challenging conditions. The ZED stereo camera mounted on a vehicle captured diverse real-world scenarios, including snow, rain, fog, and nighttime driving across different urban and residential locations. Proper annotations provide metadata, such as weather conditions, time of day, and road conditions, to accompany the high-resolution stereo images. This dataset addresses the need for advanced models to operate consistently and safely across varied geographic and environmental settings.

Methodology:
Data were collected using a ZED stereo camera at a 20 Hz frame rate with 720p resolution. Various routing environments, such as university campuses, residential areas, and city streets, were included in the collection. The camera system produced stereo pairs with factory calibration for depth estimation purposes. Images were initially stored as BMP files before being converted to PNG format for improved storage efficiency. Collected during differing weather conditions and times of day, the dataset includes sequence-level annotations to augment model training and testing effectiveness.

Files contained here:
- The dataset is organized into 5 directories: Clear/, Cloudy/, Overcast/, Rain/, Snow/. These correspond to unique weather conditions under which the data were captured.

- Within these directories, subdirectories reflect individual sequences following the naming convention:
Location_Weather_TimeOfDay_SequenceNo/

- Each sequence folder includes two subfolders for the respective camera images:
- Left/: Contains images captured by the left camera.
- Right/: Contains images captured by the right camera.

Image Naming Format:
- Each image in the dataset has a unique filename that allows users to easily differentiate between left and right camera images and their associated sequences, formatted as follows:
SequenceNo_Camera{L/R}_FrameNumber.png

Data Categorization:
- Weather Conditions: Clear, Cloudy, Overcast, Rain, Snow.
- Times of Day: Day, Night, Dusk.
- Road Conditions: Dry, Wet, Snowy, and others.
- Locations: University campus drives (clockwise and counter-clockwise), residential areas, and city settings.

Training/Testing Categorization:
- Sequences are designated as 'Train' or 'Test' to facilitate standardized machine learning development and evaluation. The dataset is divided as follows:
- Train (20 sequences): 1, 3, 4, 6, 7, 8, 9, 10, 13, 14, 15, 16, 17, 18, 19, 21, 23, 25, 26, 27
- Test (7 sequences): 2, 5, 11, 12, 20, 22, 24

Note: Detailed statistical information for each sequence of the Stereo Image Dataset (SID), such as weather conditions, time of day, sequence location, road conditions, number of image pairs, and lens soiling annotations, is compiled in the "SID_Sequence_Metadata.csv" metadata file. Additionally, the file includes the categorization of sequences into 'Training' and 'Testing' sets, aiding in the dataset's practical application for developing and benchmarking perception algorithms. For comprehensive context and an in-depth understanding of the dataset, please refer to the related publication manuscript outlined below.

Related publication(s):
- El-Shair, Z.A., Abu-raddaha, A., Cofield, A., Alawneh, H., Aladem, M., Hamzeh, Y., and Rawashdeh, S.A., 2024. SID: Stereo Image Dataset for Autonomous Driving in Adverse Conditions. In 2024 IEEE National Aerospace and Electronics Conference (NAECON). IEEE. In press.

Use and Access:
This dataset is made available under Creative Commons Attribution 4.0 International (CC BY 4.0) and is freely available for research usage.

To Cite Data:
- El-Shair, Z.A., Abu-raddaha, A., Cofield, A., Alawneh, H., Aladem, M., Hamzeh, Y., and Rawashdeh, S.A. (2024). SID: Stereo Image Dataset for Autonomous Driving in Adverse Conditions [Data set]. University of Michigan - Deep Blue.
- El-Shair, Z.A., Abu-raddaha, A., Cofield, A., Alawneh, H., Aladem, M., Hamzeh, Y., and Rawashdeh, S.A., 2024. SID: Stereo Image Dataset for Autonomous Driving in Adverse Conditions. In 2024 IEEE National Aerospace and Electronics Conference (NAECON). IEEE. In press.

Download All Files (To download individual files, select them in the “Files” panel above)

Total work file size of 194 GB is too large to download directly. Consider using Globus (see below).



Best for data sets > 3 GB. Globus is the platform Deep Blue Data uses to make large data sets available.   More about Globus

Remediation of Harmful Language

The University of Michigan Library aims to describe library materials in a way that respects the people and communities who create, use, and are represented in our collections. Report harmful or offensive language in catalog records, finding aids, or elsewhere in our collections anonymously through our metadata feedback form. More information at Remediation of Harmful Language.