Novel Sensors for Autonomous Vehicle Perception - R0_FA0_sample

Skinner, Katherine A.; Vasudevan, Ram; Ramanagopal, Manikandasriram S.; Ravi, Radhika; Carmichael, Spencer; Buchan, Austin D.

Work Description

Title: Novel Sensors for Autonomous Vehicle Perception - R0_FA0_sample Open Access Deposited

Attribute	Value
Methodology	This sequence R0_FA0_sample (Route 0, Forward Afternoon Sequence 0) was collected by driving the Ford Fusion platform on a mostly straight 7.4km route in a moderate density urban environment. The route was traveled from southeast to northwest, in the afternoon on a mostly sunny day.
Description	This dataset is part of a collection released in support of an IROS 2023 workshop publication, with a supporting website ( https://sites.google.com/umich.edu/novelsensors2023). To enable new research in the area of novel sensors for autonomous vehicles, these datasets are designed for the task of place recognition with novel sensors. To our knowledge, this new dataset is the first to include stereo thermal cameras together with stereo event cameras and stereo monochrome cameras, which perform better in low-light than RGB cameras. The dataset collection platform is a Ford Fusion vehicle with roof-mounted sensing suite, which consists of forward-facing stereo uncooled thermal cameras (FLIR Boson 640+ ADK), event cameras (iniVation DVXplorer), monochrome cameras (FLIR BFS-PGE-16S2M), and RGB cameras (FLIR BFS-PGE-50S5C) aligned with ground truth position from a high precision navigation system. Sequences include ~10 km routes, which may be driven repeatedly under varying lighting conditions and feature instances of direct sunlight and low-light that challenge conventional cameras. A software toolkit to facilitate efficient use of the dataset including dataset download, application of calibration parameters, and evaluation of place recognition results based on standard metrics (e.g., maximum recall at 100% precision). These software tools for converting, managing, and viewing datafiles can be found at the associated GitHub repository ( https://github.com/umautobots/nsavp_tools).
Creator	Skinner, Katherine A. ; Vasudevan, Ram; Ramanagopal, Manikandasriram S.; Ravi, Radhika; Carmichael, Spencer; and Buchan, Austin D.
Depositor	[email protected]
Contact information	[email protected]
Discipline	Engineering
Funding agency	Other Funding Agency
Other Funding agency	Ford Motor Company
ORSP grant number	19-PAF06749
Keyword	novel sensing perception autonomous vehicles thermal sensing neuromorphic imaging event cameras
Date coverage	2023-05-11
Citations to related material	https://sites.google.com/umich.edu/novelsensors2023 https://github.com/umautobots/nsavp_tools
Resource type	Dataset
Last modified	06/09/2023
Published	06/09/2023
DOI	https://doi.org/10.7302/4az9-pb50
License	http://creativecommons.org/licenses/by/4.0/

To Cite this Work:
Skinner, K. A., Vasudevan, R., Ramanagopal, M. S., Ravi, R., Carmichael, S., Buchan, A. D. (2023). Novel Sensors for Autonomous Vehicle Perception - R0_FA0_sample [Data set], University of Michigan - Deep Blue Data. https://doi.org/10.7302/4az9-pb50

Relationships

In Collection:

Novel Sensors for Autonomous Vehicle Perception

Files (Count: 12; Size: 253 GB)

Title	Original Upload	Last Modified	File Size	Access	Actions
R0_FA0_sample_M0_measured_extrin....yaml	2023-06-02	2023-06-02	2.62 KB	Open Access	View Details Download
R0_FA0_sample_applanix.h5	2023-06-02	2023-06-02	7.05 MB	Open Access	View Details Download
R0_FA0_sample_C0_calibration_res....yaml	2023-06-02	2023-06-02	4.92 KB	Open Access	View Details Download
R0_FA0_sample_adk_left.h5	2023-06-08	2023-06-08	15.4 GB	Open Access	View Details
R0_FA0_sample_adk_right.h5	2023-06-08	2023-06-08	15.4 GB	Open Access	View Details
R0_FA0_sample_dvxplorer_left.h5	2023-06-08	2023-06-08	9.75 GB	Open Access	View Details Download
R0_FA0_sample_dvxplorer_right.h5	2023-06-08	2023-06-08	11 GB	Open Access	View Details
R0_FA0_sample_mono_left.h5	2023-06-08	2023-06-08	20.3 GB	Open Access	View Details
R0_FA0_sample_mono_right.h5	2023-06-08	2023-06-08	22.9 GB	Open Access	View Details
R0_FA0_sample_rgb_left.h5	2023-06-08	2023-06-08	79.8 GB	Open Access	View Details
R0_FA0_sample_rgb_right.h5	2023-06-08	2023-06-08	78.5 GB	Open Access	View Details
R0_FA0_sample_readme.txt	2023-06-09	2023-06-09	17 KB	Open Access	View Details Download

# Novel Sensors for Autonomous Vehicle Perception - R0_FA0_sample

This is the ReadMe document for the R0_FA0_sample dataset, which is part of the Novel Sensors for Autonomous Vehicle Perception (NSAVP) collection.

This dataset and supporting documentation are published June 12, 2023.

# Authors

Katherine A. (Katie) Skinner (Point of Contact) {[email protected]}

Ram Vasudevan {[email protected]}

Manikandasriram Srinivasan Ramanagopal {[email protected]}

Radhika Ravi {[email protected]}

Spencer Carmichael {[email protected]}

Austin Buchan {[email protected]}

# Sequence Description

This sequence R0_FA0_sample (Route 0, Forward Afternoon Sequence 0) was collected by driving the Ford Fusion platform on a mostly straight 7.4km route in a moderate density urban environment. The route was traveled from southeast to northwest, in the afternoon on a mostly sunny day.

# Resources

- [IROS 2023 Workshop Website](https://sites.google.com/umich.edu/novelsensors2023)
- [GitHub Repository for NSAVP software tools](https://github.com/umautobots/nsavp_tools)
- [Deep Blue Data NSAVP Collection](https://deepblue.lib.umich.edu/data/collections/v118rf157)

# Sensors

The sensors utilized in the NSAVP dataset (with the exception of the Applanix POS LV 420 secondary antenna, IMU, and wheel encoder) are pairs of identical modality sensors (RGB, thermal, event) in a left-right symmetric mirrored arrangement on the roof of the vehicle. All imagers are front-facing. The `_measured_extrinsics.yaml` file provides a hand-measured estimate of their locations relative to the vehicle body frame, accurate to within 1cm and ~5 degrees.

An overview of the stereo cameras included in the NSAVP dataset is given in the table below:

| Type | Camera Model | Lens Model | Stereo Baseline (meters) | FOV* (H x V) (degrees) | Resolution (W x H) | Rate (Hz) | Bit Depth (Bits) | Frame Readout |
|------------------|---------------------|-------------------------|--------------------------|------------------------|--------------------|-----------|------------------|---------------|
| Monochrome | FLIR BFS-PGE-16S2M | Computar A4Z2812CS-MPIR | 0.90 | 70 x 55 | 1440 x 1080 | 20.14 | 8 | Global |
| RGB | FLIR BFS-PGE-50S5C | Fujinon HF6XA-5M | 1.00 | 70 x 60 | 2448 x 2048 | 20.14 | 8 | Global |
| Uncooled Thermal | FLIR ADK USB-C | NA (Integrated) | 0.64 | 50 x 40 | 640 x 512 | 60.42 | 16 | Rolling |
| Event (DVS) | Inivation DVXplorer | NA (Included) | 1.00 | 70 x 50 | 640 x 480 | NA | NA | NA |

The monochrome and event camera lenses are varifocal, and therefore may be adjusted differently between sequences. Due to these adjustments, variations in the effective FOV across rectification methods, and differences in instrinsic parameters within stereo pairs, the listed FOVs are only approximate.

# Dataset Files

All sensor data is provided in the [HDF5](https://www.hdfgroup.org/solutions/hdf5/) file format (abbreviated H5). For introduction to the H5 file format, we recommend [this video series](https://youtube.com/playlist?list=PLPyhR4PdEeGYWHRhzmCP5stzfIha8bqVg).

Extrinsic and intrinsic calibration information is provided as [YAML](https://yaml.org/) files.

A summary of the files provided for each sequence is given in the table below:

| Filename | Description | Sensor Model |
|-------------------------------------------------------------------|------------------------------|---------------------|
| `{sequence_prefix}_adk_left.h5` | Left Uncooled Thermal | FLIR ADK USB-C |
| `{sequence_prefix}_adk_right.h5` | Right Uncooled Thermal | FLIR ADK USB-C |
| `{sequence_prefix}_mono_left.h5` | Left Monochrome | FLIR BFS-PGE-16S2M |
| `{sequence_prefix}_mono_right.h5` | Right Monochrome | FLIR BFS-PGE-16S2M |
| `{sequence_prefix}_rgb_left.h5` | Left RGB | FLIR BFS-PGE-50S5C |
| `{sequence_prefix}_rgb_right.h5` | Right RGB | FLIR BFS-PGE-50S5C |
| `{sequence_prefix}_dvxplorer_left.h5` | Left Event (DVS) | Inivation DVXplorer |
| `{sequence_prefix}_dvxplorer_right.h5` | Right Event (DVS) | Inivation DVXplorer |
| `{sequence_prefix}_applanix_right.h5` | Ground Truth Pose | Applanix POS LV 420 |
| `{sequence_prefix}_{calibration_prefix}_calibration_results.yaml` | Camera Calibration Results | NA |
| `{sequence_prefix}_{measured_prefix}_measured_extrinsics.yaml` | Manually Measured Extrinsics | NA |

The `{sequence_prefix}` is a unique identifier for the sequence, e.g. `R0_FA0`, which contains the following components:
- `R{#}`: Route index, denoting the route driven, e.g. `R0`
- `{direction_symbol}{lighting_condition_symbol}{subcategory_index}`: Descriptor, e.g. `FA0`, with the following subcomponents:
- `{direction_symbol}`: Direction symbol, denoting the direction driven along the route, which takes the following values:
- `F`: Forward direction
- `R`: Reverse direction
- `D`: Forward direction with divergences from the route
- `{lighting_condition_symbol}`: Lighting condition symbol, denoting the lighting condition the route was driven under, which takes the following values:
- `A`: Afternoon
- `S`: Sunset
- `N`: Night
- `{subcategory_index}`: Subcategory index, used to denote multiple sequences driven on the same route, in the same direction, and under the same lighting condition, i.e. an index that counts within the subcategory defined by the route index, direction symbol, and lighting condition symbol

The initial uploaded sample sequences are additionally denoted with `_sample` in the `{sequence_prefix}`.

The `{calibration_prefix}`, e.g. `C0`, and `{measured_prefix}`, e.g. `M0`, contain indices denoting which calibration sequence / measurement session the values were derived from.

## 1. General Sensor Data H5 Format

All sensor data was originally collected as a ROS1 rosbag and was converted to H5 files using the following strategy:
- An H5 file is created for each topic's namespace, with forward slashes replaced by underscores in nested namespaces (e.g. `/mono_left/image_raw` is written to `mono_left h5` and `/adk/left/image_raw` is written to `adk_left.h5`).
- If available, sensor configuration information is attached to the H5 root group `/` as H5 scalar attributes.
- H5 groups are created for each topic's name (e.g. the topic `/mono_left/image_raw` is written to the H5 group `/image_raw` in the `mono_left.h5` file) and the message type of the topic is attached as a H5 scalar attribute to this group (e.g. the H5 group `/image_raw` has an H5 attribute named `ros_message_type` with value `sensor_msgs/Image`).
- Constant message fields are written as H5 scalar attributes attached to their corresponding H5 group (e.g. the `height` field of a `sensor_msgs/Image` message on the topic `/mono_left/image_raw` is written as an H5 attribute named `height` attached to the `/image_raw` H5 group).
- Variable message fields are written as separate H5 datasets in their corresponding H5 groups using descriptive names that do not necessarily match the message field name (e.g. the `data` field of a `sensor_msgs/Image` message on the topic `/mono_left/image_raw` is written to the H5 dataset `/image_raw/images`).
- Timestamps are written in nanoseconds as unsigned 64-bit integer H5 datasets.
- Where applicable, units or descriptors are written as H5 scalar attributes attached to H5 datasets (e.g. the H5 dataset `/image_raw/timestamps` will have an attribute named `units` with value `nanoseconds`).
- The first dimension of each H5 dataset indexes across messages and chunking is performed along the first dimension only. Additionally, any two chunked datasets from the same message have the same chunk size along the first dimension such that data from all message fields can be read in as full chunks.

We recommend using the [HDFView GUI](https://www.hdfgroup.org/downloads/hdfview/) to view the contents of the H5 files. Commandline tools `h5ls` and `h5dump` can also be used print the contents of the H5 files.

## 2. Sensor Specific Data Format and Characteristics

### 2.1. Uncooled Thermal

The uncooled thermal camera data files, `*_adk_left.h5` and `*_adk_right.h5`, contain the following:
- `image_raw`: A group, derived from [sensor_msgs/Image](http://docs.ros.org/en/noetic/api/sensor_msgs/html/msg/Image.html) messages, including the following:
- `height`, `width`, `encoding`, `is_bigendian`, `step`: Attributes for constant messages fields.
- `images`: A three dimensional dataset (messages x rows x columns) of raw (distorted) thermal images. Note that the firmware version we use with our ADK cameras embeds telemetry information in the first four pixels of the first row in the image. After extracting this information we write the value of the fifth pixel, to the first four.
- `timestamps`: A one dimensional dataset of TAI timestamps in nanoseconds.
- `image_meta`: A group, derived from `image_meta_msgs/ImageMeta` messages (a custom message type), including multiple one dimensional datasets of various image metadata parameters. In particular, this includes:
- `driver_timestamps`: A one dimensional dataset of TAI timestamps in nanoseconds. Note that each timestamp matches exactly with the image the metadata corresponds to, but matches may not alway exist due to some corresponding messages being dropped on one topic but not the other.
- `ffc_flags`: A one dimensional dataset of hexadecimal codes where the least significant digit indicates whether a flat field correction (FFC) is not being performed (0x0000), imminent (0x0001), in progress (0x0002), or complete (0x0003). Note that the thermal images are not valid during a FFC.

### 2.2. Monochrome

The monochrome camera data files, `*_mono_left.h5` and `*_mono_right.h5`, contain the following:
- `image_raw`: A group, derived from [sensor_msgs/Image](http://docs.ros.org/en/noetic/api/sensor_msgs/html/msg/Image.html) messages, including the following:
- `height`, `width`, `encoding`, `is_bigendian`, `step`: Attributes for constant messages fields
- `images`: A three dimensional dataset (messages x rows x columns) of raw (distorted) monochrome images.
- `timestamps`: A one dimensional dataset of TAI timestamps in nanoseconds.
- `image_meta`: A group, derived from `image_meta_msgs/ImageMeta` messages (a custom message type), including multiple one dimensional datasets of various image metadata parameters. In particular, this includes:
- `driver_timestamps`: A one dimensional dataset of TAI timestamps in nanoseconds. Note that each timestamp matches exactly with the image the metadata corresponds to, but matches may not alway exist due to some corresponding messages being dropped on one topic but not the other.
- `exposure_time_us`: A one dimensional dataset of exposure times in microseconds.
- `gain`: A one dimensional dataset of gain values in dB.
- `cam_diags`: A group, derived from `image_meta_msgs/CameraDiagnostics` messages (a custom message type), including multiple one dimensional datasets of various camera diagnostic parameters polled in between image acquisitions.
- Configuration information written as attributes to the root group `/`. This includes nearly all camera parameters readable through the Spinnaker SDK.

### 2.3. RGB

The RGB camera data files, `*_rgb_left.h5` and `*_rgb_right.h5`, contain the following:
- `image_raw`: A group, derived from [sensor_msgs/Image](http://docs.ros.org/en/noetic/api/sensor_msgs/html/msg/Image.html) messages, including the following:
- `height`, `width`, `encoding`, `is_bigendian`, `step`: Attributes for constant messages fields
- `images`: A three dimensional dataset (messages x rows x columns) of raw (distorted) bayer encoded RGB images.
- `timestamps`: A one dimensional dataset of TAI timestamps in nanoseconds.
- `image_meta`: A group, derived from `image_meta_msgs/ImageMeta` messages (a custom message type), including multiple one dimensional datasets of various image metadata parameters. In particular, this includes:
- `driver_timestamps`: A one dimensional dataset of TAI timestamps in nanoseconds. Note that each timestamp matches exactly with the image the metadata corresponds to, but matches may not alway exist due to some corresponding messages being dropped on one topic but not the other.
- `exposure_time_us`: A one dimensional dataset of exposure times in microseconds.
- `gain`: A one dimensional dataset of gain values in dB.
- `cam_diags`: A group, derived from `image_meta_msgs/CameraDiagnostics` messages (a custom message type), including multiple one dimensional datasets of various camera diagnostic parameters polled in between image acquisitions.
- Configuration information written as attributes to the root group `/`. This includes nearly all camera parameters readable through the Spinnaker SDK.

### 2.4. Event (DVS)

The event camera data files, `*_dvxplorer_left.h5` and `*_dvxplorer_right.h5`, contain the following:
- `events`: A group, derived from [dvs_msgs/EventArray](https://github.com/uzh-rpg/rpg_dvs_ros/blob/master/dvs_msgs/msg/EventArray.msg) and [dvs_msgs/Event](https://github.com/uzh-rpg/rpg_dvs_ros/blob/master/dvs_msgs/msg/Event.msg) messages, including the following:
- `height`, `width`: Attributes for constant message fields
- `x_coordinates`: A one dimensional dataset of event x pixel coordinates.
- `y_coordinates`: A one dimensional dataset of event y pixel coordinates.
- `timestamps`: A one dimensional dataset of event TAI timestamps in nanoseconds.
- `polarities`: A one dimensional dataset of event polarities (0 denotes off/negative and 1 denotes on/positive).
- Configuration information written as attributes to the root group `/`. This includes a subset of the camera parameters readable through libcaer (with attribute names given as `[libcaer_module_address_symbolic_constant] libcaer_parameter_address_symbolic_constant`). Note that the bias preset `[DVX_DVS_CHIP_BIAS] DVX_DVS_CHIP_BIAS_SIMPLE` can be set but not read, so instead, the configuration includes the underlying parameters affected by the preset (the `[DVX_DVS_CHIP_BIAS] *` attributes).

### 2.5. Ground Truth Pose

The applanix data file, `*_applanix.h5`, contains poses of the `base_link` frame in the Earth Centered Earth Fixed (ECEF) frame. The `base_link` frame is defined with its origin on ground the at the midpoint of the vehicle's rear wheels with the x-axis pointing forward, y-axis pointing left, and z-axis pointing up. See the [Measured Extrinsics](#4-measured-extrinsics) section for information relating the `base_link` frame and the camera frames.

Specifically, the file contains the following:
- `pose_base_link`: A group, derived from [geometry_msgs/PoseStamped](http://docs.ros.org/en/noetic/api/geometry_msgs/html/msg/PoseStamped.html) messages, including the following:
- `positions`: A two dimensional dataset (messages x coordinate axes) of `base_link` positions in the ECEF frame in meters. The order of the coordinates across the columns is x, y, z.
- `quaternions`: A two dimensional dataset (messages x quaternion components) of quaternion rotations from the `base_link` frame to the ECEF frame. The order of the components across the columns is x, y, z, w.
- `timestamps`: A one dimensional dataset of event TAI timestamps in nanoseconds.

## 3. Calibration Results

The calibration result file, `*_calibration_results.yaml`, contains the results of [Kalibr](https://github.com/ethz-asl/kalibr) multi-camera calibration run with all eight cameras. The format matches that of the [camchain.yaml](https://github.com/ethz-asl/kalibr/wiki/yaml-formats) file output by Kalibr, with the one exception that the `rostopic` field, e.g. `/mono_left/image_raw`, has been changed to `frame_id`, e.g. `mono_left_optical`.

## 4. Measured Extrinsics

The measured extrinsics file, `*_measured_extrinsics.yaml`, contains manually measured extrinsics between the `base_link` frame and camera housing frames, e.g. `mono_left_housing`. Note that the camera housing frames are not identical to the camera optical frames, e.g. `mono_left_optical`, for which the Kalibr extrinsics results are given. The camera housing frames have the same orientation as the base_link frame, x-axis pointing forward, y-axis pointing left, and z-axis pointing up, while the camera optical frames have the x-axis pointing right, the y-axis pointing down, and the z-axis pointing forward. Additionally, there is an unmeasured, assumed small, translation between the camera housing and camera optical frames.

Update Provenance Log Entries

Download All Files (To download individual files, select them in the “Files” panel above)

Remediation of Harmful Language

The University of Michigan Library aims to describe its collections in a way that respects the people and communities who create, use, and are represented in them. We encourage you to contact us anonymously if you encounter harmful or problematic language in catalog records or finding aids. More information about our policies and practices is available at Remediation of Harmful Language.