dog/2016-01-30--11-24-51 (7.7G)
dog/2016-01-30--13-46-00 (8.5G)
dog/2016-01-31--19-19-25 (3.0G)
dog/2016-02-02--10-16-58 (8.1G)
dog/2016-02-08--14-56-28 (3.9G)
dog/2016-02-11--21-32-47 (13G)
dog/2016-03-29--10-50-20 (12G)
emily/2016-04-21--14-48-08 (4.4G)
emily/2016-05-12--22-20-00 (7.5G)
frodo/2016-06-02--21-39-29 (6.5G)
frodo/2016-06-08--11-46-01 (2.7G)

Dataset referenced on this page is copyrighted by comma.ai and published under the Creative Commons Attribution-NonCommercial-ShareAlike 3.0 License. This means that you must attribute the work in the manner specified by the authors, you may not use this work for commercial purposes and if you alter, transform, or build upon this work, you may distribute the resulting work only under the same license.

Dataset structure

The dataset consists of 10 videos clips of variable size recorded at 20 Hz with a camera mounted on the windshield of an Acura ILX 2016. In parallel to the videos we also recorded some measurements such as car's speed, acceleration, steering angle, GPS coordinates, gyroscope angles. See the full log list here. These measurements are transformed into a uniform 100 Hz time base.

The dataset folder structure is the following:

+-- dataset
|   +-- camera
|   |   +-- 2016-04-21--14-48-08
|   |   ...
|   +-- log
|   |   +-- 2016-04-21--14-48-08
|   |   ...

All the files come in hdf5 format and are named with the time they were recorded. The camera dataset has shape number_frames x 3 x 160 x 320 and uint8 type. One of the log hdf5-datasets is called cam1_ptr and addresses the alignment between camera frames and the other measurements.

Requirements

anaconda
tensorflow-0.9
keras-1.0.6
cv2

Hiring

Want a job at comma.ai?

Show us amazing stuff on this dataset

Credits

Riccardo Biasini, George Hotz, Sam Khalandovsky, Eder Santana, and Niel van der Westhuizen

Research - dataset and code for 2016 paper Learning a Driving Simulator

Related tags

Overview

the people's comma

the paper

the comma.ai driving dataset

Examples

Downloading the dataset

Dataset structure

Requirements

Hiring

Credits

Owner

comma.ai

K-FACE Analysis Project on Pytorch

Code for the ECIR'22 paper "Evaluating the Robustness of Retrieval Pipelines with Query Variation Generators"

A JAX implementation of Broaden Your Views for Self-Supervised Video Learning, or BraVe for short.

Official PyTorch Implementation of Learning Self-Similarity in Space and Time as Generalized Motion for Video Action Recognition, ICCV 2021

Block-wisely Supervised Neural Architecture Search with Knowledge Distillation (CVPR 2020)

Collection of common code that's shared among different research projects in FAIR computer vision team.

Depth-Aware Video Frame Interpolation (CVPR 2019)

FEMDA: Robust classification with Flexible Discriminant Analysis in heterogeneous data

This repository contains code released by Google Research.

[BMVC2021] "TransFusion: Cross-view Fusion with Transformer for 3D Human Pose Estimation"

Theano is a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. It can use GPUs and perform efficient symbolic differentiation.

Implementation of SiameseXML (ICML 2021)

Code for ACL2021 paper Consistency Regularization for Cross-Lingual Fine-Tuning.

Semantic Scholar's Author Disambiguation Algorithm & Evaluation Suite

Experiment about Deep Person Re-identification with EfficientNet-v2

The Hailo Model Zoo includes pre-trained models and a full building and evaluation environment

Self-Supervised Pre-Training for Transformer-Based Person Re-Identification

Source code for "Understanding Knowledge Integration in Language Models with Graph Convolutions"

Position detection system of mobile robot in the warehouse enviroment

[ICML 2021] A fast algorithm for fitting robust decision trees.