Welcome to the comma.ai Calibration Challenge!

Your goal is to predict the direction of travel (in camera frame) from provided dashcam video.

This repo provides 10 videos. Every video is 1min long and 20 fps.
5 videos are labeled with a 2D array describing the direction of travel at every frame of the video with a pitch and yaw angle in radians.
5 videos are unlabeled. It is your task to generate the labels for them.
The example labels are generated using a Neural Network, and the labels were confirmed with a SLAM algorithm.
You can estimate the focal length to be 910 pixels.

Context

The devices that run openpilot are not mounted perfectly. The camera is not exactly aligned to the vehicle. There is some pitch and yaw angle between the camera of the device and the vehicle, which can vary between installations. Estimating these angles is essential for accurate control of the vehicle. The best way to start estimating these values is to predict the direction of motion in camera frame. More info can be found in this readme.

Deliverable

Your deliverable is the 5 labels called 5.txt to 9.txt. These labels should be a 2D array that contains the pitch and yaw angles of the direction of travel (in camera frame) of every frame of the respective videos. Zip them up and e-mail it to [email protected].

Evaluation

We will evaluate your mean squared error against our ground truth labels. Errors for frames where the car speed is less than 4m/s will be ignored. Those are also labeled as NaN in the example labels.

This repo includes an eval script that will give an error score (lower is better). You can use it to test your solutions against the labeled examples. We will use this script to evaluate your solution.

Hints

Keep the goal and evaluation script in mind, creative solutions are allowed.
Look at plots of your solutions before submitting.

$500 Prize CLAIMED

The first submission that scores an error under 25% on the unlabeled set, will receive a $500 prize.

The comma.ai Calibration Challenge!

Related tags

Overview

Welcome to the comma.ai Calibration Challenge!

Context

Deliverable

Evaluation

Hints

$500 Prize CLAIMED

Owner

comma.ai

3rd Place Solution of the Traffic4Cast Core Challenge @ NeurIPS 2021

This is an official implementation for "PlaneRecNet".

LightSeq is a high performance training and inference library for sequence processing and generation implemented in CUDA

A large-scale benchmark for co-optimizing the design and control of soft robots, as seen in NeurIPS 2021.

Deep Learning segmentation suite designed for 2D microscopy image segmentation

AEI: Actors-Environment Interaction with Adaptive Attention for Temporal Action Proposals Generation

Multi-modal Text Recognition Networks: Interactive Enhancements between Visual and Semantic Features

Code repository for our paper regarding the L3D dataset.

Official repository of the paper "GPR1200: A Benchmark for General-PurposeContent-Based Image Retrieval"

Code image classification of MNIST dataset using different architectures: simple linear NN, autoencoder, and highway network

This is the official implementation for "Do Transformers Really Perform Bad for Graph Representation?".

PyTorch implementation DRO: Deep Recurrent Optimizer for Structure-from-Motion

Official code repository for Continual Learning In Environments With Polynomial Mixing Times

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Efficient Lottery Ticket Finding: Less Data is More

Complex-Valued Neural Networks (CVNN)Complex-Valued Neural Networks (CVNN)

Your interactive network visualizing dashboard

[AI6101] Introduction to AI & AI Ethics is a core course of MSAI, SCSE, NTU, Singapore

Evolutionary Population Curriculum for Scaling Multi-Agent Reinforcement Learning

Volumetric parameterization of the placenta to a flattened template