CAT-Net: Learning Canonical Appearance Transformations

Code to accompany our paper "How to Train a CAT: Learning Canonical Appearance Transformations for Direct Visual Localization Under Illumination Change".

Dependencies

numpy
matpotlib
pytorch + torchvision (1.2)
Pillow
progress (for progress bars in train/val/test loops)
tensorboard + tensorboardX (for visualization)
pyslam + liegroups (optional, for running odometry/localization experiments)
OpenCV (optional, for running odometry/localization experiments)

Training the CAT

Download the ETHL dataset from here or the Virtual KITTI dataset from here
1. ETHL only: rename ethl1/2 to ethl1/2_static.
2. ETHL only: Update the local paths in tools/make_ethl_real_sync.py and run python3 tools/make_ethl_real_sync.py to generate a synchronized copy of the real sequences.
Update the local paths in run_cat_ethl/vkitti.py and run python3 run_cat_ethl/vkitti.py to start training.
In another terminal run tensorboard --port [port] --logdir [path] to start the visualization server, where [port] should be replaced by a numeric value (e.g., 60006) and [path] should be replaced by your local results directory.
Tune in to localhost:[port] and watch the action.

Running the localization experiments

Ensure the pyslam and liegroups packages are installed.
Update the local paths in make_localization_data.py and run python3 make_localization_data.py [dataset] to compile the model outputs into a localization_data directory.
Update the local paths in run_localization_[dataset].py and run python3 run_localization_[dataset].py [rgb,cat] to compute VO and localization results using either the original RGB or CAT-transformed images.
You can compute localization errors against ground truth using the compute_localization_errors.py script, which generates CSV files and several plots. Update the local paths and run python3 compute_localization_errors.py [dataset].

Citation

If you use this code in your research, please cite:

@article{2018_Clement_Learning,
  author = {Lee Clement and Jonathan Kelly},
  journal = {{IEEE} Robotics and Automation Letters},
  link = {https://arxiv.org/abs/1709.03009},
  title = {How to Train a {CAT}: Learning Canonical Appearance Transformations for Direct Visual Localization Under Illumination Change},
  year = {2018}
}

Canonical Appearance Transformations

Related tags

Overview

CAT-Net: Learning Canonical Appearance Transformations

Dependencies

Training the CAT

Running the localization experiments

Citation

Owner

STARS Laboratory

Generative Adversarial Networks for High Energy Physics extended to a multi-layer calorimeter simulation

The project is an official implementation of our CVPR2019 paper "Deep High-Resolution Representation Learning for Human Pose Estimation"

Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official implementation of the Soft Actor-Critic algorithm.

Developed an optimized algorithm which finds the most optimal path between 2 points in a 3D Maze using various AI search techniques like BFS, DFS, UCS, Greedy BFS and A*

Multi-task Self-supervised Object Detection via Recycling of Bounding Box Annotations (CVPR, 2019)

Official PyTorch Implementation of Embedding Transfer with Label Relaxation for Improved Metric Learning, CVPR 2021

You Only 👀 One Sequence

A NSFW content filter.

Pretrained language model and its related optimization techniques developed by Huawei Noah's Ark Lab.

Pytorch implementation of our paper under review — Lottery Jackpots Exist in Pre-trained Models

Distributing reference energies for SMIRNOFF implementations

🗺 General purpose U-Network implemented in Keras for image segmentation

TyXe: Pyro-based BNNs for Pytorch users

This repo contains the code and data used in the paper "Wizard of Search Engine: Access to Information Through Conversations with Search Engines"

Code for the paper Learning the Predictability of the Future

Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.

Train CPPNs as a Generative Model, using Generative Adversarial Networks and Variational Autoencoder techniques to produce high resolution images.

YOLOv5 detection interface - PyQt5 implementation

Code for our WACV 2022 paper "Hyper-Convolution Networks for Biomedical Image Segmentation"

A Pytorch implementation of "Splitter: Learning Node Representations that Capture Multiple Social Contexts" (WWW 2019).