HybVIO visual-inertial odometry and SLAM system

Last update: Jan 03, 2023

Overview

HybVIO

A visual-inertial odometry system with an optional SLAM module.

This is a research-oriented codebase, which has been published for the purposes of verifiability and reproducibility of the results in the paper:

Otto Seiskari, Pekka Rantalankila, Juho Kannala, Jerry Ylilammi, Esa Rahtu, and Arno Solin (2022). HybVIO: Pushing the limits of real-time visual-inertial odometry. In IEEE Winter Conference on Applications of Computer Vision (WACV).
[arXiv pre-print] | [video]

It can also serve as a baseline in VIO and VISLAM benchmarks. The code is not intended for production use and does not represent a particularly clean or simple way of implementing the methods described in the above paper. The code contains numerous feature flags and parameters (see codegen/parameter_definitions.c) that are not used in the HybVIO but may (or may not) be relevant in other scenarios and use cases.

Setup

Here are basic instructions for setting up the project, there is some more detailed help included in the later sections (e.g., for Linux).

Install CMake, glfw and ffmpeg, e.g., by brew install cmake glfw ffmpeg.
Clone this repository with the --recursive option (this will take a while)
Build dependencies by running cd 3rdparty/mobile-cv-suite; ./scripts/build.sh
Make sure you are using clang to compile the C++ sources (it's the default on Macs). If not default, like on many Linux Distros, you can control this with environment variables, e.g., CC=clang CXX=clang++ ./scripts/build.sh
(optional) In order to be able to use the SLAM module, run ./slam/src/download_orb_vocab.sh

Then, to build the main and test binaries, perform the standard CMake routine:

mkdir target
cd target
cmake -DBUILD_VISUALIZATIONS=ON -DUSE_SLAM=ON ..
# or if not using clang by default:
# CC=clang CXX=clang++ cmake ..
make

Now the target folder should contain the binaries main and run-tests. After making changes to code, only run make. Tests can be run with the binary run-tests.

To compile faster, pass -j argument to make, or use a program like ccache. To run faster, check CMakeLists.txt for some options.

Arch Linux

List of packages needed: blas, cblas, clang, cmake, ffmpeg, glfw, gtk3, lapack, python-numpy, python-matplotlib.

Debian

On Debian Stretch, had to install (some might be optional): clang, libc++-dev, libgtk2.0-dev, libgstreamer1.0-dev, libvtk6-dev, libavresample-dev.

Raspberry Pi/Raspbian

On Raspbian (Pi 4, 8 GiB), had to install at least: libglfw3-dev and libglfw3 (for accelerated arrays) and libglew-dev and libxkbcommon-dev (for Pangolin, still had problems). Also started off with the Debian setup above.

Benchmarking

EuroC

To run benchmarks on EuroC dataset and reproduce numbers published in https://arxiv.org/abs/2106.11857, follow the instructions in https://github.com/AaltoML/vio_benchmark/tree/main/hybvio_runner.

If you want to test the software on individual EuRoC datasets, you can follow this subset of instructions

In vio_benchmark root folder, run python convert/euroc_to_benchmark.py to download and convert to data
Symlink that data here: mkdir -p data && cd data && ln -s /path/to/vio_benchmark/data/benchmark .

Then you can run inividual EuRoC sequences as, e.g.,

./main -i=../data/benchmark/euroc-v1-02-medium -p -useStereo

ADVIO

Download the ADVIO dataset as instructed in https://github.com/AaltoVision/ADVIO#downloading-the-data and extract all the .zip files somewhere ("/path/to/advio").
Run ./scripts/convert/advio_to_generic_benchmark.sh /path/to/advio
Then you can run ADVIO sequences either using their full path (like in EuRoC) or using the -j shorthand, e.g., ./main -j=2 for ADVIO-02.

The `main` binary

To run the algorithm on recorded data, use ./main -i=path/to/datafolder, where datafolder/ must at the very least contain a data.{jsonl|csv} and data.{mp4|mov|avi}. Such recordings can be created with

Some common arguments to main are:

-p: show pose visualization.
-c: show video output.
-useSlam: Enable SLAM module.
-useStereo: Enable stereo.
-s: show 3d visualization. Requires -useSlam.
-gpu: Enable GPU acceleration

You can get full list of command line options with ./main -help.

Key controls

These keys can be used when any of the graphical windows are focused (see commandline/command_queue.cpp for full list).

A to pause and toggle step mode, where a key press (e.g., SPACE) processes the next frame.
Q or Escape to quit
R to rotate camera window
The horizontal number keys 1,2,… toggle methods drawn in the pose visualization.

When the command line is focused, Ctrl-C aborts the program.

Copyright

Licensed under GPLv3. For different (commercial) licensing options, contact us at https://www.spectacularai.com/

HybVIO visual-inertial odometry and SLAM system

Related tags

Overview

HybVIO

Setup

Arch Linux

Debian

Raspberry Pi/Raspbian

Benchmarking

EuroC

ADVIO

The `main` binary

Key controls

Copyright

Owner

Spectacular AI

High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.

SASM - simple crossplatform IDE for NASM, MASM, GAS and FASM assembly languages

Pytorch implementation of our paper under review — Lottery Jackpots Exist in Pre-trained Models

Unofficial implementation of HiFi-GAN+ from the paper "Bandwidth Extension is All You Need" by Su, et al.

Implement the Pareto Optimizer and pcgrad to make a self-adaptive loss for multi-task

Technical experimentations to beat the stock market using deep learning :chart_with_upwards_trend:

Keras Image Embeddings using Contrastive Loss

Using fully convolutional networks for semantic segmentation with caffe for the cityscapes dataset

Perform Linear Classification with Multi-way Data

[TIP 2021] SADRNet: Self-Aligned Dual Face Regression Networks for Robust 3D Dense Face Alignment and Reconstruction

Reduce end to end training time from days to hours (or hours to minutes), and energy requirements/costs by an order of magnitude using coresets and data selection.

SLAMP: Stochastic Latent Appearance and Motion Prediction

OpenFed: A Comprehensive and Versatile Open-Source Federated Learning Framework

Lecture materials for Cornell CS5785 Applied Machine Learning (Fall 2021)

PyTorch implementation of DeepDream algorithm

Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

GestureSSD CBAM - A gesture recognition web system based on SSD and CBAM, using pytorch, flask and node.js

Automated Attendance Project Using Face Recognition

Supporting code for the paper "Dangers of Bayesian Model Averaging under Covariate Shift"

HybVIO visual-inertial odometry and SLAM system

Related tags

Overview

HybVIO

Setup

Arch Linux

Debian

Raspberry Pi/Raspbian

Benchmarking

EuroC

ADVIO

The main binary

Key controls

Copyright

Owner

Spectacular AI

High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.

SASM - simple crossplatform IDE for NASM, MASM, GAS and FASM assembly languages

Pytorch implementation of our paper under review — Lottery Jackpots Exist in Pre-trained Models

Unofficial implementation of HiFi-GAN+ from the paper "Bandwidth Extension is All You Need" by Su, et al.

Implement the Pareto Optimizer and pcgrad to make a self-adaptive loss for multi-task

Technical experimentations to beat the stock market using deep learning :chart_with_upwards_trend:

Keras Image Embeddings using Contrastive Loss

Using fully convolutional networks for semantic segmentation with caffe for the cityscapes dataset

Perform Linear Classification with Multi-way Data

[TIP 2021] SADRNet: Self-Aligned Dual Face Regression Networks for Robust 3D Dense Face Alignment and Reconstruction

Reduce end to end training time from days to hours (or hours to minutes), and energy requirements/costs by an order of magnitude using coresets and data selection.

SLAMP: Stochastic Latent Appearance and Motion Prediction

OpenFed: A Comprehensive and Versatile Open-Source Federated Learning Framework

Lecture materials for Cornell CS5785 Applied Machine Learning (Fall 2021)

PyTorch implementation of DeepDream algorithm

Conservative and Adaptive Penalty for Model-Based Safe Reinforcement Learning

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

GestureSSD CBAM - A gesture recognition web system based on SSD and CBAM, using pytorch, flask and node.js

Automated Attendance Project Using Face Recognition

Supporting code for the paper "Dangers of Bayesian Model Averaging under Covariate Shift"

The `main` binary