Implementation of Monocular Direct Sparse Localization in a Prior 3D Surfel Map (DSL)

Last update: Nov 30, 2022

Related tags

Deep Learning dsl

Overview

DSL

Project page: https://sites.google.com/view/dsl-ram-lab/

Monocular Direct Sparse Localization in a Prior 3D Surfel Map

Authors: Haoyang Ye, Huaiyang Huang, and Ming Liu from RAM-LAB.

Paper and Video

Related publications:

@inproceedings{ye2020monocular,
  title={Monocular direct sparse localization in a prior 3d surfel map},
  author={Ye, Haoyang and Huang, Huaiyang and Liu, Ming},
  booktitle={2020 IEEE International Conference on Robotics and Automation (ICRA)},
  pages={8892--8898},
  year={2020},
  organization={IEEE}
}
@inproceedings{ye20213d,
  title={3D Surfel Map-Aided Visual Relocalization with Learned Descriptors},
  author={Ye, Haoyang and Huang, Huaiyang and Hutter, Marco and Sandy, Timothy and Liu, Ming},
  booktitle={2021 International Conference on Robotics and Automation (ICRA)},
  pages={5574-5581},
  year={2021},
  organization={IEEE}
}

Video: https://www.youtube.com/watch?v=LTihCBGcURo

Dependency

Pangolin.
CUDA.
Ceres-solver.
PCL, the default version accompanying by ROS.
OpenCV, the default version accompanying by ROS.

Build

git submodule update --init --recursive
mkdir build && cd build
cmake .. -DCMAKE_BUILD_TYPE=RelWithDebInfo
make -j8

Example

The sample config file can be downloaded from this link.

To run the example:

[path_to_build]/src/dsl_main --path "[path_to_dataset]/left_pinhole"

Preparing Your Own Data

Collect LiDAR and camera data.
Build LiDAR map and obtain LiDAR poses (the poses are not necessary).
Pre-process LiDAR map to make the [path_to_dataset]/*.pcd map file contains normal_x, normal_y, normal_z fields (downsample & normal estimation).
Extract and undistort images into [path_to_dataset]/images.
Set the first camera pose to initial_pose and other camera parameters in [path_to_dataset]/config.yaml.

Note

This implementation of DSL takes Ceres Solver as backend, which is different from the the implementation of the original paper with DSO-backend. This leads to different performance, i.e., speed and accuracy, compared to the reported results.

Credits

This work is inspired from several open-source projects, such as DSO, DSM, Elastic-Fusion, SuperPoint, DBoW2, NetVlad, LIO-mapping and etc.

Licence

The source code is released under GPL-3.0.

Implementation of Monocular Direct Sparse Localization in a Prior 3D Surfel Map (DSL)

Related tags

Overview

DSL

Monocular Direct Sparse Localization in a Prior 3D Surfel Map

Authors: Haoyang Ye, Huaiyang Huang, and Ming Liu from RAM-LAB.

Paper and Video

Dependency

Build

Example

Preparing Your Own Data

Note

Credits

Licence

Owner

Haoyang Ye

This repository contains code accompanying the paper "An End-to-End Chinese Text Normalization Model based on Rule-Guided Flat-Lattice Transformer"

Pytorch implementation of “Recursive Non-Autoregressive Graph-to-Graph Transformer for Dependency Parsing with Iterative Refinement”

MG-GCN: Scalable Multi-GPU GCN Training Framework

Code for paper Adaptively Aligned Image Captioning via Adaptive Attention Time

Face Synthetics dataset is a collection of diverse synthetic face images with ground truth labels.

Predicting Auction Sale Price using the kaggle bulldozer auction sales data: Modeling with Ensembles vs Neural Network

OpenMMLab's Next Generation Video Understanding Toolbox and Benchmark

A Tensorflow based library for Time Series Modelling with Gaussian Processes

ColBERT: Contextualized Late Interaction over BERT (SIGIR'20)

RuDOLPH: One Hyper-Modal Transformer can be creative as DALL-E and smart as CLIP

Hand-distance-measurement-game - Hand Distance Measurement Game

PaddleViT: State-of-the-art Visual Transformer and MLP Models for PaddlePaddle 2.0+

Code for Massive-scale Decoding for Text Generation using Lattices

PyTorch/GPU re-implementation of the paper Masked Autoencoders Are Scalable Vision Learners

Annealed Flow Transport Monte Carlo

Vision transformers (ViTs) have found only limited practical use in processing images

A Demo server serving Bert through ONNX with GPU written in Rust with <3

An Object Oriented Programming (OOP) interface for Ontology Web language (OWL) ontologies.

ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator

《Dual-Resolution Correspondence Network》(NeurIPS 2020)