DFM: A Performance Baseline for Deep Feature Matching

Last update: Jan 02, 2023

Related tags

Overview

DFM: A Performance Baseline for Deep Feature Matching

Python (Pytorch) and Matlab (MatConvNet) implementations of our paper DFM: A Performance Baseline for Deep Feature Matching at CVPR 2021 Image Matching Workshop.

Paper (CVF) | Paper (arXiv)
Presentation (live) | Presentation (recording)

Setup Environment

We strongly recommend using Anaconda. Open a terminal in ./python folder, and simply run the following lines to create the environment:

conda env create -f environment.yml
conda activte dfm

Dependencies
If you do not use conda, DFM needs the following dependencies:
(Versions are not strict; however, we have tried DFM with these specific versions.)

python=3.7.1
pytorch=1.7.1
torchvision=0.8.2
cudatoolkit=11.0
matplotlib=3.3.4
pillow=8.2.0
opencv=3.4.2
ipykernel=5.3.4
pyyaml=5.4.1

Enjoy with DFM!

Now you are ready to test DFM by the following command:

python dfm.py --input_pairs image_pairs.txt

You should make the image_pairs.txt file as following:

1A> 1B>
2A> 2B>
.
.
.
nA> nB>

If you want to run DFM with a specific configuration, you can make changes to the following arguments in config.yml:

Use enable_two_stage to enable or disable two stage approach (default: True)
(Note: Make it enable for planar scenes with significant viewpoint changes, otherwise disable.)
Use model to change the pre-trained model (default: VGG19)
(Note: DFM only supports VGG19 and VGG19_BN right now, we plan to add other backbones.)
Use ratio_th to change ratio test thresholds (default: [0.9, 0.9, 0.9, 0.9, 0.95, 1.0])
(Note: These ratio test thresholds are for 1st to 5th layer, the last threshold (6th) are for Stage-0 and only usable when --enable_two_stage=True)
Use bidirectional to enable or disable bidirectional ratio test. (default: True)
(Note: Make it enable to find more robust matches. Naturally, it should be enabled, make it False is only for similar results with our Matlab implementation since Matlab's matchFeatures function does not execute ratio test in a bidirectional way.)
Use display_results to enable or disable displaying results (default: True)
(Note: If True, DFM saves matched image pairs to output_directory.)
Use output_directory to define output directory. (default: 'results')
(Note: imageA_imageB_matches.npz will be created in output_directory for each image pair.)

Evaluation

Currently, we do not have support evaluation for our Python implementation. You can use our Image Matching Evaluation repository (coming soon), in which we have support to evaluate SuperPoint, SuperGlue, Patch2Pix, and DFM algorithms on HPatches. Also, you can use our Matlab implementation (see For Matlab Users section) to reproduce the results presented in the paper.

Notice

To reproduce our results given in the paper, use our Matlab implementation.
You can get more accurate results (but with fewer features) using Python implementation. It is mainly because MATLAB’s matchFeatures function does not execute ratio test in a bidirectional way, where our Python implementation performs bidirectional ratio test. Nevertheless, we made bidirectionality adjustable in our Python implementation as well.

For Matlab Users

We have implemented and tested DFM on MATLAB R2017b.

Prerequisites

You need to install MatConvNet (we have support for matconvnet-1.0-beta24). Follow the instructions on the official website.

Once you finished the installation of MatConvNet, you should download pretratined VGG-19 network to the ./matlab/models folder.

Running DFM

Now, you are ready to try DFM!

Just open and run main_DFM.m with your own images.

Evaluation on HPatches

Download HPatches sequences and extract it to ./matlab/data folder.

Run main_hpatches.m which is in ./matlab/HPatches Evaluation folder.

A results.txt file will be generetad in ./matlab/results/HPatches folder.

In the first column you can find the pair names.
In the 2-11 column you can find the Mean Matching Accuracy (MMA) results for 1-10 pixel thresholds.
In 12th column you can find number of matched features.
Columns 13-17 are for best homography estimation results (denoted as boe in the paper)
Columns 18-22 are for worst homography estimation results (denoted as woe in the paper)
Columns 22-71 are for 10 different homography estimation tests.

BibTeX Citation

Please cite our paper if you use the code:

@InProceedings{Efe_2021_CVPR,
    author    = {Efe, Ufuk and Ince, Kutalmis Gokalp and Alatan, Aydin},
    title     = {DFM: A Performance Baseline for Deep Feature Matching},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops},
    month     = {June},
    year      = {2021},
    pages     = {4284-4293}
}

DFM: A Performance Baseline for Deep Feature Matching

Related tags

Overview

DFM: A Performance Baseline for Deep Feature Matching

Setup Environment

Enjoy with DFM!

Evaluation

Notice

For Matlab Users

Prerequisites

Running DFM

Evaluation on HPatches

BibTeX Citation

Owner

Finetuning Pipeline

MemStream: Memory-Based Anomaly Detection in Multi-Aspect Streams with Concept Drift

RADIal is available now! Check the download section

A PyTorch implementation of Implicit Q-Learning

OpenMMLab Detection Toolbox and Benchmark

HTSeq is a Python library to facilitate processing and analysis of data from high-throughput sequencing (HTS) experiments.

Multi-View Consistent Generative Adversarial Networks for 3D-aware Image Synthesis (CVPR2022)

A PyTorch Implementation of Neural IMage Assessment

Very deep VAEs in JAX/Flax

A Pythonic library for Nvidia Codec.

Improving Transferability of Representations via Augmentation-Aware Self-Supervision

Pose Detection and Machine Learning for real-time body posture analysis during exercise to provide audiovisual feedback on improvement of form.

A deep learning based semantic search platform that computes similarity scores between provided query and documents

DeepSpeed is a deep learning optimization library that makes distributed training easy, efficient, and effective.

AdaMML: Adaptive Multi-Modal Learning for Efficient Video Recognition

Analysing poker data from home games with friends

CLIPort: What and Where Pathways for Robotic Manipulation

FFTNet vocoder implementation

Python scripts for performing object detection with the 1000 labels of the ImageNet dataset in ONNX.

SSL_SLAM2: Lightweight 3-D Localization and Mapping for Solid-State LiDAR (mapping and localization separated) ICRA 2021