Official repository of ICCV21 paper "Viewpoint Invariant Dense Matching for Visual Geolocalization"

Last update: Jan 03, 2023

Related tags

Deep Learning geo_warp

Overview

Viewpoint Invariant Dense Matching for Visual Geolocalization: PyTorch implementation

This is the implementation of the ICCV21 paper:

G Berton, C. Masone, V. Paolicelli and B. Caputo, Viewpoint Invariant Dense Matching for Visual Geolocalization

Setup

First download the baseline models which have been trained following the training procedure in the NetVLAD paper. We provide a script to download the six models used, which are a combination of 3 backbone encoders (AlexNet, VGG-16 and ResNet-50) with 2 pooling/aggregation layers (GeM and NetVLAD).

python download_pretrained_baselines.py

Then you should prepare your geo-localization dataset, so that the directory tree is as such:

dataset_name
└── images
    ├── train
    │   ├── gallery
    │   └── queries
    ├── val
    │   ├── gallery
    │   └── queries
    └── test
        ├── gallery
        └── queries

and the images are named as @UTM [email protected] [email protected]@.jpg

Dependencies

See requirements.txt

Training

You can train the model using the train.py, here's an example with the lightest/fastest model (i.e. AlexNet + GeM):

python train.py --arch alexnet --pooling gem --resume_fe pretrained_baselines/alexnet_gem.pth

For a full set of options, run python train.py -h. The script will create a folder under ./runs/default/YYYY-MM-DD_HH-mm-ss where logs and checkpoints will be saved.

Evaluation

Coming soon.

BibTeX

If you use this code in your project, please cite us using:

@InProceedings{Berton_ICCV_2021,
    author    = {Berton, Gabriele and Masone, Carlo and Paolicelli, Valerio and Caputo, Barbara},
    title     = {Viewpoint Invariant Dense Matching for Visual Geolocalization},
    booktitle = ICCV,
    month     = {October},
    year      = {2021},
    pages     = {12169-12178}
}

Official repository of ICCV21 paper "Viewpoint Invariant Dense Matching for Visual Geolocalization"

Related tags

Overview

Viewpoint Invariant Dense Matching for Visual Geolocalization: PyTorch implementation

Setup

Dependencies

Training

Evaluation

BibTeX

Owner

Gabriele Berton

A Keras implementation of YOLOv3 (Tensorflow backend)

A list of Machine Learning Art Colabs

Fluency ENhanced Sentence-bert Evaluation (FENSE), metric for audio caption evaluation. And Benchmark dataset AudioCaps-Eval, Clotho-Eval.

Jittor implementation of PCT:Point Cloud Transformer

A simple image/video to Desmos graph converter run locally

TEDSummary is a speech summary corpus. It includes TED talks subtitle (Document), Title-Detail (Summary), speaker name (Meta info), MP4 URL, and utterance id

This is code of book "Learn Deep Learning with PyTorch"

BADet: Boundary-Aware 3D Object Detection from Point Clouds (Pattern Recognition 2022)

Viewmaker Networks: Learning Views for Unsupervised Representation Learning

The code for two papers: Feedback Transformer and Expire-Span.

Adversarially Learned Inference

PyTorch implementation of neural style randomization for data augmentation

Video Matting via Consistency-Regularized Graph Neural Networks

Implementation of StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation in PyTorch

GT China coal model

A lightweight library to compare different PyTorch implementations of the same network architecture.

(Py)TOD: Tensor-based Outlier Detection, A General GPU-Accelerated Framework

PyTorch Code for the paper "VSE++: Improving Visual-Semantic Embeddings with Hard Negatives"

Recurrent Variational Autoencoder that generates sequential data implemented with pytorch

Point cloud processing tool library.