Code release for "COTR: Correspondence Transformer for Matching Across Images"

Last update: Jan 06, 2023

Related tags

Overview

COTR: Correspondence Transformer for Matching Across Images

This repository contains the inference code for COTR. We plan to release the training code in the future. COTR establishes correspondence in a functional and end-to-end fashion. It solves dense and sparse correspondence problem in the same framework.

Demos

Check out our demo video at here.

1. Install environment

Our implementation is based on PyTorch. Install the conda environment by: conda env create -f environment.yml.

Activate the environment by: conda activate cotr_env.

Notice that we use scipy=1.2.1 .

2. Download the pretrained weights

Down load the pretrained weights at here. Extract in to ./out, such that the weights file is at /out/default/checkpoint.pth.tar.

3. Single image pair demo

python demo_single_pair.py --load_weights="default"

Example sparse output:

Example dense output with triangulation:

Note: This example uses 10K valid sparse correspondences to densify.

4. Facial landmarks demo

python demo_face.py --load_weights="default"

Example:

5. Homography demo

python demo_homography.py --load_weights="default"

Citation

If you use this code in your research, cite the paper:

@article{jiang2021cotr,
  title={{COTR: Correspondence Transformer for Matching Across Images}},
  author={Wei Jiang and Eduard Trulls and Jan Hosang and Andrea Tagliasacchi and Kwang Moo Yi},
  booktitle={arXiv preprint},
  publisher_page={https://arxiv.org/abs/2103.14167},
  year={2021}
}

Code release for "COTR: Correspondence Transformer for Matching Across Images"

Related tags

Overview

COTR: Correspondence Transformer for Matching Across Images

Demos

1. Install environment

2. Download the pretrained weights

3. Single image pair demo

4. Facial landmarks demo

5. Homography demo

Citation

Owner

UBC Computer Vision Group

Face Library is an open source package for accurate and real-time face detection and recognition

Synthesizing and manipulating 2048x1024 images with conditional GANs

The audio-video synchronization of MKV Container Format is exploited to achieve data hiding

Reproduces ResNet-V3 with pytorch

[CVPR 2021] Unsupervised Degradation Representation Learning for Blind Super-Resolution

The source code of "SIDE: Center-based Stereo 3D Detector with Structure-aware Instance Depth Estimation", accepted to WACV 2022.

HMLLDB is a collection of LLDB commands to assist in the debugging of iOS apps.

Bringing Computer Vision and Flutter together , to build an awesome app !!

An inofficial PyTorch implementation of PREDATOR based on KPConv.

FedCV: A Federated Learning Framework for Diverse Computer Vision Tasks

PhysCap: Physically Plausible Monocular 3D Motion Capture in Real Time

Lightweight plotting to the terminal. 4x resolution via Unicode.

Stochastic Extragradient: General Analysis and Improved Rates

Implementation of our NeurIPS 2021 paper "A Bi-Level Framework for Learning to Solve Combinatorial Optimization on Graphs".

Semi-SDP Semi-supervised parser for semantic dependency parsing.

Deep Learning: Architectures & Methods Project: Deep Learning for Audio Super-Resolution

Tensorflow-seq2seq-tutorials - Dynamic seq2seq in TensorFlow, step by step

本项目是一个带有前端界面的垃圾分类项目，加载了训练好的模型参数，模型为efficientnetb4，暂时为40分类问题。

PyTorch implementation of CVPR 2020 paper (Reference-Based Sketch Image Colorization using Augmented-Self Reference and Dense Semantic Correspondence) and pre-trained model on ImageNet dataset

Revealing and Protecting Labels in Distributed Training