[BMVC 2021] Official PyTorch Implementation of Self-supervised learning of Image Scale and Orientation Estimation

Last update: Nov 10, 2022

Related tags

Overview

Self-Supervised Learning of Image Scale and Orientation Estimation (BMVC 2021)

This is the official implementation of the paper "Self-Supervised Learning of Image Scale and Orientation Estimation" by Jongmin Lee [Google Scholar], Yoonwoo Jeong [Google Scholar], and Minsh Cho [Google Scholar]. We introduce a self-supervised framework for learning patch pose. Given a rescaled/rotated pair of image patches, we feed them to the patch pose estimation networks that output scale/orientation histograms for each. We compare the output histogram vectors by the histogram alignment technique and compute the loss.

Requirements

Ubuntu 18.04
python 3.8
pytorch 1.8.1
torchvision 0.9.1
wandb 0.10.28

Environment

Clone the Git repository

git clone https://github.com/bluedream1121/SelfScaOri.git

Install dependency

Run the script to install all the dependencies. You need to provide the conda install path (e.g. ~/anaconda3) and the name for the created conda environment.

bash install.sh conda_install_path self-sca-ori

Dataset preparation

You can download the training/test dataset using the following scripts:

cd datasets
bash download.sh

If you want to regenerate the patchPose datasets, please run the following script:

cd datasets/patchpose_dataset_generation
bash generation_script.sh

Trained models

cd trained_models
bash download_ori_model.sh
bash download_sca_model.sh

Test on the patchPose and the HPatches

After download the datasets and the pre-trained models, you can evaluate the patch pose estimation results using the following scripts:

python test.py --load trained_models/_*branchori/best_model.pt  --dataset_type ppa_ppb
python test.py --load trained_models/_*branchsca/best_model.pt  --dataset_type ppa_ppb

python test.py --load trained_models/_*branchori/best_model.pt  --dataset_type hpa
python test.py --load trained_models/_*branchsca/best_model.pt  --dataset_type hpa

Training

You can train the networks for patch scale estimation and orientation estimation using the proposed histogram alignment loss as follows:

python train.py --branch ori --output_ori 36

python train.py --branch sca --output_sca 13

Citation

If you find our code or paper useful to your research work, please consider citing our work using the following bibtex:

@inproceedings{lee2021self,
    author   = {},
    title    = {},
    booktitle= {},
    year     = {2021}
}

Contact

Jongmin Lee ([email protected])

Questions can also be left as issues in the repository.

[BMVC 2021] Official PyTorch Implementation of Self-supervised learning of Image Scale and Orientation Estimation

Related tags

Overview

Self-Supervised Learning of Image Scale and Orientation Estimation (BMVC 2021)

Requirements

Environment

Clone the Git repository

Install dependency

Dataset preparation

Trained models

Test on the patchPose and the HPatches

Training

Citation

Contact

Owner

Jongmin Lee

the official implementation of the paper "Isometric Multi-Shape Matching" (CVPR 2021)

STBP is a way to train SNN with datasets by Backward propagation.

A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way :chestnut:

Command-line tool for downloading and extending the RedCaps dataset.

Deploy optimized transformer based models on Nvidia Triton server

VarCLR: Variable Semantic Representation Pre-training via Contrastive Learning

The VeriNet toolkit for verification of neural networks

The official repo for CVPR2021——ViPNAS: Efficient Video Pose Estimation via Neural Architecture Search.

This project is the PyTorch implementation of our CVPR 2022 paper:

Learning Facial Representations from the Cycle-consistency of Face (ICCV 2021)

Official PyTorch implementation of RIO

SOTR: Segmenting Objects with Transformers [ICCV 2021]

MMGeneration is a powerful toolkit for generative models, based on PyTorch and MMCV.

Instance-conditional Knowledge Distillation for Object Detection

Human pose estimation from video plays a critical role in various applications such as quantifying physical exercises, sign language recognition, and full-body gesture control.

This is an example of object detection on Micro bacterium tuberculosis using Mask-RCNN

A Kaggle competition: discriminate gender based on handwriting

tensorflow code for inverse face rendering

Code for ICMI2020 and ICMI2021 papers: "Studying Person-Specific Pointing and Gaze Behavior for Multimodal Referencing of Outside Objects from a Moving Vehicle" and "ML-PersRef: A Machine Learning-based Personalized Multimodal Fusion Approach for Referencing Outside Objects From a Moving Vehicle"

Natural Posterior Network: Deep Bayesian Predictive Uncertainty for Exponential Family Distributions