SCNet: Learning Semantic Correspondence

Related tags

Deep LearningSCNet
Overview

SCNet Code

Region matching code is contributed by Kai Han ([email protected]).

Dense matching code is contributed by Rafael S. Rezende ([email protected]).

This code is written in MATLAB, and implements the SCNet[1]. For the dataset, see our project page: http://www.di.ens.fr/willow/research/scnet.

Install Dependencies

Codes

SCNet_Matconvnet

Additional Matconvnet modules implemented for SCNet. These code should be copied into matconvnet/matlab/ folder.

SCNet

This is the primary net work training and testing code.

  • SCNet_A_init.m, SCNet_AG_init.m, SCNet_AGplus_init.m: initialize the SCNet_A, SCNet_AG, SCNet_AG+.

  • SCNet_A.m, SCNet_AG.m, SCNet_AGplus.m: train SCNet_A, SCNet_AG, SCNet_AG+.

  • eva_PCR_mIoU_SCNet_A.m, eva_PCR_mIoU_SCNet_AG.m, eva_PCR_mIoU_SCNet_AGplus.m: evaluate the trained nets.

  • eva_PCR_mIoU_ImageNet_SCNet_A.m, eva_PCR_mIoU_ImageNet_SCNet_AG.m, eva_PCR_mIoU_ImageNet_SCNet_AGplus.m: evaluate SCNets with ImageNet pretrained parameters, i.e., SCNets without training.

SCNet_Baselines

Comparison code for our SCNet features and HOG features with NAM, PHM and LOM in Proposal Flow [2, 3].

  • NAM_HOG_eva.m, PHM_HOG_eva.m, LOM_HOG_eva.m: evaluate NAM, PHM, and LOM with HOG features.

  • NAM_SCNet_eva.m, PHM_SCNet_eva.m, LOM_SCNet_eva.m: evaluate NAM, PHM, and LOM with learned SCNet features.

  • HOG_SCNet_AG_eva.m: replace the learned SCNet feature by HOG feature in SCNet_AG model.

Data

We used PF-PASCAL, PF-WILLOW, PASCAL Parts and CUB data sets and follows Proposal Flow[2, 3] to generate our trainging data.

Triaining data preparation code is put in PF-PASCAL-code folder.

Notes

  • The code is provided for academic use only. Use of the code in any commercial or industrial related activities is prohibited.
  • If you use our code or dataset, please cite the paper.
@InProceedings{han2017scnet,
author = {Kai Han and Rafael S. Rezende and Bumsub Ham and Kwan-Yee K. Wong and Minsu Cho and Cordelia Schmid and Jean Ponce},
title = {SCNet: Learning Semantic Correspondence},
booktitle = {International Conference on Computer Vision (ICCV)},
year = {2017}
}

References

[1] Kai Han, Rafael S. Rezende, Bumsub Ham, Kwan-Yee K. Wong, Minsu Cho, Cordelia Schmid, Jean Ponce, "SCNet: Learning Semantic Correspondence", International Conference on Computer Vision (ICCV), 2017.

[2] Bumsub Ham, Minsu Cho, Cordelia Schmid, Jean Ponce, "Proposal Flow: Semantic Correspondences from Object Proposals", IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), 2017

[3] Bumsub Ham, Minsu Cho, Cordelia Schmid, Jean Ponce, "Proposal Flow", Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2016

Owner
Kai Han
Visual Geometry Group (VGG)
Kai Han
Session-aware Item-combination Recommendation with Transformer Network

Session-aware Item-combination Recommendation with Transformer Network 2nd place (0.39224) code and report for IEEE BigData Cup 2021 Track1 Report EDA

Tzu-Heng Lin 6 Mar 10, 2022
UniFormer - official implementation of UniFormer

UniFormer This repo is the official implementation of "Uniformer: Unified Transformer for Efficient Spatiotemporal Representation Learning". It curren

SenseTime X-Lab 573 Jan 04, 2023
Official re-implementation of the Calibrated Adversarial Refinement model described in the paper Calibrated Adversarial Refinement for Stochastic Semantic Segmentation

Official re-implementation of the Calibrated Adversarial Refinement model described in the paper Calibrated Adversarial Refinement for Stochastic Semantic Segmentation

Elias Kassapis 31 Nov 22, 2022
scAR (single-cell Ambient Remover) is a package for data denoising in single-cell omics.

scAR scAR (single cell Ambient Remover) is a package for denoising multiple single cell omics data. It can be used for multiple tasks, such as, sgRNA

19 Nov 28, 2022
MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments from a Single Moving Camera

MonoRec: Semi-Supervised Dense Reconstruction in Dynamic Environments from a Single Moving Camera

Felix Wimbauer 494 Jan 06, 2023
Tgbox-bench - Simple TGBOX upload speed benchmark

TGBOX Benchmark This script will benchmark upload speed to TGBOX storage. Build

Non 1 Jan 09, 2022
Learnable Motion Coherence for Correspondence Pruning

Learnable Motion Coherence for Correspondence Pruning Yuan Liu, Lingjie Liu, Cheng Lin, Zhen Dong, Wenping Wang Project Page Any questions or discussi

liuyuan 41 Nov 30, 2022
A pytorch implementation of Paper "Improved Training of Wasserstein GANs"

WGAN-GP An pytorch implementation of Paper "Improved Training of Wasserstein GANs". Prerequisites Python, NumPy, SciPy, Matplotlib A recent NVIDIA GPU

Marvin Cao 1.4k Dec 14, 2022
A pyparsing-based library for parsing SOQL statements

CONTRIBUTORS WANTED!! Installation pip install python-soql-parser or, with poetry poetry add python-soql-parser Usage from python_soql_parser import p

Kicksaw 0 Jun 07, 2022
Curating a dataset for bioimage transfer learning

CytoImageNet A large-scale pretraining dataset for bioimage transfer learning. Motivation In past few decades, the increase in speed of data collectio

Stanley Z. Hua 9 Jun 20, 2022
PyTorch wrappers for using your model in audacity!

audacitorch This package contains utilities for prepping PyTorch audio models for use in Audacity. More specifically, it provides abstract classes for

Hugo Flores García 130 Dec 14, 2022
Semantic Segmentation with Pytorch-Lightning

This is a simple demo for performing semantic segmentation on the Kitti dataset using Pytorch-Lightning and optimizing the neural network by monitoring and comparing runs with Weights & Biases.

Boris Dayma 58 Nov 18, 2022
Code and datasets for the paper "Combining Events and Frames using Recurrent Asynchronous Multimodal Networks for Monocular Depth Prediction" (RA-L, 2021)

Combining Events and Frames using Recurrent Asynchronous Multimodal Networks for Monocular Depth Prediction This is the code for the paper Combining E

Robotics and Perception Group 69 Dec 26, 2022
A best practice for tensorflow project template architecture.

A best practice for tensorflow project template architecture.

Mahmoud Gamal Salem 3.6k Dec 22, 2022
Learning Features with Parameter-Free Layers (ICLR 2022)

Learning Features with Parameter-Free Layers (ICLR 2022) Dongyoon Han, YoungJoon Yoo, Beomyoung Kim, Byeongho Heo | Paper NAVER AI Lab, NAVER CLOVA Up

NAVER AI 65 Dec 07, 2022
PaddleRobotics is an open-source algorithm library for robots based on Paddle, including open-source parts such as human-robot interaction, complex motion control, environment perception, SLAM positioning, and navigation.

简体中文 | English PaddleRobotics paddleRobotics是基于paddle的机器人开源算法库集,包括人机交互、复杂运动控制、环境感知、slam定位导航等开源算法部分。 人机交互 主动多模交互技术TFVT-HRI 主动多模交互技术是通过视觉、语音、触摸传感器等输入机器人

185 Dec 26, 2022
Library for machine learning stacking generalization.

stacked_generalization Implemented machine learning *stacking technic[1]* as handy library in Python. Feature weighted linear stacking is also availab

114 Jul 19, 2022
Source code for our Paper "Learning in High-Dimensional Feature Spaces Using ANOVA-Based Matrix-Vector Multiplication"

NFFT4ANOVA Source code for our Paper "Learning in High-Dimensional Feature Spaces Using ANOVA-Based Matrix-Vector Multiplication" This package uses th

Theresa Wagner 1 Aug 10, 2022
For medical image segmentation

LeViT_UNet For medical image segmentation Our model is based on LeViT (https://github.com/facebookresearch/LeViT). You'd better gitclone its codes. Th

13 Dec 24, 2022
Run object detection model on the Raspberry Pi

Using TensorFlow Lite with Python is great for embedded devices based on Linux, such as Raspberry Pi.

Dimitri Yanovsky 6 Oct 08, 2022