Official PyTorch implementation of "Proxy Synthesis: Learning with Synthetic Classes for Deep Metric Learning" (AAAI 2021)

Last update: Dec 06, 2022

Overview

Proxy Synthesis: Learning with Synthetic Classes for Deep Metric Learning

Official PyTorch implementation of "Proxy Synthesis: Learning with Synthetic Classes for Deep Metric Learning" (AAAI 2021)

Geonmo Gu*¹, Byungsoo Ko*¹, Han-Gyu Kim² (* Authors contributed equally.)

¹@NAVER/LINE Vision, ²@NAVER Clova Speech

Paper | Presentation Video | PPT | Poster

Overview

Proxy Synthesis

Proxy Synthesis (PS) is a novel regularizer for any softmax variants and proxy-based losses in deep metric learning.

How it works?

Proxy Synthesis exploits synthetic classes and improves generalization by considering class relations and obtaining smooth decision boundaries.
Synthetic classes mimic unseen classes during training phase as described in below Figure.

Experimental results

Proxy Synthesis improves performance for every loss and benchmark dataset.

Getting Started

Installation

Clone the repository locally

$ git clone https://github.com/navervision/proxy-synthesis

Create conda virtual environment

$ conda create -n proxy_synthesis python=3.7 anaconda
$ conda activate proxy_synthesis

Install pytorch

Install pytorch according to your CUDA version

$ conda install pytorch torchvision cudatoolkit=<YOUR_CUDA_VERSION> -c pytorch

Install faiss

We use faiss library for faster evaluation
Install faiss according to your CUDA version

$ conda install faiss-gpu cudatoolkit=<YOUR_CUDA_VERSION> -c pytorch

Install requirements

$ pip install -r requirements.txt

Prepare Data

Download CARS196 dataset and unzip

$ wget http://imagenet.stanford.edu/internal/car196/car_ims.tgz
$ tar zxvf car_ims.tgz -C ./dataset

Rearrange CARS196 directory by following structure

# Dataset structure
/dataset/carDB/
  train/
    class1/
      img1.jpeg
    class2/
      img2.jpeg
  test/
    class1/
      img3.jpeg
    class2/
      img4.jpeg

# Rearrange dataset structure
$ python dataset/prepare_cars.py

Train models

Norm-SoftMax loss with CARS196

# Norm-SoftMax
$ python main.py --gpu=0 \
--save_path=./logs/CARS196_norm_softmax \
--data=./dataset/carDB --data_name=cars196 \
--dim=512 --batch_size=128 --epochs=130 \
--freeze_BN --loss=Norm_SoftMax \
--decay_step=50 --decay_stop=50 --n_instance=1 \
--scale=23.0 --check_epoch=5

PS + Norm-SoftMax loss with CARS196

# PS + Norm-SoftMax
$ python main.py --gpu=0 \
--save_path=./logs/CARS196_PS_norm_softmax \
--data=./dataset/carDB --data_name=cars196 \
 --dim=512 --batch_size=128 --epochs=130 \
--freeze_BN --loss=Norm_SoftMax \
--decay_step=50 --decay_stop=50 --n_instance=1 \
--scale=23.0 --check_epoch=5 \
--ps_alpha=0.40 --ps_mu=1.0

Proxy-NCA loss with CARS196

# Proxy-NCA
$ python main.py --gpu=0 \
--save_path=./logs/CARS196_proxy_nca \
--data=./dataset/carDB --data_name=cars196 \
--dim=512 --batch_size=128 --epochs=130 \
--freeze_BN --loss=Proxy_NCA \
--decay_step=50 --decay_stop=50 --n_instance=1 \
--scale=12.0 --check_epoch=5

PS + Proxy-NCA loss with CARS196

# PS + Proxy-NCA
$ python main.py --gpu=0 \
--save_path=./logs/CARS196_PS_proxy_nca \
--data=./dataset/carDB --data_name=cars196 \
--dim=512 --batch_size=128 --epochs=130 \
--freeze_BN --loss=Proxy_NCA \
--decay_step=50 --decay_stop=50 --n_instance=1 \
--scale=12.0 --check_epoch=5 \
--ps_alpha=0.40 --ps_mu=1.0

Check Test Results

$ tensorboard --logdir=logs --port=10000

Experimental results

We report [email protected], RP and MAP performances of each loss, which are trained with CARS196 dataset for 8 runs.

[email protected]

Loss	1	2	3	4	5	6	7	8	Mean ± std
Norm-SoftMax	83.38	83.25	83.25	83.18	83.05	82.90	82.83	82.79	83.08 ± 0.21
PS + Norm-SoftMax	84.69	84.58	84.45	84.35	84.22	83.95	83.91	83.89	84.25 ± 0.31
Proxy-NCA	83.74	83.69	83.62	83.32	83.06	83.00	82.97	82.84	83.28 ± 0.36
PS + Proxy-NCA	84.52	84.39	84.32	84.29	84.22	84.12	83.94	83.88	84.21 ± 0.21

RP

Loss	1	2	3	4	5	6	7	8	Mean ± std
Norm-SoftMax	35.85	35.51	35.28	35.28	35.24	34.95	34.87	34.84	35.23 ± 0.34
PS + Norm-SoftMax	37.01	36.98	36.92	36.74	36.74	36.73	36.54	36.45	36.76 ± 0.20
Proxy-NCA	36.08	35.85	35.79	35.66	35.66	35.63	35.47	35.43	35.70 ± 0.21
PS + Proxy-NCA	36.97	36.84	36.72	36.64	36.63	36.60	36.43	36.41	36.66 ± 0.18

MAP

Loss	1	2	3	4	5	6	7	8	Mean ± std
Norm-SoftMax	25.56	25.56	25.00	24.93	24.90	24.59	24.57	24.56	24.92 ± 0.35
PS + Norm-SoftMax	26.71	26.67	26.65	26.56	26.53	26.52	26.30	26.17	26.51 ± 0.18
Proxy-NCA	25.66	25.52	25.37	25.36	25.33	25.26	25.22	25.04	25.35 ± 0.18
PS + Proxy-NCA	26.77	26.63	26.50	26.42	26.37	26.31	26.25	26.12	26.42 ± 0.20

Performance Graph

Below figure shows performance graph of test set during training.

Reference

Our code is based on SoftTriple repository (Arxiv, Github)

Citation

If you find Proxy Synthesis useful in your research, please consider to cite the following paper.

@inproceedings{gu2020proxy,
    title={Proxy Synthesis: Learning with Synthetic Classes for Deep Metric Learning},
    author={Geonmo Gu, Byungsoo Ko, and Han-Gyu Kim},
    booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
    year={2021}
}

License

Copyright 2021-present NAVER Corp.

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

    http://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

Official PyTorch implementation of "Proxy Synthesis: Learning with Synthetic Classes for Deep Metric Learning" (AAAI 2021)

Related tags

Overview

Proxy Synthesis: Learning with Synthetic Classes for Deep Metric Learning

Overview

Proxy Synthesis

How it works?

Experimental results

Getting Started

Installation

Prepare Data

Train models

Norm-SoftMax loss with CARS196

PS + Norm-SoftMax loss with CARS196

Proxy-NCA loss with CARS196

PS + Proxy-NCA loss with CARS196

Check Test Results

Experimental results

[email protected]

RP

MAP

Performance Graph

Reference

Citation

License

Owner

NAVER/LINE Vision

Additional code for Stable-baselines3 to load and upload models from the Hub.

ICS 4u HD project, start before-wards. A curtain shooting game using python.

An implementation of the BADGE batch active learning algorithm.

A Free and Open Source Python Library for Multiobjective Optimization

Code repo for realtime multi-person pose estimation in CVPR'17 (Oral)

This repo contains the pytorch implementation for Dynamic Concept Learner (accepted by ICLR 2021).

Ray tracing of a Schwarzschild black hole written entirely in TensorFlow.

Python TFLite scripts for detecting objects of any class in an image without knowing their label.

The official PyTorch implementation of recent paper - SAINT: Improved Neural Networks for Tabular Data via Row Attention and Contrastive Pre-Training

Learning recognition/segmentation models without end-to-end training. 40%-60% less GPU memory footprint. Same training time. Better performance.

Image Fusion Transformer

Leveraging Instance-, Image- and Dataset-Level Information for Weakly Supervised Instance Segmentation

Data and code for ICCV 2021 paper Distant Supervision for Scene Graph Generation.

Tooling for GANs in TensorFlow

PyKaldi GOP-DNN on Epa-DB

Unsupervised Learning of Multi-Frame Optical Flow with Occlusions

[ICCV2021] Official Pytorch implementation for SDGZSL (Semantics Disentangling for Generalized Zero-Shot Learning)

This repo is a C++ version of yolov5_deepsort_tensorrt. Packing all C++ programs into .so files, using Python script to call C++ programs further.

🦙 LaMa Image Inpainting, Resolution-robust Large Mask Inpainting with Fourier Convolutions, WACV 2022

CLIP+FFT text-to-image