The codes and related files to reproduce the results for Image Similarity Challenge Track 1.

Last update: Jan 02, 2023

Related tags

Overview

ISC-Track1-Submission

The codes and related files to reproduce the results for Image Similarity Challenge Track 1.

Required dependencies

To begin with, you should install the following packages with the specified versions in Python, Anaconda. Other versions may work but please do NOT try. For instance, cuda 11.0 has some bugs which bring very bad results. The hardware chosen is Nvidia Tesla V100 and Intel CPU. Other hardware, such as A100, may work but please do NOT try. The stability is not guaranteed, for instance, the Ampere architecture is not suitable and some instability is observed. Please do NOT use AMD CPU, such as EPYC, we observe some instability on DGX server.

python 3.7.10
pytorch 1.7.1 with cuda 10.1
faiss-gpu 1.7.1 with cuda 10.1
h5py 3.4.0
pandas 1.3.3
sklearn 1.0
skimage 0.18.3
PIL 8.3.2
cv2 4.5.3.56
numpy 1.16.0
torchvision 0.8.2 with cuda 10.1
augly 0.1.4
selectivesearch 0.4
face-recognition 1.3.0 (with dlib of gpu-version)
tqdm 4.62.3
requests 2.26.0
seaborn 0.11.2
mkl 2.4.0
loguru 0.5.3

Note: Some unimportant packages may be missing, please install them using pip directly when an error occurs.

Pre-trained models

We use three pre-trained models. They are all pre-trained on ImageNet unsupervisedly. To be convenient, we first directly give the pre-trained models as follows, then also the training codes are given.

The first backbone: ResNet-50; The second backbone: ResNet-152; The third backbone: ResNet-50-IBN.

For ResNet-50, we do not pre-train it by ourselves. It is directly downloaded from here. It is supplied by Facebook Research, and the project is Barlow Twins. You should rename it to resnet50_bar.pth.

For ResNet-152 and ResNet-50-IBN, we use the official codes of Momentum2-teacher. We only change the backbone to ResNet-152 and ResNet-50-IBN. It takes about 2 weeks to pre-train the ResNet-152, and 1 week to pre-train the ResNet-50-IBN on 8 V100 GPUs. To be convenient, we supply the whole pre-training codes in the Pretrain folder. The related readme file is also given in that folder.

It should be noted that pre-training processing plays a very important role in our algorithm. Therefore, if you want to reproduce the pre-trained results, please do NOT change the number of GPUs, the batch size, and other related hyper-parameters.

Training

For training, we generate 11 datasets. For each dataset, 3 models with different backbones are trained. Each training takes about/less than 1 day on 4 V100 GPUs (bigger backbone takes longer and smaller backbone takes shorter). The whole training codes, including how to generate training datasets and the link to the generated datasets, are given in the Training folder. For more details, please refer to the readme file in that folder.

Test

To test the performance of the trained model, we perform multi-scale, multi-model, and multi-part testing and ensemble all the scores to get the final score. To be efficient, 33 V100 GPUs are suggested to use. The time for extracting all query images' features using 33 V100 GPUs is about 3 hours. Also extracting and storing training and reference images' features take a lot of time. Please be patient and prepare enough storage to reproduce the testing process. We give all the information to generate our final results in the Test folder. Please reproduce the results according to the readme file in that folder.

The codes and related files to reproduce the results for Image Similarity Challenge Track 1.

Related tags

Overview

ISC-Track1-Submission

Required dependencies

Pre-trained models

Training

Test

Owner

Wenhao Wang

Pytorch implementation of Rosca, Mihaela, et al. "Variational Approaches for Auto-Encoding Generative Adversarial Networks."

Generate images from texts. In Russian. In PaddlePaddle

STARCH compuets regional extreme storm physical characteristics and moisture balance based on spatiotemporal precipitation data from reanalysis or climate model data.

Code for paper Adaptively Aligned Image Captioning via Adaptive Attention Time

Use your Philips Hue lights as Racing Flags. Works with Assetto Corsa, Assetto Corsa Competizione and iRacing.

Generative Handwriting using LSTM Mixture Density Network with TensorFlow

Code release for NeX: Real-time View Synthesis with Neural Basis Expansion

Efficient neural networks for analog audio effect modeling

CoaT: Co-Scale Conv-Attentional Image Transformers

Repo for Photon-Starved Scene Inference using Single Photon Cameras, ICCV 2021

Fully Automatic Page Turning on Real Scores

Data Engineering ZoomCamp

Weakly Supervised Learning of Rigid 3D Scene Flow

OpenLT: An open-source project for long-tail classification

LSTMs (Long Short Term Memory) RNN for prediction of price trends

THIS IS THE OLD PYMC PROJECT. PLEASE USE PYMC3 INSTEAD:

Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implicit Bayesian Inference"

Just Go with the Flow: Self-Supervised Scene Flow Estimation

RADIal is available now! Check the download section

Official Pytorch Implementation of: "Semantic Diversity Learning for Zero-Shot Multi-label Classification"(2021) paper

The codes and related files to reproduce the results for Image Similarity Challenge Track 1.

Related tags

Overview

ISC-Track1-Submission

Required dependencies

Pre-trained models

Training

Test

Owner

Wenhao Wang

Pytorch implementation of Rosca, Mihaela, et al. "Variational Approaches for Auto-Encoding Generative Adversarial Networks."

Generate images from texts. In Russian. In PaddlePaddle

STARCH compuets regional extreme storm physical characteristics and moisture balance based on spatiotemporal precipitation data from reanalysis or climate model data.

Code for paper Adaptively Aligned Image Captioning via Adaptive Attention Time

Use your Philips Hue lights as Racing Flags. Works with Assetto Corsa, Assetto Corsa Competizione and iRacing.

Generative Handwriting using LSTM Mixture Density Network with TensorFlow

Code release for NeX: Real-time View Synthesis with Neural Basis Expansion

Efficient neural networks for analog audio effect modeling

CoaT: Co-Scale Conv-Attentional Image Transformers

Repo for Photon-Starved Scene Inference using Single Photon Cameras, ICCV 2021

Fully Automatic Page Turning on Real Scores

Data Engineering ZoomCamp

Weakly Supervised Learning of Rigid 3D Scene Flow

OpenLT: An open-source project for long-tail classification

LSTMs (Long Short Term Memory) RNN for prediction of price trends

THIS IS THE **OLD** PYMC PROJECT. PLEASE USE PYMC3 INSTEAD:

Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implicit Bayesian Inference"

Just Go with the Flow: Self-Supervised Scene Flow Estimation

RADIal is available now! Check the download section

Official Pytorch Implementation of: "Semantic Diversity Learning for Zero-Shot Multi-label Classification"(2021) paper

THIS IS THE OLD PYMC PROJECT. PLEASE USE PYMC3 INSTEAD: