Rethinking Nearest Neighbors for Visual Classification

Last update: Oct 11, 2022

Related tags

Deep Learning nn-revisit

Overview

Rethinking Nearest Neighbors for Visual Classification

arXiv

Environment settings

Check out scripts/env_setup.sh

Setup data

Download the following fine-grained datasets and ImageNet.

In current version, you need to modify each data file under knn/data/finetune/*.py

Experiments

The numerical experiment results with corresponding hyper-parameters can be found here:

Natural world binary classification: linear-eval
Fine-grained object classification: linear-eval, fine-tune
ImageNet classification: linear-eval

To use the code in this repo, here are some key configs:

DATA.FEATURE: specify which representation to use. FEATURES.md includes more details
DATA.BATCH_SIZE: ViT-based backbone requires a smaller batchsize
RUN_N_TIMES: ensure only run once in case duplicated submision
MODEL.TYPE: base or joint training
OUTPUT_DIR: output dir of the final model and logs
SOLVER.BASE_LR: learning rate for the experiment
SOLVER.WEIGHT_DECAY: weight decay value for the experiment
MODEL.KNN_LAMBDA: alpha in Eq 4

Linear evaluation

See script/run_linear.sh and script/run_newt.sh

End-to-end finetuning

See script/run_finetune.sh

License

This repo are released under the CC-BY-NC 4.0 license. See LICENSE for additional details.

Acknowledgement

We thank the researchers who propose NEWT for providing the features for the datasets.

Rethinking Nearest Neighbors for Visual Classification

Related tags

Overview

Rethinking Nearest Neighbors for Visual Classification

Environment settings

Setup data

Experiments

Linear evaluation

End-to-end finetuning

License

Acknowledgement

Owner

Menglin Jia

Winners of the Facebook Image Similarity Challenge

Notebooks em Python para Métodos Eletromagnéticos

QuakeLabeler is a Python package to create and manage your seismic training data, processes, and visualization in a single place — so you can focus on building the next big thing.

TeST: Temporal-Stable Thresholding for Semi-supervised Learning

Supporting code for the Neograd algorithm

The code for paper "Learning Implicit Fields for Generative Shape Modeling".

Face Recognition Attendance Project

FAIR's research platform for object detection research, implementing popular algorithms like Mask R-CNN and RetinaNet.

A PyTorch Implementation of Gated Graph Sequence Neural Networks (GGNN)

SwinIR: Image Restoration Using Swin Transformer

Object-aware Contrastive Learning for Debiased Scene Representation

A unified 3D Transformer Pipeline for visual synthesis

Official PyTorch implementation of Less is More: Pay Less Attention in Vision Transformers.

Code to reproduce the experiments from our NeurIPS 2021 paper " The Limitations of Large Width in Neural Networks: A Deep Gaussian Process Perspective"

Learning View Priors for Single-view 3D Reconstruction (CVPR 2019)

PyTorch implementation of PNASNet-5 on ImageNet

Deep functional residue identification

PiCIE: Unsupervised Semantic Segmentation using Invariance and Equivariance in clustering (CVPR2021)

Cours d'Algorithmique Appliquée avec Python pour BTS SIO SISR

SpanNER: Named EntityRe-/Recognition as Span Prediction