vit for few-shot classification

Last update: Nov 30, 2022

Related tags

Deep Learning few-shot-vit

Overview

Few-Shot ViT

Requirements

PyTorch (>= 1.9)
TorchVision
timm (latest)
einops
tqdm
numpy
scikit-learn
scipy
argparse
tensorboardx

Pretrained Checkpoints

Currently we provide SUN-M (Visformer) trained on miniImageNet (5-way 1-shot and 5-way 5-shot), see Google Drive for details.

More pretrained checkpoints coming soon.

Evaluate the Pretrained Checkpoints

Prepare data

For example, miniImageNet:

cd test_phase

Download miniImageNet dataset from miniImageNet (courtesy of Spyros Gidaris)

unzip the package to materials/mini-imagenet, then obtain materials/mini-imagenet with pickle files.

Prapare pretrained checkpoints

Download corresponding checkpoints from Google Drive and store the checkpoints in test_phase/ directory.

Evaluation

cd test_phase
python test_few_shot.py --config configs/test_1_shot.yaml --shot 1 --gpu 1 # for 1-shot
python test_few_shot.py --config configs/test_5_shot.yaml --shot 5 --gpu 1 # for 5-shot

For 1-shot, you can obtain: test epoch 1: acc=67.80 +- 0.45 (%)

For 5-shot, you can obtain: test epoch 1: acc=83.25 +- 0.28 (%)

Test accuracy may slightly vary with different pytorch/cuda versions or different hardwares

TODO

more checkpoints
training code

You might also like...

So-ViT: Mind Visual Tokens for Vision Transformer

So-ViT: Mind Visual Tokens for Vision Transformer Introduction This repository contains the source code under PyTorch framework and models trai

44 Nov 24, 2022

A PyTorch Implementation of ViT (Vision Transformer)

ViT - Vision Transformer This is an implementation of ViT - Vision Transformer by Google Research Team through the paper "An Image is Worth 16x16 Word

7 May 11, 2022

PyTorch implementation of MoCo v3 for self-supervised ResNet and ViT.

MoCo v3 for Self-supervised ResNet and ViT Introduction This is a PyTorch implementation of MoCo v3 for self-supervised ResNet and ViT. The original M

887 Jan 8, 2023

Official implement of Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer

Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer This repository contains the PyTorch code for Evo-ViT. This work proposes a slow-fas

53 Dec 5, 2022

This repository contains an overview of important follow-up works based on the original Vision Transformer (ViT) by Google.

75 Dec 2, 2022

A simple approach to emable dense segmentation with ViT.

Comments

timm version

hello, I met a question when run your code as follow? Traceback (most recent call last): File "train_classifier.py", line 296, in <module> main(config) File "train_classifier.py", line 133, in main lr_scheduler = CosineLRScheduler(optimizer, warmup_lr_init=float(config['optimizer_args']['warmup_lr']), t_initial=config['max_epoch'], cycle_decay=0.1, warmup_t=int(config['optimizer_args']['warmup'])) TypeError: __init__() got an unexpected keyword argument 'cycle_decay' I think it's the version of timm package is not right, and the requirement in your code just say that is the latest version. can your provide the version of timm package??

opened by JIAOJIAYUASD 2
The variant of visformer

Hi Bowen

Thanks for opensource the inference code. I am just curious which variant of the visformer achieves the best results in Table 5 on mini-ImageNet? Is it visformer_80_small?

opened by RongKaiWeskerMA 1

vit for few-shot classification

Related tags

Overview

Few-Shot ViT

Requirements

Pretrained Checkpoints

Evaluate the Pretrained Checkpoints

Prepare data

Prapare pretrained checkpoints

Evaluation

TODO

You might also like...

So-ViT: Mind Visual Tokens for Vision Transformer

A PyTorch Implementation of ViT (Vision Transformer)

PyTorch implementation of MoCo v3 for self-supervised ResNet and ViT.

Official implement of Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer

This repository contains an overview of important follow-up works based on the original Vision Transformer (ViT) by Google.

A simple approach to emable dense segmentation with ViT.

PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners for self-supervised ViT.

A simple program for training and testing vit

Implementing Vision Transformer (ViT) in PyTorch

Comments

timm version

The variant of visformer

Releases(SUN)

SUN(Jun 5, 2022)

Owner

Martin Dong

Starter kit for getting started in the Music Demixing Challenge.

Region-aware Contrastive Learning for Semantic Segmentation, ICCV 2021

Official code repository for "Exploring Neural Models for Query-Focused Summarization"

Research Artifact of USENIX Security 2022 Paper: Automated Side Channel Analysis of Media Software with Manifold Learning

Bayesian Meta-Learning Through Variational Gaussian Processes

C3DPO - Canonical 3D Pose Networks for Non-rigid Structure From Motion.

Emotion classification of online comments based on RNN

Pipeline code for Sequential-GAM(Genome Architecture Mapping).

Repo for "TableParser: Automatic Table Parsing with Weak Supervision from Spreadsheets" at [email protected]

Learning cell communication from spatial graphs of cells

Connecting Java/ImgLib2 + Python/NumPy

[SIGGRAPH Asia 2021] DeepVecFont: Synthesizing High-quality Vector Fonts via Dual-modality Learning.

Dataloader tools for language modelling

PyTorch Implementation of the paper Learning to Reweight Examples for Robust Deep Learning

A computational optimization project towards the goal of gerrymandering the results of a hypothetical election in the UK.

Official Repository for the paper "Improving Baselines in the Wild".

Official Implementation of SWAD (NeurIPS 2021)

[CVPR 2022] TransEditor: Transformer-Based Dual-Space GAN for Highly Controllable Facial Editing

UnsupervisedR&R: Unsupervised Pointcloud Registration via Differentiable Rendering

OHLC Average Prediction of Apple Inc. Using LSTM Recurrent Neural Network