vit for few-shot classification

Last update: Nov 30, 2022

Related tags

Deep Learning few-shot-vit

Overview

Few-Shot ViT

Requirements

PyTorch (>= 1.9)
TorchVision
timm (latest)
einops
tqdm
numpy
scikit-learn
scipy
argparse
tensorboardx

Pretrained Checkpoints

Currently we provide SUN-M (Visformer) trained on miniImageNet (5-way 1-shot and 5-way 5-shot), see Google Drive for details.

More pretrained checkpoints coming soon.

Evaluate the Pretrained Checkpoints

Prepare data

For example, miniImageNet:

cd test_phase

Download miniImageNet dataset from miniImageNet (courtesy of Spyros Gidaris)

unzip the package to materials/mini-imagenet, then obtain materials/mini-imagenet with pickle files.

Prapare pretrained checkpoints

Download corresponding checkpoints from Google Drive and store the checkpoints in test_phase/ directory.

Evaluation

cd test_phase
python test_few_shot.py --config configs/test_1_shot.yaml --shot 1 --gpu 1 # for 1-shot
python test_few_shot.py --config configs/test_5_shot.yaml --shot 5 --gpu 1 # for 5-shot

For 1-shot, you can obtain: test epoch 1: acc=67.80 +- 0.45 (%)

For 5-shot, you can obtain: test epoch 1: acc=83.25 +- 0.28 (%)

Test accuracy may slightly vary with different pytorch/cuda versions or different hardwares

TODO

more checkpoints
training code

You might also like...

So-ViT: Mind Visual Tokens for Vision Transformer

So-ViT: Mind Visual Tokens for Vision Transformer Introduction This repository contains the source code under PyTorch framework and models trai

44 Nov 24, 2022

A PyTorch Implementation of ViT (Vision Transformer)

ViT - Vision Transformer This is an implementation of ViT - Vision Transformer by Google Research Team through the paper "An Image is Worth 16x16 Word

7 May 11, 2022

PyTorch implementation of MoCo v3 for self-supervised ResNet and ViT.

MoCo v3 for Self-supervised ResNet and ViT Introduction This is a PyTorch implementation of MoCo v3 for self-supervised ResNet and ViT. The original M

887 Jan 8, 2023

Official implement of Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer

Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer This repository contains the PyTorch code for Evo-ViT. This work proposes a slow-fas

53 Dec 5, 2022

This repository contains an overview of important follow-up works based on the original Vision Transformer (ViT) by Google.

75 Dec 2, 2022

A simple approach to emable dense segmentation with ViT.

Comments

timm version

hello, I met a question when run your code as follow? Traceback (most recent call last): File "train_classifier.py", line 296, in <module> main(config) File "train_classifier.py", line 133, in main lr_scheduler = CosineLRScheduler(optimizer, warmup_lr_init=float(config['optimizer_args']['warmup_lr']), t_initial=config['max_epoch'], cycle_decay=0.1, warmup_t=int(config['optimizer_args']['warmup'])) TypeError: __init__() got an unexpected keyword argument 'cycle_decay' I think it's the version of timm package is not right, and the requirement in your code just say that is the latest version. can your provide the version of timm package??

opened by JIAOJIAYUASD 2
The variant of visformer

Hi Bowen

Thanks for opensource the inference code. I am just curious which variant of the visformer achieves the best results in Table 5 on mini-ImageNet? Is it visformer_80_small?

opened by RongKaiWeskerMA 1

vit for few-shot classification

Related tags

Overview

Few-Shot ViT

Requirements

Pretrained Checkpoints

Evaluate the Pretrained Checkpoints

Prepare data

Prapare pretrained checkpoints

Evaluation

TODO

You might also like...

So-ViT: Mind Visual Tokens for Vision Transformer

A PyTorch Implementation of ViT (Vision Transformer)

PyTorch implementation of MoCo v3 for self-supervised ResNet and ViT.

Official implement of Evo-ViT: Slow-Fast Token Evolution for Dynamic Vision Transformer

This repository contains an overview of important follow-up works based on the original Vision Transformer (ViT) by Google.

A simple approach to emable dense segmentation with ViT.

PyTorch implementation of Masked Autoencoders Are Scalable Vision Learners for self-supervised ViT.

A simple program for training and testing vit

Implementing Vision Transformer (ViT) in PyTorch

Comments

timm version

The variant of visformer

Releases(SUN)

SUN(Jun 5, 2022)

Owner

Martin Dong

Research - dataset and code for 2016 paper Learning a Driving Simulator

Benchmark for the generalization of 3D machine learning models across different remeshing/samplings of a surface.

SIMULEVAL A General Evaluation Toolkit for Simultaneous Translation

Bayesian Generative Adversarial Networks in Tensorflow

A framework for joint super-resolution and image synthesis, without requiring real training data

From the basics to slightly more interesting applications of Tensorflow

A Moonraker plug-in for real-time compensation of frame thermal expansion

Towards Representation Learning for Atmospheric Dynamics (AtmoDist)

This repo contains code to reproduce all experiments in Equivariant Neural Rendering

Learning Skeletal Articulations with Neural Blend Shapes

SymPy-powered, Wolfram|Alpha-like answer engine totally in your browser, without backend computation

Tensorflow 2 Object Detection API kurulumu, GPU desteği, custom model hazırlama

The implementation of PEMP in paper "Prior-Enhanced Few-Shot Segmentation with Meta-Prototypes"

Unofficial Implementation of Oboe (SIGCOMM'18').

Official page of Patchwork (RA-L'21 w/ IROS'21)

The MATH Dataset

Pretrained Pytorch face detection (MTCNN) and recognition (InceptionResnet) models

Symbolic Parallel Adaptive Importance Sampling for Probabilistic Program Analysis in JAX

Use unsupervised and supervised learning to predict stocks

Detection of drones using their thermal signatures from thermal camera through YOLO-V3 based CNN with modifications to encapsulate drone motion