RVT: Robust Vision Transformers

This repository contains PyTorch code for Robust Vision Transformers.

For details see Rethinking the Design Principles of Robust Vision Transformer by Xiaofeng Mao, Gege Qi, Yuefeng Chen, Yuan He and Hui Xue.

Usage

First, clone the repository locally:

git clone https://github.com/vtddggg/Robust-Vision-Transformer.git

Then, install PyTorch 1.7.0+ and torchvision 0.8.1+ and pytorch-image-models 0.3.2:

conda install -c pytorch pytorch torchvision
pip install timm==0.3.2

We use 4 nodes with 8 gpus to train RVT-Ti, RVT-S and RVT-B:

Training RVT-Ti

python -m torch.distributed.launch --nproc_per_node=8 --nnodes=4 main.py --model rvt_tiny --data-path /path/to/imagenet --output_dir output --dist-eval

Training RVT-S

python -m torch.distributed.launch --nproc_per_node=8 --nnodes=4 main.py --model rvt_small --data-path /path/to/imagenet --output_dir output --dist-eval

Training RVT-B

python -m torch.distributed.launch --nproc_per_node=8 --nnodes=4 main.py --model rvt_base --data-path /path/to/imagenet --output_dir output --batch-size 32 --dist-eval

If you want to train RVT-Ti*, RVT-S* or RVT-B*, simply add --use_mask and --use_patch_aug to enable positon-aware attention scaling and patch-wise augmentation.

This repository contains PyTorch code for Robust Vision Transformers.

Related tags

Overview

RVT: Robust Vision Transformers

Usage

Training RVT-Ti

Training RVT-S

Training RVT-B

Owner

PyTorch implementation of MICCAI 2018 paper "Liver Lesion Detection from Weakly-labeled Multi-phase CT Volumes with a Grouped Single Shot MultiBox Detector"

A series of Jupyter notebooks with Chinese comment that walk you through the fundamentals of Machine Learning and Deep Learning in python using Scikit-Learn and TensorFlow.

This repository contains the source codes for the paper AtlasNet V2 - Learning Elementary Structures.

Official PyTorch implementation of "Evolving Search Space for Neural Architecture Search"

A Python Package for Convex Regression and Frontier Estimation

Official repository of ICCV21 paper "Viewpoint Invariant Dense Matching for Visual Geolocalization"

Deep Learning to Create StepMania SM FIles

PyTorch implementation of "Learning to Discover Cross-Domain Relations with Generative Adversarial Networks"

DRLib：A concise deep reinforcement learning library, integrating HER and PER for almost off policy RL algos.

FEDn is an open-source, modular and ML-framework agnostic framework for Federated Machine Learning

Groceries ARL: Association Rules (Birliktelik Kuralı)

Similarity-based Gray-box Adversarial Attack Against Deep Face Recognition

Lazy, a tool for running things in idle time

Bayesian regularization for functional graphical models.

Thermal Control of Laser Powder Bed Fusion using Deep Reinforcement Learning

Repository for the COLING 2020 paper "Explainable Automated Fact-Checking: A Survey."

Decompose to Adapt: Cross-domain Object Detection via Feature Disentanglement

Sharing of contents on mitochondrial encounter networks

Vit-ImageClassification - Pytorch ViT for Image classification on the CIFAR10 dataset

Official Pytorch Implementation for Splicing ViT Features for Semantic Appearance Transfer presenting Splice