This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".

Last update: Dec 13, 2022

Related tags

Overview

ResT

By Qing-Long Zhang and Yu-Bin Yang

[State Key Laboratory for Novel Software Technology at Nanjing University]

This repo is the official implementation of "ResT: An Efficient Transformer for Visual Recognition". It currently includes code and models for the following tasks:

Image Classification: Included in this repo. See get_started.md for a quick start.

Object Detection and Instance Segmentation: Based on detectron2, coming soon.

ResT is initially described in arxiv, which capably serves as a general-purpose backbone for computer vision. It can tackle input images with arbitrary size. Besides, ResT compressed the memory of standard MSA and model the interaction between multi-heads while keeping the diversity ability.

Main Results on ImageNet with Pretrained Models

ImageNet-1K Pretrained Models

name	resolution	[email protected]	[email protected]	#params	FLOPs	FPS	1K model
ResT-Lite	224x224	77.2	93.7	10.5M	1.4G	1246	baidu
ResT-Small	224x224	79.6	94.9	13.7M	1.9G	1043	baidu
ResT-Base	224x224	81.6	95.7	30.3M	4.3G	673	baidu
ResT-Large	224x224	83.6	96.3	51.6M	7.9G	429	baidu

Note: access code for baidu is rest.

Citing ResT

@article{zhql2021ResT,
  title={ResT: An Efficient Transformer for Visual Recognition},
  author={Zhang, Qinglong and Yang, Yubin},
  journal={arXiv preprint arXiv:2105.13677v2},
  year={2021}
}

This is an official implementation for "ResT: An Efficient Transformer for Visual Recognition".

Related tags

Overview

ResT

Main Results on ImageNet with Pretrained Models

Citing ResT

Owner

zhql

Generating Images with Recurrent Adversarial Networks

LogAvgExp - Pytorch Implementation of LogAvgExp

Individual Tree Crown classification on WorldView-2 Images using Autoencoder -- Group 9 Weak learners - Final Project (Machine Learning 2020 Course)

We present a regularized self-labeling approach to improve the generalization and robustness properties of fine-tuning.

Dynamical Wasserstein Barycenters for Time Series Modeling

Balancing Principle for Unsupervised Domain Adaptation

CS550 Machine Learning course project on CNN Detection.

BirdCLEF 2021 - Birdcall Identification 4th place solution

Deep Latent Force Models

Prefix-Tuning: Optimizing Continuous Prompts for Generation

Code for "CloudAAE: Learning 6D Object Pose Regression with On-line Data Synthesis on Point Clouds" @ICRA2021

Progressive Growing of GANs for Improved Quality, Stability, and Variation

[WWW 2022] Zero-Shot Stance Detection via Contrastive Learning

[CVPR 2022] Unsupervised Image-to-Image Translation with Generative Prior

Style transfer, deep learning, feature transform

Real-Time Multi-Contact Model Predictive Control via ADMM

A collection of inference modules for fastai2

An official PyTorch implementation of the TKDE paper "Self-Supervised Graph Representation Learning via Topology Transformations".

Towards Understanding Quality Challenges of the Federated Learning: A First Look from the Lens of Robustness

Unofficial implementation of "Coordinate Attention for Efficient Mobile Network Design"