Bottleneck Transformers for Visual Recognition

Last update: Jan 03, 2023

Overview

Bottleneck Transformers for Visual Recognition

Experiments

Model	Params (M)	Acc (%)
ResNet50 baseline (ref)	23.5M	93.62
BoTNet-50	18.8M	95.11%
BoTNet-S1-50	18.8M	95.67%
BoTNet-S1-59	27.5M	95.98%
BoTNet-S1-77	44.9M	wip

Summary

Usage (example)

Model

from model import Model

model = ResNet50(num_classes=1000, resolution=(224, 224))
x = torch.randn([2, 3, 224, 224])
print(model(x).size())

Module

from model import MHSA

resolution = 14
mhsa = MHSA(planes, width=resolution, height=resolution)

Reference

Paper link
Author: Aravind Srinivas, Tsung-Yi Lin, Niki Parmar, Jonathon Shlens, Pieter Abbeel, Ashish Vaswani
Organization: UC Berkeley, Google Research

Owner

Myeongjun Kim

Computer Vision Research using Deep Learning

GitHub Repository

High accurate tool for automatic faces detection with landmarks

faces_detanator High accurate tool for automatic faces detection with landmarks. The library is based on public detectors with high accuracy (TinaFace

7 May 10, 2022

Implementation of the paper All Labels Are Not Created Equal: Enhancing Semi-supervision via Label Grouping and Co-training

SemCo The official pytorch implementation of the paper All Labels Are Not Created Equal: Enhancing Semi-supervision via Label Grouping and Co-training

42 Nov 14, 2022

Code accompanying "Adaptive Methods for Aggregated Domain Generalization"

Adaptive Methods for Aggregated Domain Generalization (AdaClust) Official Pytorch Implementation of Adaptive Methods for Aggregated Domain Generalizat

15 Sep 20, 2022

Repositório da disciplina de APC, no segundo semestre de 2021

NOTAS FINAIS: https://github.com/fabiommendes/apc2018/blob/master/nota-final.pdf Algoritmos e Programação de Computadores Este é o Git da disciplina A

16 Dec 16, 2022

PowerGridworld: A Framework for Multi-Agent Reinforcement Learning in Power Systems

PowerGridworld provides users with a lightweight, modular, and customizable framework for creating power-systems-focused, multi-agent Gym environments that readily integrate with existing training fr

37 Dec 17, 2022

A minimal solution to hand motion capture from a single color camera at over 100fps. Easy to use, plug to run.

Minimal Hand A minimal solution to hand motion capture from a single color camera at over 100fps. Easy to use, plug to run. This project provides the

824 Jan 07, 2023

Time Series Cross-Validation -- an extension for scikit-learn

TSCV: Time Series Cross-Validation This repository is a scikit-learn extension for time series cross-validation. It introduces gaps between the traini

222 Jan 01, 2023

GE2340 project source code without credentials.

GE2340-Project-Public GE2340 project source code without credentials. Run the bot.py to start the bot Telegram: @jasperwong_ge2340_bot If the bot does

0 Feb 10, 2022

Cross View SLAM

Cross View SLAM This is the associated code and dataset repository for our paper I. D. Miller et al., "Any Way You Look at It: Semantic Crossview Loca

99 Dec 09, 2022

Speech Enhancement Generative Adversarial Network Based on Asymmetric AutoEncoder

ASEGAN: Speech Enhancement Generative Adversarial Network Based on Asymmetric AutoEncoder 中文版简介 Readme with English Version 介绍基于SEGAN模型的改进版本，使用自主设计的非

53 Nov 17, 2022

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting This is the origin Pytorch implementation of Informer in the followin

3.1k Dec 29, 2022

Non-Homogeneous Poisson Process Intensity Modeling and Estimation using Measure Transport

Non-Homogeneous Poisson Process Intensity Modeling and Estimation using Measure Transport This GitHub page provides code for reproducing the results i

1 Nov 08, 2021

VD-BERT: A Unified Vision and Dialog Transformer with BERT

VD-BERT: A Unified Vision and Dialog Transformer with BERT PyTorch Code for the following paper at EMNLP2020: Title: VD-BERT: A Unified Vision and Dia

44 Nov 01, 2022

tmm_fast is a lightweight package to speed up optical planar multilayer thin-film device computation.

tmm_fast tmm_fast or transfer-matrix-method_fast is a lightweight package to speed up optical planar multilayer thin-film device computation. It is es

26 Dec 11, 2022

[ICCV 2021 (oral)] Planar Surface Reconstruction from Sparse Views

Planar Surface Reconstruction From Sparse Views Linyi Jin, Shengyi Qian, Andrew Owens, David F. Fouhey University of Michigan ICCV 2021 (Oral) This re

89 Jan 05, 2023

Hyper-parameter optimization for sklearn

hyperopt-sklearn Hyperopt-sklearn is Hyperopt-based model selection among machine learning algorithms in scikit-learn. See how to use hyperopt-sklearn

1.4k Jan 01, 2023

Convert dog pictures into various painting styles. Try LimnPet

LimnPet Cartoon stylization service project Try our service » Home page · Team notion · Members 목차 프로젝트 소개 프로젝트 목표 사용한 기술스택과 수행도구 팀원 구현 기능 주요 기능 추가 기능

7 Jul 14, 2022

MLOps will help you to understand how to build a Continuous Integration and Continuous Delivery pipeline for an ML/AI project.

page_type languages products description sample python azure azure-machine-learning-service azure-devops Code which demonstrates how to set up and ope

1 Nov 01, 2021

Keeping it safe - AI Based COVID-19 Tracker using Deep Learning and facial recognition

15 Jun 17, 2021

Project of 'TBEFN: A Two-branch Exposure-fusion Network for Low-light Image Enhancement '

TBEFN: A Two-branch Exposure-fusion Network for Low-light Image Enhancement Codes for TMM20 paper "TBEFN: A Two-branch Exposure-fusion Network for Low

31 Nov 06, 2022

Bottleneck Transformers for Visual Recognition

Related tags

Overview

Bottleneck Transformers for Visual Recognition

Experiments

Summary

Usage (example)

Reference

Owner

Myeongjun Kim

High accurate tool for automatic faces detection with landmarks

Implementation of the paper All Labels Are Not Created Equal: Enhancing Semi-supervision via Label Grouping and Co-training

Code accompanying "Adaptive Methods for Aggregated Domain Generalization"

Repositório da disciplina de APC, no segundo semestre de 2021

PowerGridworld: A Framework for Multi-Agent Reinforcement Learning in Power Systems

A minimal solution to hand motion capture from a single color camera at over 100fps. Easy to use, plug to run.

Time Series Cross-Validation -- an extension for scikit-learn

GE2340 project source code without credentials.

Cross View SLAM

Speech Enhancement Generative Adversarial Network Based on Asymmetric AutoEncoder

Informer: Beyond Efficient Transformer for Long Sequence Time-Series Forecasting

Non-Homogeneous Poisson Process Intensity Modeling and Estimation using Measure Transport

VD-BERT: A Unified Vision and Dialog Transformer with BERT

tmm_fast is a lightweight package to speed up optical planar multilayer thin-film device computation.

[ICCV 2021 (oral)] Planar Surface Reconstruction from Sparse Views

Hyper-parameter optimization for sklearn

Convert dog pictures into various painting styles. Try LimnPet

MLOps will help you to understand how to build a Continuous Integration and Continuous Delivery pipeline for an ML/AI project.

Keeping it safe - AI Based COVID-19 Tracker using Deep Learning and facial recognition

Project of 'TBEFN: A Two-branch Exposure-fusion Network for Low-light Image Enhancement '