MobileFormer

An implementation of MobileFormer proposed by Yinpeng Chen, Xiyang Dai et al.

Including

[1] Mobile-Former proposed in: 
                        Yinpeng Chen, Xiyang Dai et al., Mobile-Former: Bridging MobileNet and Transformer. 
                        arxiv.org/abs/2108.05895
[2] Dynamtic ReLU proposed in: 
                        Yinpeng Chen, Xiyang Dai et al., Dynamtic ReLU. 
                        arxiv.org/abs/2003.10027v2
[3] Lite-BottleNeck proposed in: 
                        Yunsheng Li, Yinpeng Chen et al., MicroNet: Improving Image Recognition with Extremely Low FLOPs. 
                        arxiv.org/abs/2108.05894v1
[4] Adam-W proposed in:
                        Ilya Loshchilov & Frank Hutter, Decoupled Weight Decay Regularization.
                        arxiv.org/abs/1711.05101v3
[5] Mixup proposed in:
                        Hongyi Zhang, Moustapha Cisse et al., Mixup: Beyond Empircal Risk Minimization.
                        arxiv.org/abs/1710.09412
[6] Multi-FocalLoss (not used), focal loss is proposed in:
                        Tsung-Yi Lin, Priya Goyal, Ross Girshick, Kaiming He, Piotr Dollár, Focal Loss for Dense Object Detection.
                        arxiv.org/abs/1708.02002

Note

(1) Due to the expanded DW conv used in strided Mobile-Former blocks, 
    the out_channel should be divisible by expand_size of the next block.
(2) Adam-W and Mixup is embedded in train.py.
(3) Use run() in train.py to train('run') or search('search'). There is an example in the train.py.

'###### The '#'s #######'

'##### are aligned #####'

No pre-train parameters for now.

An implementation of MobileFormer

Related tags

Overview

MobileFormer

Including

Note

'###### The '#'s #######'

'##### are aligned #####'

Owner

slwang9353

Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Dask, Flink and DataFlow

Hyperbolic Hierarchical Clustering.

Deploy a ML inference service on a budget in less than 10 lines of code.

PyGAD, a Python 3 library for building the genetic algorithm and training machine learning algorithms (Keras & PyTorch).

To provide 100 JAX exercises over different sections structured as a course or tutorials to teach and learn for beginners, intermediates as well as experts

Official implementation for TTT++: When Does Self-supervised Test-time Training Fail or Thrive

Tensorflow Repo for "DeepGCNs: Can GCNs Go as Deep as CNNs?"

Official Pytorch implementation of the paper: "Locally Shifted Attention With Early Global Integration"

This is an official pytorch implementation of Fast Fourier Convolution.

GradAttack is a Python library for easy evaluation of privacy risks in public gradients in Federated Learning

Collision risk estimation using stochastic motion models

Compositional Sketch Search

Keras + Hyperopt: A very simple wrapper for convenient hyperparameter optimization

Official code for CVPR2022 paper: Depth-Aware Generative Adversarial Network for Talking Head Video Generation

PyTorch implementation of TSception V2 using DEAP dataset

This repository contains the re-implementation of our paper deSpeckNet: Generalizing Deep Learning Based SAR Image Despeckling

FactSeg: Foreground Activation Driven Small Object Semantic Segmentation in Large-Scale Remote Sensing Imagery (TGRS)

Neural Style and MSG-Net

LoFTR:Detector-Free Local Feature Matching with Transformers CVPR 2021

A PyTorch implementation of EventProp [https://arxiv.org/abs/2009.08378], a method to train Spiking Neural Networks