Fastformer

Notes from the authors

Pytorch/Keras implementation of Fastformer. The keras version only includes the core fastformer attention part. The pytorch version is written in a huggingface transformers style. The jupyter notebooks contain the quickstart codes for text classification on AG's News (without pretrained word embeddings for simplicity), which can be directly run. We noticed that in our experiments, NOT all tasks need FFNN, residual connection, layer normalization and even position embedding. For example, we find that in news recommendation, it is better to directly use Fastformer without layer normalization and position embedding. However, in Ad CVR prediction, both position embedding and layer normalization are needed.

Keras version: 2.2.4 (may not be compatible with higher versions)

TF version: from 1.12 to 1.15 (may be compatible with lower versions)

Pytorch version: 1.6.0 (may be compatible with higher/lower versions)

Citation

@article{wu2021fastformer,
  title={Fastformer: Additive Attention Can Be All You Need},
  author={Wu, Chuhan and Wu, Fangzhao and Qi, Tao and Huang, Yongfeng},
  journal={arXiv preprint arXiv:2108.09084},
  year={2021}
}

A pytorch &keras implementation and demo of Fastformer.

Related tags

Overview

Fastformer

Notes from the authors

Citation

Owner

Code for the paper Hybrid Spectrogram and Waveform Source Separation

Athena is the only tool that you will ever need to optimize your portfolio.

Contrastive Learning for Metagenomic Binning

Most popular metrics used to evaluate object detection algorithms.

Tutel MoE: An Optimized Mixture-of-Experts Implementation

Boundary-aware Transformers for Skin Lesion Segmentation

Use your Philips Hue lights as Racing Flags. Works with Assetto Corsa, Assetto Corsa Competizione and iRacing.

Python implementation of "Single Image Haze Removal Using Dark Channel Prior"

The world's simplest facial recognition api for Python and the command line

deep_image_prior_extension

CvT2DistilGPT2 is an encoder-to-decoder model that was developed for chest X-ray report generation.

Parallel and High-Fidelity Text-to-Lip Generation; AAAI 2022 ; Official code

Knowledge Management for Humans using Machine Learning & Tags

HyperLib: Deep learning in the Hyperbolic space

PyTorch implementation of "VRT: A Video Restoration Transformer"

Torch-ngp - A pytorch implementation of the hash encoder proposed in instant-ngp

An interpreter for RASP as described in the ICML 2021 paper "Thinking Like Transformers"

Equipped customers with insights about their EVs Hourly energy consumption and helped predict future charging behavior using LSTM model

A PyTorch implementation of QANet.

Deep Unsupervised 3D SfM Face Reconstruction Based on Massive Landmark Bundle Adjustment.