Adaptive Fourier Neural Operators: Efficient Token Mixers for Transformers

This repository contains PyTorch implementation of the Adaptive Fourier Neural Operator token mixer. Classification code is also provided in the classification folder.

The Adaptive Fourier Neural Operator is a token mixer that learns to mix in the Fourier domain. AFNO is based on a principled foundation of operator learning which allows us to frame token mixing as a continuous global convolution without any dependence on the input resolution. This principle was previously used to design FNO, which solves global convolution efficiently in the Fourier domain and has shown promise in learning challenging PDEs. To handle challenges in visual representation learning such as discontinuities in images and high resolution inputs, we propose principled architectural modifications to FNO which results in memory and computational efficiency. This includes imposing a block-diagonal structure on the channel mixing weights, adaptively sharing weights across tokens, and sparsifying the frequency modes via soft-thresholding and shrinkage. The resulting model is highly parallel with a quasi-linear complexity and has linear memory in the sequence size.

[arXiv]

Usage

Requirements

torch>=1.8.0
torchvision
timm

Note: To use the rfft2 and irfft2 functions in PyTorch, you need to install PyTorch>=1.8.0. Complex numbers are supported after PyTorch 1.6.0, but the fft API is slightly different from the current version.

Installation

pip install -e .

Example

from afno import AFNO1D, AFNO2D

mixer = AFNO1D()
mixer = AFNO2D()

Citation

If you find our work useful in your research, please consider citing:

@inproceedings{guibas2021efficient,
  title={Efficient Token Mixing for Transformers via Adaptive Fourier Neural Operators},
  author={Guibas, John and Mardani, Morteza and Li, Zongyi and Tao, Andrew and Anandkumar, Anima and Catanzaro, Bryan},
  booktitle={International Conference on Learning Representations},
  year={2021}
}

Adaptive FNO transformer - official Pytorch implementation

Related tags

Overview

Adaptive Fourier Neural Operators: Efficient Token Mixers for Transformers

Usage

Requirements

Installation

Example

Citation

Owner

NVIDIA Research Projects

Keras implementation of AdaBound

Code base for reproducing results of I.Schubert, D.Driess, O.Oguz, and M.Toussaint: Learning to Execute: Efficient Learning of Universal Plan-Conditioned Policies in Robotics. NeurIPS (2021)

Code for the paper Progressive Pose Attention for Person Image Generation in CVPR19 (Oral).

Affine / perspective transformation in Pose Estimation with Tensorflow 2

CSD: Consistency-based Semi-supervised learning for object Detection

Project looking into use of autoencoder for semi-supervised learning and comparing data requirements compared to supervised learning.

CS506-Spring2022 - Code and Slides for Boston University CS 506

4th place solution for the SIGIR 2021 challenge.

A general-purpose encoder-decoder framework for Tensorflow

PyTorch implementation of "ContextNet: Improving Convolutional Neural Networks for Automatic Speech Recognition with Global Context" (INTERSPEECH 2020)

Text to Image Generation with Semantic-Spatial Aware GAN

A framework for using LSTMs to detect anomalies in multivariate time series data. Includes spacecraft anomaly data and experiments from the Mars Science Laboratory and SMAP missions.

ICNet and PSPNet-50 in Tensorflow for real-time semantic segmentation

Implementation of Restricted Boltzmann Machine (RBM) and its variants in Tensorflow

Multi Agent Path Finding Algorithms

scalingscattering

An Implicit Function Theorem (IFT) optimizer for bi-level optimizations

Code for the TPAMI paper: "Syntax Customized Video Captioning by Imitating Exemplar Sentences"

Official implementation of the paper ``Unifying Nonlocal Blocks for Neural Networks'' (ICCV'21)