Adaptive Fourier Neural Operators: Efficient Token Mixers for Transformers

This repository contains PyTorch implementation of the Adaptive Fourier Neural Operator token mixer. Classification code is also provided in the classification folder.

The Adaptive Fourier Neural Operator is a token mixer that learns to mix in the Fourier domain. AFNO is based on a principled foundation of operator learning which allows us to frame token mixing as a continuous global convolution without any dependence on the input resolution. This principle was previously used to design FNO, which solves global convolution efficiently in the Fourier domain and has shown promise in learning challenging PDEs. To handle challenges in visual representation learning such as discontinuities in images and high resolution inputs, we propose principled architectural modifications to FNO which results in memory and computational efficiency. This includes imposing a block-diagonal structure on the channel mixing weights, adaptively sharing weights across tokens, and sparsifying the frequency modes via soft-thresholding and shrinkage. The resulting model is highly parallel with a quasi-linear complexity and has linear memory in the sequence size.

[arXiv]

Usage

Requirements

torch>=1.8.0
torchvision
timm

Note: To use the rfft2 and irfft2 functions in PyTorch, you need to install PyTorch>=1.8.0. Complex numbers are supported after PyTorch 1.6.0, but the fft API is slightly different from the current version.

Installation

pip install -e .

Example

from afno import AFNO1D, AFNO2D

mixer = AFNO1D()
mixer = AFNO2D()

Citation

If you find our work useful in your research, please consider citing:

@inproceedings{guibas2021efficient,
  title={Efficient Token Mixing for Transformers via Adaptive Fourier Neural Operators},
  author={Guibas, John and Mardani, Morteza and Li, Zongyi and Tao, Andrew and Anandkumar, Anima and Catanzaro, Bryan},
  booktitle={International Conference on Learning Representations},
  year={2021}
}

Adaptive FNO transformer - official Pytorch implementation

Related tags

Overview

Adaptive Fourier Neural Operators: Efficient Token Mixers for Transformers

Usage

Requirements

Installation

Example

Citation

Owner

NVIDIA Research Projects

Large-Scale Unsupervised Object Discovery

Really awesome semantic segmentation

State of the Art Neural Networks for Deep Learning

YOLOv5 Series Multi-backbone, Pruning and quantization Compression Tool Box.

Ivy is a templated deep learning framework which maximizes the portability of deep learning codebases.

A Low Complexity Speech Enhancement Framework for Full-Band Audio (48kHz) based on Deep Filtering.

Text Summarization - WCN — Weighted Contextual N-gram method for evaluation of Text Summarization

StarGAN - Official PyTorch Implementation (CVPR 2018)

Repo for WWW 2022 paper: Progressively Optimized Bi-Granular Document Representation for Scalable Embedding Based Retrieval

Code for our WACV 2022 paper "Hyper-Convolution Networks for Biomedical Image Segmentation"

A GPU-optional modular synthesizer in pytorch, 16200x faster than realtime, for audio ML researchers.

HINet: Half Instance Normalization Network for Image Restoration

MAVE: : A Product Dataset for Multi-source Attribute Value Extraction

NER for Indian languages

A curated list of neural network pruning resources.

Real-CUGAN - Real Cascade U-Nets for Anime Image Super Resolution

chainladder - Property and Casualty Loss Reserving in Python

Official implementation of "StyleCariGAN: Caricature Generation via StyleGAN Feature Map Modulation" (SIGGRAPH 2021)

Simple ONNX operation generator. Simple Operation Generator for ONNX.

EMNLP'2021: Simple Entity-centric Questions Challenge Dense Retrievers