This is an official pytorch implementation of Fast Fourier Convolution.

Last update: Jan 03, 2023

Overview

Fast Fourier Convolution (FFC) for Image Classification

This is the official code of Fast Fourier Convolution for image classification on ImageNet.

Main Results

Results on ImageNet

Method	GFLOPs	#Params	Top-1 Acc
ResNet-50	4.1	25.6	76.3
FFC-ResNet-50	4.2	26.1	77.6
FFC-ResNet-50 (+LFU)	4.3	26.7	77.8

Quick starts

Requirements

pip install -r requirements.txt

Data preparation

You can follow the Pytorch implementation: https://github.com/pytorch/examples/tree/master/imagenet

Training

To train a model, run main.py with the desired model architecture and other super-paremeters:

python main.py -a ffc_resnet50 --lfu [imagenet-folder with train and val folders]

We use "lfu" to control whether to use Local Fourier Unit (LFU). Default: False.

Testing

python main.py -a ffc_resnet50 --lfu --resume PATH/TO/CHECKPOINT [imagenet-folder with train and val folders]

Citation

If you find this work or code is helpful in your research, please cite:

@InProceedings{Chi_2020_FFC,
  author = {Chi, Lu and Jiang, Borui and Mu, Yadong},
  title = {Fast Fourier Convolution},
  booktitle = {Advances in Neural Information Processing Systems},
  year = {2020}
}

This is an official pytorch implementation of Fast Fourier Convolution.

Related tags

Overview

Fast Fourier Convolution (FFC) for Image Classification

Main Results

Results on ImageNet

Quick starts

Requirements

Data preparation

Training

Testing

Citation

Owner

pkumi

🇰🇷 Text to Image in Korean

quantize aware training package for NCNN on pytorch

Implementation of the HMAX model of vision in PyTorch

A python software that can help blind people find things like laptops, phones, etc the same way a guide dog guides a blind person in finding his way.

Categorizing comments on YouTube into different categories.

🌈 PyTorch Implementation for EMNLP'21 Findings "Reasoning Visual Dialog with Sparse Graph Learning and Knowledge Transfer"

Unofficial implementation of HiFi-GAN+ from the paper "Bandwidth Extension is All You Need" by Su, et al.

Benchmark VAE - Library for Variational Autoencoder benchmarking

This is an implementation for the CVPR2020 paper "Learning Invariant Representation for Unsupervised Image Restoration"

Job-Recommend-Competition - Vectorwise Interpretable Attentions for Multimodal Tabular Data

Example scripts for the detection of lanes using the ultra fast lane detection model in ONNX.

《Rethinking Sptil Dimensions of Vision Trnsformers》(2021)

PanopticBEV - Bird's-Eye-View Panoptic Segmentation Using Monocular Frontal View Images

Analysis of rationale selection in neural rationale models

High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features

A simple software for capturing human body movements using the Kinect camera.

1st Place Solution to ECCV-TAO-2020: Detect and Represent Any Object for Tracking

[MICCAI'20] AlignShift: Bridging the Gap of Imaging Thickness in 3D Anisotropic Volumes

This is an official implementation of the CVPR2022 paper "Blind2Unblind: Self-Supervised Image Denoising with Visible Blind Spots".

SeMask: Semantically Masked Transformers for Semantic Segmentation.