Revisiting Dynamic Convolution via Matrix Decomposition (ICLR 2021)

A pytorch implementation of DCD. If you use this code in your research please consider citing

@article{li2021revisiting, title={Revisiting Dynamic Convolution via Matrix Decomposition}, author={Li, Yunsheng and Chen, Yinpeng and Dai, Xiyang and Liu, Mengchen and Chen, Dongdong and Yu, Ye and Yuan, Lu and Liu, Zicheng and Chen, Mei and Vasconcelos, Nuno}, journal={arXiv preprint arXiv:2103.08756}, year={2021} }

Requirements

Hardware: PC with NVIDIA Titan GPU.
Software: Ubuntu 16.04, CUDA 10.0, Anaconda3, pytorch 1.0.0
Python package
- conda install --quiet --yes pytorch==1.0.0 torchvision==0.2.1 cuda100 -c pytorch
- pip install tensorboard tensorboardX pillow==6.1

Evaluate DCD on ImageNet

The pre-trained model can be downloaded here ResNet-50 and MobileNetV2x1.0

DCD for ResNet-50

python main.py -a resnet50_dcd -d /path/to/imagenet/ -b 256 -c /path/to/output -j 48 --input-size 224 --dropout 0.1 --weight /path/to/resnet50_dcd.pth.tar --evaluate

DCD for MobileNetV2x1.0

python main.py -a mobilenetv2_dcd -d /path/to/imagenet/ -b 512 -c /path/to/output --width-mult 1.0 -j 48 --input-size 224 --dropout 0.1 --fc-squeeze 16 --weight mv2x1.0_dcd.pth.tar --evaluate

Train DCD on ImageNet

DCD for ResNet-50

CUDA_VISIBLE_DEVICES=0,1,2,3 python main.py -a resnet50_dcd -d /path/to/imagenet/ -b 256 --epochs 120 --lr-decay schedule --lr 0.1 --wd 1e-4 -c /path/to/output -j 48 --input-size 224 --label-smoothing 0.1 --dropout 0.1 --mixup 0.2

DCD for MobileNetV2x1.0

CUDA_VISIBLE_DEVICES=0,1,2,3 python main.py -a mobilenetv2_dcd -d /path/to/imagenet/ --epochs 300 --lr-decay cos --lr 0.1 --wd 2e-5 -c /path/to/output --width-mult 1.0 -j 48 --input-size 224 --label-smoothing 0.1 --dropout 0.2 -b 512 --mixup 0.2 --fc-squeeze 16

official code for dynamic convolution decomposition

Related tags

Overview

Revisiting Dynamic Convolution via Matrix Decomposition (ICLR 2021)

Requirements

Evaluate DCD on ImageNet

Train DCD on ImageNet

Owner

Yunsheng Li

NCVX (NonConVeX): A User-Friendly and Scalable Package for Nonconvex Optimization in Machine Learning.

Cl datasets - PyTorch image dataloaders and utility functions to load datasets for supervised continual learning

Pixel Consensus Voting for Panoptic Segmentation (CVPR 2020)

Implementation for our ICCV2021 paper: Internal Video Inpainting by Implicit Long-range Propagation

Pytorch implementation of CVPR2021 paper "MUST-GAN: Multi-level Statistics Transfer for Self-driven Person Image Generation"

This is an unofficial PyTorch implementation of Meta Pseudo Labels

Solving SMPL/MANO parameters from keypoint coordinates.

torchsummaryDynamic: support real FLOPs calculation of dynamic network or user-custom PyTorch ops

Code and data form the paper BERT Got a Date: Introducing Transformers to Temporal Tagging

A Japanese Medical Information Extraction Toolkit

End-to-end speech secognition toolkit

A keras-based real-time model for medical image segmentation (CFPNet-M)

Official Datasets and Implementation from our Paper "Video Class Agnostic Segmentation in Autonomous Driving".

Official repo of the paper "Surface Form Competition: Why the Highest Probability Answer Isn't Always Right"

Self-Supervised Contrastive Learning of Music Spectrograms

FSL-Mate: A collection of resources for few-shot learning (FSL).

Unrestricted Facial Geometry Reconstruction Using Image-to-Image Translation

A best practice for tensorflow project template architecture.

Python code for loading the Aschaffenburg Pose Dataset.

USAD - UnSupervised Anomaly Detection on multivariate time series