Reproduces ResNet-V3 with pytorch

Last update: Dec 23, 2022

Overview

ResNeXt.pytorch

Reproduces ResNet-V3 (Aggregated Residual Transformations for Deep Neural Networks) with pytorch.

Download

git clone https://github.com/prlz77/resnext.pytorch
cd resnext.pytorch
# git checkout R4.0 or R3.0 for backwards compatibility (not recommended).

Usage

To train on Cifar-10 using 2 gpu:

python train.py ~/DATASETS/cifar.python cifar10 -s ./snapshots --log ./logs --ngpu 2 --learning_rate 0.05 -b 128

It should reach ~3.65% on Cifar-10, and ~17.77% on Cifar-100.

After train phase, you can check saved model.

Thanks to @AppleHolic we have now a test script:

To test on Cifar-10 using 2 gpu:

python test.py ~/DATASETS/cifar.python cifar10 --ngpu 2 --load ./snapshots/model.pytorch --test_bs 128

Configurations

From the original paper:

cardinality	base_width	parameters	Error cifar10	error cifar100	default
8	64	34.4M	3.65	17.77	x
16	64	68.1M	3.58	17.31

Update: widen_factor has been disentangled from base_width because it was confusing. Now widen factor is set to consant 4, and base_width is the same as in the original paper.

Trained models and curves

Link to trained models corresponding to the following curves:

Update: several commits have been pushed after training the models in Mega, so it is recommended to revert to e10c37d8cf7a958048bc0f58cd86c3e8ac4e707d

Other frameworks

torch (@facebookresearch). (Original) Cifar and Imagenet
caffe (@terrychenism). Imagenet
MXNet (@dmlc). Imagenet

Cite

@article{xie2016aggregated,
  title={Aggregated residual transformations for deep neural networks},
  author={Xie, Saining and Girshick, Ross and Doll{\'a}r, Piotr and Tu, Zhuowen and He, Kaiming},
  journal={arXiv preprint arXiv:1611.05431},
  year={2016}
}

Reproduces ResNet-V3 with pytorch

Related tags

Overview

ResNeXt.pytorch

Download

Usage

Configurations

Trained models and curves

Other frameworks

Cite

Owner

Pau Rodriguez

A Gura parser implementation for Python

Riemannian Convex Potential Maps

Official repository of "BasicVSR++: Improving Video Super-Resolution with Enhanced Propagation and Alignment"

Loopy belief propagation for factor graphs on discrete variables, in JAX!

Research code for the paper "How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models"

Run Effective Large Batch Contrastive Learning on Limited Memory GPU

LinkNet - This repository contains our Torch7 implementation of the network developed by us at e-Lab.

Replication of Pix2Seq with Pretrained Model

A PyTorch Extension: Tools for easy mixed precision and distributed training in Pytorch

Image-generation-baseline - MUGE Text To Image Generation Baseline

An AFL implementation with UnTracer (our coverage-guided tracer)

A complete end-to-end demonstration in which we collect training data in Unity and use that data to train a deep neural network to predict the pose of a cube. This model is then deployed in a simulated robotic pick-and-place task.

Airborne Optical Sectioning (AOS) is a wide synthetic-aperture imaging technique

PyTorch implementation for paper Neural Marching Cubes.

This is an official implementation for "DeciWatch: A Simple Baseline for 10x Efficient 2D and 3D Pose Estimation"

This is the codebase for the ICLR 2021 paper Trajectory Prediction using Equivariant Continuous Convolution

This is the source code for the experiments related to the paper Unsupervised Audio Source Separation Using Differentiable Parametric Source Models

FactSeg: Foreground Activation Driven Small Object Semantic Segmentation in Large-Scale Remote Sensing Imagery (TGRS)

McGill Physics Hackathon 2021: Reaction-Diffusion Models for the Generation of Biological Patterns

Dynamic Multi-scale Filters for Semantic Segmentation (DMNet ICCV'2019)