Deep Residual Networks with 1K Layers

By Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun.

Microsoft Research Asia (MSRA).

Introduction
Notes
Usage

Introduction

This repository contains re-implemented code for the paper "Identity Mappings in Deep Residual Networks" (http://arxiv.org/abs/1603.05027). This work enables training quality 1k-layer neural networks in a super simple way.

Acknowledgement: This code is re-implemented by Xiang Ming from Xi'an Jiaotong Univeristy for the ease of release.

Seel Also: Re-implementations of ResNet-200 [a] on ImageNet from Facebook AI Research (FAIR): https://github.com/facebook/fb.resnet.torch/tree/master/pretrained

Notes

This code is based on the implementation of Torch ResNets (https://github.com/facebook/fb.resnet.torch).
The experiments in the paper were conducted in Caffe, whereas this code is re-implemented in Torch. We observed similar results within reasonable statistical variations.
To fit the 1k-layer models into memory without modifying much code, we simply reduced the mini-batch size to 64, noting that results in the paper were obtained with a mini-batch size of 128. Less expectedly, the results with the mini-batch size of 64 are slightly better:

mini-batch CIFAR-10 test error (%): (median (mean+/-std))

128 (as in [a]) 4.92 (4.89+/-0.14)

64 (as in this code) 4.62 (4.69+/-0.20)
Curves obtained by running this code with a mini-batch size of 64 (training loss: y-axis on the left; test error: y-axis on the right):

mini-batch	CIFAR-10 test error (%): (median (mean+/-std))
128 (as in [a])	4.92 (4.89+/-0.14)
64 (as in this code)	4.62 (4.69+/-0.20)

Usage

Install Torch ResNets (https://github.com/facebook/fb.resnet.torch) following instructions therein.
Add the file resnet-pre-act.lua from this repository to ./models.
To train ResNet-1001 as of the form in [a]:

th main.lua -netType resnet-pre-act -depth 1001 -batchSize 64 -nGPU 2 -nThreads 4 -dataset cifar10 -nEpochs 200 -shareGradInput false

Note: ``shareGradInput=true'' is not valid for this model yet.

Deep Residual Networks with 1K Layers

Related tags

Overview

Deep Residual Networks with 1K Layers

Table of Contents

Introduction

Notes

Usage

Owner

Kaiming He

This folder contains the implementation of the multi-relational attribute propagation algorithm.

[CVPR 2021] Region-aware Adaptive Instance Normalization for Image Harmonization

MinkLoc++: Lidar and Monocular Image Fusion for Place Recognition

DCSAU-Net: A Deeper and More Compact Split-Attention U-Net for Medical Image Segmentation

Single-Stage Instance Shadow Detection with Bidirectional Relation Learning (CVPR 2021 Oral)

An implementation of MobileFormer

pytorch, hand(object) detect ,yolo v5，手检测

API for RL algorithm design & testing of BCA (Building Control Agent) HVAC on EnergyPlus building energy simulator by wrapping their EMS Python API

Invariant Causal Prediction for Block MDPs

Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms

The spiritual successor to knockknock for PyTorch Lightning, get notified when your training ends

Official Implementation of "Third Time's the Charm? Image and Video Editing with StyleGAN3" https://arxiv.org/abs/2201.13433

Spectrum Surveying: Active Radio Map Estimation with Autonomous UAVs

Semantic code search implementation using Tensorflow framework and the source code data from the CodeSearchNet project

Trading Strategies for Freqtrade

This is the official implement of paper "ActionCLIP: A New Paradigm for Action Recognition"

Like ThreeJS but for Python and based on wgpu

A minimal solution to hand motion capture from a single color camera at over 100fps. Easy to use, plug to run.

This is an official implementation for "PlaneRecNet".

Official code for our EMNLP2021 Outstanding Paper MindCraft: Theory of Mind Modeling for Situated Dialogue in Collaborative Tasks