Deep Residual Networks with 1K Layers

By Kaiming He, Xiangyu Zhang, Shaoqing Ren, Jian Sun.

Microsoft Research Asia (MSRA).

Introduction
Notes
Usage

Introduction

This repository contains re-implemented code for the paper "Identity Mappings in Deep Residual Networks" (http://arxiv.org/abs/1603.05027). This work enables training quality 1k-layer neural networks in a super simple way.

Acknowledgement: This code is re-implemented by Xiang Ming from Xi'an Jiaotong Univeristy for the ease of release.

Seel Also: Re-implementations of ResNet-200 [a] on ImageNet from Facebook AI Research (FAIR): https://github.com/facebook/fb.resnet.torch/tree/master/pretrained

Notes

This code is based on the implementation of Torch ResNets (https://github.com/facebook/fb.resnet.torch).
The experiments in the paper were conducted in Caffe, whereas this code is re-implemented in Torch. We observed similar results within reasonable statistical variations.
To fit the 1k-layer models into memory without modifying much code, we simply reduced the mini-batch size to 64, noting that results in the paper were obtained with a mini-batch size of 128. Less expectedly, the results with the mini-batch size of 64 are slightly better:

mini-batch CIFAR-10 test error (%): (median (mean+/-std))

128 (as in [a]) 4.92 (4.89+/-0.14)

64 (as in this code) 4.62 (4.69+/-0.20)
Curves obtained by running this code with a mini-batch size of 64 (training loss: y-axis on the left; test error: y-axis on the right):

mini-batch	CIFAR-10 test error (%): (median (mean+/-std))
128 (as in [a])	4.92 (4.89+/-0.14)
64 (as in this code)	4.62 (4.69+/-0.20)

Usage

Install Torch ResNets (https://github.com/facebook/fb.resnet.torch) following instructions therein.
Add the file resnet-pre-act.lua from this repository to ./models.
To train ResNet-1001 as of the form in [a]:

th main.lua -netType resnet-pre-act -depth 1001 -batchSize 64 -nGPU 2 -nThreads 4 -dataset cifar10 -nEpochs 200 -shareGradInput false

Note: ``shareGradInput=true'' is not valid for this model yet.

Deep Residual Networks with 1K Layers

Related tags

Overview

Deep Residual Networks with 1K Layers

Table of Contents

Introduction

Notes

Usage

Owner

Kaiming He

Generates all variables from your .tf files into a variables.tf file.

Unofficial implementation of One-Shot Free-View Neural Talking Head Synthesis

A Python multilingual toolkit for Sentiment Analysis and Social NLP tasks

以孤立语假设和宽度优先搜索为基础，构建了一种多通道堆叠注意力Transformer结构的斗地主ai

ML-based medical imaging using Azure

MvtecAD unsupervised Anomaly Detection

The official code of "SCROLLS: Standardized CompaRison Over Long Language Sequences".

Patch2Pix: Epipolar-Guided Pixel-Level Correspondences [CVPR2021]

[NeurIPS2021] Code Release of K-Net: Towards Unified Image Segmentation

Official implementation of Deep Convolutional Dictionary Learning for Image Denoising.

TensorFlow Implementation of "Show, Attend and Tell"

This Deep Learning Model Predicts that from which disease you are suffering.

This is a simple backtesting framework to help you test your crypto currency trading. It includes a way to download and store historical crypto data and to execute a trading strategy.

Companion repo of the UCC 2021 paper "Predictive Auto-scaling with OpenStack Monasca"

Code for "Layered Neural Rendering for Retiming People in Video."

A high-performance distributed deep learning system targeting large-scale and automated distributed training.

Neural network chess engine trained on Gary Kasparov's games.

This is the pytorch implementation for the paper: Generalizable Mixed-Precision Quantization via Attribution Rank Preservation, which is accepted to ICCV2021.

Simple-Image-Classification - Simple Image Classification Code (PyTorch)

Advbox is a toolbox to generate adversarial examples that fool neural networks in PaddlePaddle、PyTorch、Caffe2、MxNet、Keras、TensorFlow and Advbox can benchmark the robustness of machine learning models.