[ICLR 2021] "Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective" by Wuyang Chen, Xinyu Gong, Zhangyang Wang

Last update: Nov 28, 2022

Overview

Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective [PDF]

Wuyang Chen, Xinyu Gong, Zhangyang Wang

In ICLR 2021.

Overview

We present TE-NAS, the first published training-free neural architecture search method with extremely fast search speed (no gradient descent at all!) and high-quality performance.

Highlights:

Trainig-free and label-free NAS: we achieved extreme fast neural architecture search without a single gradient descent.
Bridging the theory-application gap: We identified two training-free indicators to rank the quality of deep networks: the condition number of their NTKs, and the number of linear regions in their input space.
SOTA: TE-NAS achieved extremely fast search speed (one 1080Ti, 20 minutes on NAS-Bench-201 space / four hours on DARTS space on ImageNet) and maintains competitive accuracy.

Prerequisites

Ubuntu 16.04
Python 3.6.9
CUDA 10.1 (lower versions may work but were not tested)
NVIDIA GPU + CuDNN v7.3

This repository has been tested on GTX 1080Ti. Configurations may need to be changed on different platforms.

Installation

Clone this repo:

git clone https://github.com/chenwydj/TENAS.git
cd TENAS

Install dependencies:

pip install -r requirements.txt

Usage

0. Prepare the dataset

Please follow the guideline here to prepare the CIFAR-10/100 and ImageNet dataset, and also the NAS-Bench-201 database.
Remember to properly set the TORCH_HOME and data_paths in the prune_launch.py.

1. Search

NAS-Bench-201 Space

python prune_launch.py --space nas-bench-201 --dataset cifar10 --gpu 0
python prune_launch.py --space nas-bench-201 --dataset cifar100 --gpu 0
python prune_launch.py --space nas-bench-201 --dataset ImageNet16-120 --gpu 0

DARTS Space (NASNET)

python prune_launch.py --space darts --dataset cifar10 --gpu 0
python prune_launch.py --space darts --dataset imagenet-1k --gpu 0

2. Evaluation

For architectures searched on nas-bench-201, the accuracies are immediately available at the end of search (from the console output).
For architectures searched on darts, please use DARTS_evaluation for training the searched architecture from scratch and evaluation.

Citation

@inproceedings{chen2020tenas,
  title={Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective},
  author={Chen, Wuyang and Gong, Xinyu and Wang, Zhangyang},
  booktitle={International Conference on Learning Representations},
  year={2021}
}

Acknowledgement

Code base from NAS-Bench-201.

[ICLR 2021] "Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective" by Wuyang Chen, Xinyu Gong, Zhangyang Wang

Related tags

Overview

Neural Architecture Search on ImageNet in Four GPU Hours: A Theoretically Inspired Perspective [PDF]

Overview

Prerequisites

Installation

Usage

0. Prepare the dataset

1. Search

NAS-Bench-201 Space

DARTS Space (NASNET)

2. Evaluation

Citation

Acknowledgement

Owner

VITA

Official page of Struct-MDC (RA-L'22 with IROS'22 option); Depth completion from Visual-SLAM using point & line features

Supervised Contrastive Learning for Downstream Optimized Sequence Representations

SVG Icon processing tool for C++

Boosted CVaR Classification (NeurIPS 2021)

An end-to-end implementation of intent prediction with Metaflow and other cool tools

Revitalizing CNN Attention via Transformers in Self-Supervised Visual Representation Learning

Federated Learning - Including common test models for federated learning, like CNN, Resnet18 and lstm, controlled by different parser

The implementation for "Comprehensive Knowledge Distillation with Causal Intervention".

Open-source implementation of Google Vizier for hyper parameters tuning

Self-Learned Video Rain Streak Removal: When Cyclic Consistency Meets Temporal Correspondence

Empirical Study of Transformers for Source Code & A Simple Approach for Handling Out-of-Vocabulary Identifiers in Deep Learning for Source Code

Combine Tacotron2 and Hifi GAN to generate speech from text

Fre-GAN: Adversarial Frequency-consistent Audio Synthesis

A toy compiler that can convert Python scripts to pickle bytecode 🥒

Code for the ECCV2020 paper "A Differentiable Recurrent Surface for Asynchronous Event-Based Data"

Code to reproduce the results in "Visually Grounded Reasoning across Languages and Cultures", EMNLP 2021.

PyTorch implementation of TSception V2 using DEAP dataset

CrossMLP - The repository offers the official implementation of our BMVC 2021 paper (oral) in PyTorch.

Official PyTorch Implementation of Unsupervised Learning of Scene Flow Estimation Fusing with Local Rigidity

Junction Tree Variational Autoencoder for Molecular Graph Generation (ICML 2018)