Multi-Glimpse Network With Python

Last update: May 10, 2022

Related tags

Deep Learning MGNet

Overview

Multi-Glimpse Network

Our code requires Python ≥ 3.8

Installation

For example, venv + pip:

$ python3 -m venv env
$ source env/bin/activate
(env) $ python3 -m pip install -r requirements.txt

Evaluation

Accuracy on clean images

Create ImageNet100 from ImageNet (using symbolic links).

$ python3 tools/create_imagenet100.py tools/imagenet100.txt \
    /path/to/ImageNet /path/to/ImageNet100

Download checkpoints from Google Drive.
Test accuracy.

$ export dataset="--train_dir /path/to/ImageNet100/train \
    --val_dir /path/to/ImageNet100/val \
    --dataset imagenet --num_class 100"
# Baseline
$ python3 main.py $dataset --test --n_iter 1 --scale 1.0  --model resnet18 \
    --checkpoint resnet18_baseline
# Ours
$ python3 main.py $dataset --test --n_iter 4 --scale 2.33 --model resnet18 \
    --checkpoint resnet18_ours --alpha 0.6 --s 0.02

Add the flag --flop_count to count the approximate FLOPs for the inference of an image. (using fvcore)

Accuracy on adversarial attacks (PGD)

Test adversarial accuracy.

# Baseline
$ python3 main.py $dataset --test --n_iter 1 --scale 1.0  --adv --step_k 10 \
    --model resnet18 --checkpoint resnet18_baseline
# Ours
$ python3 main.py $dataset --test --n_iter 4 --scale 2.33 --adv --step_k 10 \
    --model resnet18 --checkpoint resnet18_ours --alpha 0.6 --s 0.02

Accuracy on common corruptions

Create ImageNet100-C from ImageNet-C (using symbolic links).

$ python3 tools/create_imagenet100c.py  \
    tools/imagenet100.txt  /path/to/ImageNet-C/ /path/to/ImageNet100-C/

Test for a single corruption.

$ export dataset="--train_dir /path/to/ImageNet100/train \
    --val_dir /path/to/ImageNet100-C/pixelate/5 \
    --dataset imagenet --num_class 100"
# Baseline
$ python3 main.py $dataset --test --n_iter 1 --scale 1.0  --model resnet18 \
    --checkpoint resnet18_baseline
# Ours
$ python3 main.py $dataset --test --n_iter 4 --scale 2.33 --model resnet18 \
    --checkpoint resnet18_ours --alpha 0.6 --s 0.02

A simple script to test all corruptions and collect results.

# Modify tools/eval_imagenet100c.py and run it to generate script
$ python3 tools/eval_imagenet100c.py /home2/ImageNet100-C/ > run.sh
# Evaluate
$ bash run.sh
# Collect results
$ python3 tools/collect_imagenet100c.py

Training

$ export dataset="--train_dir /path/to/ImageNet100/train \
    --val_dir /path/to/ImageNet100/val \
    --dataset imagenet --num_class 100"
# Baseline
$ python3 main.py $dataset --epochs 400 --n_iter 1 --scale 1.0 \
    --model resnet18 --gpu 0,1,2,3
# Ours
$ python3 main.py $dataset --epochs 400 --n_iter 4 --scale 2.33 \
    --model resnet18 --alpha 0.6 --s 0.02  --gpu 0,1,2,3

Check tensorboard for the logs. (When training with multiple gpus, the log value may be scaled by the number of gpus except for the validation accuracy)

tensorboard  --logdir=logs

Note that we left our exploration in the code for further study, e.g., self-supervised spatial guidance, dynamic gradient re-scaling operation.

Multi-Glimpse Network With Python

Related tags

Overview

Multi-Glimpse Network

Installation

Evaluation

Accuracy on clean images

Accuracy on adversarial attacks (PGD)

Accuracy on common corruptions

Training

Owner

Research Artifact of USENIX Security 2022 Paper: Automated Side Channel Analysis of Media Software with Manifold Learning

Lucid library adapted for PyTorch

Repository relating to the CVPR21 paper TimeLens: Event-based Video Frame Interpolation

AQP is a modular pipeline built to enable the comparison and testing of different quality metric configurations.

An atmospheric growth and evolution model based on the EVo degassing model and FastChem 2.0

Food Drinks and groceries Images Multi Lingual (FooDI-ML) dataset.

NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling

In this project we use both Resnet and Self-attention layer for cat, dog and flower classification.

A Python library that enables ML teams to share, load, and transform data in a collaborative, flexible, and efficient way :chestnut:

Myia prototyping

A toy compiler that can convert Python scripts to pickle bytecode 🥒

An efficient PyTorch implementation of the winning entry of the 2017 VQA Challenge.

Reinforcement Learning for Portfolio Management

Deep Residual Learning for Image Recognition

一个免费开源一键搭建的通用验证码识别平台，大部分常见的中英数验证码识别都没啥问题。

The official implementation of Variable-Length Piano Infilling (VLI).

ICS 4u HD project, start before-wards. A curtain shooting game using python.

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

MDETR: Modulated Detection for End-to-End Multi-Modal Understanding

Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers