Implementation of "RaScaNet: Learning Tiny Models by Raster-Scanning Image" from CVPR 2021.

Last update: Dec 26, 2022

Related tags

Overview

RaScaNet: Learning Tiny Models by Raster-Scanning Images

Deploying deep convolutional neural networks on ultra-low power systems is challenging, because the systems put a hard limit on the size of on-chip memory. To overcome this drawback, we propose a novel Raster-Scanning Network, named RaScaNet, inspired by raster-scanning in image sensors.

RaScaNet reads only a few rows of pixels at a time using a convolutional neural network and then sequentially learns the representation of the whole image using a recurrent neural network. The proposed method requires 15.9-24.3x smaller peak memory and 5.3-12.9x smaller weight memory than the state-of-the-art tiny models. The total memory usage of RaScaNet does not exceed 60 KB, in the VWW dataset with competitive accuracy.

Conference: CVPR 2021
Paper | Video | Citation

Requirements

python 3.6
torch 1.7.0
torchvision 0.8.1
pycocotools 2.0.1
numpy 0.19.0
VWW dataset

Usage

For running the model, (only support vww dataset)

python test.py --dataset='vww' --dataset_path={dataset_path} --rsz_w=240 --model_path=checkpoint/rascanet_210x240.pth.tar
python test.py --dataset='vww' --dataset_path={dataset_path} --rsz_w=120 --model_path=checkpoint/rascanet_105x120.pth.tar

With early termination,

python test.py --dataset='vww' --dataset_path={dataset_path} --rsz_w=240 --model_path=checkpoint/rascanet_210x240.pth.tar --early_terminate=1
python test.py --dataset='vww' --dataset_path={dataset_path} --rsz_w=120 --model_path=checkpoint/rascanet_105x120.pth.tar --early_terminate=1

Currently, we do not provide the code for training.

Result

Model	Weight Memory	Peak Memory	OPs Cnt.	Accuracy
rascanet(210x240)	47.03 KB	7.92 KB	56.34 M	91.835%
rascanet(105x120)	31.77 KB	3.60 KB	9.71 M	88.100%

Citation

@InProceedings{Yoo_2021_CVPR,
    author    = {Yoo, Jaehyoung and Lee, Dongwook and Son, Changyong and Jung, Sangil and Yoo, ByungIn and Choi, Changkyu and Han, Jae-Joon and Han, Bohyung},
    title     = {RaScaNet: Learning Tiny Models by Raster-Scanning Images},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
    month     = {June},
    year      = {2021},
    pages     = {13673-13682}
}

License

Copyright (C) 2021 Samsung Electronics Co. LTD

This software is a property of Samsung Electronics.
No part of this software, either material or conceptual may be copied or distributed, transmitted,
transcribed, stored in a retrieval system or translated into any human or computer language in any form by any means,
electronic, mechanical, manual or otherwise, or disclosed
to third parties without the express written permission of Samsung Electronics.
(Use of the Software is restricted to non-commercial, personal or academic, research purpose only)

Implementation of "RaScaNet: Learning Tiny Models by Raster-Scanning Image" from CVPR 2021.

Related tags

Overview

RaScaNet: Learning Tiny Models by Raster-Scanning Images

Requirements

Usage

Result

Citation

License

Owner

SAIT (Samsung Advanced Institute of Technology)

Normalizing Flows with a resampled base distribution

Style-based Point Generator with Adversarial Rendering for Point Cloud Completion (CVPR 2021)

PyTorch implementation of the cross-modality generative model that synthesizes dance from music.

Code for "R-GCN: The R Could Stand for Random"

A very impractical 3D rendering engine that runs in the python terminal.

Trajectory Prediction with Graph-based Dual-scale Context Fusion

Official implementation of Monocular Quasi-Dense 3D Object Tracking

Automatic Image Background Subtraction

Code for "ATISS: Autoregressive Transformers for Indoor Scene Synthesis", NeurIPS 2021

Simple reference implementation of GraphSAGE.

Official code for our EMNLP2021 Outstanding Paper MindCraft: Theory of Mind Modeling for Situated Dialogue in Collaborative Tasks

the official code for ICRA 2021 Paper: "Multimodal Scale Consistency and Awareness for Monocular Self-Supervised Depth Estimation"

TensorFlow implementation of PHM (Parameterization of Hypercomplex Multiplication)

Pytorch code for "DPFM: Deep Partial Functional Maps" - 3DV 2021 (Oral)

[ICCV2021] Learning to Track Objects from Unlabeled Videos

LTR_CrossEncoder: Legal Text Retrieval Zalo AI Challenge 2021

A simple python module to generate anchor (aka default/prior) boxes for object detection tasks.

Python scripts for performing stereo depth estimation using the MobileStereoNet model in Tensorflow Lite.

Python Classes: Medical Insurance Project using Object Oriented Programming Concepts

Generative Art Using Neural Visual Grammars and Dual Encoders