An efficient and easy-to-use deep learning model compression framework

Last update: Dec 25, 2022

Related tags

Overview

TinyNeuralNetwork

TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework, which contains features like neural architecture search, pruning, quantization, model conversion and etc. It has been utilized for the deployment on devices such as Tmall Genie, Haier TV, Youku video, face recognition check-in machine, and etc, which equips over 10 million IoT devices with AI capability.

Installation

Python >= 3.6, PyTorch >= 1.4（ PyTorch >= 1.6 if quantization-aware training is involved ）

# Install the TinyNeuralNetwork framework
git clone https://github.com/alibaba/TinyNeuralNetwork.git
cd TinyNeuralNetwork
python setup.py install

# Alternatively, you may try the one-liner
pip install git+https://github.com/alibaba/TinyNeuralNetwork.git

Basic modules

Computational graph capture: The Graph Tracer in TinyNeuralNetwork captures connectivity of PyTorch operators, which automates pruning and model quantization. It also supports code generation from PyTorch models to equivalent model description files (e.g. models.py).
Dependency resolving: Modifying an operator often causes mismatch in subgraph, i.e. mismatch with other dependent operators. The Graph Modifier in TinyNeuralNetwork handles the mismatchs automatically within and between subgraphs to automate the computational graph modification.
Pruner: OneShot (L1, L2, FPGM), ADMM, NetAdapt, Gradual, End2End and other pruning algorithms have been implemented and will be opened gradually.
Quantization-aware training: TinyNeuralNetwork uses PyTorch's QAT as the backend (we also support simulated bfloat16 training) and optimizes its usability with automating the fusion of operators and quantization of computational graphs (the official implementation requires manual implementation by the user, which is a huge workload).
Model conversion: TinyNeuralNetwork supports conversion of floating-point and quantized PyTorch models to TFLite models for end-to-end deployment.

Project architecture

examples: Provides examples of each module
models: Provides pre-trained models for getting quickstart
tests: Unit tests
tinynn: Code for model compression
- graph : Foundation for computational graph capture, resolving, quantization, code generation, mask management, and etc
- prune : Pruning algorithms
- converter : Model converter
- util: Utility classes

RoadMap

Nov. 2021: A new pruner with adaptive sparsity
Dec. 2021: Model compression for Transformers

Frequently Asked Questions

Because of the high complexity and frequent updates of PyTorch, we cannot ensure that all cases are covered through automated testing. When you encounter problems You can check out the FAQ, or join the Q&A group in DingTalk via the QR Code below.

An efficient and easy-to-use deep learning model compression framework

Related tags

Overview

TinyNeuralNetwork

Installation

Basic modules

Project architecture

RoadMap

Frequently Asked Questions

Owner

Alibaba

Code for "On the Effects of Batch and Weight Normalization in Generative Adversarial Networks"

Tensorflow Implementation of ECCV'18 paper: Multimodal Human Motion Synthesis

ObjDetApp deploys a pytorch model for object detection

Time Delayed NN implemented in pytorch

Code for DisCo: Remedy Self-supervised Learning on Lightweight Models with Distilled Contrastive Learning

Computationally Efficient Optimization of Plackett-Luce Ranking Models for Relevance and Fairness

Official implementation of NeurIPS'21: Implicit SVD for Graph Representation Learning

TensorFlow Implementation of Unsupervised Cross-Domain Image Generation

Voxel-based Network for Shape Completion by Leveraging Edge Generation (ICCV 2021, oral)

MVSDF - Learning Signed Distance Field for Multi-view Surface Reconstruction

Implementation of the paper Scalable Intervention Target Estimation in Linear Models (NeurIPS 2021), and the code to generate simulation results.

Official Pytorch implementation for AAAI2021 paper (RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning)

An automated algorithm to extract the linear blend skinning (LBS) from a set of example poses

Official code release for "Learned Spatial Representations for Few-shot Talking-Head Synthesis" ICCV 2021

This repository contains several jupyter notebooks to help users learn to use neon, our deep learning framework

O-CNN: Octree-based Convolutional Neural Networks for 3D Shape Analysis

[CVPR2021] DoDNet: Learning to segment multi-organ and tumors from multiple partially labeled datasets

Minimal diffusion models - Minimal code and simple experiments to play with Denoising Diffusion Probabilistic Models (DDPMs)

Marine debris detection with commercial satellite imagery and deep learning.

Vanilla and Prototypical Networks with Random Weights for image classification on Omniglot and mini-ImageNet. Made with Python3.

An efficient and easy-to-use deep learning model compression framework

Related tags

Overview

TinyNeuralNetwork

Installation

Basic modules

Project architecture

RoadMap

Frequently Asked Questions

Owner

Alibaba

Code for "On the Effects of Batch and Weight Normalization in Generative Adversarial Networks"

Tensorflow Implementation of ECCV'18 paper: Multimodal Human Motion Synthesis

*ObjDetApp* deploys a pytorch model for object detection

Time Delayed NN implemented in pytorch

Code for DisCo: Remedy Self-supervised Learning on Lightweight Models with Distilled Contrastive Learning

Computationally Efficient Optimization of Plackett-Luce Ranking Models for Relevance and Fairness

Official implementation of NeurIPS'21: Implicit SVD for Graph Representation Learning

TensorFlow Implementation of Unsupervised Cross-Domain Image Generation

Voxel-based Network for Shape Completion by Leveraging Edge Generation (ICCV 2021, oral)

MVSDF - Learning Signed Distance Field for Multi-view Surface Reconstruction

Implementation of the paper Scalable Intervention Target Estimation in Linear Models (NeurIPS 2021), and the code to generate simulation results.

Official Pytorch implementation for AAAI2021 paper (RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning)

An automated algorithm to extract the linear blend skinning (LBS) from a set of example poses

Official code release for "Learned Spatial Representations for Few-shot Talking-Head Synthesis" ICCV 2021

This repository contains several jupyter notebooks to help users learn to use neon, our deep learning framework

O-CNN: Octree-based Convolutional Neural Networks for 3D Shape Analysis

[CVPR2021] DoDNet: Learning to segment multi-organ and tumors from multiple partially labeled datasets

Minimal diffusion models - Minimal code and simple experiments to play with Denoising Diffusion Probabilistic Models (DDPMs)

Marine debris detection with commercial satellite imagery and deep learning.

Vanilla and Prototypical Networks with Random Weights for image classification on Omniglot and mini-ImageNet. Made with Python3.

ObjDetApp deploys a pytorch model for object detection