A Pytorch Implementation of a continuously rate adjustable learned image compression framework.

Last update: Dec 24, 2022

Related tags

Overview

GainedVAE

A Pytorch Implementation of a continuously rate adjustable learned image compression framework, Gained Variational Autoencoder(GainedVAE).

Note that This Is Not An Official Implementation Code.

More details can be found in the following paper:

Asymmetric Gained Deep Image Compression With Continuous Rate Adaptation.
Huawei Technologies, CVPR 2021
Ze Cui, Jing Wang, Shangyin Gao, Tiansheng Guo, Yihui Feng, Bo Bai

Todo: Reproduce Implementation of the following paper:

INTERPOLATION VARIABLE RATE IMAGE COMPRESSION
Alibaba Group, arxiv 2021.9.20
Zhenhong Sun, Zhiyu Tan, Xiuyu Sun, Fangyi Zhang, Yichen Qian, Dongyang Li, Hao Li

Environment

Python == 3.7.10
Pytorch == 1.7.1
CompressAI

Dataset

Training set

I use a part of the OpenImages Dataset to train the models (train06, train07, train08, about 54w images). You can download from here. Download OpenImages Maybe train08 (14w images) is enough.

Test set

Download Kodak dataset

Train Your Own Model

python3 trainGain.py -d /path/to/your/image/dataset/ --epochs 200 -lr 1e-4 --batch-size 16 --model-save /path/to/your/model/save/dir --cuda

Result

I try to train the Gained Mean-Scale Hyperprior model and here is the result.

Acknowledgement

The framework is based on CompressAI, I add the model in compressai.models.gain, compressai.models.gain_utils.
And trainGain/trainGain.py is modified with reference to compressai_examples/train.py.

More Variable Rate Image Compression Repositories

"Variable-Rate Deep Image Compression through Spatially-Adaptive Feature Transform" (ICCV 2021).
code

"Variable Bitrate Image Compression with Quality Scaling Factors" (ICASSP 2020).
code

"Variable Rate Deep Image Compression with Modulated Autoencoders" (IEEE SPL 2020)
code

"Slimmable Compressive Autoencoders for Practical Neural Image Compression" (CVPR 2021)
code

Contact

Feel free to contact me if there is any question about the code or to discuss any problems with image and video compression. ([email protected])

A Pytorch Implementation of a continuously rate adjustable learned image compression framework.

Related tags

Overview

GainedVAE

Environment

Dataset

Training set

Test set

Train Your Own Model

Result

Acknowledgement

More Variable Rate Image Compression Repositories

Contact

Owner

基于Pytorch实现优秀的自然图像分割框架！(包括FCN、U-Net和Deeplab)

Active learning for Mask R-CNN in Detectron2

Volumetric parameterization of the placenta to a flattened template

Official code release for "GRAF: Generative Radiance Fields for 3D-Aware Image Synthesis"

Tooling for the Common Objects In 3D dataset.

Learning Intents behind Interactions with Knowledge Graph for Recommendation, WWW2021

Credo AI Lens is a comprehensive assessment framework for AI systems. Lens standardizes model and data assessment, and acts as a central gateway to assessments created in the open source community.

Identify the emotion of multiple speakers in an Audio Segment

Implementation of Kronecker Attention in Pytorch

NU-Wave: A Diffusion Probabilistic Model for Neural Audio Upsampling @ INTERSPEECH 2021 Accepted

《K-Adapter: Infusing Knowledge into Pre-Trained Models with Adapters》(2020)

We will release the code of "ConTNet: Why not use convolution and transformer at the same time?" in this repo

source code the paper Fast and Robust Iterative Closet Point.

Neural Caption Generator with Attention

Implementation of several Bayesian multi-target tracking algorithms, including Poisson multi-Bernoulli mixture filters for sets of targets and sets of trajectories. The repository also includes the GOSPA metric and a metric for sets of trajectories to evaluate performance.

Official code for "Maximum Likelihood Training of Score-Based Diffusion Models", NeurIPS 2021 (spotlight)

Turning SymPy expressions into JAX functions

《Fst Lerning of Temporl Action Proposl vi Dense Boundry Genertor》(AAAI 2020)

Library for machine learning stacking generalization.

Lightweight stereo matching network based on MobileNetV1 and MobileNetV2