Learned image compression

Last update: Dec 04, 2022

Overview

Pytorch code of our recent work A Unified End-to-End Framework for Efficient Deep Image Compression.

We first release the code for Variational image compression with a scale hyperprior, we will update our code to our full implementaion of our paper.

Prerequisites

You should install the libraries of this repo.

pip install -r requirements.txt

Data Preparation

We need to first prepare the training and validation data. The trainging data is from flicker.com. You can obtain the training data according to description of CompressionData.

The validation data is the popular kodak dataset.

bash data/download_kodak.sh

Training

For high bitrate (4096, 6144, 8192), the out_channel_N is 192 and the out_channel_M is 320 in 'config_high.json'. For low bitrate (256, 512, 1024, 2048), the out_channel_N is 128 and the out_channel_M is 192 in 'config_low.json'.

Details

PSNR experiments.

For high bitrate of 8192, we first train from scratch as follows.

CUDA_VISIBLE_DEVICES=0 python train.py --config examples/example/config_high.json -n baseline_8192 --train flicker_path --val kodak_path

For other high bitrate (4096, 6144), we use the converged model of 8192 as pretrain model and set the learning rate as 1e-5. The training iterations are set as 500000.

The low bitrate (256, 512, 1024, 2048) training process follows the same strategy.

MS-SSIM experiments

You should change the distorsion loss to (1-MS_SSIM), and fine-tune the pretrained model optimized by PSNR to accelerate the training process. You can find more details in our released paper. The training strategy is similar.

If your find our code is helpful for your research, please cite our paper. Besides, this code is only for research.

@article{liu2020unified,
  title={A Unified End-to-End Framework for Efficient Deep Image Compression},
  author={Liu, Jiaheng and Lu, Guo and Hu, Zhihao and Xu, Dong},
  journal={arXiv preprint arXiv:2002.03370},
  year={2020}
}

Learned image compression

Related tags

Overview

Overview

Content

Prerequisites

Data Preparation

Training

Details

PSNR experiments.

MS-SSIM experiments

Owner

Jiaheng Liu

[CVPRW 2021] Code for Region-Adaptive Deformable Network for Image Quality Assessment

The official codes for the ICCV2021 Oral presentation "Rethinking Counting and Localization in Crowds: A Purely Point-Based Framework"

SOTA easy to use PyTorch-based DL training library

PIXIE: Collaborative Regression of Expressive Bodies

TensorFlow Tutorial and Examples for Beginners (support TF v1 & v2)

OOD Generalization and Detection (ACL 2020)

Funnels: Exact maximum likelihood with dimensionality reduction.

Code for paper " AdderNet: Do We Really Need Multiplications in Deep Learning?"

Code for "Localization with Sampling-Argmax", NeurIPS 2021

Python implementation of the multistate Bennett acceptance ratio (MBAR)

My implementation of transformers related papers for computer vision in pytorch

A denoising autoencoder + adversarial losses and attention mechanisms for face swapping.

SE-MSCNN: A Lightweight Multi-scaled Fusion Network for Sleep Apnea Detection Using Single-Lead ECG Signals

Example-custom-ml-block-keras - Custom Keras ML block example for Edge Impulse

Implementation of Perceiver, General Perception with Iterative Attention in TensorFlow

Exadel CompreFace is a free and open-source face recognition GitHub project

Official implementation of "Variable-Rate Deep Image Compression through Spatially-Adaptive Feature Transform", ICCV 2021

Double pendulum simulator using a symplectic Euler's method and Hamiltonian mechanics

Gated-Shape CNN for Semantic Segmentation (ICCV 2019)

SVG Icon processing tool for C++