Official Pytorch implementation for Deep Contextual Video Compression, NeurIPS 2021

Last update: Dec 03, 2022

Related tags

Deep Learning DCVC

Overview

Introduction

Official Pytorch implementation for Deep Contextual Video Compression, NeurIPS 2021

Prerequisites

Python 3.8 and conda, get Conda
CUDA 11.0

Environment

conda create -n $YOUR_PY38_ENV_NAME python=3.8
conda activate $YOUR_PY38_ENV_NAME

pip install torch==1.7.1+cu110 torchvision==0.8.2+cu110 torchaudio==0.7.2 -f https://download.pytorch.org/whl/torch_stable.html
python -m pip install -r requirements.txt

Test dataset

Currenlty the spatial resolution of video needs to be cropped into the integral times of 64.

The dataset format can be seen in dataset_config_example.json.

For example, one video of HEVC Class B can be prepared as:

Crop the original YUV via ffmpeg:

ffmpeg -pix_fmt yuv420p  -s 1920x1080 -i  BasketballDrive_1920x1080_50.yuv -vf crop=1920:1024:0:0 BasketballDrive_1920x1024_50.yuv

Make the video path:
```
mkdir BasketballDrive_1920x1024_50
```

Convert YUV to PNG:

ffmpeg -pix_fmt yuv420p -s 1920x1024 -i BasketballDrive_1920x1024_50.yuv   -f image2 BasketballDrive_1920x1024_50/im%05d.png

At last, the folder structure of dataset is like:

/media/data/HEVC_B/
    * BQTerrace_1920x1024_60/
        - im00001.png
        - im00002.png
        - im00003.png
        - ...
    * BasketballDrive_1920x1024_50/
        - im00001.png
        - im00002.png
        - im00003.png
        - ...
    * ...
/media/data/HEVC_D
/media/data/HEVC_C/
...

Pretrained models

Download CompressAI models

cd checkpoints/
python download_compressai_models.py
cd ..

Download DCVC models and put them into /checkpoints folder.

Test DCVC

Example of test the PSNR model:

python test_video.py --i_frame_model_name cheng2020-anchor  --i_frame_model_path  checkpoints/cheng2020-anchor-3-e49be189.pth.tar  checkpoints/cheng2020-anchor-4-98b0b468.pth.tar   checkpoints/cheng2020-anchor-5-23852949.pth.tar   checkpoints/cheng2020-anchor-6-4c052b1a.pth.tar  --test_config     dataset_config_example.json  --cuda true --cuda_device 0,1,2,3   --worker 4   --output_json_result_path  DCVC_result_psnr.json    --model_type psnr  --recon_bin_path recon_bin_folder_psnr --model_path checkpoints/model_dcvc_quality_0_psnr.pth  checkpoints/model_dcvc_quality_1_psnr.pth checkpoints/model_dcvc_quality_2_psnr.pth checkpoints/model_dcvc_quality_3_psnr.pth

Example of test the MSSSIM model:

python test_video.py --i_frame_model_name bmshj2018-hyperprior  --i_frame_model_path  checkpoints/bmshj2018-hyperprior-ms-ssim-3-92dd7878.pth.tar checkpoints/bmshj2018-hyperprior-ms-ssim-4-4377354e.pth.tar    checkpoints/bmshj2018-hyperprior-ms-ssim-5-c34afc8d.pth.tar    checkpoints/bmshj2018-hyperprior-ms-ssim-6-3a6d8229.pth.tar   --test_config   dataset_config_example.json  --cuda true --cuda_device 0,1,2,3   --worker 4   --output_json_result_path  DCVC_result_msssim.json  --model_type msssim  --recon_bin_path recon_bin_folder_msssim --model_path checkpoints/model_dcvc_quality_0_msssim.pth checkpoints/model_dcvc_quality_1_msssim.pth checkpoints/model_dcvc_quality_2_msssim.pth checkpoints/model_dcvc_quality_3_msssim.pth

It is recommended that the --worker number is equal to your GPU number.

Acknowledgement

The implementation is based on CompressAI and PyTorchVideoCompression. The model weights of intra coding come from CompressAI.

Citation

If you find this work useful for your research, please cite:

@article{li2021deep,
  title={Deep Contextual Video Compression},
  author={Li, Jiahao and Li, Bin and Lu, Yan},
  journal={Advances in Neural Information Processing Systems},
  volume={34},
  year={2021}
}

Official Pytorch implementation for Deep Contextual Video Compression, NeurIPS 2021

Related tags

Overview

Introduction

Prerequisites

Test dataset

Pretrained models

Test DCVC

Acknowledgement

Citation

Owner

Weakly Supervised Learning of Rigid 3D Scene Flow

CS50x-AI - Artificial Intelligence with Python from Harvard University

AdaFocus (ICCV 2021) Adaptive Focus for Efficient Video Recognition

Repo for "TableParser: Automatic Table Parsing with Weak Supervision from Spreadsheets" at [email protected]

code release for USENIX'22 paper `On the Security Risks of AutoML`

Deep Compression for Dense Point Cloud Maps.

Spatial Temporal Graph Convolutional Networks (ST-GCN) for Skeleton-Based Action Recognition in PyTorch

Projects for AI/ML and IoT integration for games and other presented at re:Invent 2021.

optimization routines for hyperparameter tuning

KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)

[CVPR 2021] Scan2Cap: Context-aware Dense Captioning in RGB-D Scans

Image classification for projects and researches

List of all dependencies affected by node-ipc malicious commit

Project dự đoán giá cổ phiếu bằng thuật toán LSTM gồm: code train và code demo

Hierarchical Memory Matching Network for Video Object Segmentation (ICCV 2021)

Double pendulum simulator using a symplectic Euler's method and Hamiltonian mechanics

Spiking Neural Network for Computer Vision using SpikingJelly framework and Pytorch-Lightning

Code for project: "Learning to Minimize Remainder in Supervised Learning".

X-VLM: Multi-Grained Vision Language Pre-Training

Prompts - Read a textfile of prompts and import into anki via ankiconnect