CBREN: Convolutional Neural Networks for Constant Bit Rate Video Quality Enhancement

Last update: Nov 04, 2022

Related tags

Overview

CBREN

This is the Pytorch implementation for our IEEE TCSVT paper : CBREN: Convolutional Neural Networks for Constant Bit Rate Video Quality Enhancement.

Note: different from the paper, this code adds residual blocks to the pixel-domain branch of DRM module, but it has little impact on the effect of the network.

Because the DCN compilation in Windows environment may cause problems, this code may only run in Linux environment.

Requirements

Python 3.8
PyTorch 1.6.0
Numpy 1.19.2
Pillow 7.2.0
OpenCV 4.4.0.44

Prepare

Build

Deformable convolution is used in this code from: https://github.com/chengdazhi/Deformable-Convolution-V2-PyTorch
Run sh make.sh to compile the deformable convolution. If there are compilation errors, delete the 'build/' directory before recompiling.

Datasets

The directories used in the project need to be created manually.
Download HEVC standard test sequence: https://pan.baidu.com/s/1m0jZfkhX_cjaoFrHlMp0Xg Extraction code: 88n9
Hm16.0 is used to compress the standard test sequence.
The original video and compressed video in YUV format are converted to MP4 format by ffmpeg.
We provided the BasketballPass video in MP4 format as a demonstration in this project.
Run tools/video_get_frames.py to obtain the image sequence in PNG format from the video.

Pretrained models

Pretrained models are available: https://pan.baidu.com/s/1sszHgZ1tYVEu8toyUkFaUw Extraction code: i0zs

Run

Runrun.py, and the generated images are saved in results/.
If the size of GPU memory is not large enough to run the sequences A and B, please run run_group_A&B.py.

CBREN: Convolutional Neural Networks for Constant Bit Rate Video Quality Enhancement

Related tags

Overview

CBREN

Requirements

Prepare

Build

Datasets

Pretrained models

Run

Owner

Zhao Hengrun

RIM: Reliable Influence-based Active Learning on Graphs.

Revisiting Video Saliency: A Large-scale Benchmark and a New Model (CVPR18, PAMI19)

It is a system used to detect bone fractures. using techniques deep learning and image processing

A comprehensive list of published machine learning applications to cosmology

Face Mask Detection System built with OpenCV, TensorFlow using Computer Vision concepts

A Lightweight Hyperparameter Optimization Tool 🚀

This is a demo app to be used in the video streaming applications

Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch

Pytorch implementation of “Recursive Non-Autoregressive Graph-to-Graph Transformer for Dependency Parsing with Iterative Refinement”

Task-based end-to-end model learning in stochastic optimization

A baseline code for VSPW

Pytorch Implementation of Value Retrieval with Arbitrary Queries for Form-like Documents.

Fully convolutional networks for semantic segmentation

Adversarial examples to the new ConvNeXt architecture

Deep RGB-D Saliency Detection with Depth-Sensitive Attention and Automatic Multi-Modal Fusion (CVPR'2021, Oral)

Source code for "Pack Together: Entity and Relation Extraction with Levitated Marker"

PCAM: Product of Cross-Attention Matrices for Rigid Registration of Point Clouds

Revisiting Temporal Alignment for Video Restoration

Extracts data from the database for a graph-node and stores it in parquet files

Object Depth via Motion and Detection Dataset