Dataset and Code for ICCV 2021 paper "Real-world Video Super-resolution: A Benchmark Dataset and A Decomposition based Learning Scheme"

Last update: Nov 22, 2022

Related tags

Computer Vision RealVSR

Overview

Dataset and Code for RealVSR

Real-world Video Super-resolution: A Benchmark Dataset and A Decomposition based Learning Scheme
Xi Yang, Wangmeng Xiang, Hui Zeng and Lei Zhang
International Conference on Computer Vision, 2021.

Dataset

The dataset is hosted on Google Drive and Baidu Drive (code: 43ph). Some example scenes are shown below.

The structure of the dataset is illustrated below.

File	Description
GT.zip	All ground truth sequences in RGB format
LQ.zip	All low quality sequences in RGB format
GT_YCbCr.zip	All ground truth sequences in YCbCr format
LQ_YCbCr.zip	All low quality sequences in YCbCr format
GT_test.zip	Ground truth test sequences in RGB format
LQ_test.zip	Low Quality test sequences in RGB format
GT_YCbCr_test.zip	Ground truth test sequences in YCbCr format
LQ_YCbCr_test.zip	Low Quality test sequences in YCbCr format

Code

Dependencies

Linux (tested on Ubuntu 18.04)
Python 3 (tested on python 3.7)
NVIDIA GPU + CUDA (tested on CUDA 10.2 and 11.1)

Installation

# Create a new anaconda python environment (realvsr)
conda create -n realvsr python=3.7 -y

# Activate the created environment
conda activate realvsr

# Install dependencies
pip install -r requirements.txt

# Bulid the DCN module
cd codes/models/archs/dcn
python setup.py develop

Training

Modify the configuration files accordingly in codes/options/train folder and run the following command (current we did not implement distributed training):

python train.py -opt xxxxx.yml

Testing

Test on RealVSR testing set sequences:

Modify the configuration in test_RealVSR_wi_GT.py and run the following command:

python test_RealVSR_wi_GT.py

Test on real-world captured sequences:

Modify the configuration in test_RealVSR_wo_GT.py and run the following command:

python test_RealVSR_wo_GT.py

Pre-trained Models

Some pretrained models could be found on Google Drive and Baidu Drive (code: n1n0).

License

This project is released under the Apache 2.0 license.

Citation

If you find this code useful in your research, please consider citing:

@article{yang2021real,
  title={Real-world Video Super-resolution: A Benchmark Dataset and A Decomposition based Learning Scheme},
  author={YANG, Xi and Xiang, Wangmeng and Zeng, Hui and Zhang, Lei},
  journal=ICCV,
  year={2021}
}

Acknowledgement

This implementation largely depends on EDVR. Thanks for the excellent codebase! You may also consider migrating it to BasicSR.

Dataset and Code for ICCV 2021 paper "Real-world Video Super-resolution: A Benchmark Dataset and A Decomposition based Learning Scheme"

Related tags

Overview

Dataset and Code for RealVSR

Dataset

Code

Dependencies

Installation

Training

Testing

Test on RealVSR testing set sequences:

Test on real-world captured sequences:

Pre-trained Models

License

Citation

Acknowledgement

Owner

Xi Yang

Convolutional Recurrent Neural Network (CRNN) for image-based sequence recognition.

Converts an image into funny, smaller amongus characters

🔎 Like Chardet. 🚀 Package for encoding & language detection. Charset detection.

Convolutional Recurrent Neural Networks(CRNN) for Scene Text Recognition

利用Paddle框架复现CRAFT

Handwritten Character Recognition using CNN

Corner-based Region Proposal Network

基于图像识别的开源RPA工具，理论上可以支持所有windows软件和网页的自动化

Handwritten Text Recognition (HTR) system implemented with TensorFlow.

🖺 OCR using tensorflow with attention

Layout Analysis Evaluator for the ICDAR 2017 competition on Layout Analysis for Challenging Medieval Manuscripts

Using computer vision method to recognize and calcutate the features of the architecture.

CNN+LSTM+CTC based OCR implemented using tensorflow.

Character Segmentation using TensorFlow

ocroseg - This is a deep learning model for page layout analysis / segmentation.

Pytorch implementation of PSEnet with Pyramid Attention Network as feature extractor

Select range and every time the screen changes, OCR is activated.

Python tool that takes the OCR.space JSON output as input and draws a text overlay on top of the image.

Detect textlines in document images

An interactive interface for using OpenCV's GrabCut algorithm for image segmentation.