The official pytorch implemention of the CVPR paper "Temporal Modulation Network for Controllable Space-Time Video Super-Resolution".

Last update: Oct 24, 2022

Related tags

Deep Learning TMNet

Overview

This is the official PyTorch implementation of TMNet in the CVPR 2021 paper "Temporal Modulation Network for Controllable Space-Time VideoSuper-Resolution"[PDF]. Our TMNet can flexibly interpolate intermediate frames for space-time video super-resolution (STVSR).

Requirements
Installation
Demo
Training
Testing
Citations

Requirements

Python 3.6
PyTorch >= 1.1
NVIDIA GPU + CUDA
Deformable Convolution v2, we adopt CharlesShang's implementation in the submodule.
Python packages: pip install numpy opencv-python lmdb pyyaml pickle5 matplotlib seaborn

Installation

First, make sure your machine has a GPU, which is required for the DCNv2 module.

Clone the TMNet repository.

git clone --recursive https://github.com/CS-GangXu/TMNet.git

Compile the DCNv2:

cd $ROOT/codes/models/modules/DCNv2
bash make.sh
python test.py

Demo (To be uploaded at April 24, 2021 11:59PM (Pacific Time))

Training (To be uploaded at April 24, 2021 11:59PM (Pacific Time))

Testing (To be uploaded at April 24, 2021 11:59PM (Pacific Time))

Citations

If you find the code helpful in your research or work, please cite the following papers.

@InProceedings{xu2021temporal,
  author = {Gang Xu and Jun Xu and Zhen Li and Liang Wang and Xing Sun and Mingming Cheng},
  title = {Temporal Modulation Network for Controllable Space-Time VideoSuper-Resolution},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
  month = {June},
  year = {2021}
}

@InProceedings{xiang2020zooming,
  author = {Xiang, Xiaoyu and Tian, Yapeng and Zhang, Yulun and Fu, Yun and Allebach, Jan P. and Xu, Chenliang},
  title = {Zooming Slow-Mo: Fast and Accurate One-Stage Space-Time Video Super-Resolution},
  booktitle = {IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR)},
  pages={3370--3379},
  month = {June},
  year = {2020}
}

@InProceedings{wang2019edvr,
  author    = {Wang, Xintao and Chan, Kelvin C.K. and Yu, Ke and Dong, Chao and Loy, Chen Change},
  title     = {EDVR: Video restoration with enhanced deformable convolutional networks},
  booktitle = {The IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW)},
  month     = {June},
  year      = {2019},
}

Acknowledgments

Our code is inspired by Zooming-Slow-Mo-CVPR-2020 and EDVR.

Contact

If you have any questions, feel free to E-mail Gang Xu with [email protected].

The official pytorch implemention of the CVPR paper "Temporal Modulation Network for Controllable Space-Time Video Super-Resolution".

Related tags

Overview

Contents

Requirements

Installation

Demo (To be uploaded at April 24, 2021 11:59PM (Pacific Time))

Training (To be uploaded at April 24, 2021 11:59PM (Pacific Time))

Testing (To be uploaded at April 24, 2021 11:59PM (Pacific Time))

Citations

Acknowledgments

Contact

Owner

Gang Xu

Immortal tracker

Bridging Composite and Real: Towards End-to-end Deep Image Matting

Deep Learning (with PyTorch)

A Python package for faster, safer, and simpler ML processes

This repository contains the source codes for the paper AtlasNet V2 - Learning Elementary Structures.

(NeurIPS '21 Spotlight) IQ-Learn: Inverse Q-Learning for Imitation

Patch Rotation: A Self-Supervised Auxiliary Task for Robustness and Accuracy of Supervised Models

working repo for my xumx-sliCQ submissions to the ISMIR 2021 MDX

Self-Supervised Pre-Training for Transformer-Based Person Re-Identification

Learning to Identify Top Elo Ratings with A Dueling Bandits Approach

Official pytorch implementation of paper "Image-to-image Translation via Hierarchical Style Disentanglement".

This repo provides the official code for TransBTS: Multimodal Brain Tumor Segmentation Using Transformer (https://arxiv.org/pdf/2103.04430.pdf).

[ACL 20] Probing Linguistic Features of Sentence-level Representations in Neural Relation Extraction

Flax is a neural network ecosystem for JAX that is designed for flexibility.

This repo holds codes of the ICCV21 paper: Visual Alignment Constraint for Continuous Sign Language Recognition.

Neurons Dataset API - The official dataloader and visualization tools for Neurons Datasets.

Unofficial & improved implementation of NeRF--: Neural Radiance Fields Without Known Camera Parameters

ObjectDetNet is an easy, flexible, open-source object detection framework

Implementation of SE3-Transformers for Equivariant Self-Attention, in Pytorch.

PyTorch implementation of the method described in the paper VoiceLoop: Voice Fitting and Synthesis via a Phonological Loop.