an implementation of Video Frame Interpolation via Adaptive Separable Convolution using PyTorch

Last update: Jan 08, 2023

Overview

This work has now been superseded by: https://github.com/sniklaus/revisiting-sepconv

sepconv-slomo

This is a reference implementation of Video Frame Interpolation via Adaptive Separable Convolution [1] using PyTorch. Given two frames, it will make use of adaptive convolution [2] in a separable manner to interpolate the intermediate frame. Should you be making use of our work, please cite our paper [1].

For a reimplemntation of our work, see: https://github.com/martkartasev/sepconv
And for another adaptation, consider: https://github.com/HyeongminLEE/pytorch-sepconv
For softmax splatting, please see: https://github.com/sniklaus/softmax-splatting

setup

The separable convolution layer is implemented in CUDA using CuPy, which is why CuPy is a required dependency. It can be installed using pip install cupy or alternatively using one of the provided binary packages as outlined in the CuPy repository.

If you plan to process videos, then please also make sure to have pip install moviepy installed.

usage

To run it on your own pair of frames, use the following command. You can either select the l1 or the lf model, please see our paper for more details. In short, the l1 model should be used for quantitative evaluations and the lf model for qualitative comparisons.

python run.py --model lf --one ./images/one.png --two ./images/two.png --out ./out.png

To run in on a video, use the following command.

python run.py --model lf --video ./videos/car-turn.mp4 --out ./out.mp4

For a quick benchmark using examples from the Middlebury benchmark for optical flow, run python benchmark.py. You can use it to easily verify that the provided implementation runs as expected.

video

license

The provided implementation is strictly for academic purposes only. Should you be interested in using our technology for any commercial use, please feel free to contact us.

references

[1]  @inproceedings{Niklaus_ICCV_2017,
         author = {Simon Niklaus and Long Mai and Feng Liu},
         title = {Video Frame Interpolation via Adaptive Separable Convolution},
         booktitle = {IEEE International Conference on Computer Vision},
         year = {2017}
     }

[2]  @inproceedings{Niklaus_CVPR_2017,
         author = {Simon Niklaus and Long Mai and Feng Liu},
         title = {Video Frame Interpolation via Adaptive Convolution},
         booktitle = {IEEE Conference on Computer Vision and Pattern Recognition},
         year = {2017}
     }

acknowledgment

This work was supported by NSF IIS-1321119. The video above uses materials under a Creative Common license or with the owner's permission, as detailed at the end.

an implementation of Video Frame Interpolation via Adaptive Separable Convolution using PyTorch

Related tags

Overview

sepconv-slomo

setup

usage

video

license

references

acknowledgment

Owner

Simon Niklaus

[ICCV21] Official implementation of the "Social NCE: Contrastive Learning of Socially-aware Motion Representations" in PyTorch.

Official PyTorch Implementation of GAN-Supervised Dense Visual Alignment

[ICLR2021oral] Rethinking Architecture Selection in Differentiable NAS

[MedIA2021]MIDeepSeg: Minimally Interactive Segmentation of Unseen Objects from Medical Images Using Deep Learning

Baseline powergrid model for NY

EsViT: Efficient self-supervised Vision Transformers

Machine Learning Time-Series Platform

FedTorch is an open-source Python package for distributed and federated training of machine learning models using PyTorch distributed API

Framework that uses artificial intelligence applied to mathematical models to make predictions

(NeurIPS 2020) Wasserstein Distances for Stereo Disparity Estimation

Potato Disease Classification - Training, Rest APIs, and Frontend to test.

Extremely easy multi instancing software for minecraft speedrunning.

Pydantic models for pywttr and aiopywttr.

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Learning an Adaptive Meta Model-Generator for Incrementally Updating Recommender Systems

Code implementation from my Medium blog post: [Transformers from Scratch in PyTorch]

Pytorch implementation of the paper "COAD: Contrastive Pre-training with Adversarial Fine-tuning for Zero-shot Expert Linking."

PyTorch implementation of Interpretable Explanations of Black Boxes by Meaningful Perturbation

Capsule endoscopy detection DACON challenge

LeViT a Vision Transformer in ConvNet's Clothing for Faster Inference