Deep Sketch-guided Cartoon Video Inbetweening

Last update: Dec 22, 2022

Related tags

Overview

Cartoon Video Inbetweening

Paper | DOI | Video

The source code of Deep Sketch-guided Cartoon Video Inbetweening by Xiaoyu Li, Bo Zhang, Jing Liao, Pedro V. Sander, IEEE Transactions on Visualization and Computer Graphics, 2021.

Prerequisites

Linux or Windows
Python 3
CPU or NVIDIA GPU + CUDA CuDNN

Use the Pre-trained Models

You can download the pre-trained model here.

Run the following commands for evaluating the frame synthesis model and full model:

python eval_synthesis.py
python eval_full.py

The frame synthesis model takes img_0, img_1, ske_t as inputs and synthesizes img_t. The full model takes img_0, img_1, ske_t as inputs and interpolates five frames between img_0 and img_1.

Datasets

A dataset is a directory with the following structure:

dataset
    ├── frame
    │   └── ${clip_id}
    │       └──${image_id}.png
    ├── sketch
    │   └── ${clip_id}
    │       └──${image_id}.png
    └── dismap
        └── ${clip_id}
            └──${image_id}.npy

The sketch images can be generated by the script "sketch.py" and the distance maps can be generated by "dismap.py". Due to the copyright issue of the movie Spirited Away, we can not release our training dataset. You can generate your own dataset if you interest.

Training

Run the following command for training the frame synthesis model and full model:

python train_synthesis.py
python train_full.py

Before you train the full model, you must train the frame synthesis model first and use its parameters to initialize the full model.

Citing

If you find our work useful, please consider citing:

@article{li2021deep,
  author    = {Li, Xiaoyu and Zhang, Bo and Liao, Jing and Sander, Pedro},
  journal   = {IEEE Transactions on Visualization and Computer Graphics},
  year      = {2021},
  publisher = {IEEE}
}

Deep Sketch-guided Cartoon Video Inbetweening

Related tags

Overview

Cartoon Video Inbetweening

Paper | DOI | Video

Prerequisites

Use the Pre-trained Models

Datasets

Training

Citing

Owner

Xiaoyu Li

It's a implement of this paper：Relation extraction via Multi-Level attention CNNs

Spatial Action Maps for Mobile Manipulation (RSS 2020)

Implementation of ICCV19 Paper "Learning Two-View Correspondences and Geometry Using Order-Aware Network"

PyTorch implementation of Barlow Twins.

A PyTorch implementation of "TokenLearner: What Can 8 Learned Tokens Do for Images and Videos?"

Pacman-AI - AI project designed by UC Berkeley. Designed reflex and minimax agents for the game Pacman.

Official PyTorch code for "BAM: Bottleneck Attention Module (BMVC2018)" and "CBAM: Convolutional Block Attention Module (ECCV2018)"

Python package for downloading ECMWF reanalysis data and converting it into a time series format.

An evaluation toolkit for voice conversion models.

The official re-implementation of the Neurips 2021 paper, "Targeted Neural Dynamical Modeling".

Source code for TACL paper "KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language Representation".

Representing Long-Range Context for Graph Neural Networks with Global Attention

Advanced Signal Processing Notebooks and Tutorials

A Pytree Module system for Deep Learning in JAX

This is the repo for our work "Towards Persona-Based Empathetic Conversational Models" (EMNLP 2020)

git《Beta R-CNN: Looking into Pedestrian Detection from Another Perspective》(NeurIPS 2020) GitHub:[fig3]

Face uncertainty quantification or estimation using PyTorch.

The openspoor package is intended to allow easy transformation between different geographical and topological systems commonly used in Dutch Railway

MSG-Transformer: Exchanging Local Spatial Information by Manipulating Messenger Tokens

Official code for 'Pixel-wise Energy-biased Abstention Learning for Anomaly Segmentationon Complex Urban Driving Scenes'