Deep Sketch-guided Cartoon Video Inbetweening

Last update: Dec 22, 2022

Related tags

Overview

Cartoon Video Inbetweening

Paper | DOI | Video

The source code of Deep Sketch-guided Cartoon Video Inbetweening by Xiaoyu Li, Bo Zhang, Jing Liao, Pedro V. Sander, IEEE Transactions on Visualization and Computer Graphics, 2021.

Prerequisites

Linux or Windows
Python 3
CPU or NVIDIA GPU + CUDA CuDNN

Use the Pre-trained Models

You can download the pre-trained model here.

Run the following commands for evaluating the frame synthesis model and full model:

python eval_synthesis.py
python eval_full.py

The frame synthesis model takes img_0, img_1, ske_t as inputs and synthesizes img_t. The full model takes img_0, img_1, ske_t as inputs and interpolates five frames between img_0 and img_1.

Datasets

A dataset is a directory with the following structure:

dataset
    ├── frame
    │   └── ${clip_id}
    │       └──${image_id}.png
    ├── sketch
    │   └── ${clip_id}
    │       └──${image_id}.png
    └── dismap
        └── ${clip_id}
            └──${image_id}.npy

The sketch images can be generated by the script "sketch.py" and the distance maps can be generated by "dismap.py". Due to the copyright issue of the movie Spirited Away, we can not release our training dataset. You can generate your own dataset if you interest.

Training

Run the following command for training the frame synthesis model and full model:

python train_synthesis.py
python train_full.py

Before you train the full model, you must train the frame synthesis model first and use its parameters to initialize the full model.

Citing

If you find our work useful, please consider citing:

@article{li2021deep,
  author    = {Li, Xiaoyu and Zhang, Bo and Liao, Jing and Sander, Pedro},
  journal   = {IEEE Transactions on Visualization and Computer Graphics},
  year      = {2021},
  publisher = {IEEE}
}

Deep Sketch-guided Cartoon Video Inbetweening

Related tags

Overview

Cartoon Video Inbetweening

Paper | DOI | Video

Prerequisites

Use the Pre-trained Models

Datasets

Training

Citing

Owner

Xiaoyu Li

Binary Passage Retriever (BPR) - an efficient passage retriever for open-domain question answering

This repository contains the accompanying code for Deep Virtual Markers for Articulated 3D Shapes, ICCV'21

Cereal box identification in store shelves using computer vision and a single train image per model.

masscan + nmap + Finger

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

[NeurIPS-2021] Mosaicking to Distill: Knowledge Distillation from Out-of-Domain Data

An educational resource to help anyone learn deep reinforcement learning.

Implementation of popular SOTA self-supervised learning algorithms as Fastai Callbacks.

Check out the StyleGAN repo and place it in the same directory hierarchy as the present repo

It is an open dataset for object detection in remote sensing images.

Official code of Team Yao at Multi-Modal-Fact-Verification-2022

PyTorch Live is an easy to use library of tools for creating on-device ML demos on Android and iOS.

Shitty gaze mouse controller

Using LSTM write Tang poetry

Robotics environments

TSDF++: A Multi-Object Formulation for Dynamic Object Tracking and Reconstruction

TResNet: High Performance GPU-Dedicated Architecture

[ICCV'21] Learning Conditional Knowledge Distillation for Degraded-Reference Image Quality Assessment

Pywonderland - A tour in the wonderland of math with python.

MazeRL is an application oriented Deep Reinforcement Learning (RL) framework