official Pytorch implementation of ICCV 2021 paper FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting.

Last update: Dec 27, 2022

Related tags

Deep Learning FuseFormer

Overview

FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting

By Rui Liu, Hanming Deng, Yangyi Huang, Xiaoyu Shi, Lewei Lu, Wenxiu Sun, Xiaogang Wang, Jifeng Dai, Hongsheng Li.

This repo is the official Pytorch implementation of FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting.

Introduction

Usage

Prerequisites

Python >= 3.6
Pytorch >= 1.0 and corresponding torchvision (https://pytorch.org/)

Install

Clone this repo:

git clone https://github.com/ruiliu-ai/FuseFormer.git

Install other packages:

cd FuseFormer
pip install -r requirements.txt

Training

Dataset preparation

Download datasets (YouTube-VOS and DAVIS) into the data folder.

mkdir data

Training script

python train.py -c configs/youtube-vos.json

Test

Download pre-trained model into checkpoints folder.

mkdir checkpoints

Test script

python test.py -c checkpoints/fuseformer.pth -v data/DAVIS/JPEGImages/blackswan -m data/DAVIS/Annotations/blackswan

Citing FuseFormer

If you find FuseFormer useful in your research, please consider citing:

@InProceedings{Liu_2021_FuseFormer,
  title={FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting},
  author={Liu, Rui and Deng, Hanming and Huang, Yangyi and Shi, Xiaoyu and Lu, Lewei and Sun, Wenxiu and Wang, Xiaogang and Dai, Jifeng and Li, Hongsheng},
  booktitle = {International Conference on Computer Vision (ICCV)},
  year={2021}
}

Acknowledement

This code borrows heavily from the video inpainting framework spatial-temporal transformer net.

official Pytorch implementation of ICCV 2021 paper FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting.

Related tags

Overview

FuseFormer: Fusing Fine-Grained Information in Transformers for Video Inpainting

Introduction

Usage

Prerequisites

Install

Training

Dataset preparation

Training script

Test

Test script

Citing FuseFormer

Acknowledement

Owner

Mmdetection3d Noted - MMDetection3D is an open source object detection toolbox based on PyTorch

This is code to fit per-pixel environment map with spherical Gaussian lobes, using LBFGS optimization

RealFormer-Pytorch Implementation of RealFormer using pytorch

MOT-Tracking-by-Detection-Pipeline - For Tracking-by-Detection format MOT (Multi Object Tracking), is it a framework that separates Detection and Tracking processes?

Template repository to build PyTorch projects from source on any version of PyTorch/CUDA/cuDNN.

Predicting a person's gender based on their weight and height

Docker containers of baseline agents for the Crafter environment

Data Consistency for Magnetic Resonance Imaging

Code for CVPR2019 Towards Natural and Accurate Future Motion Prediction of Humans and Animals

Single Image Super-Resolution (SISR) with SRResNet, EDSR and SRGAN

Official Pytorch implementation of "Unbiased Classification Through Bias-Contrastive and Bias-Balanced Learning (NeurIPS 2021)

Pytorch implementation of SELF-ATTENTIVE VAD, ICASSP 2021

This is a deep learning-based method to segment deep brain structures and a brain mask from T1 weighted MRI.

The Malware Open-source Threat Intelligence Family dataset contains 3,095 disarmed PE malware samples from 454 families

Explore the Expression: Facial Expression Generation using Auxiliary Classifier Generative Adversarial Network

Constructing interpretable quadratic accuracy predictors to serve as an objective function for an IQCQP problem that represents NAS under latency constraints and solve it with efficient algorithms.

TipToiDog - Tip Toi Dog With Python

ChineseBERT: Chinese Pretraining Enhanced by Glyph and Pinyin Information

The backbone CSPDarkNet of YOLOX.

Offline Multi-Agent Reinforcement Learning Implementations: Solving Overcooked Game with Data-Driven Method