Hybrid Neural Fusion for Full-frame Video Stabilization

Last update: Jan 04, 2023

Related tags

Deep Learning video-stabilization

Overview

FuSta: Hybrid Neural Fusion for Full-frame Video Stabilization

Project Page | Video | Paper | Google Colab

Setup

Setup environment for [Yu and Ramamoorthi 2020].

cd CVPR2020CODE_yulunliu_modified
conda create --name FuSta_CVPR2020 python=3.6
conda activate FuSta_CVPR2020
pip install -r requirements_CVPR2020.txt
./install.sh

Download pre-trained checkpoints of [Yu and Ramamoorthi 2020].

wget https://www.cmlab.csie.ntu.edu.tw/~yulunliu/FuSta/CVPR2020_ckpts.zip
unzip CVPR2020_ckpts.zip
cd ..

Setup environment for FuSta.

conda deactivate
conda create --name FuSta python=3.6
conda activate FuSta
conda install pytorch=1.6.0 torchvision=0.7.0 cudatoolkit=10.1 -c pytorch
conda install matplotlib
conda install tensorboard
conda install scipy
conda install opencv
conda install -c conda-forge cupy cudatoolkit=10.1
pip install PyMaxflow

Running code

Calculate smoothed flow using [Yu and Ramamoorthi 2020].

conda activate FuSta_CVPR2020
cd CVPR2020CODE_yulunliu_modified
python main.py [input_frames_path] [output_frames_path] [output_warping_field_path]

e.g.

python main.py ../../NUS/Crowd/0/ NUS_results/Crowd/0/ CVPR2020_warping_field/

Run FuSta video stabilization.

conda deactivate
conda activate FuSta
cd ..
python run_FuSta.py --load [model_checkpoint_path] --input_frames_path [input_frames_path] --warping_field_path [warping_field_path] --output_path [output_frames_path] --temporal_width [temporal_width] --temporal_step [temporal_step]

e.g.

python run_FuSta.py --load NeRViS_model/checkpoint/model_epoch050.pth --input_frames_path ../NUS/Crowd/0/ --warping_field_path CVPR2020CODE_yulunliu_modified/CVPR2020_warping_field/ --output_path output/ --temporal_width 41 --temporal_step 4

Citation

@inproceedings{Liu-FuSta-2021,
    author    = {Liu, Yu-Lun and Lai, Wei-Sheng and Yang, Ming-Hsuan and Chuang, Yung-Yu and Huang, Jia-Bin}, 
    title     = {Hybrid Neural Fusion for Full-frame Video Stabilization}, 
    journal   = {arXiv preprint},
    year      = {2021}
}

Acknowledgements

Parts of the code were based on from AdaCoF-pytorch. Some functions are borrowed from softmax-splatting, RAFT, and [Yu and Ramamoorthi 2020]

Hybrid Neural Fusion for Full-frame Video Stabilization

Related tags

Overview

FuSta: Hybrid Neural Fusion for Full-frame Video Stabilization

Project Page | Video | Paper | Google Colab

Setup

Running code

Citation

Acknowledgements

Owner

Yu-Lun Liu

Air Quality Prediction Using LSTM

[CVPR 2021] MetaSAug: Meta Semantic Augmentation for Long-Tailed Visual Recognition

Official pytorch implementation of the IrwGAN for unaligned image-to-image translation

[CVPR 2021] MiVOS - Mask Propagation module. Reproduced STM (and better) with training code :star2:. Semi-supervised video object segmentation evaluation.

Image Super-Resolution by Neural Texture Transfer

Open-source codebase for EfficientZero, from "Mastering Atari Games with Limited Data" at NeurIPS 2021.

A Self-Supervised Contrastive Learning Framework for Aspect Detection

Simple reimplemetation experiments about FcaNet

Learning Spatio-Temporal Transformer for Visual Tracking

TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.

Convert Apple NeuralHash model for CSAM Detection to ONNX.

Codebase for arXiv preprint "NeRF++: Analyzing and Improving Neural Radiance Fields"

Convolutional neural network that analyzes self-generated images in a variety of languages to find etymological similarities

A PyTorch port of the Neural 3D Mesh Renderer

fklearn: Functional Machine Learning

RITA is a family of autoregressive protein models, developed by LightOn in collaboration with the OATML group at Oxford and the Debora Marks Lab at Harvard.

How to Become More Salient? Surfacing Representation Biases of the Saliency Prediction Model

This repository contains the code for the paper "Hierarchical Motion Understanding via Motion Programs"

Code for the paper: Sketch Your Own GAN

3D Multi-Person Pose Estimation by Integrating Top-Down and Bottom-Up Networks