GitHub repository for "Improving Video Generation for Multi-functional Applications"

Last update: Dec 07, 2022

Related tags

Overview

Improving Video Generation for Multi-functional Applications

GitHub repository for "Improving Video Generation for Multi-functional Applications"

Paper Link

For more information please refer to our homepage.

Requirements

Tensorflow 1.2.1
Python 2.7
ffmpeg

Data Format

Videos are stored as JPEGs of vertically stacked frames. Every frame needs to be at least 64x64 pixels; videos contain between 16 and 32 frames. For an example datasets see: http://carlvondrick.com/tinyvideo/#data

Training

python main_train.py

Important Parameters:

mode: one of 'generate', 'predict', 'bw2rgb', 'inpaint' depending on weather you want to generate videos, predict future frames, colorize videos or do inpainting.
batch_size: Recommended 64, for colorization use 32 for memory issues.
root_dir: root directory of dataset
index_file: must be in root_dir, containing a list of all training data clips; path relative to root_dir.
experiment_name: name of experiment
output_every: output loss to stdout and write to tensorboard summary every xx steps.
sample_every: generate a visual sample every xx steps.
save_model_very: save the model every xx steps.
recover_model: if true recover model and continue training

GitHub repository for "Improving Video Generation for Multi-functional Applications"

Related tags

Overview

Improving Video Generation for Multi-functional Applications

Requirements

Data Format

Training

Owner

Bernhard Kratzwald

nnFormer: Interleaved Transformer for Volumetric Segmentation

This repository contains code released by Google Research.

ALL Snow Removed: Single Image Desnowing Algorithm Using Hierarchical Dual-tree Complex Wavelet Representation and Contradict Channel Loss (HDCWNet)

A fast poisson image editing implementation that can utilize multi-core CPU or GPU to handle a high-resolution image input.

Learning nonlinear operators via DeepONet

AVD Quickstart Containerlab

pytorch implementation of the ICCV'21 paper "MVTN: Multi-View Transformation Network for 3D Shape Recognition"

CoRe: Contrastive Recurrent State-Space Models

OOD Generalization and Detection (ACL 2020)

A pre-trained language model for social media text in Spanish

This is the repo for Uncertainty Quantification 360 Toolkit.

Supplemental learning materials for "Fourier Feature Networks and Neural Volume Rendering"

So-ViT: Mind Visual Tokens for Vision Transformer

Attempt at implementation of a simple GAN using Keras

Structured Edge Detection Toolbox

This example implements the end-to-end MLOps process using Vertex AI platform and Smart Analytics technology capabilities

The Pytorch implementation for "Video-Text Pre-training with Learned Regions"

CCPD: a diverse and well-annotated dataset for license plate detection and recognition

Honours project, on creating a depth estimation map from two stereo images of featureless regions

pq is a jq-like Pickle file viewer