This code extends the neural style transfer image processing technique to video by generating smooth transitions between several reference style images

Last update: Jan 07, 2023

Overview

Neural Style Transfer Transition Video Processing

By Brycen Westgarth and Tristan Jogminas

Description

This code extends the neural style transfer image processing technique to video by generating smooth transitions between a sequence of reference style images across video frames. The generated output video is a highly altered, artistic representation of the input video consisting of constantly changing abstract patterns and colors that emulate the original content of the video. The user's choice of style reference images, style sequence order, and style sequence length allow for infinite user experimentation and the creation of an endless range of artistically interesting videos.

System Requirements

This algorithm is computationally intensive so I highly recommend optimizing its performance by installing drivers for Tensorflow GPU support if you have access to a CUDA compatible GPU. Alternatively, you can take advantage of the free GPU resources available through Google Colab Notebooks. Even with GPU acceleration, the program may take several minutes to render a video.

Colab Notebook Version

Configuration

All configuration of the video properties and input/output file locations can be set by the user in config.py

Configurable Variable in config.py	Description
ROOT_PATH	Path to input/output directory
FRAME_HEIGHT	Sets height dimension in pixels to resize the output video to. Video width will be calculated automatically to preserve aspect ratio. Low values will speed up processing time but reduce output video quality
INPUT_FPS	Defines the rate at which frames are captured from the input video
INPUT_VIDEO_NAME	Filename of input video
STYLE_SEQUENCE	List that contains the indices corresponding to the image files in the 'style_ref' folder. Defines the reference style image transition sequence. Can be arbitrary length, the rate at which the video transitions between styles will be adjusted to fit the video
OUTPUT_FPS	Defines the frame rate of the output video
OUTPUT_VIDEO_NAME	Filename of output video to be created
GHOST_FRAME_TRANSPARENCY	Proportional feedback constant for frame generation. Should be a value between 0 and 1. Affects the amount change that can occur between frames and the smoothness of the transitions.

The user must find and place their own style reference images in the style_ref directory. Style reference images can be arbitrary size. Three example style reference images are given.

Minor video time effects can be created by setting INPUT_FPS and OUTPUT_FPS to different relative values

INPUT_FPS > OUTPUT_FPS creates a slowed time effect
INPUT_FPS = OUTPUT_FPS creates no time effect
INPUT_FPS < OUTPUT_FPS creates a timelapse effect

Usage

$ python3 -m venv env
$ source env/bin/activate
$ pip3 install -r requirements.txt
$ python3 style_frames.py

This code extends the neural style transfer image processing technique to video by generating smooth transitions between several reference style images

Related tags

Overview

Neural Style Transfer Transition Video Processing

Description

System Requirements

Configuration

Usage

Examples

Input Video

Example 1

Reference Style Image Transition Sequence

Output Video

Example 2

Reference Style Image Transition Sequence

Output Video

Example Video made using this program

Owner

Brycen Westgarth

Code for the paper in Findings of EMNLP 2021: "EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up Knowledge Distillation".

Code for papers "Generation-Augmented Retrieval for Open-Domain Question Answering" and "Reader-Guided Passage Reranking for Open-Domain Question Answering", ACL 2021

This is a project of data parallel that running on NLP tasks.

[EMNLP 2021] Mirror-BERT: Converting Pretrained Language Models to universal text encoders without labels.

CPC-big and k-means clustering for zero-resource speech processing

Multilingual finetuning of Machine Translation model on low-resource languages. Project for Deep Natural Language Processing course.

Open source code for AlphaFold.

SAVI2I: Continuous and Diverse Image-to-Image Translation via Signed Attribute Vectors

Installation, test and evaluation of Scribosermo speech-to-text engine

justCTF [*] 2020 challenges sources

This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Translation".

spaCy-wrap: For Wrapping fine-tuned transformers in spaCy pipelines

Implementation for paper BLEU: a Method for Automatic Evaluation of Machine Translation

Implementation of paper Does syntax matter? A strong baseline for Aspect-based Sentiment Analysis with RoBERTa.

PyTranslator é simultaneamente um editor e tradutor de texto com diversos recursos e interface feito com coração e 100% em Python

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

An automated program that helps customers of Pizza Palour place their pizza orders

Opal-lang - A WIP programming language based on Python

Repository for Project Insight: NLP as a Service

AudioCLIP Extending CLIP to Image, Text and Audio