This project helps to colorize grayscale images using multiple exemplars.

Last update: Aug 05, 2022

Overview

Multiple Exemplar-based Deep Colorization (Pytorch Implementation)

Pretrained Model

[Jitendra Chautharia](IIT Jodhpur)^1,3,

Prerequisites

Python 3.6+
Nvidia GPU + CUDA, CuDNN

Installation

First use the following commands to prepare the environment:

conda create -n ColorVid python=3.6
source activate ColorVid
pip install -r requirements.txt

Then, download the pretrained models from this link, unzip the file and place the files into the corresponding folders:

video_moredata_l1 under the checkpoints folder
vgg19_conv.pth and vgg19_gray.pth under the data folder

Data Preparation

In order to colorize your own video, it requires to extract the video frames, and provide a reference image as an example.

Place your Target grayscale image into one folder, e.g., ./exp_sample/target
Place your reference images into another folder, e.g., ./exp_sample/references

If you want to automatically retrieve color images, you can try the retrieval algorithm from this link which will retrieve similar images from the ImageNet dataset. Or you can try this link on your own image database.

Test

python test.py --image-size [image-size] \
               --clip_path [path-to-target-grayscale-image] \
               --ref_path [path-to-reference] \
               --output_path [path-to-output]

We provide several sample video clips with corresponding references. For example, one can colorize one sample legacy video using:

python test.py --clip_path ./exp_sample/target \
               --ref_path ./exp_sample/references \
               --output_path ./exp_sample/output

Note that we use 216*384 images for training, which has aspect ratio of 1:2. During inference, we scale the input to this size and then rescale the output back to the original size.

Train

We also provide training code for reference. The training can be started by running:

python --data_root [root of video samples] \
       --data_root_imagenet [root of image samples] \
       --gpu_ids [gpu ids] \

We do not provide the full video dataset due to the copyright issue. For image samples, we retrieve semantically similar images from ImageNet using this repository. Still, one can refer to our code to understand the detailed procedure of augmenting the image dataset to mimic the video frames.

This project helps to colorize grayscale images using multiple exemplars.

Related tags

Overview

Multiple Exemplar-based Deep Colorization (Pytorch Implementation)

Prerequisites

Installation

Data Preparation

Test

Train

Comparison with State-of-the-Arts

Owner

jitendra chautharia

deep learning model with only python and numpy with test accuracy 99 % on mnist dataset and different optimization choices

Code accompanying "Dynamic Neural Relational Inference" from CVPR 2020

Hyperparameters tuning and features selection are two common steps in every machine learning pipeline.

Easy-to-use library to boost AI inference leveraging state-of-the-art optimization techniques.

Official PyTorch repo for JoJoGAN: One Shot Face Stylization

Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text Classification

Code for the Paper: Conditional Variational Capsule Network for Open Set Recognition

Official PyTorch implementation of "Improving Face Recognition with Large AgeGaps by Learning to Distinguish Children" (BMVC 2021)

Classify music genre from a 10 second sound stream using a Neural Network.

Equivariant layers for RC-complement symmetry in DNA sequence data

Distilling Motion Planner Augmented Policies into Visual Control Policies for Robot Manipulation (CoRL 2021)

Implementation of association rules mining algorithms (Apriori|FPGrowth) using python.

Code for the paper "Relation of the Relations: A New Formalization of the Relation Extraction Problem"

Code accompanying the paper "Knowledge Base Completion Meets Transfer Learning"

《Unsupervised 3D Human Pose Representation with Viewpoint and Pose Disentanglement》(ECCV 2020) GitHub: [fig9]

Lolviz - A simple Python data-structure visualization tool for lists of lists, lists, dictionaries; primarily for use in Jupyter notebooks / presentations

🐥A PyTorch implementation of OpenAI's finetuned transformer language model with a script to import the weights pre-trained by OpenAI

Original Pytorch Implementation of FLAME: Facial Landmark Heatmap Activated Multimodal Gaze Estimation

Title: Heart-Failure-Classification