Official Pytorch Implementation for Splicing ViT Features for Semantic Appearance Transfer presenting Splice

Last update: Jan 06, 2023

Overview

Splicing ViT Features for Semantic Appearance Transfer [Project Page]

Splice is a method for semantic appearance transfer, as described in Splicing ViT Features for Semantic Appearance Transfer (link to paper).

Given two input images—a source structure image and a target appearance image–our method generates a new image in which the structure of the source image is preserved, while the visual appearance of the target image is transferred in a semantically aware manner. That is, objects in the structure image are “painted” with the visual appearance of semantically related objects in the appearance image. Our method leverages a self-supervised, pre-trained ViT model as an external semantic prior. This allows us to train our generator only on a single input image pair, without any additional information (e.g., segmentation/correspondences), and without adversarial training. Thus, our framework can work across a variety of objects and scenes, and can generate high quality results in high resolution (e.g., HD).

Getting Started

Installation

git clone https://github.com/omerbt/Splice.git
pip install -r requirements.txt

Run examples

Run the following command to start training

python train.py --dataroot datasets/cows

Intermediate results will be saved to /out/output.png during optimization. The frequency of saving intermediate results is indicated in the save_epoch_freq flag of the configuration.

Sample Results

Citation

@article{Splice2022,
    author = {Tumanyan, Narek
              and Bar-Tal, Omer
              and Bagon, Shai
              and Dekel, Tali
              },
    title = {Splicing ViT Features for Semantic Appearance Transfer}, 
    journal = {arXiv preprint arXiv:2201.00424},
    year  = {2022}
}

Official Pytorch Implementation for Splicing ViT Features for Semantic Appearance Transfer presenting Splice

Related tags

Overview

Splicing ViT Features for Semantic Appearance Transfer [Project Page]

Getting Started

Installation

Run examples

Sample Results

Citation

Owner

Omer Bar Tal

For holding anime-related object classification and detection models

Apply a perspective transformation to a raster image inside Inkscape (no need to use an external software such as GIMP or Krita).

[AAAI 2022] Negative Sample Matters: A Renaissance of Metric Learning for Temporal Grounding

TuckER: Tensor Factorization for Knowledge Graph Completion

Torch-mutable-modules - Use in-place and assignment operations on PyTorch module parameters with support for autograd

《Image2Reverb: Cross-Modal Reverb Impulse Response Synthesis》(2021)

PyTorch implementation of MulMON

Repository for paper "Non-intrusive speech intelligibility prediction from discrete latent representations"

Implementation of "Learning Multi-Granular Hypergraphs for Video-Based Person Re-Identification"

We present a regularized self-labeling approach to improve the generalization and robustness properties of fine-tuning.

Event sourced bank - A wide-and-shallow example using the Python event sourcing library

CL-Gym: Full-Featured PyTorch Library for Continual Learning

Custom implementation of Corrleation Module

Callable PyTrees and filtered JIT/grad transformations => neural networks in JAX.

The official implementation of NeMo: Neural Mesh Models of Contrastive Features for Robust 3D Pose Estimation [ICLR-2021]. https://arxiv.org/pdf/2101.12378.pdf

State-to-Distribution (STD) Model

BossNAS: Exploring Hybrid CNN-transformers with Block-wisely Self-supervised Neural Architecture Search

Code release for NeX: Real-time View Synthesis with Neural Basis Expansion

Public repository created to store my custom-made tools for Just Dance (UbiArt Engine)

Towards Improving Embedding Based Models of Social Network Alignment via Pseudo Anchors