HairCLIP: Design Your Hair by Text and Reference Image

Last update: Jan 06, 2023

Related tags

Overview

This repository hosts the official PyTorch implementation of the paper: "HairCLIP: Design Your Hair by Text and Reference Image".

Our single framework supports hairstyle and hair color editing individually or jointly, and conditional inputs can come from either image or text domain.

Tianyi Wei¹, Dongdong Chen², Wenbo Zhou¹, Jing Liao³, Zhentao Tan¹, Lu Yuan², Weiming Zhang¹, Nenghai Yu¹
¹University of Science and Technology of China, ²Microsoft Cloud AI, ³City University of Hong Kong

Abstract

Hair editing is an interesting and challenging problem in computer vision and graphics. Many existing methods require well-drawn sketches or masks as conditional inputs for editing, however these interactions are neither straightforward nor efficient. In order to free users from the tedious interaction process, this paper proposes a new hair editing interaction mode, which enables manipulating hair attributes individually or jointly based on the texts or reference images provided by users. For this purpose, we encode the image and text conditions in a shared embedding space and propose a unified hair editing framework by leveraging the powerful image text representation capability of the Contrastive Language-Image Pre-Training (CLIP) model. With the carefully designed network structures and loss functions, our framework can perform high-quality hair editing in a disentangled manner. Extensive experiments demonstrate the superiority of our approach in terms of manipulation accuracy, visual realism of editing results, and irrelevant attribute preservation.

HairCLIP: Design Your Hair by Text and Reference Image

Related tags

Overview

Overview

Abstract

Comparison

Comparison to Text-Driven Image Manipulation Methods

Comparison to Hair Transfer Methods

Application

Hair Interpolation

Generalization Ability to Unseen Descriptions

Cross-Modal Conditional Inputs

To Do

Owner

X-VLM: Multi-Grained Vision Language Pre-Training

SpeechBrain is an open-source and all-in-one speech toolkit based on PyTorch.

meProp: Sparsified Back Propagation for Accelerated Deep Learning

CLIP+FFT text-to-image

Film review classification

Official Implementation of VAT

Additional environments compatible with OpenAI gym

STRIVE: Scene Text Replacement In Videos

Video lie detector using xgboost - A video lie detector using OpenFace and xgboost

Deep Dual Consecutive Network for Human Pose Estimation (CVPR2021)

Lepard: Learning Partial point cloud matching in Rigid and Deformable scenes

STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllable Neural Text to Speech

NeuralForecast is a Python library for time series forecasting with deep learning models

Supporting code for "Autoregressive neural-network wavefunctions for ab initio quantum chemistry".

Machine Learning Model deployment for Container (TensorFlow Serving)

Simple Python application to transform Serial data into OSC messages

Predicting 10 different clothing types using Xception pre-trained model.

Pytorch Implementation of LNSNet for Superpixel Segmentation

GBK-GNN: Gated Bi-Kernel Graph Neural Networks for Modeling Both Homophily and Heterophily

Fast Scattering Transform with CuPy/PyTorch