Navigating StyleGAN2 w latent space using CLIP

an attempt to build sth with the official SG2-ADA Pytorch impl kinda inspired by Generating Images from Prompts using CLIP and StyleGAN based on the og projector.py

things learned:

it's better to generate initial w values from a well converged sample rather than starting with random or median ones
optimizing w and noise inputs works better than w alone
default values of 0.02 for LR/noise work fine with portraits

Quick start

clone SG2 repo, copy clip dir from CLIP repo, install pytorch 1.7.1 and stuff
pick a suitable SG2 PKL (eg FFHQ)
pick a seed
run python3 approach.py --network network-snapshot-ffhq.pkl --outdir project --num-steps 100 --text 'an image of a girl with a face resembling Paul Krugman' --psi 0.8 --seed 12345
alternatively, one can start from a w vector stored as .npz python3 approach.py --network network-snapshot-ffhq.pkl --outdir project --num-steps 100 --text 'an image of a girl with a face resembling Paul Krugman' --w w-7660ca0b7e95428cac94c89459b5cebd8a7acbd4.npz

FFHQ test

python3 approach.py --network stylegan2-ffhq-config-f.pkl --outdir ffhq --num-steps 100 --text 'an image of an Instagram influencer girl' --psi 0.7 --seed 32

Navigating StyleGAN2 w latent space using CLIP

Related tags

Overview

Navigating StyleGAN2 w latent space using CLIP

Quick start

FFHQ test

Owner

Mike K.

MTCNN face detection implementation for TensorFlow, as a PIP package.

Vehicle direction identification consists of three module detection , tracking and direction recognization.

本项目是一个带有前端界面的垃圾分类项目，加载了训练好的模型参数，模型为efficientnetb4，暂时为40分类问题。

Source code for our paper "Improving Empathetic Response Generation by Recognizing Emotion Cause in Conversations"

A script written in Python that returns a consensus string and profile matrix of a given DNA string(s) in FASTA format.

Adversarial Autoencoders

Implementation of Graph Transformer in Pytorch, for potential use in replicating Alphafold2

Cross-view Transformers for real-time Map-view Semantic Segmentation (CVPR 2022 Oral)

A Python Reconnection Tool for alt:V

Code for the paper "Reinforced Active Learning for Image Segmentation"

A Python implementation of the Locality Preserving Matching (LPM) method for pruning outliers in image matching.

Implementation of ML models like Decision tree, Naive Bayes, Logistic Regression and many other

Distributed Asynchronous Hyperparameter Optimization better than HyperOpt.

Learning hidden low dimensional dyanmics using a Generalized Onsager Principle and neural networks

PyTorch deep learning projects made easy.

Easy-to-use micro-wrappers for Gym and PettingZoo based RL Environments

EquiBind: Geometric Deep Learning for Drug Binding Structure Prediction

MicroNet: Improving Image Recognition with Extremely Low FLOPs (ICCV 2021)

PyTorch implementation of the Quasi-Recurrent Neural Network - up to 16 times faster than NVIDIA's cuDNN LSTM

CausaLM: Causal Model Explanation Through Counterfactual Language Models