A simple library that implements CLIP guided loss in PyTorch.

Last update: Dec 26, 2022

Overview

pytorch_clip_guided_loss: Pytorch implementation of the CLIP guided loss for Text-To-Image, Image-To-Image, or Image-To-Text generation.

A simple library that implements CLIP guided loss in PyTorch.

Install package

pip install pytorch_clip_guided_loss

Install the latest version

pip install --upgrade git+https://github.com/bes-dev/pytorch_clip_guided_loss.git

Features

The library supports multiple prompts (images or texts) as targets for optimization.
The library automatically detects the language of the input text, and multilingual translate it via google translate.
The library supports the original CLIP model by OpenAI and ruCLIP model by SberAI.

Usage

Simple code

import torch
from pytorch_clip_guided_loss import get_clip_guided_loss

loss_fn = get_clip_guided_loss(clip_type="ruclip", input_range = (-1, 1)).eval().requires_grad_(False)
# text prompt
loss_fn.add_prompt(text="text description of the what we would like to generate")
# image prompt
loss_fn.add_prompt(image=torch.randn(1, 3, 224, 224))

# variable
var = torch.randn(1, 3, 224, 224).requires_grad_(True)
loss = loss_fn(image=var)["loss"]
loss.backward()
print(var.grad)

VQGAN-CLIP

We provide our tiny implementation of the VQGAN-CLIP pipeline for image generation as an example of the usage of our library. To start using our implementation of the VQGAN-CLIP please follow by documentation.

A simple library that implements CLIP guided loss in PyTorch.

Related tags

Overview

pytorch_clip_guided_loss: Pytorch implementation of the CLIP guided loss for Text-To-Image, Image-To-Image, or Image-To-Text generation.

Install package

Install the latest version

Features

Usage

Simple code

VQGAN-CLIP

Owner

Sergei Belousov

Finetune SSL models for MOS prediction

PULSE: Self-Supervised Photo Upsampling via Latent Space Exploration of Generative Models

PyTorch implementation of HDN(Homography Decomposition Networks) for planar object tracking

Code release for ConvNeXt model

Implementation of paper "Towards a Unified View of Parameter-Efficient Transfer Learning"

Deep Reinforcement Learning for Keras.

CoSMA: Convolutional Semi-Regular Mesh Autoencoder. From Paper "Mesh Convolutional Autoencoder for Semi-Regular Meshes of Different Sizes"

This is project is the implementation of the DeepShift: Towards Multiplication-Less Neural Networks paper

Official implementation of SIGIR'2021 paper: "Sequential Recommendation with Graph Neural Networks".

Rethinking Nearest Neighbors for Visual Classification

ICLR2021 (Under Review)

Edison AT is software Depression Assistant personal.

Aligning Latent and Image Spaces to Connect the Unconnectable

PointCNN: Convolution On X-Transformed Points (NeurIPS 2018)

GazeScroller - Using Facial Movements to perform Hands-free Gesture on the system

PyTorch implementation of paper "Neural Scene Flow Fields for Space-Time View Synthesis of Dynamic Scenes", CVPR 2021

A PyTorch Implementation of ViT (Vision Transformer)

Jupyter notebooks showing best practices for using cx_Oracle, the Python DB API for Oracle Database

A Simulation Environment to train Robots in Large Realistic Interactive Scenes

Implementation of 🦩 Flamingo, state-of-the-art few-shot visual question answering attention net out of Deepmind, in Pytorch