Augmented CLIP - Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.

Last update: Sep 13, 2022

Related tags

Overview

Train aug_clip against laion400m-embeddings found here: https://laion.ai/laion-400-open-dataset/ - note that this used the base ViT-B/32 CLIP model.

Sample notebook adapted from Sadnow's 360Diffusion repo, thanks to all involved!

Latest revision: Beta 1.52 (10/11/21): https://colab.research.google.com/github/sadnow/360Diffusion/blob/main/360Diffusion_Public.ipynb

Latest highlights: Full compatibility for both 256 and 512 model for upscaling to 256,512,1024,2048, and 4096px.

Note that 4096 files aren’t quite as pretty as 2048, and they’re massive in file size. 2048 is appealing in most cases. If you intend on upscaling to anything higher than 1024, I recommend using the 512 diffusion model found in the settings-

Credits & Acknowledgements

Katherine Crowson (https://github.com/crowsonkb, https://twitter.com/RiversHaveWings)
Founder of OG Diffusion Notebook Original notebook founder; [I think] has a large involvement in both VQGAN and Diffusion!
Daniel Russell (https://github.com/russelldc, https://twitter.com/danielrussruss) Fast Diffusion Fork Founder Made the OG Fast Diffusion notebook.
Dango233 and nsheppard Contributed to Daniel’s Fast Diffusion Notebook
Sadnow (twitter.com/sadly_existent) 360Diffusion Fork Founder Forked Daniel Russel’s Fast Diffusion Notebook to include Real-ESRGAN integration-
airguitararchon (steven) Init Research
Everyone else on the VQLIPSE Discord (https://www.patreon.com/sportsracer48); Support & Research

Prior release(s): Implemented Daniel Russ’s Perlin revisions, fixed init_bug, 4096 double-pass, VRAM fixes, practical debug_mode (set to higher skip_timestep)

All edits & additions are welcome and appreciated~

Augmented CLIP - Training simple models to predict CLIP image embeddings from text embeddings, and vice versa.

Related tags

Overview

Train aug_clip against laion400m-embeddings found here: https://laion.ai/laion-400-open-dataset/ - note that this used the base ViT-B/32 CLIP model.

Sample notebook adapted from Sadnow's 360Diffusion repo, thanks to all involved!

Owner

Peter Baylies

Evolving neural network parameters in JAX.

RMNA: A Neighbor Aggregation-Based Knowledge Graph Representation Learning Model Using Rule Mining

A simple Python library for stochastic graphical ecological models

The devkit of the nuScenes dataset.

Motion planning algorithms commonly used on autonomous vehicles. (path planning + path tracking)

Where2Act: From Pixels to Actions for Articulated 3D Objects

Garbage Detection system which will detect objects based on whether it is plastic waste or plastics or just garbage.

Ascend your Jupyter Notebook usage

A simple root calculater for python

A series of Jupyter notebooks with Chinese comment that walk you through the fundamentals of Machine Learning and Deep Learning in python using Scikit-Learn and TensorFlow.

Research code of ICCV 2021 paper "Mesh Graphormer"

An onlinel learning to rank python codebase.

Supervised multi-SNE (S-multi-SNE): Multi-view visualisation and classification

Semantic Image Synthesis with SPADE

A simple python library for fast image generation of people who do not exist.

Image processing in Python

Reusable constraint types to use with typing.Annotated

A big endian Gentoo port developed on a Pine64.org RockPro64

Simple-Image-Classification - Simple Image Classification Code (PyTorch)

[Link]deep_portfolo - Use Reforcemet earg ad Supervsed learg to Optmze portfolo allocato []