Colab notebook for openai/glide-text2im.

Last update: Oct 19, 2022

Overview

GLIDE text2im on Colab

This repository provides a Colab notebook to produce images conditioned on text prompts with GLIDE [1].

Usage

Run text2im.ipynb

Tip: press <Ctrl+F9> to run everything.

Results

The process is based on the small, filtered-data GLIDE model, with classifier-free guidance.

Results consist of 64x64 images, and the corresponding 256x256 upsampled versions.

Expected run-time: 2m30s (for the one-time set-up), 1 min (64x64 sampling), 30 sec (256x256 upsampling).

_{Several uncurated samples obtained with the same prompt: "a magnificent French rooster singing".}

Safety considerations

The small model has 300 million parameters, compared to the unreleased 3.5 billion parameter model.

As described in Appendix F.1, the training dataset was filtered so that it would not contain:

images of humans and human-like objects,
images of violent objects,
two prevalent hate symbols in America (swastika and confederate flag).

References

[1] Alex Nichol, Prafulla Dhariwal, Aditya Ramesh, et al. GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models. arXiv preprint 2112.10741. 2021.

Colab notebook for openai/glide-text2im.

Related tags

Overview

GLIDE text2im on Colab

Usage

Results

Safety considerations

References

Owner

Wok

Fashion Recommender System With Python

PyTorch implementation of SampleRNN: An Unconditional End-to-End Neural Audio Generation Model

ChainerRL is a deep reinforcement learning library built on top of Chainer.

Parametric Contrastive Learning (ICCV2021)

HMLET (Hybrid-Method-of-Linear-and-non-linEar-collaborative-filTering-method)

Pytorch Implementation of Residual Vision Transformers(ResViT)

.NET bindings for the Pytorch engine

Point-NeRF: Point-based Neural Radiance Fields

A fast poisson image editing implementation that can utilize multi-core CPU or GPU to handle a high-resolution image input.

Official implementation of UTNet: A Hybrid Transformer Architecture for Medical Image Segmentation

The "breathing k-means" algorithm with datasets and example notebooks

f-BRS: Rethinking Backpropagating Refinement for Interactive Segmentation

PyTorch implementation of the paper:A Convolutional Approach to Melody Line Identification in Symbolic Scores.

Colossal-AI: A Unified Deep Learning System for Large-Scale Parallel Training

Swapping face using Face Mesh with TensorFlow Lite

Training deep models using anime, illustration images.

Cache Requests in Deta Bases and Echo them with Deta Micros

Computer Vision and Pattern Recognition, NUS CS4243, 2022

Wider or Deeper: Revisiting the ResNet Model for Visual Recognition

This project deploys a yolo fastest model in the form of tflite on raspberry 3b+. The model is from another repository of mine called -Trash-Classification-Car