Script and models for clustering LAION-400m CLIP embeddings.

Last update: Oct 04, 2022

Related tags

Overview

clustering-laion400m

Script and models for clustering LAION-400m CLIP embeddings.

Models were fit on the first million or so image embeddings. A subjective description of what the labels appear to be is included in cluster-labels.txt along with counts for the first million or so embeddings (aka the first file).

Precomputed labels are here: https://archive.org/details/laion400m-64-clustering-labels.tar

Run Fit Clusters.ipynb to reproduce the labels or create your own clusters / models. This requires the CLIP embeddings from the LAION 400m open dataset, which can be found here: https://laion.ai/laion-400-open-dataset/

Owner

Peter Baylies

GitHub Repository

NSFW A chatbot based on GPT2-chitchat

DangBot -- 好怪哦，再来一句卡群怪话bot，powered by GPT2 for Chinese chitchat Training Example: python train.py --lr 5e-2 --epochs 30 --max_len 300 --batch_size 8

11 Jul 21, 2022

A deep learning-based translation library built on Huggingface transformers

DL Translate A deep learning-based translation library built on Huggingface transformers and Facebook's mBART-Large 💻 GitHub Repository 📚 Documentat

244 Dec 30, 2022

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

Speech-Backbones This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab. Grad-TTS Official implementation of the Grad-

295 Jan 07, 2023

German Text-To-Speech Engine using Tacotron and Griffin-Lim

jotts JoTTS is a German text-to-speech engine using tacotron and griffin-lim. The synthesizer model has been trained on my voice using Tacotron1. Due

6 Aug 28, 2022

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded systems.

Welcome to Spokestack Python! This library is intended for developing voice interfaces in Python. This can include anything from Raspberry Pi applicat

133 Sep 20, 2022

Python wrapper for Stanford CoreNLP tools v3.4.1

Python interface to Stanford Core NLP tools v3.4.1 This is a Python wrapper for Stanford University's NLP group's Java-based CoreNLP tools. It can eit

610 Sep 07, 2022

WikiPron - a command-line tool and Python API for mining multilingual pronunciation data from Wiktionary

WikiPron WikiPron is a command-line tool and Python API for mining multilingual pronunciation data from Wiktionary, as well as a database of pronuncia

213 Jan 01, 2023

Espial is an engine for automated organization and discovery of personal knowledge

Live Demo (currently not running, on it) Espial is an engine for automated organization and discovery in knowledge bases. It can be adapted to run wit

159 Dec 30, 2022

Transformers Wav2Vec2 + Parlance's CTCDecodeTransformers Wav2Vec2 + Parlance's CTCDecode

🤗 Transformers Wav2Vec2 + Parlance's CTCDecode Introduction This repo shows how 🤗 Transformers can be used in combination with Parlance's ctcdecode

9 Jul 21, 2022

Speech to text streamlit app

Speech to text Streamlit-app! 👄 This speech to text recognition is powered by t

9 Jan 01, 2023

aMLP Transformer Model for Japanese

aMLP-japanese Japanese aMLP Pretrained Model aMLPとは、Liu, Daiらが提案する、Transformerモデルです。ざっくりというと、BERTの代わりに使えて、より性能の良いモデルです。詳しい解説は、こちらの記事などを参考にしてください。この

13 Aug 11, 2022

Code for using and evaluating SpanBERT.

SpanBERT This repository contains code and models for the paper: SpanBERT: Improving Pre-training by Representing and Predicting Spans. If you prefer

798 Dec 30, 2022

NLP techniques such as named entity recognition, sentiment analysis, topic modeling, text classification with Python to predict sentiment and rating of drug from user reviews.

This file contains the following documents sumbited for Baruch CIS9665 group 9 fall 2021. 1. Dataset: drug_reviews.csv 2. python codes for text classi

2 Jan 04, 2023

Script and models for clustering LAION-400m CLIP embeddings.

Related tags

Overview

clustering-laion400m

Owner

Peter Baylies

NSFW A chatbot based on GPT2-chitchat

A deep learning-based translation library built on Huggingface transformers

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

German Text-To-Speech Engine using Tacotron and Griffin-Lim

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded systems.

Python wrapper for Stanford CoreNLP tools v3.4.1

WikiPron - a command-line tool and Python API for mining multilingual pronunciation data from Wiktionary

Espial is an engine for automated organization and discovery of personal knowledge

Transformers Wav2Vec2 + Parlance's CTCDecodeTransformers Wav2Vec2 + Parlance's CTCDecode

Speech to text streamlit app

aMLP Transformer Model for Japanese

Code for using and evaluating SpanBERT.

NLP techniques such as named entity recognition, sentiment analysis, topic modeling, text classification with Python to predict sentiment and rating of drug from user reviews.

Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts

Include MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.

This repository has a implementations of data augmentation for NLP for Japanese.

A python script that will use hydra to get user and password to login to ssh, ftp, and telnet

A simple word search made in python

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

KoBART model on huggingface transformers

Script and models for clustering LAION-400m CLIP embeddings.

Related tags

Overview

clustering-laion400m

Owner

Peter Baylies

**NSFW** A chatbot based on GPT2-chitchat

A deep learning-based translation library built on Huggingface transformers

This is the main repository of open-sourced speech technology by Huawei Noah's Ark Lab.

German Text-To-Speech Engine using Tacotron and Griffin-Lim

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded systems.

Python wrapper for Stanford CoreNLP tools v3.4.1

WikiPron - a command-line tool and Python API for mining multilingual pronunciation data from Wiktionary

Espial is an engine for automated organization and discovery of personal knowledge

Transformers Wav2Vec2 + Parlance's CTCDecodeTransformers Wav2Vec2 + Parlance's CTCDecode

Speech to text streamlit app

aMLP Transformer Model for Japanese

Code for using and evaluating SpanBERT.

NLP techniques such as named entity recognition, sentiment analysis, topic modeling, text classification with Python to predict sentiment and rating of drug from user reviews.

Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts

Include MelGAN, HifiGAN and Multiband-HifiGAN, maybe NHV in the future.

This repository has a implementations of data augmentation for NLP for Japanese.

A python script that will use hydra to get user and password to login to ssh, ftp, and telnet

A simple word search made in python

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

KoBART model on huggingface transformers

NSFW A chatbot based on GPT2-chitchat