Fine-tune GPT-3 with a Google Chat conversation history

Last update: Dec 10, 2022

Related tags

Overview

Google Chat GPT-3

This repo will help you fine-tune GPT-3 with a Google Chat conversation history. The trained model will be able to converse as one or both sides of the conversation in the participants' style.

Download your Chat archive from Google Takeout.
Locate the messages.json file of the conversation you would like to use as a training set.
Use the script to prepare data for training:

python preparer.py --messages <MESSAGES_FILE> --output <TRAINING_FILE>

Test your training data with OpenAI's tool:

openai tools fine_tunes.prepare_data -f <TRAINING_FILE>

You should see: No remediations found.

Fine-tine GPT-3 with your training data:

openai api fine_tunes.create -t <TRAINING_FILE>

You should see: Job complete! Status: succeeded 🎉. Don't forget to note the name of the model.

Try out your model in the Playground or with the CLI:

openai api completions.create -m

Owner

Nate Baer

Software engineer at Procore. CS bachelors from RPI. Software engineering masters from UCI.

GitHub Repository

This repository contains helper functions which can help you generate additional data points depending on your NLP task.

NLP Albumentations For Data Augmentation This repository contains helper functions which can help you generate additional data points depending on you

6 May 22, 2022

An open source library for deep learning end-to-end dialog systems and chatbots.

DeepPavlov is an open-source conversational AI library built on TensorFlow, Keras and PyTorch. DeepPavlov is designed for development of production re

6k Dec 30, 2022

A calibre plugin that generates Word Wise and X-Ray files then sends them to Kindle. Supports KFX, AZW3 and MOBI eBooks. X-Ray supports 18 languages.

WordDumb A calibre plugin that generates Word Wise and X-Ray files then sends them to Kindle. Supports KFX, AZW3 and MOBI eBooks. Languages X-Ray supp

172 Dec 29, 2022

Help you discover excellent English projects and get rid of disturbing by other spoken language

GitHub English Top Charts 「Help you discover excellent English projects and get

544 Jan 09, 2023

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

ALBERT ***************New March 28, 2020 *************** Add a colab tutorial to run fine-tuning for GLUE datasets. ***************New January 7, 2020

3k Dec 26, 2022

The following links explain a bit the idea of semantic search and how search mechanisms work by doing retrieve and rerank

Main Idea The following links explain a bit the idea of semantic search and how search mechanisms work by doing retrieve and rerank Semantic Search Re

2 Jan 28, 2022

Simple Python script to scrape youtube channles of "Parity Technologies and Web3 Foundation" and translate them to well-known braille language or any language

Simple Python script to scrape youtube channles of "Parity Technologies and Web3 Foundation" and translate them to well-known braille language or any

1 Apr 28, 2022

Fine-tune GPT-3 with a Google Chat conversation history

Related tags

Overview

Google Chat GPT-3

Owner

Nate Baer

This repository contains helper functions which can help you generate additional data points depending on your NLP task.

An open source library for deep learning end-to-end dialog systems and chatbots.

A calibre plugin that generates Word Wise and X-Ray files then sends them to Kindle. Supports KFX, AZW3 and MOBI eBooks. X-Ray supports 18 languages.

Help you discover excellent English projects and get rid of disturbing by other spoken language

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

The following links explain a bit the idea of semantic search and how search mechanisms work by doing retrieve and rerank

Simple Python script to scrape youtube channles of "Parity Technologies and Web3 Foundation" and translate them to well-known braille language or any language

ChainKnowledgeGraph, 产业链知识图谱包括A股上市公司、行业和产品共3类实体

Synthetic data for the people.

Translation for Trilium Notes. Trilium Notes 中文版.

The PyTorch based implementation of continuous integrate-and-fire (CIF) module.

⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x using fastT5.

Enterprise Scale NLP with Hugging Face & SageMaker Workshop series

A NLP program: tokenize method, PoS Tagging with deep learning

TalkNet: Audio-visual active speaker detection Model

Artificial Conversational Entity for queries in Eulogio "Amang" Rodriguez Institute of Science and Technology (EARIST)

This repo is to provide a list of literature regarding Deep Learning on Graphs for NLP

FactSumm: Factual Consistency Scorer for Abstractive Summarization

Sequence model architectures from scratch in PyTorch

뉴스 도메인 질의응답 시스템 (21-1학기 졸업 프로젝트)