Fine-tune GPT-3 with a Google Chat conversation history

Overview

Google Chat GPT-3

This repo will help you fine-tune GPT-3 with a Google Chat conversation history. The trained model will be able to converse as one or both sides of the conversation in the participants' style.

  1. Download your Chat archive from Google Takeout.
  2. Locate the messages.json file of the conversation you would like to use as a training set.
  3. Use the script to prepare data for training:
python preparer.py --messages <MESSAGES_FILE> --output <TRAINING_FILE>
  1. Test your training data with OpenAI's tool:
openai tools fine_tunes.prepare_data -f <TRAINING_FILE>

You should see: No remediations found.

  1. Fine-tine GPT-3 with your training data:
openai api fine_tunes.create -t <TRAINING_FILE>

You should see: Job complete! Status: succeeded ЁЯОЙ. Don't forget to note the name of the model.

  1. Try out your model in the Playground or with the CLI:
openai api completions.create -m 
   

   
Owner
Nate Baer
Software engineer at Procore. CS bachelors from RPI. Software engineering masters from UCI.
Nate Baer
Grading tools for Advanced NLP (11-711)Grading tools for Advanced NLP (11-711)

Grading tools for Advanced NLP (11-711) Installation You'll need docker and unzip to use this repo. For docker, visit the official guide to get starte

Hao Zhu 2 Sep 27, 2022
ADCS cert template modification and ACL enumeration

Purpose This tool is designed to aid an operator in modifying ADCS certificate templates so that a created vulnerable state can be leveraged for privi

Fortalice Solutions, LLC 78 Dec 12, 2022
Silero Models: pre-trained speech-to-text, text-to-speech models and benchmarks made embarrassingly simple

Silero Models: pre-trained speech-to-text, text-to-speech models and benchmarks made embarrassingly simple

Alexander Veysov 3.2k Dec 31, 2022
Code for the Python code smells video on the ArjanCodes channel.

7 Python code smells This repository contains the code for the Python code smells video on the ArjanCodes channel (watch the video here). The example

55 Dec 29, 2022
Ongoing research training transformer language models at scale, including: BERT & GPT-2

Megatron (1 and 2) is a large, powerful transformer developed by the Applied Deep Learning Research team at NVIDIA.

NVIDIA Corporation 3.5k Dec 30, 2022
keras implement of transformers for humans

keras implement of transformers for humans

шЛПхЙСцЮЧ(Jianlin Su) 4.8k Jan 03, 2023
Continuously update some NLP practice based on different tasks.

NLP_practice We will continuously update some NLP practice based on different tasks. prerequisites Software pytorch = 1.10 torchtext = 0.11.0 sklear

0 Jan 05, 2022
Open-World Entity Segmentation

Open-World Entity Segmentation Project Website Lu Qi*, Jason Kuen*, Yi Wang, Jiuxiang Gu, Hengshuang Zhao, Zhe Lin, Philip Torr, Jiaya Jia This projec

DV Lab 408 Dec 29, 2022
[ICLR 2021 Spotlight] Pytorch implementation for "Long-tailed Recognition by Routing Diverse Distribution-Aware Experts."

RIDE: Long-tailed Recognition by Routing Diverse Distribution-Aware Experts. by Xudong Wang, Long Lian, Zhongqi Miao, Ziwei Liu and Stella X. Yu at UC

Xudong (Frank) Wang 205 Dec 16, 2022
рдорд░рд╛рдареА рднрд╛рд╖рд╛ рд╡рд╛рдЪрд╡рд┐рдгреНрдпрд╛рдЪрд╛ рдПрдХ рдкреНрд░рдпрд╛рд╕. рдЗрдВрдЧреНрд░рдЬреА рддреЗ рдорд░рд╛рдареАрдЪрд╛ рд╢рдмреНрджрдХреЛрд╢. An attempt to preserve the Marathi language. A lightweight and ad free English to Marathi thesaurus.

For English, scroll down рдорд░рд╛рдареА рд╢рдмреНрдж рдорд░рд╛рдареА рднрд╛рд╖рд╛ рд╡рд╛рдЪрд╡рдгреНрдпрд╛рд╕рд╛рдареА рдореА рд╣рд╛ рдУрдкрди рд╕реЛрд░реНрд╕ рдкреНрд░реЛрдЬреЗрдХреНрдЯ рд╕реБрд░реВ рдХреЗрд▓рд╛ рдЖрд╣реЗ. рдорд╛рдЭреНрдпрд╛ рдорддреЗ, рдЖрдкрд▓реА рднрд╛рд╖рд╛ рд╣рд│реВрд╣рд│реВ рдЖрдгрд┐ рдХреЛрдгрд╛рдЪрд╛рд╣реА рд▓рдХреНрд╖рд╛рдд

рдореБрдХреНрдд рд╕реНрддреНрд░реЛрдд 20 Oct 11, 2022
Data manipulation and transformation for audio signal processing, powered by PyTorch

torchaudio: an audio library for PyTorch The aim of torchaudio is to apply PyTorch to the audio domain. By supporting PyTorch, torchaudio follows the

1.9k Jan 08, 2023
Machine Learning Course Project, IMDB movie review sentiment analysis by lstm, cnn, and transformer

IMDB Sentiment Analysis This is the final project of Machine Learning Courses in Huazhong University of Science and Technology, School of Artificial I

Daniel 0 Dec 27, 2021
Dope Wars game engine on StarkNet L2 roll-up

RYO Dope Wars game engine on StarkNet L2 roll-up. What TI-83 drug wars built as smart contract system. Background mechanism design notion here. Initia

104 Dec 04, 2022
LegalNLP - Natural Language Processing Methods for the Brazilian Legal Language

LegalNLP - Natural Language Processing Methods for the Brazilian Legal Language тЪЦя╕П The library of Natural Language Processing for Brazilian legal lang

Felipe Maia Polo 125 Dec 20, 2022
Journalism AI тАУ Quotes extraction for modular journalism

Quote extraction for modular journalism (JournalismAI collab 2021)

Journalism AI collab 2021 207 Dec 25, 2022
TTS is a library for advanced Text-to-Speech generation.

TTS is a library for advanced Text-to-Speech generation. It's built on the latest research, was designed to achieve the best trade-off among ease-of-training, speed and quality. TTS comes with pretra

Mozilla 6.5k Jan 08, 2023
AIDynamicTextReader - A simple dynamic text reader based on Artificial intelligence

AI Dynamic Text Reader: This is a simple dynamic text reader based on Artificial

Md. Rakibul Islam 1 Jan 18, 2022
Source code for AAAI20 "Generating Persona Consistent Dialogues by Exploiting Natural Language Inference".

Generating Persona Consistent Dialogues by Exploiting Natural Language Inference Source code for RCDG model in AAAI20 Generating Persona Consistent Di

16 Oct 08, 2022
Trained T5 and T5-large model for creating keywords from text

text to keywords Trained T5-base and T5-large model for creating keywords from text. Supported languages: ru Pretraining Large version | Pretraining B

Danil 61 Nov 24, 2022
StarGAN - Official PyTorch Implementation

StarGAN - Official PyTorch Implementation ***** New: StarGAN v2 is available at https://github.com/clovaai/stargan-v2 ***** This repository provides t

Yunjey Choi 5.1k Dec 30, 2022