INTRODUCTION

This is a modification of the OpenAI-CLIP repo of moein-shariatnia(https://github.com/moein-shariatnia/OpenAI-CLIP).

The current training dataset supports flicker-8k or flicker-30k, and the image encoder supports Resnet50 or ViT(vit_base_patch16_384).

Text encoder supports only DistilBert like moein-shariatnia.

ENVIRONTMENT SETTING

$ virtualenv .venv --python=python3.6
$ source .venv/bin/activate
$ pip install -r requirements.txt

EXECUTTION

Pretrain

$ python3 pretrain.py

Inference

$ python3 inference.py --qeury={YOUR QUERY}

CAUTION

You must set(or check) some options in config.py before pretrain & inference

ex1) dataset("8k" or "30k"): Train dataset(flicker-8k or flicker-30k)

ex2) model_name("resnet50" or "vit_base_patch16_384"): Type of image encoder

ex3) pretrained(True or False): Decide whether to learn by loading pretrain versions of text encoder(DistilBert) and image encoder(resnet50 or ViT)

ex4) batch_size: Set according to the capacity of the machine

This is a modification of the OpenAI-CLIP repository of moein-shariatnia

Related tags

Overview

INTRODUCTION

ENVIRONTMENT SETTING

EXECUTTION

CAUTION

Owner

Sangwon Beak

Use Google's BERT for named entity recognition （CoNLL-2003 as the dataset）.

Semi-automated vocabulary generation from semantic vector models

This repo contains simple to use, pretrained/training-less models for speaker diarization.

NLPShala , the best IDE for all Natural language processing tasks.

Translators - is a library which aims to bring free, multiple, enjoyable translation to individuals and students in Python

Modified GPT using average pooling to reduce the softmax attention memory constraints.

Stanford CoreNLP provides a set of natural language analysis tools written in Java

Text to speech converter with GUI made in Python.

A list of NLP(Natural Language Processing) tutorials built on Tensorflow 2.0.

Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.

The FinQA dataset from paper: FinQA: A Dataset of Numerical Reasoning over Financial Data

Pipeline for training LSA models using Scikit-Learn.

A simple version of DeTR

Simple GUI where you can enter an article and get a crisp summarized version.

Transformers-regression - Regression Bugs Are In Your Model! Measuring, Reducing and Analyzing Regressions In NLP Model Updates

An open source library for deep learning end-to-end dialog systems and chatbots.

Open-source offline translation library written in Python. Uses OpenNMT for translations

A simple visual front end to the Maya UE4 RBF plugin delivered with MetaHumans

A Transformer Implementation that is easy to understand and customizable.

Club chatbot