This is a general repo that helps you develop fast/effective NLP classifiers using Huggingface

Last update: Mar 11, 2022

Related tags

Overview

NLP Classifier

Introduction

This project trains a bert model on any NLP classifcation model. And uses the model in make predictions on new data using batch_inference.py. This architecture can be easily extended to cover a lot more models.

Installation

Set up

$ https://github.com/abdullahtarek/nlp_classifier.git
$ cd nlp_classifier.git
Move the train.csv and test.csv in the data folder

Python

$ pip install -r requirements.txt
$ Copy the training or testing dataset in the "data" folder
$ python training.py or $ python batch_inference.py

Docker

$ docker build . -t nlp_classifier
$ docker run -it -v $DATA_FOLDER:/app/data -v $LOCAL_SAVED_MODEL_FOLDER:/app/saved_models nlp_classifier python batch_inference.py or python training.py

Extra options

Manging Configurations

All configurations are in the conf folder where you can change the data path, model path, etc.
You can also provide the configuration flag while running the script. You can write --help after the python command to see which configs you can change. Example: python3 batch_inference.py --help.

This is a general repo that helps you develop fast/effective NLP classifiers using Huggingface

Related tags

Overview

NLP Classifier

Introduction

Installation

Set up

Python

Docker

Extra options

Manging Configurations

Owner

Abdullah Tarek

An easy-to-use Python module that helps you to extract the BERT embeddings for a large text dataset (Bengali/English) efficiently.

Must-read papers on improving efficiency for pre-trained language models.

This repository structures data in title, summary, tags, sentiment given a fragment of a conversation

Revisiting Pre-trained Models for Chinese Natural Language Processing (Findings of EMNLP 2020)

Code for "Generating Disentangled Arguments with Prompts: a Simple Event Extraction Framework that Works"

profile tools for pytorch nn models

NeurIPS'21: Probabilistic Margins for Instance Reweighting in Adversarial Training (Pytorch implementation).

Poetry PEP 517 Build Backend & Core Utilities

Implementation of the Hybrid Perception Block and Dual-Pruned Self-Attention block from the ITTR paper for Image to Image Translation using Transformers

RIDE automatically creates the package and boilerplate OOP Python node scripts as per your needs

A PyTorch Implementation of End-to-End Models for Speech-to-Text

Problem: Given a nepali news find the category of the news

AEC_DeepModel - Deep learning based acoustic echo cancellation baseline code

Rethinking the Truly Unsupervised Image-to-Image Translation - Official PyTorch Implementation (ICCV 2021)

Open-Source Toolkit for End-to-End Speech Recognition leveraging PyTorch-Lightning and Hydra.

Train 🤗-transformers model with Poutyne.

A Semi-Intelligent ChatBot filled with statistical and economical data for the Premier League.

Learning Spatio-Temporal Transformer for Visual Tracking

Implementation of COCO-LM, Correcting and Contrasting Text Sequences for Language Model Pretraining, in Pytorch

A notebook that shows how to import the IITB English-Hindi Parallel Corpus from the HuggingFace datasets repository