Library of various Few-Shot Learning frameworks for text classification

Last update: Jan 03, 2023

Related tags

Overview

FewShotText

This repository contains code for the paper A Neural Few-Shot Text Classification Reality Check

Environment setup

# Create environment
python3 -m virtualenv .venv --python=python3.6

# Install environment
.venv/bin/pip install -r requirements.txt

# Activate environment
source .venv/bin/activate

Fine-tuning BERT on the MLM task

model_name=bert-base-cased
block_size=256
dataset=OOS
output_dir=transformer_models/${dataset}/fine-tuned

python scripts_transformers/run_language_modeling.py \
        --model_name_or_path ${model_name} \
        --output_dir ${output_dir} \
        --mlm \
        --do_train \
        --train_data_file data/${dataset}/full/full-train.txt  \
        --do_eval \
        --eval_data_file data/${dataset}/full/full-test.txt \
        --overwrite_output_dir \
        --evaluate_during_training \
        --logging_steps=1000 \
        --line_by_line \
        --logging_dir ${output_dir} \
        --block_size ${block_size} \
        --save_steps=1000 \
        --num_train_epochs 20 \
        --save_total_limit 20 \
        --seed 42

Training a few-shot model

To run the paper's experiments, simply use the utils/scripts/runner.sh file.

Reference

If you use the data or codes in this repository, please cite our paper:

@article{dopierre2021neural,
    title={A Neural Few-Shot Text Classification Reality Check},
    author={Dopierre, Thomas and Gravier, Christophe and Logerais, Wilfried},
    journal={arXiv preprint arXiv:2101.12073},
    year={2021}
}

Library of various Few-Shot Learning frameworks for text classification

Related tags

Overview

FewShotText

Environment setup

Fine-tuning BERT on the MLM task

Training a few-shot model

Reference

Owner

Thomas Dopierre

Runtime type annotations for the shape, dtype etc. of PyTorch Tensors.

“英特尔创新大师杯”深度学习挑战赛赛道3：CCKS2021中文NLP地址相关性任务

Code for the ICCV2021 paper "Personalized Image Semantic Segmentation"

Simple-System-Convert--C--F - Simple System Convert With Python

HW3 ― GAN, ACGAN and UDA

FOSS Digital Asset Distribution Platform built on Frappe.

Repo for "Event-Stream Representation for Human Gaits Identification Using Deep Neural Networks"

An air quality monitoring service with a Raspberry Pi and a SDS011 sensor.

A library that can print Python objects in human readable format

A PoC Corporation Relationship Knowledge Graph System on top of Nebula Graph.

TVNet: Temporal Voting Network for Action Localization

U2-Net: Going Deeper with Nested U-Structure for Salient Object Detection

code and models for "Laplacian Pyramid Reconstruction and Refinement for Semantic Segmentation"

Differential fuzzing for the masses!

Official code release for 3DV 2021 paper Human Performance Capture from Monocular Video in the Wild.

DeepVoxels is an object-specific, persistent 3D feature embedding.

Repository for MDPGT

PASTRIE: A Corpus of Prepositions Annotated with Supersense Tags in Reddit International English

Providing the solutions for high-frequency trading (HFT) strategies using data science approaches (Machine Learning) on Full Orderbook Tick Data.

PyTorch Implementation of Realtime Multi-Person Pose Estimation project.

Library of various Few-Shot Learning frameworks for text classification

Related tags

Overview

FewShotText

Environment setup

Fine-tuning BERT on the MLM task

Training a few-shot model

Reference

Owner

Thomas Dopierre

Runtime type annotations for the shape, dtype etc. of PyTorch Tensors.

“英特尔创新大师杯”深度学习挑战赛 赛道3：CCKS2021中文NLP地址相关性任务

Code for the ICCV2021 paper "Personalized Image Semantic Segmentation"

Simple-System-Convert--C--F - Simple System Convert With Python

HW3 ― GAN, ACGAN and UDA

FOSS Digital Asset Distribution Platform built on Frappe.

Repo for "Event-Stream Representation for Human Gaits Identification Using Deep Neural Networks"

An air quality monitoring service with a Raspberry Pi and a SDS011 sensor.

A library that can print Python objects in human readable format

A PoC Corporation Relationship Knowledge Graph System on top of Nebula Graph.

TVNet: Temporal Voting Network for Action Localization

U2-Net: Going Deeper with Nested U-Structure for Salient Object Detection

code and models for "Laplacian Pyramid Reconstruction and Refinement for Semantic Segmentation"

Differential fuzzing for the masses!

Official code release for 3DV 2021 paper Human Performance Capture from Monocular Video in the Wild.

DeepVoxels is an object-specific, persistent 3D feature embedding.

Repository for MDPGT

PASTRIE: A Corpus of Prepositions Annotated with Supersense Tags in Reddit International English

Providing the solutions for high-frequency trading (HFT) strategies using data science approaches (Machine Learning) on Full Orderbook Tick Data.

PyTorch Implementation of Realtime Multi-Person Pose Estimation project.

“英特尔创新大师杯”深度学习挑战赛赛道3：CCKS2021中文NLP地址相关性任务