MetaNLI

Meta learning algorithms to train cross-lingual NLI (multi-task) models

Train (source task)

Reptile

To train the model using Reptile algorithm, run the command below:

python reptile.py \
    --meta_tasks sc_en,sc_de,sc_es,sc_fr \
    --queue_len 4 \
    --temp 5.0 \
    --epochs 1 \
    --meta_lr 1e-5 \
    --scheduler \
    --gamma 0.5 \
    --step_size 4000 \
    --shot 4 \
    --meta_iteration 8000 \
    --log_interval 300

Prototypical

To train the model using Prototypical Networks algorithm, run the command below:

python prototype.py \
    --meta_tasks sc_en,sc_de,sc_es,sc_fr \
    --target_task sc_fa \
    --epochs 1 \
    --meta_lr 1e-5 \
    --lambda_1 1 \
    --lambda_2 1 \
    --scheduler \
    --gamma 0.5 \
    --step_size 1000 \
    --shot 8 \
    --query_num 0 \
    --target_shot 8 \
    --meta_iteration 2500 \
    --log_interval 50

Zero-shot Test (on target task)

To perform a zero-shot test of the trained model on the target task, run the command below:

python zeroshot.py \
    --load saved/model_sc.pt \
    --task sc_fa

Fine-tune (target task)

To fine-tune the trained model on the target task, run the command below:

python finetune.py \
    --save saved \
    --model_filename fine.pt \
    --load saved/model_sc.pt \
    --task sc_fa \
    --epochs 5 \
    --lr 1e-5

Meta learning algorithms to train cross-lingual NLI (multi-task) models

Related tags

Overview

MetaNLI

Train (source task)

Reptile

Prototypical

Zero-shot Test (on target task)

Fine-tune (target task)

Owner

M.Hassan Mojab

Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.

This repository contains (not all) code from my project on Named Entity Recognition in philosophical text

A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.

AIDynamicTextReader - A simple dynamic text reader based on Artificial intelligence

chaii - hindi & tamil question answering

Blue Brain text mining toolbox for semantic search and structured information extraction

Natural language Understanding Toolkit

Use Google's BERT for named entity recognition （CoNLL-2003 as the dataset）.

This is a project built for FALLABOUT2021 event under SRMMIC, This project deals with NLP poetry generation.

🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.

Script to generate VAD dataset used in Asteroid recipe

Code for the paper TestRank: Bringing Order into Unlabeled Test Instances for Deep Learning Tasks

Language Models as Zero-Shot Planners: Extracting Actionable Knowledge for Embodied Agents

This code is the implementation of Text Emotion Recognition (TER) with linguistic features

2021搜狐校园文本匹配算法大赛baseline

Reformer, the efficient Transformer, in Pytorch

Textpipe: clean and extract metadata from text

A very simple framework for state-of-the-art Natural Language Processing (NLP)

open-information-extraction-system, build open-knowledge-graph(SPO, subject-predicate-object) by pyltp(version==3.4.0)

The RWKV Language Model