Code and datasets for our paper "PTR: Prompt Tuning with Rules for Text Classification"

Last update: Dec 30, 2022

Related tags

Overview

PTR

Code and datasets for our paper "PTR: Prompt Tuning with Rules for Text Classification"

If you use the code, please cite the following paper:

@article{han2021ptr,
  title={PTR: Prompt Tuning with Rules for Text Classification},
  author={Han, Xu and Zhao, Weilin and Ding, Ning and Liu, Zhiyuan and Sun, Maosong},
  journal={arXiv preprint arXiv:2105.11259},
  year={2021}
}

Requirements

The model is implemented using PyTorch. The versions of packages used are shown below.

numpy>=1.18.0
scikit-learn>=0.22.1
scipy>=1.4.1
torch>=1.3.0
tqdm>=4.41.1
transformers>=4.0.0

Baselines

Some baselines, especially the baselines using entity markers, come from the project [RE_improved_baseline].

Datasets

We provide all the datasets and prompts used in our experiments.

Run the experiments

(1) For TACRED

mkdir results
cd results
mkdir tacred
cd tacred
mkdir train
mkdir val
mkdir test
cd ..
cd ..
cd code_script
bash run_large_tacred.sh

(2) For TACREV

mkdir results
cd results
mkdir tacrev
cd tacrev
mkdir train
mkdir val
mkdir test
cd ..
cd ..
cd code_script
bash run_large_tacrev.sh

(3) For RETACRED

mkdir results
cd results
mkdir retacred
cd retacred
mkdir train
mkdir val
mkdir test
cd ..
cd ..
cd code_script
bash run_large_retacred.sh

Code and datasets for our paper "PTR: Prompt Tuning with Rules for Text Classification"

Related tags

Overview

PTR

Requirements

Baselines

Datasets

Run the experiments

(1) For TACRED

(2) For TACREV

(3) For RETACRED

Owner

THUNLP

Code for Editing Factual Knowledge in Language Models

translate using your voice

Code for Emergent Translation in Multi-Agent Communication

Predict an emoji that is associated with a text

Synthetic data for the people.

LV-BERT: Exploiting Layer Variety for BERT (Findings of ACL 2021)

This is Assignment1 code for the Web Data Processing System.

This is a project built for FALLABOUT2021 event under SRMMIC, This project deals with NLP poetry generation.

Contract Understanding Atticus Dataset

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

ZUNIT - Toward Zero-Shot Unsupervised Image-to-Image Translation

Indonesia spellchecker with python

Scene Text Retrieval via Joint Text Detection and Similarity Learning

Unsupervised text tokenizer for Neural Network-based text generation.

API for the GPT-J language model 🦜. Including a FastAPI backend and a streamlit frontend

Official PyTorch implementation of Time-aware Large Kernel (TaLK) Convolutions (ICML 2020)

PyTorch implementation of Tacotron speech synthesis model.

Seq2seq attn - Use the Seq2Seq method to implement machine translation and introduce Attention mechanism to improve the results

Tools for curating biomedical training data for large-scale language modeling

VampiresVsWerewolves - Our Implementation of a MiniMax algorithm with alpha beta pruning in the context of an in-class competition