PyTorch implementation of the paper: Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding

Last update: Dec 14, 2022

Related tags

Text Data & NLP ProSLU

Overview

Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding

This repository contains the official PyTorch implementation of the paper:

Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding. Xiao Xu*, Libo Qin*, Kaiji Chen, Guoxing Wu, Linlin Li, Wanxiang Che. AAAI 2022. [Paper(Arxiv)] [Paper]

If you use any source codes or the datasets included in this toolkit in your work, please cite the following paper. The bibtex are listed below:

...

In the following, we will guide you how to use this repository step by step.

Workflow

Architecture

Results

Preparation

Our code is based on the following packages:

numpy==1.19.5
tqdm==4.50.2
pytorch==1.7.0
python==3.7.3
cudatoolkit==11.0.3
transformers==4.1.1

We highly suggest you using Anaconda to manage your python environment.

We download the chinese pretrained model checkpoints from the following links:

How to Run it

The script train.py acts as a main function to the project, you can run the experiments by the following commands.

# LSTM w/o Profile on TITAN Xp
python train.py -g -fs -es -uf -bs 8 -lr 0.0006
# LSTM w/ Profile on TITAN Xp
python train.py -g -fs -es -uf -ui -bs 8 -lr 0.0004
# BERT w/o Profile on Tesla V100s PCIE 32GB
python train.py -g -fs -es -uf -up -mt XLNet -bs 8 -lr 0.001 -blr 4e-05
# BERT w/ Profile on Tesla V100 PCIE 32GB
python train.py -g -fs -es -uf -up -ui -mt ELECTRA -bs 8 -lr 0.0008 -blr 4e-05

If you have any question, please issue the project or email me or lbqin, and we will reply you soon.

Acknowledgement

We are highly grateful for the public code of Stack-Propagation!

A Stack-Propagation Framework with Token-Level Intent Detection for Spoken Language Understanding. Libo Qin,Wanxiang Che, Yangming Li, Haoyang Wen and Ting Liu. (EMNLP 2019). Long paper. [pdf] [code]
We are highly grateful for the open-source knowledge graph!
- CN-DBpedia
- OwnThink

PyTorch implementation of the paper: Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding

Related tags

Overview

Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding

Workflow

Architecture

Results

Preparation

How to Run it

Acknowledgement

Owner

Xiao Xu

A text augmentation tool for named entity recognition.

AI and Machine Learning workflows on Anthos Bare Metal.

PyWorld3 is a Python implementation of the World3 model

This code is the implementation of Text Emotion Recognition (TER) with linguistic features

NLP-Project - Used an API to scrape 2000 reddit posts, then used NLP analysis and created a classification model to mixed succcess

Partially offline multi-language translator built upon Huggingface transformers.

pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation

[ICCV 2021] Instance-level Image Retrieval using Reranking Transformers

Beta Distribution Guided Aspect-aware Graph for Aspect Category Sentiment Analysis with Affective Knowledge. Proceedings of EMNLP 2021

Code for paper "Which Training Methods for GANs do actually Converge? (ICML 2018)"

JaQuAD: Japanese Question Answering Dataset

fastai ulmfit - Pretraining the Language Model, Fine-Tuning and training a Classifier

Big Bird: Transformers for Longer Sequences

Code Implementation of "Learning Span-Level Interactions for Aspect Sentiment Triplet Extraction".

Easy-to-use CPM for Chinese text generation

QVHighlights: Detecting Moments and Highlights in Videos via Natural Language Queries

Simple Annotated implementation of GPT-NeoX in PyTorch

GrammarTagger — A Neural Multilingual Grammar Profiler for Language Learning

Code for the project carried out fulfilling the course requirements for Fall 2021 NLP at NYU

Contains links to publicly available datasets for modeling health outcomes using speech and language.