Deep Learning for Natural Language Processing SS 2021 (TU Darmstadt)

Last update: Jan 26, 2022

Overview

Deep Learning for Natural Language Processing SS 2021 (TU Darmstadt)

Task

Training huge unsupervised deep neural networks yields to strong progress in the field of Natural Language Processing (NLP). Using these extensively pre-trained networks for particular NLP applications is the current state-of-the-art approach. In this project, we approach the task of ranking possible clarifying questions for a given query. We fine-tuned a pre-trained BERT model to rank the possible clarifying questions in a classification manner. The achieved model scores a top-5 accuracy of 0.4565 on the provided benchmark dataset.

Installation

This project was originally developed with Python 3.8, PyTorch 1.7, and CUDA 11.0. The training requires one NVIDIA GeForce RTX 1080 (11GB memory).

Create conda environment:

conda create --name dl4nlp
source activate dl4nlp

Install the dependencies:

pip install -r requirements.txt

Run

We use a pretrained BERT-Base by Hugging Face and fine-tune it on the given training dataset. To run training, please use the following command:

python main.py --train

For evaluation on the test set, please use the following command:

python main.py --test

Arguments for training and/or testing:

--train: Run training on training dataset. Default: True
--val: Run evaluation during training on validation dataset. Default: True
--test: Run evaluation on test dataset. Default: True
--cuda-devices: Set GPU index Default: 0
--cpu: Run everything on CPU. Default: False
--data-parallel: Use DataParallel. Default: False
--data-root: Path to dataset folder. Default: data
--train-file-name: Name of training file name in data-root. Default: training.tsv
--test-file-name: Name of test file name in data-root. Default: test_set.tsv
--question-bank-name: Name of question bank file name in data-root. Default: question_bank.tsv
--checkpoints-root: Path to checkpoints folder. Default: checkpoints
--checkpoint-name: File name of checkpoint in checkpoints-root to start training or use for testing. Default: None
--runs-root: Path to output runs folder for tensorboard. Default: runs
--txt-root: Path to output txt folder for evaluation results. Default: txt
--lr: Learning rate. Default: 1e-5
--betas: Betas for optimization. Default: (0.9, 0.999)
--weight-decay: Weight decay. Default: 1e-2
--val-start: Set at which epoch to start validation. Default: 0
--val-step: Set at which epoch rate to valide. Default: 1
--val-split: Use subset of training dataset for validation. Default: 0.005
--num-epochs: Number of epochs for training. Default: 10
--batch-size: Samples per batch. Default: 32
--num-workers: Number of workers. Default: 4
--top-k-accuracy: Evaluation metric with flexible top-k-accuracy. Default: 50
--true-label: True label in dataset. Default: 1
--false-label: False label in dataset. Default: 0

Example output

User query:

Tell me about Computers

Propagated clarifying questions:

do you like using computers
do you want to know how to do computer programming
do you want to see some closeup of a turbine
are you looking for information on different computer programming languages
are you referring to a software

Deep Learning for Natural Language Processing SS 2021 (TU Darmstadt)

Related tags

Overview

Deep Learning for Natural Language Processing SS 2021 (TU Darmstadt)

Task

Installation

Run

Example output

Owner

Oliver Hahn

Implementation of Perceiver, General Perception with Iterative Attention, in Pytorch

Neural machine translation between the writings of Shakespeare and modern English using TensorFlow

Hierarchical Memory Matching Network for Video Object Segmentation (ICCV 2021)

Official PyTorch(Geometric) implementation of DPGNN(DPGCN) in "Distance-wise Prototypical Graph Neural Network for Node Imbalance Classification"

torchlm is aims to build a high level pipeline for face landmarks detection, it supports training, evaluating, exporting, inference(Python/C++) and 100+ data augmentations

A JAX-based research framework for writing differentiable numerical simulators with arbitrary discretizations

MMdet2-based reposity about lightweight detection model: Nanodet, PicoDet.

Gin provides a lightweight configuration framework for Python

Neural Nano-Optics for High-quality Thin Lens Imaging

This repo is a PyTorch implementation for Paper "Unsupervised Learning for Cuboid Shape Abstraction via Joint Segmentation from Point Clouds"

Learning an Adaptive Meta Model-Generator for Incrementally Updating Recommender Systems

The code for our paper Semi-Supervised Learning with Multi-Head Co-Training

Explicable Reward Design for Reinforcement Learning Agents [NeurIPS'21]

Datasets and source code for our paper Webly Supervised Fine-Grained Recognition: Benchmark Datasets and An Approach

PyTorch implementation of the ideas presented in the paper Interaction Grounded Learning (IGL)

All course materials for the Zero to Mastery Machine Learning and Data Science course.

Learning Features with Parameter-Free Layers (ICLR 2022)

This repository allows the user to automatically scale a 3D model/mesh/point cloud on Agisoft Metashape

WiFi-based Multi-task Sensing

Code for ICCV 2021 paper Graph-to-3D: End-to-End Generation and Manipulation of 3D Scenes using Scene Graphs