Grading tools for Advanced NLP (11-711)

Installation

You'll need docker and unzip to use this repo. For docker, visit the official guide to get started. For unzip, you can install it on ubuntu via sudo apt-get install unzip.

Install the python package by

git clone https://github.com/ProKil/anlp-grading-tools
cd anlp-grading-tools
pip install -e .

Usage

To evaluate your code, you'll need to change the environment variables in test.sh.

ANLP_TMP_DIR: mkdir a new folder, e.g. mkdir tmp, and point this variable to the absolute path of the tmp folder.

SUBMISSION_DIR: this should point to the folder containing your submission zip file. Note that the toolkit will automatically evaluate all zip files in the folder.

SCORES_DIR: this should point to an empty folder. Your score will be logged in a text file there.

DATA_DIR: this should point to the data folder of minnn-assignment. Please copy the original minnn-assignment/classifier.py to minnn-assignment/data/classifier_orig.py to test if your code can be executed with the original classifier.

Example code to prepare the folders:

mkdir tmp
mkdir scores
cp -r path/to/minnn-assignment/data ./
cp path/to/minnn-assignment/classifier.py data/classifier_orig.py
mkdir submission
cp your/submission.zip submission

Now you can evaluate your code through bash test.sh, after which your scores are at SCORES_DIR/andrewid. It is normal to get 0s for the last two (correct labels for the imdb test set are not available), but you should get reasonable accuracies for the first two (~40).

Troubleshooting

You may find writing files inside ANLP_TMP_DIR and SCORE_DIR requiring permission. You can either use sudo or log into docker through docker run -v FOLDER_TO_WRITE:/mnt -it --entrypoint /bin/bash anlp and cd /mnt to write those files.
You may experience other permission issues with docker. Please refer to this page to use docker without sudo.

Grading tools for Advanced NLP (11-711)Grading tools for Advanced NLP (11-711)

Related tags

Overview

Grading tools for Advanced NLP (11-711)

Installation

Usage

Troubleshooting

Owner

Hao Zhu

Extracting Summary Knowledge Graphs from Long Documents

A tool helps build a talk preview image by combining the given background image and talk event description

An attempt to map the areas with active conflict in Ukraine using open source twitter data.

Sample data associated with the Aurora-BP study

Source code for AAAI20 "Generating Persona Consistent Dialogues by Exploiting Natural Language Inference".

This converter will create the exact measure for your cappuccino recipe from the grandiose Rafaella Ballerini!

Toward Model Interpretability in Medical NLP

BERT has a Mouth, and It Must Speak: BERT as a Markov Random Field Language Model

Generate text line images for training deep learning OCR model (e.g. CRNN)

Converts text into a PDF of handwritten notes

Faster, modernized fork of the language identification tool langid.py

code for "AttentiveNAS Improving Neural Architecture Search via Attentive Sampling"

Dust model dichotomous performance analysis

Simple Annotated implementation of GPT-NeoX in PyTorch

Text Classification Using LSTM

A Python script that compares files in directories

pyMorfologik MorfologikpyMorfologik - Python binding for Morfologik.

🏆 • 5050 most frequent words in 109 languages

Towards Nonlinear Disentanglement in Natural Data with Temporal Sparse Coding

A fast hierarchical dimensionality reduction algorithm.