FastTrees

This repository contains the experimental code supporting the FastTrees paper by Bill Pung.

Software Requirements

Python 3.6, NLTK and PyTorch 0.4 are required for the current codebase.

Steps

Install PyTorch 0.4 and NLTK
Download PTB data. Note that the two tasks, i.e., language modeling and unsupervised parsing share the same model structure but require different formats of the PTB data. For language modeling we need the standard 10,000 word Penn Treebank corpus data, and for parsing we need Penn Treebank Parsed data. Place train, test and valid.txt in ./data/penn/
Scripts and commands
- Train Language Modeling with FastTrees python main.py --model FT --batch_size 20 --dropout 0.45 --dropouth 0.3 --dropouti 0.5 --wdrop 0.45 --chunk_size 10 --seed 141 --epoch 1000 --data /path/to/your/data
- Train Language Modeling with Conv FastTrees python main.py --model FFT --batch_size 20 --dropout 0.45 --dropouth 0.3 --dropouti 0.5 --wdrop 0.45 --chunk_size 10 --seed 141 --epoch 1000 --data /path/to/your/data
- Train Language Modeling with Faster FastTrees python main.py --model CFT --batch_size 20 --dropout 0.45 --dropouth 0.3 --dropouti 0.5 --wdrop 0.45 --chunk_size 10 --seed 141 --epoch 1000 --data /path/to/your/data
- Train Language Modeling with ON-LSTM python main.py --model ON --batch_size 20 --dropout 0.45 --dropouth 0.3 --dropouti 0.5 --wdrop 0.45 --chunk_size 10 --seed 141 --epoch 1000 --data /path/to/your/data
- Train Language Modeling with LSTM python main.py --model LSTM --batch_size 20 --dropout 0.45 --dropouth 0.3 --dropouti 0.5 --wdrop 0.45 --chunk_size 10 --seed 141 --epoch 1000 --data /path/to/your/data
- Test Unsupervised Parsing python test_phrase_grammar.py --cuda

Source

Code is partly adapted from for word-level language model and unsupervised parsing experiments in Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks paper, originally forked from the LSTM and QRNN Language Model Toolkit for PyTorch.

Parallel Latent Tree-Induction for Faster Sequence Encoding

Related tags

Overview

FastTrees

Software Requirements

Steps

Source

Owner

Bill Pung

Winning solution of the Indoor Location & Navigation Kaggle competition

The official repo for CVPR2021——ViPNAS: Efficient Video Pose Estimation via Neural Architecture Search.

BEAMetrics: Benchmark to Evaluate Automatic Metrics in Natural Language Generation

MusicYOLO framework uses the object detection model, YOLOx, to locate notes in the spectrogram.

OpenL3: Open-source deep audio and image embeddings

SEC'21: Sparse Bitmap Compression for Memory-Efficient Training onthe Edge

Implementation of the paper "Fine-Tuning Transformers: Vocabulary Transfer"

x-transformers-paddle 2.x version

Evolution Strategies in PyTorch

DISTIL: Deep dIverSified inTeractIve Learning.

SustainBench: Benchmarks for Monitoring the Sustainable Development Goals with Machine Learning

Official implementation of CVPR2020 paper "Deep Generative Model for Robust Imbalance Classification"

Algebraic effect handlers in Python

End-to-end beat and downbeat tracking in the time domain.

Office source code of paper UniFuse: Unidirectional Fusion for 360$^\circ$ Panorama Depth Estimation

This repository contains the code, data, and models of the paper titled "XL-Sum: Large-Scale Multilingual Abstractive Summarization for 44 Languages" published in Findings of the Association for Computational Linguistics: ACL 2021.

Implementation of Squeezenet in pytorch, pretrained models on Cifar 10 data to come

Audio2Face - Audio To Face With Python

Object detection on multiple datasets with an automatically learned unified label space.

Election Exit Poll Prediction and U.S.A Presidential Speech Analysis using Machine Learning