Semi-SDP

Semi-supervised parser for semantic dependency parsing.

This repo contains the code used for the semi-supervised semantic dependency parser in Jia et al. (2020), Semi-Supervised Semantic Dependency Parsing Using CRF Autoencoders. Part of the codebase is extended from Parser-v3.

Requirement

python3
tensorflow-gpu=1.13.1

How to use

Training

Our semi-sdp parser can be trained by simply running

python3 main.py train UnlabelGraphParserNetwork  --force --config_file $CONFIGFILE

CONFIGFILE contains all hyperparameters of the model, we give an example file in the directory 'config/'. By default, if the save directory already exists, you'll get a prompt warning you that the system will delete it if you continue and giving you one last chance to opt-out. If you are debugging or want to run the program in the background, add the --force flag.
Our model was trained with Glove embedding.
To train with the unlabeled data, note the parameters in Flag subsection in the config file. Set fix_label_data=True under the Flag and the labeled_num means how many sentences have labels in your training set (Need to put labeled data in front of unlabeled data). For other parameters that need to be modified, please refer to the paper and code.

Data

Format of datasets used in this code is CoNLL-U, we give an example in the directory 'data/'. The path of training/develop set is set in the CONFIGFILE, these can be changed according to yourself setting. Depending on the memory size (24G) of the running platform, we use labeled sentences less than 60 in length. Script that filters sentence according to lengths is simple, we give an exampel in the directory 'scripts/select_sents.py'.

Parsing

The trained model can be run by calling

python main.py --save_dir $SAVEDIR run --output_dir TestResult $DATAFILE

This will save parsed sentences in DATAFILE to the TestResult/ directory--make sure no files in different directories have the same basename though, or one will get overwritten! The sentences in DATAFILE is in CoNLL-U format. SAVEDIR is the directory that your trained model saved.

Semi-SDP Semi-supervised parser for semantic dependency parsing.

Related tags

Overview

Semi-SDP

Requirement

How to use

Training

Data

Parsing

Owner

Toontown House CT Edition

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

A modular application for performing anomaly detection in networks

🔅 Shapash makes Machine Learning models transparent and understandable by everyone

A Neural Net Training Interface on TensorFlow, with focus on speed + flexibility

Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.

(Python, R, C/C++) Isolation Forest and variations such as SCiForest and EIF, with some additions (outlier detection + similarity + NA imputation)

Populating 3D Scenes by Learning Human-Scene Interaction https://posa.is.tue.mpg.de/

Deploy a ML inference service on a budget in less than 10 lines of code.

This python-based package offers a way of creating a parametric OpenMC plasma source from plasma parameters.

Simple implementation of OpenAI CLIP model in PyTorch.

A pure PyTorch batched computation implementation of "CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition"

Real-CUGAN - Real Cascade U-Nets for Anime Image Super Resolution

Time-stretch audio clips quickly with PyTorch (CUDA supported)! Additional utilities for searching efficient transformations are included.

A Player for Kanye West's Stem Player. Sort of an emulator.

A Review of Deep Learning Techniques for Markerless Human Motion on Synthetic Datasets

The Hailo Model Zoo includes pre-trained models and a full building and evaluation environment

Conjugated Discrete Distributions for Distributional Reinforcement Learning (C2D)

mmfewshot is an open source few shot learning toolbox based on PyTorch

Assessing syntactic abilities of BERT