Constituency Tree Labeling Tool

The purpose of this package is to solve the constituency tree labeling problem.

Look from the dataset labeled by NLTK,it is a bit counter-intuitive and it is very troublesome to label.

Then this package provides a LabelTree, you can use this class to generate dataset, for example, convert example1 and convert example2, and then use the label_tree_to_nltk method to convert them into data conforming to the NLTK label format. Then this package provides a LabelTree, you can use this class to generate dataset, for example, convert example1 and convert example2, and then use the label_tree_to_nltk method to convert them into data conforming to the NLTK label format.

examples

example1

NLTK example 1

     TOP      
      |        
    IP-HLN    
  ____|_____   
 IP   IP    IP
 |    |     |  
 VP   VP    VP
 |    |     |  
 VA   VA    VA
 |    |     |  
 清新   清新    清新

convert example 1

example2

NLTK example 2

                      TOP                 
                       |                   
                     IP-HLN               
                 ______|________________   
              IP-TPC              |     | 
     ___________|______           |     |  
    |                  VP         |     | 
    |            ______|_____     |     |  
    |         PP-DIR         |    |     | 
    |       ____|______      |    |     |  
NP-PN-SBJ  |           NP    VP NP-SBJ  VP
    |      |           |     |    |     |  
    NR     P           NN    VV   NN    VV
    |      |           |     |    |     |  
    广西     对           外     开放   成绩    斐然

convert example 2

More example you can see test.

成分分析树标注工具

这个包的目的在于标注成分分析树。

从nltk标注出来的数据集来看，有点反直觉，标注起来很麻烦。那么此包提供一个LabelTree，您可以通过这个类来生成例如convert example1以及convert example2，然后通过label_tree_to_nltk方法将其转换成符合nltk标注格式的数据出来。

Constituency Tree Labeling Tool

Related tags

Overview

Constituency Tree Labeling Tool

examples

example1

example2

成分分析树标注工具

Owner

张宇

Perform sentiment analysis and keyword extraction on Craigslist listings

PyTorch Implementation of "Non-Autoregressive Neural Machine Translation"

Translation for Trilium Notes. Trilium Notes 中文版.

Code for the paper "Language Models are Unsupervised Multitask Learners"

This is a project of data parallel that running on NLP tasks.

Language-Agnostic SEntence Representations

Beta Distribution Guided Aspect-aware Graph for Aspect Category Sentiment Analysis with Affective Knowledge. Proceedings of EMNLP 2021

Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow

Fast, general, and tested differentiable structured prediction in PyTorch

An implementation of WaveNet with fast generation

An evaluation toolkit for voice conversion models.

Knowledge Management for Humans using Machine Learning & Tags

Tools to download and cleanup Common Crawl data

HuggingSound: A toolkit for speech-related tasks based on HuggingFace's tools

Stuff related to Ben Eater's 8bit breadboard computer

PyTorch Implementation of "Bridging Pre-trained Language Models and Hand-crafted Features for Unsupervised POS Tagging" (Findings of ACL 2022)

Trains an OpenNMT PyTorch model and SentencePiece tokenizer.

Deep Learning for Natural Language Processing - Lectures 2021

Contract Understanding Atticus Dataset

超轻量级bert的pytorch版本，大量中文注释，容易修改结构，持续更新