SpanNER: Named EntityRe-/Recognition as Span Prediction

Last update: Dec 17, 2022

Related tags

Overview

SpanNER: Named EntityRe-/Recognition as Span Prediction

This repository contains the code for our paper SpanNER: Named EntityRe-/Recognition as Span Prediction (ACL 2021).

The model designed in this work has been deployed into ExplainaBoard.

Overview

We investigate complementary advantages of systems based on different paradigms: span prediction model and sequence labeling framework. We then reveal that span prediction, simultaneously, can serve as a system combiner to re-recognize named entities from different systems’ outputs. We experimentally implement 154 systems on 11 datasets, covering three languages, comprehensive results show the effectiveness of span prediction models that both serve as base NER systems and system combiners.

Demo

We deploy SpanNER into the ExplainaBoard.

Quick Installation

python3
PyTorch
pytorch-lightning

Run the following script to install the dependencies,

pip3 install -r requirements.txt

Data Preprocessing

The dataset needs to be preprocessed, before running the model. We provide dataprocess/bio2spannerformat.py for reference, which gives the CoNLL-2003 as an example. First, you need to download datasets, and then convert them into BIO2 tagging format. We provided the CoNLL-2003 dataset with BIO format in data/conll03_bio folder, and its preprocessed format dataset in data/conll03 folder.

The download links of the datasets used in this work are shown as follows:

Prepare Models

For English Datasets, we use BERT-Large.

For Dutch and Spanish Datasets, we use BERT-Multilingual-Base.

How to Run?

Here, we give CoNLL-2003 as an example. You may need to change the DATA_DIR, PRETRAINED, dataname, n_class to your own dataset path, pre-trained model path, dataset name, and the number of labels in the dataset, respectively.

./run_conll03_spanner.sh

System Combination

Base Model

We provided 12 base models (result-files) of CoNLL-2003 dataset in combination/results. More base model (result-files) can be download from ExplainaBoard-download.

Combination

Put your different base models (result-files) in the data/results folder, then run:

python comb_voting.py

Here, we provided four system combination methods, including:

SpanNER,
Majority voting (VM),
Weighted voting base on overall F1-score (VOF1),
Weighted voting base on class F1-score (VCF1).

Results at a Glance

Bib

@article{fu2021spanner,
  title={SpanNer: Named Entity Re-/Recognition as Span Prediction},
  author={Fu, Jinlan and Huang, Xuanjing and Liu, Pengfei},
  journal={arXiv preprint arXiv:2106.00641},
  year={2021}
}

SpanNER: Named EntityRe-/Recognition as Span Prediction

Related tags

Overview

SpanNER: Named EntityRe-/Recognition as Span Prediction

Overview

Demo

Quick Installation

Data Preprocessing

Prepare Models

How to Run?

System Combination

Base Model

Combination

Results at a Glance

Bib

Owner

NeuLab

Selective Wavelet Attention Learning for Single Image Deraining

Hidden-Fold Networks (HFN): Random Recurrent Residuals Using Sparse Supermasks

⚡️Optimizing einsum functions in NumPy, Tensorflow, Dask, and more with contraction order optimization.

Versatile Generative Language Model

CKD - Collaborative Knowledge Distillation for Heterogeneous Information Network Embedding

ObjDetApp deploys a pytorch model for object detection

yolov5目标检测模型的知识蒸馏（基于响应的蒸馏）

Repo for my Tensorflow/Keras CV experiments. Mostly revolving around the Danbooru20xx dataset

Data reduction pipeline for KOALA on the AAT.

CRF-RNN for Semantic Image Segmentation - PyTorch version

A repository for interferometer controller code.

A rule-based log analyzer & filter

A visualization tool to show a TensorFlow's graph like TensorBoard

Build Graph Nets in Tensorflow

Official git for "CTAB-GAN: Effective Table Data Synthesizing"

Evaluation and Benchmarking of Speech Super-resolution Methods

DetCo: Unsupervised Contrastive Learning for Object Detection

bio_inspired_min_nets_improve_the_performance_and_robustness_of_deep_networks

Focal Loss for Dense Rotation Object Detection

Keras Image Embeddings using Contrastive Loss

SpanNER: Named EntityRe-/Recognition as Span Prediction

Related tags

Overview

SpanNER: Named EntityRe-/Recognition as Span Prediction

Overview

Demo

Quick Installation

Data Preprocessing

Prepare Models

How to Run?

System Combination

Base Model

Combination

Results at a Glance

Bib

Owner

NeuLab

Selective Wavelet Attention Learning for Single Image Deraining

Hidden-Fold Networks (HFN): Random Recurrent Residuals Using Sparse Supermasks

⚡️Optimizing einsum functions in NumPy, Tensorflow, Dask, and more with contraction order optimization.

Versatile Generative Language Model

CKD - Collaborative Knowledge Distillation for Heterogeneous Information Network Embedding

*ObjDetApp* deploys a pytorch model for object detection

yolov5目标检测模型的知识蒸馏（基于响应的蒸馏）

Repo for my Tensorflow/Keras CV experiments. Mostly revolving around the Danbooru20xx dataset

Data reduction pipeline for KOALA on the AAT.

CRF-RNN for Semantic Image Segmentation - PyTorch version

A repository for interferometer controller code.

A rule-based log analyzer & filter

A visualization tool to show a TensorFlow's graph like TensorBoard

Build Graph Nets in Tensorflow

Official git for "CTAB-GAN: Effective Table Data Synthesizing"

Evaluation and Benchmarking of Speech Super-resolution Methods

DetCo: Unsupervised Contrastive Learning for Object Detection

bio_inspired_min_nets_improve_the_performance_and_robustness_of_deep_networks

Focal Loss for Dense Rotation Object Detection

Keras Image Embeddings using Contrastive Loss

ObjDetApp deploys a pytorch model for object detection