TFPNER: Exploration on the Named Entity Recognition of Token Fused with Part-of-Speech

Last update: Feb 07, 2022

Related tags

Overview

TFPNER

TFPNER: Exploration on the Named Entity Recognition of Token Fused with Part-of-Speech

Named entity recognition (NER), which aims at identifying real-world entity mentions from texts, is a fundamental task in natural language processing with a wide range of applications. Previous approaches mainly focus on the original pure sentence but the Part of speech (POS) contains rich semantic information and contribute to the success of the Natural Language Processing task. To further improve the performance of the NER task, we proposed the five methods that employed POS tags fused with the original tokens based on the BERT model to achieve the NER task, including concatenating token and POS as one or two sentences, adding POS embedding as one of the embedding elements, model ensemble, and conduct the multi-attention between the token representations and POS representations. In this work, we addressed the CoNLL-2003 and Groningen Meaning Bank (GMB) datasets which can provide both NER tags and POS tags. From our experiments on two datasets, part of the proposed methods can show performance improvement in comparison with the baseline methods.

This is the project I worked with Haoqing Tang, the extraordinary computer scientist in CV & NLP area, during the interesting and memorable Master study period.

TFPNER: Exploration on the Named Entity Recognition of Token Fused with Part-of-Speech

Related tags

Overview

TFPNER

TFPNER: Exploration on the Named Entity Recognition of Token Fused with Part-of-Speech

This is the project I worked with Haoqing Tang, the extraordinary computer scientist in CV & NLP area, during the interesting and memorable Master study period.

Owner

CATs: Semantic Correspondence with Transformers

Code to use Augmented Shapiro Wilks Stopping, as well as code for the paper "Statistically Signifigant Stopping of Neural Network Training"

Text editor on python to convert english text to malayalam(Romanization/Transiteration).

A python project made to generate code using either OpenAI's codex or GPT-J (Although not as good as codex)

Resources for "Natural Language Processing" Coursera course.

Pervasive Attention: 2D Convolutional Networks for Sequence-to-Sequence Prediction

Text to speech is a process to convert any text into voice. Text to speech project takes words on digital devices and convert them into audio. Here I have used Google-text-to-speech library popularly known as gTTS library to convert text file to .mp3 file. Hope you like my project!

Disfl-QA: A Benchmark Dataset for Understanding Disfluencies in Question Answering

تولید اسم های رندوم فینگیلیش

MMDA - multimodal document analysis

✔👉A Centralized WebApp to Ensure Road Safety by checking on with the activities of the driver and activating label generator using NLP.

Example code for "Real-World Natural Language Processing"

Text Classification Using LSTM

GraphNLI: A Graph-based Natural Language Inference Model for Polarity Prediction in Online Debates

A python package to fine-tune transformer-based models for named entity recognition (NER).

A Japanese tokenizer based on recurrent neural networks

Legal text retrieval for python

BERT-based Financial Question Answering System

Unsupervised text tokenizer focused on computational efficiency

Creating an Audiobook (mp3 file) using a Ebook (epub) using BeautifulSoup and Google Text to Speech