Utilizing RBERT model for KLUE Relation Extraction task

Last update: Nov 15, 2022

Related tags

Text Data & NLP KLUE-RBERT

Overview

RBERT for Relation Extraction task for KLUE

Project Description

Relation Extraction task is one of the task of Korean Language Understanding Evaluation(KLUE) Benchmark.
Relation extraction can be defined as multiclass classification task for relationship between subject entity and object entity.
Classes are such as no_relation, per:employee_of, org:founded_by... totaling 30 labels.
This repo contains custom fine-tuning method utilizing monologg's R-BERT Implementation.
Custom punctuations with Pororo NER has been added to the dataset prior to the model's training.
If you want to refer to the experimentation note such as punctuation method of the entity, please refer to the blog post

Arguments Usage

Argument	type	Default	Explanation
batch_size	int	40	batch size for training and inferece
num_folds	int	5	number of fold for Stratified KFold
num_train_epochs	int	5	number of epochs for training
loss	str	focalloss	loss function
gamma	float	1.0	focalloss's gamma value
optimizer	str	adamp	optimizer for training
scheduler	str	get_cosine_schedule_with_warmup	learning rate scheduler
learning_rate	float	0.00005	initial learning rate
weight_decay	float	0.01	Loss function's weight decay, preventing overfit
warmup_step	int	500
debug	bool	false	debug with CPU device for better error representation
dropout_rate	float	0.1
save_steps	int	100	number of steps for saving the model
evaluation_steps	int	100	number of step until the evaluation
metric_for_best_model	str	eval/loss	the metric for determining which is the best model
load_best_model_at_end	bool	True

References

Authorship

Hardware

GPU : Tesla V100 32GB

Utilizing RBERT model for KLUE Relation Extraction task

Related tags

Overview

RBERT for Relation Extraction task for KLUE

Project Description

Arguments Usage

References

Authorship

Hardware

Owner

snoop2head

HuggingSound: A toolkit for speech-related tasks based on HuggingFace's tools

hashily is a Python module that provides a variety of text decoding and encoding operations.

Develop open-source Python Arabic NLP libraries that the Arab world will easily use in all Natural Language Processing applications

A pytorch implementation of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".

This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Translation".

⚖️ A Statutory Article Retrieval Dataset in French.

Build Text Rerankers with Deep Language Models

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含自然语言处理各领域的面试题积累。

The repository for the paper: Multilingual Translation via Grafting Pre-trained Language Models

This repository contains the code for "Generating Datasets with Pretrained Language Models".

Submit issues and feature requests for our API here.

Finding Label and Model Errors in Perception Data With Learned Observation Assertions

Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.

This repository contains the code for EMNLP-2021 paper "Word-Level Coreference Resolution"

The Internet Archive Research Assistant - Daily search Internet Archive for new items matching your keywords

A fast and easy implementation of Transformer with PyTorch.

L3Cube-MahaCorpus a Marathi monolingual data set scraped from different internet sources.

Open Source Neural Machine Translation in PyTorch

Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.

Sentiment Analysis Project using Count Vectorizer and TF-IDF Vectorizer

Utilizing RBERT model for KLUE Relation Extraction task

Related tags

Overview

RBERT for Relation Extraction task for KLUE

Project Description

Arguments Usage

References

Authorship

Hardware

Owner

snoop2head

HuggingSound: A toolkit for speech-related tasks based on HuggingFace's tools

hashily is a Python module that provides a variety of text decoding and encoding operations.

Develop open-source Python Arabic NLP libraries that the Arab world will easily use in all Natural Language Processing applications

A pytorch implementation of the ACL2019 paper "Simple and Effective Text Matching with Richer Alignment Features".

This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Translation".

⚖️ A Statutory Article Retrieval Dataset in French.

Build Text Rerankers with Deep Language Models

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含 自然语言处理各领域的 面试题积累。

The repository for the paper: Multilingual Translation via Grafting Pre-trained Language Models

This repository contains the code for "Generating Datasets with Pretrained Language Models".

Submit issues and feature requests for our API here.

Finding Label and Model Errors in Perception Data With Learned Observation Assertions

Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.

This repository contains the code for EMNLP-2021 paper "Word-Level Coreference Resolution"

The Internet Archive Research Assistant - Daily search Internet Archive for new items matching your keywords

A fast and easy implementation of Transformer with PyTorch.

L3Cube-MahaCorpus a Marathi monolingual data set scraped from different internet sources.

Open Source Neural Machine Translation in PyTorch

Repository for fine-tuning Transformers 🤗 based seq2seq speech models in JAX/Flax.

Sentiment Analysis Project using Count Vectorizer and TF-IDF Vectorizer

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含自然语言处理各领域的面试题积累。