Finetuning Pipeline

Last update: Dec 13, 2022

Related tags

Overview

KLUE Baseline

KLUE-baseline contains the baseline code for the Korean Language Understanding Evaluation (KLUE) benchmark. See our paper for more details about KLUE and the baselines.

Dependencies

Make sure you have installed the packages listed in requirements.txt.

pip install -r requirements.txt

All expereiments are tested under Python 3.7 environment.

KLUE Benchmark Datasets

All train/dev sets of KLUE tasks are publicly available in this repo. You can access them by using git submodules. To clone the repo with datasets:

git clone --recursive https://github.com/KLUE-benchmark/KLUE-Baseline.git

or just download datasets after cloned this repo:

git submodule update --init --recursive

All test sets are not publicly available. To measure performance of your model on test set, you should first train your model on train set and submit the model to our submission system. Alternatively, you can compare dev set performances with our baseline models. They are also reported in our paper.

Train

To reproduce our baselines, run run_all.sh.

NOTE: klue/roberta models accept input length at most 510 tokens. Details are explained here.

Reference

If you use this code or KLUE, please cite:

@misc{park2021klue,
      title={KLUE: Korean Language Understanding Evaluation}, 
      author={Sungjoon Park and Jihyung Moon and Sungdong Kim and Won Ik Cho and Jiyoon Han and Jangwon Park and Chisung Song and Junseong Kim and Yongsook Song and Taehwan Oh and Joohong Lee and Juhyun Oh and Sungwon Lyu and Younghoon Jeong and Inkwon Lee and Sangwoo Seo and Dongjun Lee and Hyunwoo Kim and Myeonghwa Lee and Seongbo Jang and Seungwon Do and Sunkyoung Kim and Kyungtae Lim and Jongwon Lee and Kyumin Park and Jamin Shin and Seonghyun Kim and Lucy Park and Alice Oh and Jung-Woo Ha and Kyunghyun Cho},
      year={2021},
      eprint={2105.09680},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

Contribution

Feel free to leave issues if there are any questions or comments. To contribute, please run make style before creating pull requests.

Finetuning Pipeline

Related tags

Overview

KLUE Baseline

Dependencies

KLUE Benchmark Datasets

Train

Reference

Contribution

Owner

The official pytorch implemention of the CVPR paper "Temporal Modulation Network for Controllable Space-Time Video Super-Resolution".

AITUS - An atomatic notr maker for CYTUS

Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit

Everything about being a TA for ITP/AP course!

Learning 3D Part Assembly from a Single Image

AI创造营：Metaverse启动机之重构现世，结合PaddlePaddle 和 Wechaty 创造自己的聊天机器人

AutoML library for deep learning

JDet is Object Detection Framework based on Jittor.

Training Very Deep Neural Networks Without Skip-Connections

Official implementation of "Generating 3D Molecules for Target Protein Binding"

Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!

Emotional conditioned music generation using transformer-based model.

Code for the CIKM 2019 paper "DSANet: Dual Self-Attention Network for Multivariate Time Series Forecasting".

Image process framework based on plugin like imagej, it is esay to glue with scipy.ndimage, scikit-image, opencv, simpleitk, mayavi...and any libraries based on numpy

“英特尔创新大师杯”深度学习挑战赛赛道3：CCKS2021中文NLP地址相关性任务

MILK: Machine Learning Toolkit

MINERVA: An out-of-the-box GUI tool for offline deep reinforcement learning

Pytorch Implementation for NeurIPS (oral) paper: Pixel Level Cycle Association: A New Perspective for Domain Adaptive Semantic Segmentation

This repository contains the map content ontology used in narrative cartography

wmctrl ported to Python Ctypes

Finetuning Pipeline

Related tags

Overview

KLUE Baseline

Dependencies

KLUE Benchmark Datasets

Train

Reference

Contribution

Owner

The official pytorch implemention of the CVPR paper "Temporal Modulation Network for Controllable Space-Time Video Super-Resolution".

AITUS - An atomatic notr maker for CYTUS

Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit

Everything about being a TA for ITP/AP course!

Learning 3D Part Assembly from a Single Image

AI创造营 ：Metaverse启动机之重构现世，结合PaddlePaddle 和 Wechaty 创造自己的聊天机器人

AutoML library for deep learning

JDet is Object Detection Framework based on Jittor.

Training Very Deep Neural Networks Without Skip-Connections

Official implementation of "Generating 3D Molecules for Target Protein Binding"

Capture all information throughout your model's development in a reproducible way and tie results directly to the model code!

Emotional conditioned music generation using transformer-based model.

Code for the CIKM 2019 paper "DSANet: Dual Self-Attention Network for Multivariate Time Series Forecasting".

Image process framework based on plugin like imagej, it is esay to glue with scipy.ndimage, scikit-image, opencv, simpleitk, mayavi...and any libraries based on numpy

“英特尔创新大师杯”深度学习挑战赛 赛道3：CCKS2021中文NLP地址相关性任务

MILK: Machine Learning Toolkit

MINERVA: An out-of-the-box GUI tool for offline deep reinforcement learning

Pytorch Implementation for NeurIPS (oral) paper: Pixel Level Cycle Association: A New Perspective for Domain Adaptive Semantic Segmentation

This repository contains the map content ontology used in narrative cartography

wmctrl ported to Python Ctypes

AI创造营：Metaverse启动机之重构现世，结合PaddlePaddle 和 Wechaty 创造自己的聊天机器人

“英特尔创新大师杯”深度学习挑战赛赛道3：CCKS2021中文NLP地址相关性任务