UIT-ViSD4SA PACLIC 35

General Introduction

This repository contains the data of the paper: Span Detection for Vietnamese Aspect-Based Sentiment Analysis.

UIT-ViSD4SA is a benchmark Vietnamese smartphone feedback dataset for ABSA and span detection. UIT-ViSD4SA consisting of 35,396 human-annotated spans on 11,122 feedback comments, and each is manually annotated according to its spans towards ten fine-grained aspect categories with sentiment polarities. We split the dataset into a training set (7,784), a development set (1,113) and a test set (2,225) randomly.

Data Example

Read File

!pip install jsonlines

import jsonlines

data = []

with jsonlines.open('train.jsonl') as f:

    for line in f.iter():
       
        data.append((line['text'], {'labels': line['labels']}))

Citation

Please cite the following paper if you found it useful in your work.

@misc{nguyen2021span,
      title={Span Detection for Aspect-Based Sentiment Analysis in Vietnamese}, 
      author={Kim Thi-Thanh Nguyen and Sieu Khai Huynh and Luong Luc Phan and Phuc Huynh Pham and Duc-Vu Nguyen and Kiet Van Nguyen},
      year={2021},
      eprint={2110.07833},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

ViSD4SA, a Vietnamese Span Detection for Aspect-based sentiment analysis dataset

Related tags

Overview

UIT-ViSD4SA PACLIC 35

General Introduction

Data Example

Read File

Citation

Contact

Owner

Nguyễn Thị Thanh Kim

GUI for TOAD-GAN, a PCG-ML algorithm for Token-based Super Mario Bros. Levels.

PyTorch code accompanying the paper "Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning" (NeurIPS 2021).

Official implementation for Multi-Modal Interaction Graph Convolutional Network for Temporal Language Localization in Videos

Abstractive opinion summarization system (SelSum) and the largest dataset of Amazon product summaries (AmaSum). EMNLP 2021 conference paper.

Instance-wise Feature Importance in Time (FIT)

Churn-Prediction-Project - In this project, a churn prediction model is developed for a private bank as a term project for Data Mining class.

Implementation of a Transformer that Ponders, using the scheme from the PonderNet paper

Code and data for ACL2021 paper Cross-Lingual Abstractive Summarization with Limited Parallel Resources.

TransCD: Scene Change Detection via Transformer-based Architecture

Model-based Reinforcement Learning Improves Autonomous Racing Performance

Video lie detector using xgboost - A video lie detector using OpenFace and xgboost

Code for paper: Group-CAM: Group Score-Weighted Visual Explanations for Deep Convolutional Networks

Simulating Sycamore quantum circuits classically using tensor network algorithm.

Tool for working with Y-chromosome data from YFull and FTDNA

GNEE - GAT Neural Event Embeddings

A hyperparameter optimization framework

A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.

Adversarial Self-Defense for Cycle-Consistent GANs

Code for this paper The Lottery Ticket Hypothesis for Pre-trained BERT Networks.

本项目是一个带有前端界面的垃圾分类项目，加载了训练好的模型参数，模型为efficientnetb4，暂时为40分类问题。