Reimplementation of the paper `Human Attention Maps for Text Classification: Do Humans and Neural Networks Focus on the Same Words? (ACL2020)`

Last update: Dec 13, 2021

Overview

Human Attention for Text Classification

Re-implementation of the paper Human Attention Maps for Text Classification: Do Humans and Neural Networks Focus on the Same Words? (ACL2020).

Install requirements

$ poetry install

Download and Split Yelp dataset

Download from Yelp.com

https://www.yelp.com/dataset/download

Split the dataset

The Yelp dataset is so large that it is divided into subsets in advance.
- After that, we can get tng.jsonl, val.jsonl, and tst.jsonl from data directory.

$ allennlp split-dataset \
    --input-file data/yelp_academic_dataset_review.json \
    --output-dir data/ \
    --tng-ratio 0.8 \
    --val-ratio 0.1 \
    --tst_ratio 0.1

Preprocess HAM dataset

$ allennlp preprocess-ham-dataset \
    --ham-dataset-dir data/ham-dataset/raw_data/ \
    --output-dir data/

Train RNN model

$ CUDA_VISIBLE_DEVICES=0 allennlp train config/base.jsonnet -s outputs -o '{"trainer": {"cuda_device": 0}}'

Reference

Sen, Cansu, et al. "Human Attention Maps for Text Classification: Do Humans and Neural Networks Focus on the Same Words?." Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics. 2020.

Reimplementation of the paper `Human Attention Maps for Text Classification: Do Humans and Neural Networks Focus on the Same Words? (ACL2020)`

Related tags

Overview

Human Attention for Text Classification

Install requirements

Download and Split Yelp dataset

Download from Yelp.com

Split the dataset

Preprocess HAM dataset

Train RNN model

Reference

Owner

Shunsuke KITADA

Official implementation of UTNet: A Hybrid Transformer Architecture for Medical Image Segmentation

A check for whether the dependency jobs are all green.

Locally Differentially Private Distributed Deep Learning via Knowledge Distillation (LDP-DL)

A curated list of awesome papers for Semantic Retrieval (TOIS Accepted: Semantic Models for the First-stage Retrieval: A Comprehensive Review).

Temporal Segment Networks (TSN) in PyTorch

COPA-SSE contains crowdsourced explanations for the Balanced COPA dataset

Universal Probability Distributions with Optimal Transport and Convex Optimization

[ICLR'21] FedBN: Federated Learning on Non-IID Features via Local Batch Normalization

Rule Based Classification Project

GarmentNets: Category-Level Pose Estimation for Garments via Canonical Space Shape Completion

FlingBot: The Unreasonable Effectiveness of Dynamic Manipulations for Cloth Unfolding

UMPNet: Universal Manipulation Policy Network for Articulated Objects

Change is Everywhere: Single-Temporal Supervised Object Change Detection in Remote Sensing Imagery (ICCV 2021)

A PyTorch implementation of SlowFast based on ICCV 2019 paper "SlowFast Networks for Video Recognition"

Assessing the Influence of Models on the Performance of Reinforcement Learning Algorithms applied on Continuous Control Tasks

An implementation of the 1. Parallel, 2. Streaming, 3. Randomized SVD using MPI4Py

The Submission for SIMMC 2.0 Challenge 2021

Cascaded Pyramid Network (CPN) based on Keras (Tensorflow backend)

Fine-tuning StyleGAN2 for Cartoon Face Generation

QuakeLabeler is a Python package to create and manage your seismic training data, processes, and visualization in a single place — so you can focus on building the next big thing.