Which Apple Keeps Which Doctor Away? Colorful Word Representations with Visual Oracles

Last update: Apr 14, 2022

Related tags

Text Data & NLP AppleLM

Overview

AppleLM

Which Apple Keeps Which Doctor Away? Colorful Word Representations with Visual Oracles (TASLP 2022)

Setup

This implementation is based on Transformers.

Preparation

Download GLUE datasets

The datasets can be downloaded automatically. Please refer to https://github.com/nyu-mll/GLUE-baselines

git clone https://github.com/nyu-mll/GLUE-baselines.git
python download_glue_data.py --data_dir glue_data --tasks all

It is recommended to put the folder glue_data to data/. The architecture looks like:

AppleLM
└───data
│   └───glue_data
│       │   CoLA/
│       │   MRPC/
│       │   ...

Visual Features

Pre-extracted visual features can be downloaded from Google Drive borrowed from the repo Multi30K.

The features are used in image embedding layer for indexing. Extract train-resnet50-avgpool.npy and put it in the data/ folder.

Training & Evaluate

export GLUE_DIR=data/glue_data/
export CUDA_VISIBLE_DEVICES="0"
export TASK_NAME=CoLA
python ./examples/run_glue_visual-tfidf_att.py \
    --model_type bert \
    --model_name_or_path bert-large-uncased-whole-word-masking \
    --task_name $TASK_NAME \
    --do_eval \
    --do_lower_case \
    --data_dir $GLUE_DIR/$TASK_NAME \
    --max_seq_length 128 \
    --per_gpu_eval_batch_size=32   \
    --per_gpu_train_batch_size=16   \
    --learning_rate 1e-5 \
    --eval_all_checkpoints \
    --save_steps 500 \
    --max_steps 5336 \
    --warmup_steps 320 \
    --image_dir data/train.lc.norm.tok.en \
    --image_embedding_file data/train-resnet50-avgpool.npy \
    --num_img 3 \
    --tfidf 5 \
    --image_merge att-gate \
    --stopwords_dir data/stopwords-en.txt \
    --output_dir experiments/CoLA_bert_wwm

Reference

Please kindly cite this paper in your publications if it helps your research:

@ARTICLE{zhang2022which,
  author={Zhang, Zhuosheng and Yu, Haojie and Zhao, Hai and Utiyama, Masao},
  journal={IEEE/ACM Transactions on Audio, Speech, and Language Processing}, 
  title={Which Apple Keeps Which Doctor Away? Colorful Word Representations With Visual Oracles}, 
  year={2022},
  volume={30},
  number={},
  pages={49-59},
  doi={10.1109/TASLP.2021.3130972}
}

Which Apple Keeps Which Doctor Away? Colorful Word Representations with Visual Oracles

Related tags

Overview

AppleLM

Setup

Preparation

Training & Evaluate

Reference

Owner

Zhuosheng Zhang

The code for two papers: Feedback Transformer and Expire-Span.

Natural language processing summarizer using 3 state of the art Transformer models: BERT, GPT2, and T5

Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasks

A linter to manage all your python exceptions and try/except blocks (limited only for those who like dinosaurs).

Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.

Simple Speech to Text, Text to Speech

a chinese segment base on crf

Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

Generating Korean Slogans with phonetic and structural repetition

PyTorch implementation of Tacotron speech synthesis model.

Few-shot Natural Language Generation for Task-Oriented Dialog

SpikeX - SpaCy Pipes for Knowledge Extraction

GPT-3: Language Models are Few-Shot Learners

A Semi-Intelligent ChatBot filled with statistical and economical data for the Premier League.

Indobenchmark are collections of Natural Language Understanding (IndoNLU) and Natural Language Generation (IndoNLG)

Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System

Code and dataset for the EMNLP 2021 Finding paper "Can NLI Models Verify QA Systems’ Predictions?"

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含自然语言处理各领域的面试题积累。

Tokenizer - Module python d'analyse syntaxique et de grammaire, tokenization

Continuously update some NLP practice based on different tasks.

Which Apple Keeps Which Doctor Away? Colorful Word Representations with Visual Oracles

Related tags

Overview

AppleLM

Setup

Preparation

Training & Evaluate

Reference

Owner

Zhuosheng Zhang

The code for two papers: Feedback Transformer and Expire-Span.

Natural language processing summarizer using 3 state of the art Transformer models: BERT, GPT2, and T5

Framework for fine-tuning pretrained transformers for Named-Entity Recognition (NER) tasks

A linter to manage all your python exceptions and try/except blocks (limited only for those who like dinosaurs).

Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.

Simple Speech to Text, Text to Speech

a chinese segment base on crf

Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

Generating Korean Slogans with phonetic and structural repetition

PyTorch implementation of Tacotron speech synthesis model.

Few-shot Natural Language Generation for Task-Oriented Dialog

SpikeX - SpaCy Pipes for Knowledge Extraction

GPT-3: Language Models are Few-Shot Learners

A Semi-Intelligent ChatBot filled with statistical and economical data for the Premier League.

Indobenchmark are collections of Natural Language Understanding (IndoNLU) and Natural Language Generation (IndoNLG)

Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System

Code and dataset for the EMNLP 2021 Finding paper "Can NLI Models Verify QA Systems’ Predictions?"

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含 自然语言处理各领域的 面试题积累。

Tokenizer - Module python d'analyse syntaxique et de grammaire, tokenization

Continuously update some NLP practice based on different tasks.

本项目是作者们根据个人面试和经验总结出的自然语言处理(NLP)面试准备的学习笔记与资料，该资料目前包含自然语言处理各领域的面试题积累。