Code for Emergent Translation in Multi-Agent Communication

Last update: Jul 15, 2022

Related tags

Overview

Emergent Translation in Multi-Agent Communication

PyTorch implementation of the models described in the paper Emergent Translation in Multi-Agent Communication.

We present code for training and decoding both word- and sentence-level models and baselines, as well as preprocessed datasets.

Dependencies

Python

Python 2.7
PyTorch 0.2
Numpy

GPU

CUDA (we recommend using the latest version. The version 8.0 was used in all our experiments.)

Related code

For preprocessing, we used scripts from Moses and Subword-NMT.

Downloading Datasets

The original corpora can be downloaded from (Bergsma500, Multi30k, MS COCO). For the preprocessed corpora see below.

	Dataset
Bergsma500	Data
Multi30k	Data
MS COCO	Data

Before you run the code

Download the datasets and place them in /data/word (Bergsma500) and /data/sentence (Multi30k and MS COCO)
Set correct path in scr_path() from /scr/word/util.py and scr_path(), multi30k_reorg_path() and coco_path() from /src/sentence/util.py

Word-level Models

Running nearest neighbour baselines

$ python word/bergsma_bli.py

Running our models

$ python word/train_word_joint.py --l1 <L1> --l2 <L2>

where <L1> and <L2> are any of {en, de, es, fr, it, nl}

Sentence-level Models

Baseline 1 : Nearest neighbour

$ python sentence/baseline_nn.py --dataset <DATASET> --task <TASK> --src <SRC> --trg <TRG>

Baseline 2 : NMT with neighbouring sentence pairs

$ python sentence/nmt.py --dataset <DATASET> --task <TASK> --src <SRC> --trg <TRG> --nn_baseline

Baseline 3 : Nakayama and Nishida, 2017

$ python sentence/train_naka_encdec.py --dataset <DATASET> --task <TASK> --src <SRC> --trg <TRG> --train_enc_how <ENC_HOW> --train_dec_how <DEC_HOW>

where <ENC_HOW> is either two or three, and <DEC_HOW> is either img, des, or both.

Our models :

$ python sentence/train_seq_joint.py --dataset <DATASET> --task <TASK>

Aligned NMT :

$ python sentence/nmt.py --dataset <DATASET> --task <TASK> --src <SRC> --trg <TRG>

where <DATASET> is multi30k or coco, and <TASK> is either 1 or 2 (only applicable for Multi30k).

Dataset & Related Code Attribution

Moses is licensed under LGPL, and Subword-NMT is licensed under MIT License.
MS COCO and Multi30k are licensed under Creative Commons.

Citation

If you find the resources in this repository useful, please consider citing:

@inproceedings{Lee:18,
  author    = {Jason Lee and Kyunghyun Cho and Jason Weston and Douwe Kiela},
  title     = {Emergent Translation in Multi-Agent Communication},
  year      = {2018},
  booktitle = {Proceedings of the International Conference on Learning Representations},
}

Code for Emergent Translation in Multi-Agent Communication

Related tags

Overview

Emergent Translation in Multi-Agent Communication

Dependencies

Python

GPU

Related code

Downloading Datasets

Before you run the code

Word-level Models

Running nearest neighbour baselines

Running our models

Sentence-level Models

Baseline 1 : Nearest neighbour

Baseline 2 : NMT with neighbouring sentence pairs

Baseline 3 : Nakayama and Nishida, 2017

Our models :

Aligned NMT :

Dataset & Related Code Attribution

Citation

Owner

Facebook Research

Residual2Vec: Debiasing graph embedding using random graphs

ConvBERT: Improving BERT with Span-based Dynamic Convolution

This repository contains the codes for LipGAN. LipGAN was published as a part of the paper titled "Towards Automatic Face-to-Face Translation".

This repo is to provide a list of literature regarding Deep Learning on Graphs for NLP

Official Pytorch implementation of Test-Agnostic Long-Tailed Recognition by Test-Time Aggregating Diverse Experts with Self-Supervision.

This repository describes our reproducible framework for assessing self-supervised representation learning from speech

BERN2: an advanced neural biomedical namedentity recognition and normalization tool

FedNLP: A Benchmarking Framework for Federated Learning in Natural Language Processing

A Streamlit web app that generates Rick and Morty stories using GPT2.

Korea Spell Checker

RuCLIP tiny (Russian Contrastive Language–Image Pretraining) is a neural network trained to work with different pairs (images, texts).

The SVO-Probes Dataset for Verb Understanding

KoBART model on huggingface transformers

Tutorial to pretrain & fine-tune a 🤗 Flax T5 model on a TPUv3-8 with GCP

Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.

Club chatbot

This is an incredibly powerful calculator that is capable of many useful day-to-day functions.

A Multilingual Latent Dirichlet Allocation (LDA) Pipeline with Stop Words Removal, n-gram features, and Inverse Stemming, in Python.

Implementation for paper BLEU: a Method for Automatic Evaluation of Machine Translation

Code and checkpoints for training the transformer-based Table QA models introduced in the paper TAPAS: Weakly Supervised Table Parsing via Pre-training.