Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"

Last update: May 18, 2022

Related tags

Deep Learning multiDDS

Overview

Balancing Training for Multilingual Neural Machine Translation

Implementation of the paper

Balancing Training for Multilingual Neural Machine Translation

Xinyi Wang, Yulia Tsvetkov, Graham Neubig

Data:

The preprocessed and binarized data for fairseq can be downloaded here

To process data from scrach, see the script

util_scripts/prepare_multilingual_data.sh

Training Scripts:

The training scripts for many-to-one translation of the related language group (Related M2O) is under the directory job_scripts/related_ted8_m2o/.

Our methods:

MultiDDS-S:

job_scripts/related_ted8_m2o/multidds_s.sh

MultiDDS:

job_scripts/related_ted8_m2o/multidds.sh

Baselines:

Proportional:

job_scripts/related_ted8_m2o/proportional.sh

Temperature:

job_scripts/related_ted8_m2o/temperature.sh

The scripts for Related O2M is under the directory job_scripts/related_ted8_o2m/

The scripts for Diverse M2O is under the directory job_scripts/diverse_ted8_m2o/

The scripts for Diverse O2M is under the directory job_scripts/diverse_ted8_o2m/

Inference Scripts:

Each of the experiment script directory contains a trans.sh file to translate the test set. To translate the test set for the Related M2O MultiDDS-S

job_scripts/related_ted8_m2o/trans.sh checkpoints/related_ted8_m2o/multidds_s/

To translate other experiment, simply replace the argument with the experiment checkpoint directory.

Citation

Please cite as:

@inproceedings{wang2020multiDDS,
  title = {Balancing Training for Multilingual Neural Machine Translation},
  author = {Xinyi Wang, Yulia Tsvetkov, Graham Neubig},
  booktitle = {ACL},
  year = {2020},
}

Code for the paper "Balancing Training for Multilingual Neural Machine Translation, ACL 2020"

Related tags

Overview

Balancing Training for Multilingual Neural Machine Translation

Data:

Training Scripts:

Inference Scripts:

Citation

Owner

Xinyi Wang

Repository for MDPGT

This repository contains code accompanying the paper "An End-to-End Chinese Text Normalization Model based on Rule-Guided Flat-Lattice Transformer"

Repository of best practices for deep learning in Julia, inspired by fastai

Implementation of Pooling by Sliced-Wasserstein Embedding (NeurIPS 2021)

Txt2Xml tool will help you convert from txt COCO format to VOC xml format in Object Detection Problem.

Synthetic Humans for Action Recognition, IJCV 2021

RIFE - Real-Time Intermediate Flow Estimation for Video Frame Interpolation

Efficient Deep Learning Systems course

ProMP: Proximal Meta-Policy Search

FedCV: A Federated Learning Framework for Diverse Computer Vision Tasks

minimizer-space de Bruijn graphs (mdBG) for whole genome assembly

Explaining in Style: Training a GAN to explain a classifier in StyleSpace

This is the implementation of the paper "Self-supervised Outdoor Scene Relighting"

Machine learning and Deep learning models, deploy on telegram (the best social media)

UniMoCo: Unsupervised, Semi-Supervised and Full-Supervised Visual Representation Learning

An Implementation of Fully Convolutional Networks in Tensorflow.

Retinal vessel segmentation based on GT-UNet

Implementation of Analyzing and Improving the Image Quality of StyleGAN (StyleGAN 2) in PyTorch

Semantic Edge Detection with Diverse Deep Supervision

Bayesian dessert for Lasagne