The implementation of Parameter Differentiation based Multilingual Neural Machine Translation

Last update: Dec 17, 2022

Overview

The implementation of Parameter Differentiation based Multilingual Neural Machine Translation .

Requirement:

apex
fairseq
scikit-learn
pytorch

Process data following https://github.com/pytorch/fairseq/tree/main/examples/translation#multilingual-translation.
Training:

data_bin=    # data path 
lang_pairs=  # comma separated language pairs

fairseq-train $data_path \
    --task parameter_differentiation_task --lang-pairs $lang_pairs --encoder-langtok tgt \
    --criterion label_smoothed_cross_entropy --label-smoothing 0.1 \
    --optimizer adam --lr 0.0015 --adam-betas '(0.9,0.98)' \
    --lr-scheduler inverse_sqrt --warmup-updates 4000 --warmup-init-lr 1e-07 \
    --arch parameter_differentiation_base_model \
    --max-tokens 8192 \
    --user-dir $PWD

Decoding

source_lang=
target_lang=
model_path=
fairseq-generate $data_path --path $model_path \
    --task parameter_differentiation_task --lang-pairs $lang_pairs --encoder-langtok tgt \
    --beam 4 --lenpen 0.6 --remove-bpe sentencepiece \
    --source-lang $source_lang --target-lang $target_lang > result.$source_lang-$target_lang.txt

The implementation of Parameter Differentiation based Multilingual Neural Machine Translation

Related tags

Overview

Owner

Qian Wang

Sploitus - Command line search tool for sploitus.com. Think searchsploit, but with more POCs

PyTorch implementation and pretrained models for XCiT models. See XCiT: Cross-Covariance Image Transformer

Indonesia spellchecker with python

Yet another Python binding for fastText

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

pkuseg多领域中文分词工具; The pkuseg toolkit for multi-domain Chinese word segmentation

Finetune gpt-2 in google colab

HuggingSound: A toolkit for speech-related tasks based on HuggingFace's tools

[NeurIPS 2021] Code for Learning Signal-Agnostic Manifolds of Neural Fields

Text classification is one of the popular tasks in NLP that allows a program to classify free-text documents based on pre-defined classes.

The official repository of the ISBI 2022 KNIGHT Challenge

Residual2Vec: Debiasing graph embedding using random graphs

HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis

Syntax-aware Multi-spans Generation for Reading Comprehension (TASLP 2022)

Trex is a tool to match semantically similar functions based on transfer learning.

Implementation of TTS with combination of Tacotron2 and HiFi-GAN

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

Sentence Embeddings with BERT & XLNet

Get list of common stop words in various languages in Python

This repository contains the code, data, and models of the paper titled "CrossSum: Beyond English-Centric Cross-Lingual Abstractive Text Summarization for 1500+ Language Pairs".