Code for the paper "Adapting Monolingual Models: Data can be Scarce when Language Similarity is High"

Last update: Aug 02, 2021

Related tags

Deep Learning low-resource-adapt

Overview

Wietse de Vries • Martijn Bartelds • Malvina Nissim • Martijn Wieling

Adapting Monolingual Models: Data can be Scarce when Language Similarity is High

This repository contains everything that is needed to replicate the results in the paper:

📝 Adapting Monolingual Models: Data can be Scarce when Language Similarity is High

Models

The best fine-tuned models for Gronings and West Frisian are available on the HuggingFace model hub:

Lexical layers

These models are identical to BERTje, but with different lexical layers (bert.embeddings.word_embeddings).

🤗 GroNLP/bert-base-dutch-cased (Dutch; source language)
🤗 GroNLP/bert-base-dutch-cased-gronings (Gronings)
🤗 GroNLP/bert-base-dutch-cased-frisian (West Frisian)

POS tagging

These models share the same fine-tuned Transformer layers + classification head, but with the retrained lexical layers from the models above.

🤗 GroNLP/bert-base-dutch-cased-upos-alpino (Dutch)
🤗 GroNLP/bert-base-dutch-cased-upos-alpino-gronings (Gronings)
🤗 GroNLP/bert-base-dutch-cased-upos-alpino-frisian (West Frisian)

Development

Conda/mamba dependencies are listed in environment.yml. This repository contains all scripts and configs that are needed to replicate the results in the paper. A more extensive usage guide will be provided later.

BibTeX entry

The paper is to appear in Findings of ACL2021. The preprint can be cited as:

@misc{devries2021adapting,
      title={{Adapting Monolingual Models: Data can be Scarce when Language Similarity is High}}, 
      author={Wietse de Vries and Martijn Bartelds and Malvina Nissim and Martijn Wieling},
      year={2021},
      eprint={2105.02855},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

Code for the paper "Adapting Monolingual Models: Data can be Scarce when Language Similarity is High"

Related tags

Overview

Adapting Monolingual Models: Data can be Scarce when Language Similarity is High

Models

Lexical layers

POS tagging

Development

BibTeX entry

Owner

Wietse de Vries

[EMNLP 2020] Keep CALM and Explore: Language Models for Action Generation in Text-based Games

Code for the paper "Unsupervised Contrastive Learning of Sound Event Representations", ICASSP 2021.

details on efforts to dump the Watermelon Games Paprium cart

The DL Streamer Pipeline Zoo is a catalog of optimized media and media analytics pipelines.

Collection of generative models in Tensorflow

Sequence modeling benchmarks and temporal convolutional networks

Diverse Branch Block: Building a Convolution as an Inception-like Unit

SimulLR - PyTorch Implementation of SimulLR

Codes for CIKM'21 paper 'Self-Supervised Graph Co-Training for Session-based Recommendation'.

GemNet model in PyTorch, as proposed in "GemNet: Universal Directional Graph Neural Networks for Molecules" (NeurIPS 2021)

CNNs for Sentence Classification in PyTorch

A Deep Convolutional Encoder-Decoder Architecture for Image Segmentation

an implementation of 3D Ken Burns Effect from a Single Image using PyTorch

Deep Latent Force Models

Learning Domain Invariant Representations in Goal-conditioned Block MDPs

An implementation of the Contrast Predictive Coding (CPC) method to train audio features in an unsupervised fashion.

Learning Representations that Support Robust Transfer of Predictors

[BMVC 2021] Official PyTorch Implementation of Self-supervised learning of Image Scale and Orientation Estimation

WarpDrive: Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning on a GPU

Web-interface + rest API for classification and regression (https://jeff1evesque.github.io/machine-learning.docs)