Machine translation models released by the Gourmet project

Last update: Dec 08, 2021

Related tags

Overview

Gourmet Models

Overview

The Gourmet project has released several machine translation models to translate low-resource languages. This repository contains information about the models, as well as sample code showing how the models can be used. The models themselves are available as dockers, and can be downloaded from a separate site (see below).

Some of the models are described in our public deliverables. See the integration report for details of model training, and the evaluation report for details of evaluation.

Using the Models

The model download links are here. Once downloaded, the model can be deployed using:

docker load <

to launch the translation server, use:

docker run -p 4000:4000 -i --rm

This exposes the model on port 4000. Change the second number above if you want to use a different port.

To test the model, use the sample client like this:

$ ./gourmet-client.py 
2021-11-15 14:54:10 DEBUG: __main__:  Connecting to translation server at localhost4000
2021-11-15 14:54:10 DEBUG: urllib3.connectionpool:  Starting new HTTP connection (1): localhost:4000
2021-11-15 14:54:14 DEBUG: urllib3.connectionpool:  http://localhost:4000 "POST /translation HTTP/1.1" 200 88
Translation: An gudanar da taron sauyin yanayi a Glasgow
Time: 2242
Error: None

This is using the English to Hausa model. You can use the -n and -p arguments to change the host and port. The source text is hard-coded but easily changed.

Some of the models support additional arguments which can be passed to the docker at launch time, using the -e argument to docker (for environemnent) variables. See the models page for more details.

Computational Requirements

All models are configured to use the CPU for translation. A GPU is not required, and the models will not use it.

Licence

CC-BY

Acknowledgements

This project has received funding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 825299.

Machine translation models released by the Gourmet project

Related tags

Overview

Gourmet Models

Overview

Using the Models

Computational Requirements

Licence

Acknowledgements

Owner

Edinburgh NLP

Code from the paper "High-Performance Brain-to-Text Communication via Handwriting"

A cross platform OCR Library based on PaddleOCR & OnnxRuntime

Paddlespeech Streaming ASR GUI

This repo is to provide a list of literature regarding Deep Learning on Graphs for NLP

API for the GPT-J language model 🦜. Including a FastAPI backend and a streamlit frontend

Implementation of Memorizing Transformers (ICLR 2022), attention net augmented with indexing and retrieval of memories using approximate nearest neighbors, in Pytorch

Anomaly Detection 이상치 탐지 전처리 모듈

A large-scale (194k), Multiple-Choice Question Answering (MCQA) dataset designed to address realworld medical entrance exam questions.

Unifying Cross-Lingual Semantic Role Labeling with Heterogeneous Linguistic Resources (NAACL-2021).

Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.

Chinese version of GPT2 training code, using BERT tokenizer.

What are the best Systems? New Perspectives on NLP Benchmarking

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

Interactive Jupyter Notebook Environment for using the GPT-3 Instruct API

NLP topic mdel LDA - Gathered from New York Times website

Linking data between GBIF, Biodiverse, and Open Tree of Life

Python package for performing Entity and Text Matching using Deep Learning.

Fixes mojibake and other glitches in Unicode text, after the fact.

✨Rubrix is a production-ready Python framework for exploring, annotating, and managing data in NLP projects.