Translate

Command-line interface to translation pipelines, powered by Huggingface transformers. This tool can download translation models, and then using them to translate sentences offline. By default, tries using models from Helsinki-NLP (each model is about 300MB large).

Install

$ git clone https://github.com/Teuze/translate
$ cd translate
$ pip3 install --user -r requirements.py

If you want to be able to use this script from anywhere in your system, you can symlink or copy the translate script file into one of your path folders, like for example $HOME/.local/bin.

Usage

Listing available and installed translation models :

$ # Also available on https://huggingface.co/models
$ ./translate model list online | less
$ ./translate model list local | less

Downloading models :

$ ./translate download model "Helsinki-NLP/opus-mt-en-es"
$ ./translate download model "Helsinki-NLP/opus-mt-fr-en"

Using models to translate from CLI arguments or from standard input :

$ ./translate text -e "Helsinki-NLP/opus-mt-en-es" "Hello World!"
¡Hola Mundo!
$ echo "Ceci est une phrase d'exemple simple" | ./translate text -s fr -t en
This is a simple example sentence

Partially offline multi-language translator built upon Huggingface transformers.

Related tags

Overview

Translate

Install

Usage

Owner

Richard Jarry

Code to use Augmented Shapiro Wilks Stopping, as well as code for the paper "Statistically Signifigant Stopping of Neural Network Training"

⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).

Built for cleaning purposes in military institutions

Generate vector graphics from a textual caption

NeuTex: Neural Texture Mapping for Volumetric Neural Rendering

Python powered crossword generator with database with 20k+ polish words

A programming language with logic of Python, and syntax of all languages.

PyTorch implementation of the paper: Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding

Write Alphabet, Words and Sentences with your eyes.

Beyond Paragraphs: NLP for Long Sequences

StarGAN - Official PyTorch Implementation

Partially offline multi-language translator built upon Huggingface transformers.

A CRM department in a local bank works on classify their lost customers with their past datas. So they want predict with these method that average loss balance and passive duration for future.

A Japanese tokenizer based on recurrent neural networks

This project uses word frequency and Term Frequency-Inverse Document Frequency to summarize a text.

Search with BERT vectors in Solr and Elasticsearch

Wake: Context-Sensitive Automatic Keyword Extraction Using Word2vec

Shared code for training sentence embeddings with Flax / JAX

Syntax-aware Multi-spans Generation for Reading Comprehension (TASLP 2022)

Deep Learning Topics with Computer Vision & NLP