🌐 Translation microservice powered by AI

Last update: Nov 22, 2022

Related tags

Text Data & NLP translate

Overview

Dot Translate

🌐 A microservice for quick and local translation using A.I.

This service starts a local webserver used for neural machine translation.

🚀 Features

	Dot Translate
🔒	No tracking or telemetry data is collected from you
🆓	Always free
⚡️	Fast on low-compute machines
📝	Accurate and keeps your prompt meaningful
💻	Open-source and open for contributions

For inference, all models are ran on the CPU. Every model utilized in this service are 8-bit quantized, which results in decreased latency and storage costs.

🔧 Contributing

We accept all positive contributions that affects this repository and service as a whole; we accept trained .argosmodels files via pull request.

Language	Source -> Target	Target -> Source
🇳🇱	nl -> en	en -> nl

❤️ Acknowledgements

Argos Translate, which is built on OpenNMT, is widely used in this repository for translation.

📜 Licenses

Dot Translate is licensed under the MIT license.

Sequence-to-sequence framework with a focus on Neural Machine Translation based on Apache MXNet

Sockeye This package contains the Sockeye project, an open-source sequence-to-sequence framework for Neural Machine Translation based on Apache MXNet

1.1k Dec 27, 2022

An Analysis Toolkit for Natural Language Generation (Translation, Captioning, Summarization, etc.)

VizSeq is a Python toolkit for visual analysis on text generation tasks like machine translation, summarization, image captioning, speech translation

409 Oct 28, 2022

Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.

Summarization, translation, Q&A, text generation and more at blazing speed using a T5 version implemented in ONNX. This package is still in alpha stag

211 Dec 28, 2022

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.

TextBlob: Simplified Text Processing Homepage: https://textblob.readthedocs.io/ TextBlob is a Python (2 and 3) library for processing textual data. It

7.5k Feb 17, 2021

Open Source Neural Machine Translation in PyTorch

OpenNMT-py: Open-Source Neural Machine Translation OpenNMT-py is the PyTorch version of the OpenNMT project, an open-source (MIT) neural machine trans

4.8k Feb 18, 2021

Sequence-to-sequence framework with a focus on Neural Machine Translation based on Apache MXNet

Sockeye This package contains the Sockeye project, an open-source sequence-to-sequence framework for Neural Machine Translation based on Apache MXNet

986 Feb 17, 2021

An Analysis Toolkit for Natural Language Generation (Translation, Captioning, Summarization, etc.)

VizSeq is a Python toolkit for visual analysis on text generation tasks like machine translation, summarization, image captioning, speech translation

310 Feb 1, 2021

Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.

Summarization, translation, Q&A, text generation and more at blazing speed using a T5 version implemented in ONNX. This package is still in alpha stag

137 Feb 1, 2021

A deep learning-based translation library built on Huggingface transformers

DL Translate A deep learning-based translation library built on Huggingface transformers and Facebook's mBART-Large 💻 GitHub Repository 📚 Documentat

244 Dec 30, 2022

Comments

Great Project!

Hi!

This looks like an awesome project! LibreTranslate is great but I'm partial towards the minimalist style.

Feel free to pass relevant support requests my way, I'm normally pretty responsive on GitHub and the LibreTranslate Forum.

Best,

P.J.

opened by argosopentech 1
Todo: fallback when out-of-memory or kill

when someone toys with the api around too much, it can cause the server to go out of memory, killing the flask app. as a temporary fix, going to decrease batch size.

future reference code: https://gist.github.com/kevinxhan/6c0bbc68f2ea6b2f4a620e5413c98fb8

opened by johnpaulbin 1

Releases(v2.0)

v2.0(Dec 4, 2021)
Dot Translate 2.0 Release

Bug fixes and overall improvement:

No longer using databases (keeping it open)

Returning JSON instead of plain text (for unicode errors)

More polished

Startup Dot Translate in just 4 simple steps:

git clone https://github.com/dothq/translate.git cd translate/ sudo docker build -t translate . sudo docker run -d -p 3000:3000 translate
Source code(tar.gz)
Source code(zip)

v1.0(Nov 27, 2021)

Dot Translate 1.0 Release

Startup Dot Translate with just 4 simple commands:

git clone https://github.com/dothq/translate.git
cd translate/
sudo docker build -t translate .
sudo docker run -d -p 3000:3000 translate

Source code(tar.gz)
Source code(zip)

November-2021(Nov 25, 2021)

This release contains .argosmodel files officially released in November 2021.

This release contains: en-cy cy-en en-nl nl-en
Source code(tar.gz)
Source code(zip)
cy_en.argosmodel(60.16 MB)
en_cy.argosmodel(59.51 MB)
en_nl.argosmodel(60.22 MB)
nl_en.argosmodel(61.37 MB)

Owner

Dot HQ

🚀 Makers of the privacy-focused web browser, Dot.

GitHub Repository

This is a modification of the OpenAI-CLIP repository of moein-shariatnia

2 Mar 04, 2022

मराठी भाषा वाचविण्याचा एक प्रयास. इंग्रजी ते मराठीचा शब्दकोश. An attempt to preserve the Marathi language. A lightweight and ad free English to Marathi thesaurus.

For English, scroll down मराठी शब्द मराठी भाषा वाचवण्यासाठी मी हा ओपन सोर्स प्रोजेक्ट सुरू केला आहे. माझ्या मते, आपली भाषा हळूहळू आणि कोणाचाही लक्षात

20 Oct 11, 2022

Built for cleaning purposes in military institutions

Ferramenta do AL Construído para fins de limpeza em instituições militares. Instalação Requer python = 3.2 pip install -r requirements.txt Usagem Exe

0 Aug 13, 2022

This repository is home to the Optimus data transformation plugins for various data processing needs.

Transformers Optimus's transformation plugins are implementations of Task and Hook interfaces that allows execution of arbitrary jobs in optimus. To i

37 Dec 14, 2022

A Python 3.6+ package to run .many files, where many programs written in many languages may exist in one file.

6 May 22, 2022

MEDIALpy: MEDIcal Abbreviations Lookup in Python

A small python package that allows the user to look up common medical abbreviations.

7 Nov 09, 2022

基于百度的语音识别，用python实现，pyaudio+pyqt

Speech-recognition 基于百度的语音识别，python3.8(conda)+pyaudio+pyqt+baidu-aip 百度有面向python

1 Jan 03, 2022

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

LightSpeech UnOfficial PyTorch implementation of LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search.

54 Dec 03, 2022

A repo for materials relating to the tutorial of CS-332 NLP

CS-332-NLP A repo for materials relating to the tutorial of CS-332 NLP Contents Tutorial 1: Introduction Corpus Regular expression Tokenization Tutori

9 Feb 15, 2022

MRC approach for Aspect-based Sentiment Analysis (ABSA)

B-MRC MRC approach for Aspect-based Sentiment Analysis (ABSA) Paper: Bidirectional Machine Reading Comprehension for Aspect Sentiment Triplet Extracti

1 Apr 05, 2022

A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN

artificial intelligence cosmic love and attention fire in the sky a pyramid made of ice a lonely house in the woods marriage in the mountains lantern

2.3k Jan 01, 2023

Poetry PEP 517 Build Backend & Core Utilities

Poetry Core A PEP 517 build backend implementation developed for Poetry. This project is intended to be a light weight, fully compliant, self-containe

293 Jan 02, 2023

⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x using fastT5.

Reduce T5 model size by 3X and increase the inference speed up to 5X. Install Usage Details Functionalities Benchmarks Onnx model Quantized onnx model

399 Jan 05, 2023

Data loaders and abstractions for text and NLP

torchtext This repository consists of: torchtext.data: Generic data loaders, abstractions, and iterators for text (including vocabulary and word vecto

3.2k Dec 30, 2022

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

ParlAI (pronounced “par-lay”) is a python framework for sharing, training and testing dialogue models, from open-domain chitchat, to task-oriented dia

9.7k Jan 09, 2023

LOT: A Benchmark for Evaluating Chinese Long Text Understanding and Generation

LOT: A Benchmark for Evaluating Chinese Long Text Understanding and Generation Tasks | Datasets | LongLM | Baselines | Paper Introduction LOT is a ben

46 Dec 28, 2022

Predicting the usefulness of reviews given the review text and metadata surrounding the reviews.

Predicting Yelp Review Quality Table of Contents Introduction Motivation Goal and Central Questions The Data Data Storage and ETL EDA Data Pipeline Da

3 Nov 27, 2022

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded systems.

Welcome to Spokestack Python! This library is intended for developing voice interfaces in Python. This can include anything from Raspberry Pi applicat

133 Sep 20, 2022

Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks

Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks. It takes raw videos/images + text as inputs, and outputs task predictions. ClipB

612 Jan 04, 2023

Text-to-Speech for Belarusian language

title emoji colorFrom colorTo sdk app_file pinned Belarusian TTS 🐸 green green gradio app.py false Belarusian TTS 📢 🤖 Belarusian TTS (text-to-speec

1 Nov 27, 2021

🌐 Translation microservice powered by AI

Related tags

Overview

Dot Translate

🚀 Features

🔧 Contributing

❤️ Acknowledgements

📜 Licenses

You might also like...

Sequence-to-sequence framework with a focus on Neural Machine Translation based on Apache MXNet

An Analysis Toolkit for Natural Language Generation (Translation, Captioning, Summarization, etc.)

Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.

Simple, Pythonic, text processing--Sentiment analysis, part-of-speech tagging, noun phrase extraction, translation, and more.

Open Source Neural Machine Translation in PyTorch

Sequence-to-sequence framework with a focus on Neural Machine Translation based on Apache MXNet

An Analysis Toolkit for Natural Language Generation (Translation, Captioning, Summarization, etc.)

Summarization, translation, sentiment-analysis, text-generation and more at blazing speed using a T5 version implemented in ONNX.

A deep learning-based translation library built on Huggingface transformers

Comments

Great Project!

Todo: fallback when out-of-memory or kill

Releases(v2.0)

v2.0(Dec 4, 2021)

Dot Translate 2.0 Release

v1.0(Nov 27, 2021)

Dot Translate 1.0 Release

November-2021(Nov 25, 2021)

Owner

Dot HQ

This is a modification of the OpenAI-CLIP repository of moein-shariatnia

मराठी भाषा वाचविण्याचा एक प्रयास. इंग्रजी ते मराठीचा शब्दकोश. An attempt to preserve the Marathi language. A lightweight and ad free English to Marathi thesaurus.

Built for cleaning purposes in military institutions

This repository is home to the Optimus data transformation plugins for various data processing needs.

A Python 3.6+ package to run .many files, where many programs written in many languages may exist in one file.

MEDIALpy: MEDIcal Abbreviations Lookup in Python

基于百度的语音识别，用python实现，pyaudio+pyqt

LightSpeech: Lightweight and Fast Text to Speech with Neural Architecture Search

A repo for materials relating to the tutorial of CS-332 NLP

MRC approach for Aspect-based Sentiment Analysis (ABSA)

A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN

Poetry PEP 517 Build Backend & Core Utilities

⚡ boost inference speed of T5 models by 5x & reduce the model size by 3x using fastT5.

Data loaders and abstractions for text and NLP

A framework for training and evaluating AI models on a variety of openly available dialogue datasets.

LOT: A Benchmark for Evaluating Chinese Long Text Understanding and Generation

Predicting the usefulness of reviews given the review text and metadata surrounding the reviews.

Spokestack is a library that allows a user to easily incorporate a voice interface into any Python application with a focus on embedded systems.

Official PyTorch code for ClipBERT, an efficient framework for end-to-end learning on image-text and video-text tasks

Text-to-Speech for Belarusian language