Takes a string and puts it through different languages in Google Translate a requested amount of times, returning nonsense.

Last update: Aug 29, 2022

Related tags

Overview

PythonTextObfuscator

Takes a string and puts it through different languages in Google Translate a requested amount of times, returning nonsense.

Requirements:

python3

For the Selenium Obfuscator:

    -Selenium
    
    -Firefox
    
    -Geckodriver

In the Selenium Obfuscator:

-The major benefit is that you can translate excel documents, the downside is that after 10 or so document translations, Google blocks your ip for a while.

-Translation is generally slower and more limited using selenium as a browser tab is being used to scrape the data. Also beware of RAM usage.

-May no longer be supported in the future due to its drawbacks.

In the Urllib Obfuscator:

-Translation is generally faster and uses very little resources as only html is downloaded through a request. Multiprocessing also allows simultanious requests and can be used to the full extent without worrying about RAM usage.

—Split by length is faster and uses less requests (better for longer texts)

—Split by newline is slower and uses more requests but adds much more translation variety.

-Reminder: Since google has a url request limit, you'll need to switch VPN locations when the request limit is hit.

    ——Don't worry too much though, as it takes quite a bit of requests to get to that point, and the block only lasts for around an hour.

Translate - a PyTorch Language Library

NOTE PyTorch Translate is now deprecated, please use fairseq instead. Translate - a PyTorch Language Library Translate is a library for machine transl

678 Feb 15, 2021

Auto translate textbox from Japanese to English or Indonesia

priconne-auto-translate Auto translate textbox from Japanese to English or Indonesia How to use Install python first, Anaconda is recommended Install

5 Aug 25, 2022

translate using your voice

speech-to-text-translator Usage translate using your voice description this project makes translating a word easy, all you have to do is speak and...

1 Oct 18, 2021

translate using your voice

speech-to-text-translator Usage translate using your voice description this project makes translating a word easy, all you have to do is speak and...

1 Oct 18, 2021

This program do translate english words to portuguese

Python-Dictionary This program is used to translate english words to portuguese. Web-Scraping This program use BeautifulSoap to make web scraping, so

1 Oct 10, 2022

Translate U is capable of translating the text present in an image from one language to the other.

Translate U is capable of translating the text present in an image from one language to the other. The app uses OCR and Google translate to identify and translate across 80+ languages.

1 Dec 22, 2021

Graphical user interface for Argos Translate

Argos Translate GUI Website | GitHub | PyPI Graphical user interface for Argos Translate. Install pip3 install argostranslategui

16 Dec 7, 2022

Use the state-of-the-art m2m100 to translate large data on CPU/GPU/TPU. Super Easy!

Easy-Translate is a script for translating large text files in your machine using the M2M100 models from Facebook/Meta AI. We also privide a script fo

41 Dec 15, 2022

Search for documents in a domain through Google. The objective is to extract metadata

MetaFinder - Metadata search through Google _____ __ ___________ .__ .___ / \

85 Dec 16, 2022

Comments

Attempt to decode JSON with unexpected mimetype: text/plain

I'm not sure what's causing this, as the last time I tried this release, this issue was not present. If it's accessing content server-side, then it might be that the server has had a config change resulting in it returning a different mimetype?

I get the error message below consistently in the console, with %2E being added to the end of the URL each time. It does seem like some translation does happen; in this case, I inputted "Test", and the URL ended with "Hlola".

https://translate.alefvanoon.xyz/api/v1/zu/mi/Hlola%2E 0, message='Attempt to decode JSON with unexpected mimetype: text/plain; charset=utf-8', url=URL('https://translate.alefvanoon.xyz/api/v1/zu/mi/Hlola')

From what I've gathered looking online, the issue lies in either line 13, line 469, or both.

return (await response.json())['translation'].replace('/','⁄')

text = (await response.json())['translation'].replace('/','⁄')

Some of the solutions online referred to adding "content_type=None" or "content_type='text/plain'" into the brackets after "json", but this only seemed to cause further issues for me.

opened by UltraHylia 2
Program Freezes Up and Looping Error

When you have Chinese (Simplified) and/or Chinese (Traditional) enabled in the language selector, the program can freeze and an error loops in the console. It happens no matter what other languages are enabled.

https://user-images.githubusercontent.com/60769253/197659506-38871035-e311-4710-9eb9-ac2d7387841f.mp4

opened by DerpTaco99921 0

Releases(v0.4)

v0.4(Feb 2, 2022)

Rebuilt from the ground up with a new GUI and translation method.

Changes:

-Improved GUI.

-Translations are retrieved from a front-end to Google Translate called Lingva, which removes the issue with being blocked for doing too many requests.

-Translations are done in an asynchronous function using aiohttp instead of a process pool, which is optimal for large bulk translations.

-Removed selenium obfuscation.

Additions: -Importing and saving text files. -Language Selector to activate or deactivate any individual language. -Language setting for the result. -Three different split methods: ____-Initial ________-Text is split by length before being passed into the obfuscate function. ________-Faster as less requests are made. ________-Different languages for each piece. ________-Tabs not preserved. ____-Continuous ________-Text is split by length inside the obfuscate function. ________-Faster as less requests are made. ________-Same languages for each piece. ________-Tabs not preserved. ____-Newline ________-Text is split by newlines and tabs. ________-Slower as more requests are made. ________-Every single line is translated with different languages. ________-Tabs preserved. -Translation Generator which creates a .csv file containing multiple translations of the same text: ____-Repeat mode obfuscates the original text each time, adding the result in each new column. ____-Continue mode obfuscates the results from each subsequent obfuscation, adding the result in each new column.
Source code(tar.gz)
Source code(zip)
Python.Text.Obfuscator.v0.4.zip(15.75 KB)
v0.3.1c-r2(Dec 23, 2021)

Recursive error hotfix; more needs to be done when I get a chance.
Source code(tar.gz)
Source code(zip)
Python.Text.Obfuscator.v0.3.1c-r2.zip(51.72 KB)
v0.3.1c(Dec 23, 2021)

Newlines no longer get messed up in Urllib Obfuscator. Added a choice to split by length or by newlines. —Split by length is faster and uses less requests (better for longer texts) —Split by newline is slower and uses more requests but adds much more translation variety. Reminder: Since google has a URL request limit, you'll need to switch VPN locations when the request limit is hit.
Source code(tar.gz)
Source code(zip)
Python.Text.Obfuscator.v0.3.1c.zip(51.63 KB)
v0.3.1b(Dec 23, 2021)

Fixed some problems with the excel obfuscator.
Source code(tar.gz)
Source code(zip)
Python.Text.Obfuscator.v0.3.1b.zip(15.45 KB)
v0.3.1a(Dec 23, 2021)

Both urllib and selenium obfuscators (including excel obfuscator). Fixes many issues.
Source code(tar.gz)
Source code(zip)
Python.Text.Obfuscator.v0.3.1.zip(14.38 KB)
v0.3(Dec 23, 2021)

I made massive improvements to the speed of the obfuscation thanks to learning about urllib.

For example, I did translated the same ~2300 character long string of text 10 times in the old and new version; the old one took 38.8 seconds while the new one took only 6.8 seconds.

In addition, the capacity to add a larger amount of characters is far increased as it doesn't require Firefox tabs to be open and eating up ram.

As a test I translated the entire Among Us Wikipedia page 50 times (with a character count of over 60 thousand!), and it only took only 114 seconds to finish translating. Using the old obfuscator I wouldn't be able to translate more than half that amount, and it would take ages to complete (Like 10 mins or more).

Unfortunately for this version the Excel Obfuscator is removed until I can figure out how to get it to work in urllib, if I can't then I'll probably add it back it with Selenium.

At least if you couldn't get selenium to work on your computer for the previous versions you don't have to worry about getting it for this.
Source code(tar.gz)
Source code(zip)
Python.Text.Obfuscator.v0.3.zip(5.73 KB)
v0.2.2(Dec 23, 2021)

Made some improvements to the Excel Translator, such as better error handling.
Source code(tar.gz)
Source code(zip)
Python.Text.Obfuscator.v0.2.2.zip(6.09 KB)
v0.2.1b(Dec 23, 2021)

Small update: fixes stuck at accept google policy.
Source code(tar.gz)
Source code(zip)
Python.Text.Obfuscator.v0.2.1b.zip(5.92 KB)
v0.2.1a(Dec 23, 2021)

Fixed TimeoutExceptions for the string translations (textbox input) obfuscation. You can now do as many translations as you want without worrying about encountering an error. Same for amount of characters (as long as your PC can handle of course). As for excel translations they remain unchanged — since I can't do anything about Google's Document translation limit — so just switch locations on VPN like usual after 10 translations for the Excel Obfuscator.
Source code(tar.gz)
Source code(zip)
Python.Text.Obfuscator.v0.2.1.zip(5.88 KB)
v0.2(Dec 23, 2021)

Excel documents can be auto obfuscated now; just make sure you have a VPN ready since Google puts limits on document translation.
Source code(tar.gz)
Source code(zip)
Python.Text.Obfuscator.v0.2.zip(4.98 KB)
v0.1b(Dec 23, 2021)

Fixed some problems like the formatting breaking (linebreaks not preserved, etc.)
Source code(tar.gz)
Source code(zip)
Python.Text.Obfuscator.v0.1b.zip(3.18 KB)
v0.1a(Dec 23, 2021)

First version; used Selenium, Gecko Driver, and Firefox.
Source code(tar.gz)
Source code(zip)
Python.Text.Obfuscator.v0.1.zip(3.43 KB)

Owner

GitHub Repository

Active learning for text classification in Python

Active Learning allows you to efficiently label training data in a small-data scenario.

375 Dec 28, 2022

Wind Speed Prediction using LSTMs in PyTorch

Implementation of Deep-Forecast using PyTorch Deep Forecast: Deep Learning-based Spatio-Temporal Forecasting Adapted from original implementation Setu

151 Dec 14, 2022

Sentiment-Analysis and EDA on the IMDB Movie Review Dataset

Sentiment-Analysis and EDA on the IMDB Movie Review Dataset The main part of the work focuses on the exploration and study of different approaches whi

1 Jan 12, 2022

BERT score for text generation

BERTScore Automatic Evaluation Metric described in the paper BERTScore: Evaluating Text Generation with BERT (ICLR 2020). News: Features to appear in

1k Jan 08, 2023

Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.

TextDistance TextDistance -- python library for comparing distance between two or more sequences by many algorithms. Features: 30+ algorithms Pure pyt

3k Jan 06, 2023

KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)

KoGPT KoGPT (Korean Generative Pre-trained Transformer) https://github.com/kakaobrain/kogpt https://huggingface.co/kakaobrain/kogpt Model Descriptions

797 Dec 26, 2022

SAINT PyTorch implementation

SAINT-pytorch A Simple pyTorch implementation of "Towards an Appropriate Query, Key, and Value Computation for Knowledge Tracing" based on https://arx

63 Dec 25, 2022

PyTorch implementation of NATSpeech: A Non-Autoregressive Text-to-Speech Framework

A Non-Autoregressive Text-to-Speech (NAR-TTS) framework, including official PyTorch implementation of PortaSpeech (NeurIPS 2021) and DiffSpeech (AAAI 2022)

760 Jan 03, 2023

GrammarTagger — A Neural Multilingual Grammar Profiler for Language Learning

GrammarTagger — A Neural Multilingual Grammar Profiler for Language Learning GrammarTagger is an open-source toolkit for grammatical profiling for lan

27 Jan 05, 2023

Transformer - A TensorFlow Implementation of the Transformer: Attention Is All You Need

[UPDATED] A TensorFlow Implementation of Attention Is All You Need When I opened this repository in 2017, there was no official code yet. I tried to i

3.8k Dec 26, 2022

A collection of models for image - text generation in ACM MM 2021.

Bi-directional Image and Text Generation UMT-BITG (image & text generator) Unifying Multimodal Transformer for Bi-directional Image and Text Generatio

63 Oct 30, 2022

Diaformer: Automatic Diagnosis via Symptoms Sequence Generation

Diaformer Diaformer: Automatic Diagnosis via Symptoms Sequence Generation (AAAI 2022) Diaformer is an efficient model for automatic diagnosis via symp

20 Dec 13, 2022

Mednlp - Medical natural language parsing and utility library

Medical natural language parsing and utility library A natural language medical

3 Aug 24, 2022

This is the offline-training-pipeline for our project.

offline-training-pipeline This is the offline-training-pipeline for our project. We adopt the offline training and online prediction Machine Learning

0 Apr 22, 2022

Japanese Long-Unit-Word Tokenizer with RemBertTokenizerFast of Transformers

Japanese-LUW-Tokenizer Japanese Long-Unit-Word (国語研長単位) Tokenizer for Transformers based on 青空文庫 Basic Usage from transformers import RemBertToken

3 Dec 22, 2021

A Python package implementing a new model for text classification with visualization tools for Explainable AI :octocat:

A Python package implementing a new model for text classification with visualization tools for Explainable AI 🍣 Online live demos: http://tworld.io/s

285 Jan 02, 2023

T‘rex Park is a Youzan sponsored project. Offering Chinese NLP and image models pretrained from E-commerce datasets

T‘rex Park is a Youzan sponsored project. Offering Chinese NLP and image models pretrained from E-commerce datasets (product titles, images, comments, etc.).

55 Nov 22, 2022

🕹 An esoteric language designed so that the program looks like the transcript of a Pokémon battle

PokéBattle is an esoteric language designed so that the program looks like the transcript of a Pokémon battle. Original inspiration and specification

9 Jan 11, 2022

Open solution to the Toxic Comment Classification Challenge

Starter code: Kaggle Toxic Comment Classification Challenge More competitions 🎇 Check collection of public projects 🎁 , where you can find multiple

153 Jun 22, 2022

Global Rhythm Style Transfer Without Text Transcriptions

Global Prosody Style Transfer Without Text Transcriptions This repository provides a PyTorch implementation of AutoPST, which enables unsupervised glo

193 Dec 30, 2022

Takes a string and puts it through different languages in Google Translate a requested amount of times, returning nonsense.

Related tags

Overview

PythonTextObfuscator

You might also like...

Translate - a PyTorch Language Library

Auto translate textbox from Japanese to English or Indonesia

translate using your voice

translate using your voice

This program do translate english words to portuguese

Translate U is capable of translating the text present in an image from one language to the other.

Graphical user interface for Argos Translate

Use the state-of-the-art m2m100 to translate large data on CPU/GPU/TPU. Super Easy!

Search for documents in a domain through Google. The objective is to extract metadata

Comments

Attempt to decode JSON with unexpected mimetype: text/plain

Program Freezes Up and Looping Error

Releases(v0.4)

v0.4(Feb 2, 2022)

v0.3.1c-r2(Dec 23, 2021)

v0.3.1c(Dec 23, 2021)

v0.3.1b(Dec 23, 2021)

v0.3.1a(Dec 23, 2021)

v0.3(Dec 23, 2021)

v0.2.2(Dec 23, 2021)

v0.2.1b(Dec 23, 2021)

v0.2.1a(Dec 23, 2021)

v0.2(Dec 23, 2021)

v0.1b(Dec 23, 2021)

v0.1a(Dec 23, 2021)

Owner

Active learning for text classification in Python

Wind Speed Prediction using LSTMs in PyTorch

Sentiment-Analysis and EDA on the IMDB Movie Review Dataset

BERT score for text generation

Compute distance between sequences. 30+ algorithms, pure python implementation, common interface, optional external libs usage.

KakaoBrain KoGPT (Korean Generative Pre-trained Transformer)

SAINT PyTorch implementation

PyTorch implementation of NATSpeech: A Non-Autoregressive Text-to-Speech Framework

GrammarTagger — A Neural Multilingual Grammar Profiler for Language Learning

Transformer - A TensorFlow Implementation of the Transformer: Attention Is All You Need

A collection of models for image - text generation in ACM MM 2021.

Diaformer: Automatic Diagnosis via Symptoms Sequence Generation

Mednlp - Medical natural language parsing and utility library

This is the offline-training-pipeline for our project.

Japanese Long-Unit-Word Tokenizer with RemBertTokenizerFast of Transformers

A Python package implementing a new model for text classification with visualization tools for Explainable AI :octocat:

T‘rex Park is a Youzan sponsored project. Offering Chinese NLP and image models pretrained from E-commerce datasets

🕹 An esoteric language designed so that the program looks like the transcript of a Pokémon battle

Open solution to the Toxic Comment Classification Challenge

Global Rhythm Style Transfer Without Text Transcriptions