Neural machine translation between the writings of Shakespeare and modern English using TensorFlow

Last update: Dec 28, 2022

Overview

Shakespeare translations using TensorFlow

This is an example of using the new Google's TensorFlow library on monolingual translation going from modern English to Shakespeare based on research from Wei Xu.

Prepare

First download the TensorFlow library depending on your platform:

pip install https://storage.googleapis.com/tensorflow/mac/tensorflow-0.5.0-py2-none-any.whl # for mac
pip install https://storage.googleapis.com/tensorflow/linux/cpu/tensorflow-0.5.0-cp27-none-linux_x86_64.whl # for ubuntu

Grabs parallel data.
Gets train, dev split.
Builds vocabulary
Converts parallel data into ids

From the root directory:

python -m tensorshake.get_data
python -m tensorshake.prepare_corpus

Delete /cache to start anew.

Train

Use the example BASH script to train the model. This saves the check points in the --train_dir directory. If you run it again, the training process continues from the check point. To restart with fresh parameters, simply delete/rename the check points.

./run.sh

Results

Benchmarks from original paper. (Shakespeare -> Modern English)

Input	Output
i will bite thee by the ear for that jest .	i ’ ll bite you by the ear for that joke .
what further woe conspires against mine age ?	what ’ s true despair conspires against my old age ?
how doth my lady ?	how is my lady ?
hast thou slain tybalt ?	have you killed tybalt ?
an i might live to see thee married once , i have my wish .	if i could live to see you married, i ’ ve my wish .
benvolio , who began this bloody fray ?	benvolio , who started this bloody fight itself ?
what is your will ?	what do you want ?
call her forth to me .	bring her out to me .

Cherrypicked examples from this repo (Modern English -> Shakespeare)

Input	Output
but you’re not listening to me.	but you do not hear me .
Gregory, on my word, we will not be humiliated, like carrying coal.	regory , we 'll not carry coals .
but he got the promotion.	he is the friend .
i can hit quickly, if i'm motivated.	i strike , i am moved .
Did you just give us the finger, sir?	have you leave the thumb , sir ?
You don’t know what you’re doing!	you do not what you know you .
have you killed Tybalt?	hast thou slain tybalt ?
Why, Romeo, are you crazy?	why , art thou mad , mad ?

Pre-Trained Models

Here is a link for an example model: https://s3-us-west-2.amazonaws.com/foxtype-nlp/tensorshake/model_cache.zip

Possible improvements

word embeddings
beam search
language model reranking

Neural machine translation between the writings of Shakespeare and modern English using TensorFlow

Related tags

Overview

Shakespeare translations using TensorFlow

Prepare

Train

Results

Pre-Trained Models

Possible improvements

Owner

Motoki Wu

Bayesian optimization in PyTorch

StableSims is an open-source project aimed at simulating MakerDAO's Dai stablecoin system

(CVPR 2022 - oral) Multi-View Depth Estimation by Fusing Single-View Depth Probability with Multi-View Geometry

A plug-and-play library for neural networks written in Python

SimpleDepthEstimation - An unified codebase for NN-based monocular depth estimation methods

[NeurIPS 2021] A weak-shot object detection approach by transferring semantic similarity and mask prior.

Unofficial PyTorch implementation of SimCLR by Google Brain

Code of paper: "DropAttack: A Masked Weight Adversarial Training Method to Improve Generalization of Neural Networks"

A pytorch implementation of Pytorch-Sketch-RNN

BC3407-Group-5-Project - BC3407 Group Project With Python

GPU-accelerated Image Processing library using OpenCL

Dense Gaussian Processes for Few-Shot Segmentation

Rot-Pro: Modeling Transitivity by Projection in Knowledge Graph Embedding

Project repo for Learning Category-Specific Mesh Reconstruction from Image Collections

Introducing neural networks to predict stock prices

official code for dynamic convolution decomposition

Alphabetical Letter Recognition

This repository is a basic Machine Learning train & validation Template (Using PyTorch)

YOLTv4 builds upon YOLT and SIMRDWN, and updates these frameworks to use the most performant version of YOLO, YOLOv4

Research code for CVPR 2021 paper "End-to-End Human Pose and Mesh Reconstruction with Transformers"