Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".

Last update: Dec 29, 2022

Related tags

Overview

Multilingual Unsupervised Sentence Simplification

Code and pretrained models to reproduce experiments in "MUSS: Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".

Prerequisites

Linux with python 3.6 or above.

Installing

git clone [email protected]:facebookresearch/muss.git
cd muss/
pip install -e .

How to use

Some scripts might still contain a few bugs, if you notice anything wrong, feel free to open an issue or submit a Pull Request.

Simplify sentences from a file using pretrained models

# English
python scripts/simplify.py scripts/examples.en --model-name muss_en_wikilarge_mined
# French
python scripts/simplify.py scripts/examples.fr --model-name muss_fr_mined
# French
python scripts/simplify.py scripts/examples.es --model-name muss_es_mined

Pretrained models should be downloaded automatically, but you can also find them here:
muss_en_wikilarge_mined
muss_en_mined
muss_fr_mined
muss_es_mined

Mine the data

python scripts/mine_sequences.py

Train the models

python scripts/train_model.py

Evaluate simplifications

Please head over to EASSE for Sentence Simplification evaluation.

License

The MUSS license is CC-BY-NC. See the LICENSE file for more details.

Authors

Louis Martin ([email protected])

Citation

If you use MUSS in your research, please cite MUSS: Multilingual Unsupervised Sentence Simplification by Mining Paraphrases

@article{martin2021muss,
  title={MUSS: Multilingual Unsupervised Sentence Simplification by Mining Paraphrases},
  author={Martin, Louis and Fan, Angela and de la Clergerie, {\'E}ric and Bordes, Antoine and Sagot, Beno{\^\i}t},
  journal={arXiv preprint arXiv:2005.00352},
  year={2021}
}

Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".

Related tags

Overview

Multilingual Unsupervised Sentence Simplification

Prerequisites

Installing

How to use

Simplify sentences from a file using pretrained models

Mine the data

Train the models

Evaluate simplifications

License

Authors

Citation

Owner

Facebook Research

A highly efficient and modular implementation of Gaussian Processes in PyTorch

基于YoloX目标检测+DeepSort算法实现多目标追踪Baseline

This git repo contains the implementation of my ML project on Heart Disease Prediction

AI-UPV at IberLEF-2021 DETOXIS task: Toxicity Detection in Immigration-Related Web News Comments Using Transformers and Statistical Models

Self-supervised learning on Graph Representation Learning (node-level task)

In the AI for TSP competition we try to solve optimization problems using machine learning.

An implementation of chunked, compressed, N-dimensional arrays for Python.

Use unsupervised and supervised learning to predict stocks

OrienMask: Real-time Instance Segmentation with Discriminative Orientation Maps

SimBERT升级版（SimBERTv2）！

CVPRW 2021: How to calibrate your event camera

Real-ESRGAN aims at developing Practical Algorithms for General Image Restoration.

Code for "Causal autoregressive flows" - AISTATS, 2021

Cycle Consistent Adversarial Domain Adaptation (CyCADA)

code for `Look Closer to Segment Better: Boundary Patch Refinement for Instance Segmentation`

Multi-layer convolutional LSTM with Pytorch

Mail classification with tensorflow and MS Exchange Server (ham or spam).

LTR_CrossEncoder: Legal Text Retrieval Zalo AI Challenge 2021

Fast and simple implementation of RL algorithms, designed to run fully on GPU.

A torch.Tensor-like DataFrame library supporting multiple execution runtimes and Arrow as a common memory format