Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".

Last update: Dec 29, 2022

Related tags

Overview

Multilingual Unsupervised Sentence Simplification

Code and pretrained models to reproduce experiments in "MUSS: Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".

Prerequisites

Linux with python 3.6 or above.

Installing

git clone [email protected]:facebookresearch/muss.git
cd muss/
pip install -e .

How to use

Some scripts might still contain a few bugs, if you notice anything wrong, feel free to open an issue or submit a Pull Request.

Simplify sentences from a file using pretrained models

# English
python scripts/simplify.py scripts/examples.en --model-name muss_en_wikilarge_mined
# French
python scripts/simplify.py scripts/examples.fr --model-name muss_fr_mined
# French
python scripts/simplify.py scripts/examples.es --model-name muss_es_mined

Pretrained models should be downloaded automatically, but you can also find them here:
muss_en_wikilarge_mined
muss_en_mined
muss_fr_mined
muss_es_mined

Mine the data

python scripts/mine_sequences.py

Train the models

python scripts/train_model.py

Evaluate simplifications

Please head over to EASSE for Sentence Simplification evaluation.

License

The MUSS license is CC-BY-NC. See the LICENSE file for more details.

Authors

Louis Martin ([email protected])

Citation

If you use MUSS in your research, please cite MUSS: Multilingual Unsupervised Sentence Simplification by Mining Paraphrases

@article{martin2021muss,
  title={MUSS: Multilingual Unsupervised Sentence Simplification by Mining Paraphrases},
  author={Martin, Louis and Fan, Angela and de la Clergerie, {\'E}ric and Bordes, Antoine and Sagot, Beno{\^\i}t},
  journal={arXiv preprint arXiv:2005.00352},
  year={2021}
}

Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".

Related tags

Overview

Multilingual Unsupervised Sentence Simplification

Prerequisites

Installing

How to use

Simplify sentences from a file using pretrained models

Mine the data

Train the models

Evaluate simplifications

License

Authors

Citation

Owner

Facebook Research

In this project I played with mlflow, streamlit and fastapi to create a training and prediction app on digits

[ICCV 2021] Our work presents a novel neural rendering approach that can efficiently reconstruct geometric and neural radiance fields for view synthesis.

Parasite: a tool allowing you to compress and decompress files, to reduce their size

This repo is a C++ version of yolov5_deepsort_tensorrt. Packing all C++ programs into .so files, using Python script to call C++ programs further.

Semi-supevised Semantic Segmentation with High- and Low-level Consistency

The code uses SegFormer for Semantic Segmentation on Drone Dataset.

Python library for computer vision labeling tasks. The core functionality is to translate bounding box annotations between different formats-for example, from coco to yolo.

Language-Agnostic Website Embedding and Classification

Complete the code of prefix-tuning in low data setting

This project intends to use SVM supervised learning to determine whether or not an individual is diabetic given certain attributes.

YoloV3 Implemented in Tensorflow 2.0

Full Stack Deep Learning Labs

Neural Turing Machine (NTM) & Differentiable Neural Computer (DNC) with pytorch & visdom

Improving Calibration for Long-Tailed Recognition (CVPR2021)

VLGrammar: Grounded Grammar Induction of Vision and Language

MOOSE (Multi-organ objective segmentation) a data-centric AI solution that generates multilabel organ segmentations to facilitate systemic TB whole-person research

LowRankModels.jl is a julia package for modeling and fitting generalized low rank models.

DeepLab is a state-of-art deep learning system for semantic image segmentation built on top of Caffe.

This code provides various models combining dilated convolutions with residual networks

HCQ: Hybrid Contrastive Quantization for Efficient Cross-View Video Retrieval