THOR: Transformer with Stochastic Experts

This PyTorch package implements Taming Sparsely Activated Transformer with Stochastic Experts.

Installation

The most convenient way to run the code is to use this docker image: tartarusz/adv-train:azure-pytorch-apex-v1.7.0. The image supports running on Microsoft Azure.
Our implementation is based on Fairseq.

Instructions

Download Fairseq (v1.0.0+) to the current directory.
Run pip install -e . to install the package locally.
To run a sample translation task on IWSLT'14 De-En, first follow the instructions here to download and tokenize the data, then use bash preprocess.sh to pre-process the tokenized data.
Run bash run.sh to train a THOR model.

Notes

Contact Information

For personal communication related to this package, please contact Simiao Zuo ([email protected]), Xiaodong Liu ([email protected]), or Jian Jiao ([email protected]).

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.opensource.microsoft.com.

When you submit a pull request, a CLA bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., status check, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact [email protected] with any additional questions or comments.

Trademarks

This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft trademarks or logos is subject to and must follow Microsoft's Trademark & Brand Guidelines. Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship. Any use of third-party trademarks or logos are subject to those third-party's policies.

This package implements THOR: Transformer with Stochastic Experts.

Related tags

Overview

THOR: Transformer with Stochastic Experts

Installation

Instructions

Notes

Contact Information

Contributing

Trademarks

Owner

Microsoft

Hyperparameter tuning for humans

Tiny-NewsRec: Efﬁcient and Effective PLM-based News Recommendation

FlingBot: The Unreasonable Effectiveness of Dynamic Manipulations for Cloth Unfolding

Keeper for Ricochet Protocol, implemented with Apache Airflow

Files for a tutorial to train SegNet for road scenes using the CamVid dataset

Source code for CAST - Crisis Domain Adaptation Using Sequence-to-sequence Transformers (Accepted to ISCRAM 2021, CorePaper).

OpenMMLab 3D Human Parametric Model Toolbox and Benchmark

Neural Logic Inductive Learning

Attempt at implementation of a simple GAN using Keras

Unofficial PyTorch implementation of MobileViT based on paper "MobileViT: Light-weight, General-purpose, and Mobile-friendly Vision Transformer".

MIMIC Code Repository: Code shared by the research community for the MIMIC-III database

pytorch implementation of "Contrastive Multiview Coding", "Momentum Contrast for Unsupervised Visual Representation Learning", and "Unsupervised Feature Learning via Non-Parametric Instance-level Discrimination"

Asterisk is a framework to generate high-quality training datasets at scale

Unofficial implementation of the ImageNet, CIFAR 10 and SVHN Augmentation Policies learned by AutoAugment using pillow

Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.

Code accompanying the paper "Knowledge Base Completion Meets Transfer Learning"

Learning from graph data using Keras

PyTorch implementation of federated learning framework based on the acceleration of global momentum

Torch Implementation of "Photo-Realistic Single Image Super-Resolution Using a Generative Adversarial Network"

Arbitrary Distribution Modeling with Censorship in Real Time 59 2 60 3 Bidding Advertising for KDD'21