Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms

Last update: Dec 05, 2022

Overview

FNet: Mixing Tokens with Fourier Transforms

Pytorch implementation of Fnet : Mixing Tokens with Fourier Transforms.

Citation:

@misc{leethorp2021fnet,
      title={FNet: Mixing Tokens with Fourier Transforms}, 
      author={James Lee-Thorp and Joshua Ainslie and Ilya Eckstein and Santiago Ontanon},
      year={2021},
      eprint={2105.03824},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

Owner

Rishikesh (ऋषिकेश)

GitHub Repository

构建一个多源（公众号、RSS）、干净、个性化的阅读环境

2C 构建一个多源（公众号、RSS）、干净、个性化的阅读环境作为一名微信公众号的重度用户，公众号一直被我设为汲取知识的地方。随着使用程度的增加，相信大家或多或少会有一个比较头疼的问题——广告问题。假设你关注的公众号有十来个，若一个公众号两周接一次广告，理论上你会面临二十多次广告，实际上会更多，运

678 Dec 28, 2022

An implementation of model parallel GPT-3-like models on GPUs, based on the DeepSpeed library. Designed to be able to train models in the hundreds of billions of parameters or larger.

GPT-NeoX An implementation of model parallel GPT-3-like models on GPUs, based on the DeepSpeed library. Designed to be able to train models in the hun

3.1k Jan 08, 2023

Unofficial implementation of Google's FNet: Mixing Tokens with Fourier Transforms

Related tags

Overview

FNet: Mixing Tokens with Fourier Transforms

Citation:

Owner

Rishikesh (ऋषिकेश)

构建一个多源（公众号、RSS）、干净、个性化的阅读环境

An implementation of model parallel GPT-3-like models on GPUs, based on the DeepSpeed library. Designed to be able to train models in the hundreds of billions of parameters or larger.

Augmenty is an augmentation library based on spaCy for augmenting texts.

Code to reproduce the results of the paper 'Towards Realistic Few-Shot Relation Extraction' (EMNLP 2021)

小布助手对话短文本语义匹配的一个baseline

code for "AttentiveNAS Improving Neural Architecture Search via Attentive Sampling"

This is the offline-training-pipeline for our project.

Nystromformer: A Nystrom-based Algorithm for Approximating Self-Attention

This repository contains the code for "Generating Datasets with Pretrained Language Models".

Training and evaluation codes for the BertGen paper (ACL-IJCNLP 2021)

Code for using and evaluating SpanBERT.

DELTA is a deep learning based natural language and speech processing platform.

Korean stereoypte detector with TUNiB-Electra and K-StereoSet

The SVO-Probes Dataset for Verb Understanding

The source code of "Language Models are Few-shot Multilingual Learners" (MRL @ EMNLP 2021)

Code for the paper "A Simple but Tough-to-Beat Baseline for Sentence Embeddings".

Sample data associated with the Aurora-BP study

This code is the implementation of Text Emotion Recognition (TER) with linguistic features

Research code for "What to Pre-Train on? Efficient Intermediate Task Selection", EMNLP 2021

RecipeReduce: Simplified Recipe Processing for Lazy Programmers