Rotary Transformer

Last update: Jan 03, 2023

Related tags

Overview

Rotary Transformer

Rotary Transformer，简称RoFormer，是我们自研的语言模型之一，主要是为Transformer结构设计了新的旋转式位置编码（Rotary Position Embedding，RoPE）。RoPE具有良好的理论性质，且是目前唯一一种可以应用到线性Attention的绝对位置编码，目前来看实验结果也颇为不错。

详细介绍：https://kexue.fm/archives/8265

依赖

bert4keras 0.10.4

参考配置：在24G显存的3090上，跑maxlen=1024，batch_size能跑到8以上。

下载

chinese_roformer_L-12_H-768_A-12.zip(提取码：xy9x)

引用

Bibtex：

@techreport{zhuiyiroformer,
  title={RoFormer: Transformer with Rotary Position Embeddings - ZhuiyiAI},
  author={Jianlin Su},
  year={2021},
  url="https://github.com/ZhuiyiTechnology/roformer",
}

联系

邮箱：[email protected] 追一科技：https://zhuiyi.ai

Owner

Zhuiyi Technology is a leading enterprise intelligent service AI company in China. We focus on deep learning and NLP.

GitHub Repository

🔅 Shapash makes Machine Learning models transparent and understandable by everyone

🎉 What's new ? Version New Feature Description Tutorial 1.6.x Explainability Quality Metrics To help increase confidence in explainability methods, y

2.1k Dec 27, 2022

UV matrix decompostion using movielens dataset

UV-matrix-decompostion-with-kfold UV matrix decompostion using movielens dataset upload the 'ratings.dat' file install the following python libraries

2 Oct 18, 2022

Keras Implementation of Neural Style Transfer from the paper "A Neural Algorithm of Artistic Style"

Neural Style Transfer & Neural Doodles Implementation of Neural Style Transfer from the paper A Neural Algorithm of Artistic Style in Keras 2.0+ INetw

2.2k Dec 31, 2022

Simple ray intersection library similar to coldet - succedeed by libacc

Ray Intersection This project offers a header only acceleration structure library including implementations for a BVH- and KD-Tree. Applications may i

29 Jun 23, 2022

A Context-aware Visual Attention-based training pipeline for Object Detection from a Webpage screenshot!

CoVA: Context-aware Visual Attention for Webpage Information Extraction Abstract Webpage information extraction (WIE) is an important step to create k

41 Jan 01, 2023

Docker containers of baseline agents for the Crafter environment

Crafter Baselines This repository contains Docker containers for running various baselines on the Crafter environment. Reward Agents DreamerV2 based o

17 Sep 25, 2022

This is a GUI interface which can process forest fire detection, smoke detection and fire segmentation

This is a GUI interface which can process forest fire detection, smoke detection and fire segmentation. Yolov5 is used to detect fire and smoke and unet is used to segment fire.

7 Jan 08, 2023

Extracts data from the database for a graph-node and stores it in parquet files

subgraph-extractor Extracts data from the database for a graph-node and stores it in parquet files Installation For developing, it's recommended to us

0 Jan 10, 2022

Keqing Chatbot With Python

KeqingChatbot A public running instance can be found on telegram as @keqingchat_bot. Requirements Python 3.8 or higher. A bot token. Local Deploy git

2 Jan 16, 2022

Spectral Tensor Train Parameterization of Deep Learning Layers

Spectral Tensor Train Parameterization of Deep Learning Layers This repository is the official implementation of our AISTATS 2021 paper titled "Spectr

12 Oct 23, 2022

PaddleRobotics is an open-source algorithm library for robots based on Paddle, including open-source parts such as human-robot interaction, complex motion control, environment perception, SLAM positioning, and navigation.

简体中文 | English PaddleRobotics paddleRobotics是基于paddle的机器人开源算法库集，包括人机交互、复杂运动控制、环境感知、slam定位导航等开源算法部分。人机交互主动多模交互技术TFVT-HRI 主动多模交互技术是通过视觉、语音、触摸传感器等输入机器人

185 Dec 26, 2022

Rotary Transformer

Related tags

Overview

Rotary Transformer

依赖

下载

引用

联系

Owner

🔅 Shapash makes Machine Learning models transparent and understandable by everyone

UV matrix decompostion using movielens dataset

Keras Implementation of Neural Style Transfer from the paper "A Neural Algorithm of Artistic Style"

Simple ray intersection library similar to coldet - succedeed by libacc

A Context-aware Visual Attention-based training pipeline for Object Detection from a Webpage screenshot!

Docker containers of baseline agents for the Crafter environment

This is a GUI interface which can process forest fire detection, smoke detection and fire segmentation

Extracts data from the database for a graph-node and stores it in parquet files

Keqing Chatbot With Python

Spectral Tensor Train Parameterization of Deep Learning Layers

PaddleRobotics is an open-source algorithm library for robots based on Paddle, including open-source parts such as human-robot interaction, complex motion control, environment perception, SLAM positioning, and navigation.

PFLD pytorch Implementation

VOS: Learning What You Don’t Know by Virtual Outlier Synthesis

Two types of Recommender System : Content-based Recommender System and Colaborating filtering based recommender system

Differentiable Quantum Chemistry (only Differentiable Density Functional Theory and Hartree Fock at the moment)

SketchEdit: Mask-Free Local Image Manipulation with Partial Sketches

A simple python library for fast image generation of people who do not exist.

Solution of Kaggle competition: Sartorius - Cell Instance Segmentation

This code uses generative adversarial networks to generate diverse task allocation plans for Multi-agent teams.

Pytorch implementation of SimSiam Architecture