SimBERT升级版（SimBERTv2）！

Last update: Dec 23, 2022

Related tags

Text Data & NLP roformer-sim

Overview

RoFormer-Sim

RoFormer-Sim，又称SimBERTv2，是我们之前发布的SimBERT模型的升级版。

介绍

https://kexue.fm/archives/8454

训练

tensorflow 1.14 + keras 2.3.1 + bert4keras 0.10.6

下载

chinese_roformer-sim-char_L-12_H-768_A-12.zip(提取码：2cgz)

引用

Bibtex：

@techreport{roformer-sim,
  title={RoFormer-Sim: Integrating Retrieval and Generation into RoFormer},
  author={Jianlin Su},
  year={2021},
  url="https://github.com/ZhuiyiTechnology/roformer-sim",
}

联系

邮箱：[email protected]

链接

追一科技：https://zhuiyi.ai

Owner

Zhuiyi Technology is a leading enterprise intelligent service AI company in China. We focus on deep learning and NLP.

GitHub Repository

Arabic speech recognition, classification and text-to-speech.

klaam Arabic speech recognition, classification and text-to-speech using many advanced models like wave2vec and fastspeech2. This repository allows tr

177 Dec 27, 2022

中文无监督SimCSE Pytorch实现

A PyTorch implementation of unsupervised SimCSE SimCSE: Simple Contrastive Learning of Sentence Embeddings 1. 用法无监督训练 python train_unsup.py ./data/ne

99 Dec 23, 2022

Modified GPT using average pooling to reduce the softmax attention memory constraints.

NLP-GPT-Upsampling This repository contains an implementation of Open AI's GPT Model. In particular, this implementation takes inspiration from the Ny

1 Dec 03, 2021

a chinese segment base on crf

Genius Genius是一个开源的python中文分词组件，采用 CRF(Conditional Random Field)条件随机场算法。 Feature 支持python2.x、python3.x以及pypy2.x。支持简单的pinyin分词支持用户自定义break 支持用户自定义合并词

237 Nov 04, 2022

Persian Bert For Long-Range Sequences

ParsBigBird: Persian Bert For Long-Range Sequences The Bert and ParsBert algorithms can handle texts with token lengths of up to 512, however, many ta

63 Dec 14, 2022

Python SDK for working with Voicegain Speech-to-Text

Voicegain Speech-to-Text Python SDK Python SDK for the Voicegain Speech-to-Text API. This API allows for large vocabulary speech-to-text transcription

3 Dec 14, 2022

NSFW A chatbot based on GPT2-chitchat

DangBot -- 好怪哦，再来一句卡群怪话bot，powered by GPT2 for Chinese chitchat Training Example: python train.py --lr 5e-2 --epochs 30 --max_len 300 --batch_size 8

11 Jul 21, 2022

Protein Language Model

ProteinLM We pretrain protein language model based on Megatron-LM framework, and then evaluate the pretrained model results on TAPE (Tasks Assessing P

77 Dec 27, 2022

API for the GPT-J language model 🦜. Including a FastAPI backend and a streamlit frontend

gpt-j-api 🦜 An API to interact with the GPT-J language model. You can use and test the model in two different ways: Streamlit web app at http://api.v

276 Dec 31, 2022

⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).

BERT-of-Theseus Code for paper "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing". BERT-of-Theseus is a new compressed BERT by progre

284 Nov 25, 2022

SimBERT升级版（SimBERTv2）！

Related tags

Overview

RoFormer-Sim

介绍

训练

下载

引用

联系

链接

Owner

Arabic speech recognition, classification and text-to-speech.

中文无监督SimCSE Pytorch实现

Modified GPT using average pooling to reduce the softmax attention memory constraints.

a chinese segment base on crf

Persian Bert For Long-Range Sequences

Python SDK for working with Voicegain Speech-to-Text

NSFW A chatbot based on GPT2-chitchat

Protein Language Model

API for the GPT-J language model 🦜. Including a FastAPI backend and a streamlit frontend

⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).

Must-read papers on improving efficiency for pre-trained language models.

숭실대학교 컴퓨터학부 전공종합설계프로젝트

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

[EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

A cross platform OCR Library based on PaddleOCR & OnnxRuntime

A complete NLP guideline for enthusiasts

Official implementation of Meta-StyleSpeech and StyleSpeech

Sentiment-Analysis and EDA on the IMDB Movie Review Dataset

SimBERT升级版（SimBERTv2）！

Related tags

Overview

RoFormer-Sim

介绍

训练

下载

引用

联系

链接

Owner

Arabic speech recognition, classification and text-to-speech.

中文无监督SimCSE Pytorch实现

Modified GPT using average pooling to reduce the softmax attention memory constraints.

a chinese segment base on crf

Persian Bert For Long-Range Sequences

Python SDK for working with Voicegain Speech-to-Text

**NSFW** A chatbot based on GPT2-chitchat

Protein Language Model

API for the GPT-J language model 🦜. Including a FastAPI backend and a streamlit frontend

⛵️The official PyTorch implementation for "BERT-of-Theseus: Compressing BERT by Progressive Module Replacing" (EMNLP 2020).

Must-read papers on improving efficiency for pre-trained language models.

숭실대학교 컴퓨터학부 전공종합설계프로젝트

ALBERT: A Lite BERT for Self-supervised Learning of Language Representations

[EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction

Unofficial Implementation of Zero-Shot Text-to-Speech for Text-Based Insertion in Audio Narration

Unofficial Parallel WaveGAN (+ MelGAN & Multi-band MelGAN & HiFi-GAN & StyleMelGAN) with Pytorch

A cross platform OCR Library based on PaddleOCR & OnnxRuntime

A complete NLP guideline for enthusiasts

Official implementation of Meta-StyleSpeech and StyleSpeech

Sentiment-Analysis and EDA on the IMDB Movie Review Dataset

NSFW A chatbot based on GPT2-chitchat