Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

Last update: Dec 25, 2022

Related tags

Overview

ConSERT

Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

Requirements

torch==1.6.0
cudatoolkit==10.0.103
cudnn==7.6.5
sentence-transformers==0.3.9
transformers==3.4.0
tensorboardX==2.1
pandas==1.1.5
sentencepiece==0.1.85
matplotlib==3.4.1
apex==0.1.0

Get Started

Download pre-trained language model (e.g. bert-base-uncased) from HuggingFace's Library
Download STS datasets to ./data folder using SentEval toolkit

Run the following script to run the unsupervised experiment:

python3 main.py --no_pair --seed 1 --use_apex_amp --apex_amp_opt_level O1 --batch_size 96 --max_seq_length 64 --evaluation_steps 200 --add_cl --cl_loss_only --cl_rate 0.15 --temperature 0.1 --learning_rate 0.0000005 --train_data stssick --num_epochs 10 --da_final_1 feature_cutoff --da_final_2 shuffle --cutoff_rate_final_1 0.2 --model_name_or_path [PRETRAINED_BERT_FOLDER] --model_save_path ./output/unsup-base-feature_cutoff-shuffle --force_del --no_dropout --patience 10

where [PRETRAINED_BERT_FOLDER] should be replaced to the folder that contains downloaded pre-trained language model

Citation

@article{yan2021consert,
  title={ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer},
  author={Yan, Yuanmeng and Li, Rumei and Wang, Sirui and Zhang, Fuzheng and Wu, Wei and Xu, Weiran},
  journal={arXiv preprint arXiv:2105.11741},
  year={2021}
}

Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

Related tags

Overview

ConSERT

Requirements

Get Started

Citation

Owner

Yan Yuanmeng

Autoregressive Entity Retrieval

List of GSoC organisations with number of times they have been selected.

🤗Transformers: State-of-the-art Natural Language Processing for Pytorch and TensorFlow 2.0.

German Text-To-Speech Engine using Tacotron and Griffin-Lim

Refactored version of FastSpeech2

nlpcommon is a python Open Source Toolkit for text classification.

PRAnCER is a web platform that enables the rapid annotation of medical terms within clinical notes.

fastai ulmfit - Pretraining the Language Model, Fine-Tuning and training a Classifier

Semantic search through a vectorized Wikipedia (SentenceBERT) with the Weaviate vector search engine

Unsupervised text tokenizer focused on computational efficiency

Rhythm-Finder is a unsupervised ML driven python powered web-application that can find the songs that suits you.

Pretty-doc - Composable text objects with python

A method to generate speech across multiple speakers

Creating an LSTM model to generate music

Simple virtual assistant using pyttsx3 and speech recognition optionally with pywhatkit and pther libraries.

Seq2seq attn - Use the Seq2Seq method to implement machine translation and introduce Attention mechanism to improve the results

AEC_DeepModel - Deep learning based acoustic echo cancellation baseline code

自然言語で書かれた時間情報表現を抽出/規格化するルールベースの解析器

Code for "Finetuning Pretrained Transformers into Variational Autoencoders"

Implementation of TF-IDF algorithm to find documents similarity with cosine similarity