Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

Last update: Dec 25, 2022

Related tags

Overview

ConSERT

Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

Requirements

torch==1.6.0
cudatoolkit==10.0.103
cudnn==7.6.5
sentence-transformers==0.3.9
transformers==3.4.0
tensorboardX==2.1
pandas==1.1.5
sentencepiece==0.1.85
matplotlib==3.4.1
apex==0.1.0

Get Started

Download pre-trained language model (e.g. bert-base-uncased) from HuggingFace's Library
Download STS datasets to ./data folder using SentEval toolkit

Run the following script to run the unsupervised experiment:

python3 main.py --no_pair --seed 1 --use_apex_amp --apex_amp_opt_level O1 --batch_size 96 --max_seq_length 64 --evaluation_steps 200 --add_cl --cl_loss_only --cl_rate 0.15 --temperature 0.1 --learning_rate 0.0000005 --train_data stssick --num_epochs 10 --da_final_1 feature_cutoff --da_final_2 shuffle --cutoff_rate_final_1 0.2 --model_name_or_path [PRETRAINED_BERT_FOLDER] --model_save_path ./output/unsup-base-feature_cutoff-shuffle --force_del --no_dropout --patience 10

where [PRETRAINED_BERT_FOLDER] should be replaced to the folder that contains downloaded pre-trained language model

Citation

@article{yan2021consert,
  title={ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer},
  author={Yan, Yuanmeng and Li, Rumei and Wang, Sirui and Zhang, Fuzheng and Wu, Wei and Xu, Weiran},
  journal={arXiv preprint arXiv:2105.11741},
  year={2021}
}

Code for our ACL 2021 paper - ConSERT: A Contrastive Framework for Self-Supervised Sentence Representation Transfer

Related tags

Overview

ConSERT

Requirements

Get Started

Citation

Owner

Yan Yuanmeng

The official implementation of Theme Transformer

Real-time Joint Semantic Reasoning for Autonomous Driving

Official PyTorch implementation of the paper Image-Based CLIP-Guided Essence Transfer.

NeuroGen: activation optimized image synthesis for discovery neuroscience

Graph-Refined Convolutional Network for Multimedia Recommendation with Implicit Feedback

A Sign Language detection project using Mediapipe landmark detection and Tensorflow LSTM's

The project is an official implementation of our paper "3D Human Pose Estimation with Spatial and Temporal Transformers".

Implementation of light baking system for ray tracing based on Activision's UberBake

Cancer Drug Response Prediction via a Hybrid Graph Convolutional Network

The deployment framework aims to provide a simple, lightweight, fast integrated, pipelined deployment framework that ensures reliability, high concurrency and scalability of services.

This is an unofficial PyTorch implementation of Meta Pseudo Labels

A testcase generation tool for Persistent Memory Programs.

Unsupervised Real-World Super-Resolution: A Domain Adaptation Perspective

[Official] Exploring Temporal Coherence for More General Video Face Forgery Detection(ICCV 2021)

Integrated Semantic and Phonetic Post-correction for Chinese Speech Recognition

Caffe: a fast open framework for deep learning.

Implementation of the "Point 4D Transformer Networks for Spatio-Temporal Modeling in Point Cloud Videos" paper.

《DeepViT: Towards Deeper Vision Transformer》(2021)

sssegmentation is a general framework for our research on strongly supervised semantic segmentation.

The official implementation of ICCV paper "Box-Aware Feature Enhancement for Single Object Tracking on Point Clouds".