Unified Pre-training for Self-Supervised Learning and Supervised Learning for ASR

Last update: Jan 09, 2023

Related tags

Overview

UniSpeech

The family of UniSpeech:

UniSpeech (ICML 2021): Unified Pre-training for Self-Supervised Learning and Supervised Learning for ASR

UniSpeech-SAT (ICASSP 2022 Submission): Universal Speech Representation Learning with Speaker Aware Pre-Training

Pre-trained models

We strongly suggest using our UniSpeech-SAT model for speaker related tasks, since it shows very powerful performance on various speaker related benchmarks.

Model	Dataset	Model
UniSpeech Base	1500 hrs CommonVoice	download
UniSpeech Large	1500 hrs CommonVoice	download
UniSpeech-SAT Base	960 hrs LibriSpeech	download
UniSpeech-SAT Base+	60k hrs Libri-Light + 10k hrs GigaSpeech + 24k hrs VoxPopuli	download
UniSpeech-SAT Large	60k hrs Libri-Light + 10k hrs GigaSpeech + 24k hrs VoxPopuli	download

License

This project is licensed under the license found in the LICENSE file in the root directory of this source tree. Portions of the source code are based on the FAIRSEQ project.

Microsoft Open Source Code of Conduct

Contact Information

For help or issues using UniSpeech models, please submit a GitHub issue.

For other communications related to UniSpeech, please contact Yu Wu ([email protected]).

Unified Pre-training for Self-Supervised Learning and Supervised Learning for ASR

Related tags

Overview

UniSpeech

Pre-trained models

License

Contact Information

Owner

Microsoft

Official Implementation for HyperStyle: StyleGAN Inversion with HyperNetworks for Real Image Editing

Differentiable Simulation of Soft Multi-body Systems

This repository contains the official implementation code of the paper Improving Multimodal Fusion with Hierarchical Mutual Information Maximization for Multimodal Sentiment Analysis, accepted at EMNLP 2021.

A curated list of Machine Learning and Deep Learning tutorials in Jupyter Notebook format ready to run in Google Colaboratory

TANL: Structured Prediction as Translation between Augmented Natural Languages

License Plate Detection Application

PyTorch version of the paper 'Enhanced Deep Residual Networks for Single Image Super-Resolution' (CVPRW 2017)

State-Relabeling Adversarial Active Learning

Neural network chess engine trained on Gary Kasparov's games.

Official code for the paper: Deep Graph Matching under Quadratic Constraint (CVPR 2021)

SUPERVISED-CONTRASTIVE-LEARNING-FOR-PRE-TRAINED-LANGUAGE-MODEL-FINE-TUNING - The Facebook paper about fine tuning RoBERTa with contrastive loss

Minecraft Hack Detection With Python

Code repository for Self-supervised Structure-sensitive Learning, CVPR'17

Denoising Diffusion Implicit Models

A robust camera and Lidar fusion based velocity estimator to undistort the pointcloud.

Yolo ros - YOLO-ROS for HUAWEI ATLAS200

G-NIA model from "Single Node Injection Attack against Graph Neural Networks" (CIKM 2021)

IDA file loader for UF2, created for the DEFCON 29 hardware badge

Minimal deep learning library written from scratch in Python, using NumPy/CuPy.

Label-Free Model Evaluation with Semi-Structured Dataset Representations