STT for TorchScript is a port of Coqui STT based on DeepSpeech to PyTorch.

Last update: Oct 18, 2021

Related tags

Text Data & NLP st3

Overview

st3

STT for TorchScript is a port of Coqui STT based on DeepSpeech to PyTorch.

Currently it supports converting pbmm models to pt scripts with integrated beam search.

Check out the first pre-release: https://github.com/proger/st3/releases

PyTorch impelementations of BERT-based Spelling Error Correction Models

59 Jun 29, 2021

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

VAENAR-TTS - PyTorch Implementation PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

67 Nov 14, 2022

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

WaveGlow A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis Quick Start: Install requirements: pip install

204 Jul 14, 2022

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Deepvoice3_pytorch PyTorch implementation of convolutional networks-based text-to-speech synthesis models: arXiv:1710.07654: Deep Voice 3: Scaling Tex

1.8k Dec 30, 2022

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation, available for both PyTorch and Tensorflow.

730 Jan 9, 2023

A Word Level Transformer layer based on PyTorch and 🤗 Transformers.

Transformer Embedder A Word Level Transformer layer based on PyTorch and 🤗 Transformers. How to use Install the library from PyPI: pip install transf

27 Nov 20, 2022

The PyTorch based implementation of continuous integrate-and-fire (CIF) module.

CIF-PyTorch This is a PyTorch based implementation of continuous integrate-and-fire (CIF) module for end-to-end (E2E) automatic speech recognition (AS

24 Dec 29, 2022

An example project using OpenPrompt under pytorch-lightning for prompt-based SST2 sentiment analysis model

pl_prompt_sst An example project using OpenPrompt under the framework of pytorch-lightning for a training prompt-based text classification model on SS

5 Oct 21, 2022

PyTorch implementation of the paper: Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding

Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding This repository contains the official PyTorch implementation of th

26 Dec 14, 2022

Releases(english1)

english1(Sep 13, 2021)
This is a conversion of Coqui English STT v0.9.3 model to TorchScript, allowing to deploy a speech recognizer as a single file. The TorchScript bundle is self-contained and runs DeepSpeech frontend and beam search returning 10 best results. LM Scorer is not supported at the moment.

To run, download the pt file and save the following code to recognize.py and make sure you have torchaudio installed using pip3 install torchaudio:

import torch, torchaudio, sys waveform, sr = torchaudio.load(sys.argv[1], normalize=True) assert sr == 16000 model = torch.jit.load('coqui-stt-0.9.3-models.pt') for transcript, scores in model(waveform.squeeze()): print(transcript, scores)

Now you can run the model on English recordings like below. Any format supported by TorchAudio backend should work.

python3 recognize.py sample.wav
Source code(tar.gz)
Source code(zip)
coqui-stt-0.9.3-models.pt(180.26 MB)

Owner

Vlad Ki

GitHub Repository

[ICCV 2021] Counterfactual Attention Learning for Fine-Grained Visual Categorization and Re-identification

Counterfactual Attention Learning Created by Yongming Rao*, Guangyi Chen*, Jiwen Lu, Jie Zhou This repository contains PyTorch implementation for ICCV

89 Dec 18, 2022

BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents

BROS (BERT Relying On Spatiality) is a pre-trained language model focusing on text and layout for better key information extraction from documents. Given the OCR results of the document image, which

94 Dec 30, 2022

Python-zhuyin - An open source Python library that provides a unified interface for converting between Chinese pinyin and Zhuyin (bopomofo)

2 Dec 29, 2022

Ελληνικά νέα (Python script) / Greek News Feed (Python script)

Ελληνικά νέα (Python script) / Greek News Feed (Python script) Ελληνικά English Το 2017 είχα υλοποιήσει ένα Python script για να εμφανίζει τα τωρινά ν

1 Jun 14, 2022

MiCECo - Misskey Custom Emoji Counter

MiCECo Misskey Custom Emoji Counter Introduction This little script counts custo

7 Dec 25, 2022

Tools and data for measuring the popularity & growth of various programming languages.

growth-data Tools and data for measuring the popularity & growth of various programming languages. Install the dependencies $ pip install -r requireme

3 Jan 06, 2022

Repositório da disciplina no semestre 2021-2

Avisos! Nenhum aviso! Compiladores 1 Este é o Git da disciplina Compiladores 1. Aqui ficará o material produzido em sala de aula assim como tarefas, w

6 May 13, 2022

Ongoing research training transformer language models at scale, including: BERT & GPT-2

What is this fork of Megatron-LM and Megatron-DeepSpeed This is a detached fork of https://github.com/microsoft/Megatron-DeepSpeed, which in itself is

316 Jan 03, 2023

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

English | 简体中文 | 繁體中文 | 한국어 State-of-the-art Machine Learning for JAX, PyTorch and TensorFlow 🤗 Transformers provides thousands of pretrained models

77.1k Dec 31, 2022

💫 Industrial-strength Natural Language Processing (NLP) in Python

spaCy: Industrial-strength NLP spaCy is a library for advanced Natural Language Processing in Python and Cython. It's built on the very latest researc

24.9k Jan 02, 2023

Product-Review-Summarizer - Created a product review summarizer which clustered thousands of product reviews and summarized them into a maximum of 500 characters, saving precious time of customers and helping them make a wise buying decision.

Product-Review-Summarizer - Created a product review summarizer which clustered thousands of product reviews and summarized them into a maximum of 500 characters, saving precious time of customers an

1 Jan 01, 2022

An official repository for tutorials of Probabilistic Modelling and Reasoning (2021/2022) - a University of Edinburgh master's course.

PMR computer tutorials on HMMs (2021-2022) This is a repository for computer tutorials of Probabilistic Modelling and Reasoning (2021/2022) - a Univer

10 Dec 06, 2022

STT for TorchScript is a port of Coqui STT based on DeepSpeech to PyTorch.

Related tags

Overview

st3

You might also like...

PyTorch impelementations of BERT-based Spelling Error Correction Models

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

A PyTorch implementation of the WaveGlow: A Flow-based Generative Network for Speech Synthesis

PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models

Transformers4Rec is a flexible and efficient library for sequential and session-based recommendation, available for both PyTorch and Tensorflow.

A Word Level Transformer layer based on PyTorch and 🤗 Transformers.

The PyTorch based implementation of continuous integrate-and-fire (CIF) module.

An example project using OpenPrompt under pytorch-lightning for prompt-based SST2 sentiment analysis model

PyTorch implementation of the paper: Text is no more Enough! A Benchmark for Profile-based Spoken Language Understanding

Releases(english1)

english1(Sep 13, 2021)

Owner

Vlad Ki

[ICCV 2021] Counterfactual Attention Learning for Fine-Grained Visual Categorization and Re-identification

BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents

Python-zhuyin - An open source Python library that provides a unified interface for converting between Chinese pinyin and Zhuyin (bopomofo)

Ελληνικά νέα (Python script) / Greek News Feed (Python script)

MiCECo - Misskey Custom Emoji Counter

Tools and data for measuring the popularity & growth of various programming languages.

Repositório da disciplina no semestre 2021-2

Ongoing research training transformer language models at scale, including: BERT & GPT-2

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

💫 Industrial-strength Natural Language Processing (NLP) in Python

Product-Review-Summarizer - Created a product review summarizer which clustered thousands of product reviews and summarized them into a maximum of 500 characters, saving precious time of customers and helping them make a wise buying decision.

Learning to Rewrite for Non-Autoregressive Neural Machine Translation

Official PyTorch implementation of Time-aware Large Kernel (TaLK) Convolutions (ICML 2020)

A very simple framework for state-of-the-art Natural Language Processing (NLP)

Paradigm Shift in NLP - "Paradigm Shift in Natural Language Processing".

Image2pcl - Enter the metaverse with 2D image to 3D projections

DLO8012: Natural Language Processing & CSL804: Computational Lab - II

Yet Another Sequence Encoder - Encode sequences to vector of vector in python !

PyTorch impelementations of BERT-based Spelling Error Correction Models.

An official repository for tutorials of Probabilistic Modelling and Reasoning (2021/2022) - a University of Edinburgh master's course.