wenet-kws

Production First and Production Ready End-to-End Keyword Spotting Toolkit.

The goal of this toolkit it to...

Small footprint keyword spotting (KWS), or specifically wake-up word (WuW) detection is a typical and important module in internet of things (IoT) devices. It provides a way for users to control IoT devices with a hands-free experience. A WuW detection system usually runs locally and persistently on IoT devices, which requires low consumptional power, less model parameters, low computational comlexity and to detect predefined keyword in a streaming way, i.e., requires low latency.

Typical Scenario

We are going to support the following typical applications of wakeup word:

Single wake-up word
Multiple wake-up words
Customizable wake-up word
Personalized wake-up word, i.e. combination of wake-up word detection and voiceprint

Dataset

We plan to support a variaty of open source wake-up word datasets, include but not limited to:

All the well-trained models on these dataset will be made public avaliable.

Runtime

We plan to support a variaty of hardwares and platforms, including:

Web browser
x86
Android
Raspberry Pi

Production First and Production Ready End-to-End Keyword Spotting Toolkit

Related tags

Overview

wenet-kws

Typical Scenario

Dataset

Runtime

Owner

Header-only C++ HNSW implementation with python bindings

Yet Another Sequence Encoder - Encode sequences to vector of vector in python !

Chinese Grammatical Error Diagnosis

A multi-lingual approach to AllenNLP CoReference Resolution along with a wrapper for spaCy.

A highly sophisticated sequence-to-sequence model for code generation

Translators - is a library which aims to bring free, multiple, enjoyable translation to individuals and students in Python

Use fastai-v2 with HuggingFace's pretrained transformers

Write Python in Urdu - اردو میں کوڈ لکھیں

Telegram AI chat bot written in Python using Pyrogram

Free and Open Source Machine Translation API. 100% self-hosted, offline capable and easy to setup.

GPT-3 command line interaction

DensePhrases provides answers to your natural language questions from the entire Wikipedia in real-time

This is the Alpha of Nutte language, she is not complete yet / Essa é a Alpha da Nutte language, não está completa ainda

Code for our paper "Mask-Align: Self-Supervised Neural Word Alignment" in ACL 2021

Large-scale pretraining for dialogue

Applied Natural Language Processing in the Enterprise - An O'Reilly Media Publication

NumPy String-Indexed is a NumPy extension that allows arrays to be indexed using descriptive string labels

🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.

In this project, we compared Spanish BERT and Multilingual BERT in the Sentiment Analysis task.

Tools for curating biomedical training data for large-scale language modeling