Recognition of 38 speech commands in russian. Based on Yandex Cup 2021 ML Challenge: ASR

Last update: May 05, 2022

Overview

Speech_38_ru_commands

Recognition of 38 speech commands in russian. Based on Yandex Cup 2021 ML Challenge: ASR

Программа умеет распознавать 38 ключевых слов на русском языке , произнесенных в микрофон из списка:

дальше, вперед, назад, вверх, вниз, выше, ниже, домой, громче, тише, лайк, дизлайк, следующий, предыдущий, сначала, перемотай, выключи, стоп, хватит, замолчи, заткнись, останови, пауза, включи, смотреть, продолжи, играй, запусти, ноль, один, два, три, четыре, пять, шесть, семь, восемь, девять.

Используемая модель была подготовлена для соревнования Yandex Cup 2021 ML Challenge: ASR. Получило 3 место из 54 участников. с показателем точности 92.01

Скачать модель по ссылке https://disk.yandex.ru/d/L053qF-0OPKlog

Пример запуска программы:

python speech_38_ru_commands.py --porog 1.2

где , число 1.2 - это порог уверенности в команде. Можно задавать в диапазоне 0.0 - 7.9999

Recognition of 38 speech commands in russian. Based on Yandex Cup 2021 ML Challenge: ASR

Related tags

Overview

Speech_38_ru_commands

Owner

Andrey

Dé op-de-vlucht Pieton vertaler. Wereldwijd gebruikt door meer dan 1.000+ succesvolle bedrijven!

Large-scale open domain KNOwledge grounded conVERsation system based on PaddlePaddle

A look-ahead multi-entity Transformer for modeling coordinated agents.

AI Assistant for Building Reliable, High-performing and Fair Multilingual NLP Systems

AllenNLP integration for Shiba: Japanese CANINE model

Protein Language Model

A simple recipe for training and inferencing Transformer architecture for Multi-Task Learning on custom datasets. You can find two approaches for achieving this in this repo.

The official implementation of "BERT is to NLP what AlexNet is to CV: Can Pre-Trained Language Models Identify Analogies?, ACL 2021 main conference"

Retraining OpenAI's GPT-2 on Discord Chats

Perform sentiment analysis on textual data that people generally post on websites like social networks and movie review sites.

中文空间语义理解评测

Data preprocessing rosetta parser for python

A complete NLP guideline for enthusiasts

Indonesia spellchecker with python

CMeEE 数据集医学实体抽取

Toolkit for Machine Learning, Natural Language Processing, and Text Generation, in TensorFlow. This is part of the CASL project: http://casl-project.ai/

A retro text-to-speech bot for Discord

Subtitle Workshop (subshop): tools to download and synchronize subtitles

Unsupervised Document Expansion for Information Retrieval with Stochastic Text Generation

Korean stereoypte detector with TUNiB-Electra and K-StereoSet