Maix Speech AI lib, including ASR, chat, TTS etc.

Last update: Dec 25, 2022

Related tags

Text Data & NLP Maix-Speech

Overview

Maix-Speech

中文 | English

Brief

Now only support Chinese, See 中文

Build

Clone code by:

git clone https://github.com/sipeed/Maix-Speech

Compile

x86x64

cd projects/asr

python project.py clean_conf
python project.py menuconfig

python project.py build
# python project.py build --verbose

./build/asr

python project.py clean
python project.py distclean
# python project.py clean_conf

R329

cd projects/asr
python project.py --toolchain /opt/toolchain/bin --toolchain-prefix aarch64-openwrt-linux- config
python project.py menuconfig
python project.py build

More project structure usage

See github.com/Neutree/c_cpp_project_framework

License

Apache 2.0

A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型，适用于英语、普通话/中文、日语、韩语、俄语和藏语（当前已测试）。

简体中文 | English 并行语音合成 [TOC] 新进展 2021/04/20 合并 wavegan 分支到 main 主分支，删除 wavegan 分支！ 2021/04/13 创建 encoder 分支用于开发语音风格迁移模块！ 2021/04/13 softdtw 分支支持使用 Sof

161 Dec 19, 2022

Command Line Text-To-Speech using Google TTS

cli-tts Thanks to gTTS by @pndurette! This is an interactive command line text-to-speech tool using Google TTS. Just type text and the voice will be p

3 Nov 11, 2022

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

VAENAR-TTS - PyTorch Implementation PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

67 Nov 14, 2022

Chinese real time voice cloning (VC) and Chinese text to speech (TTS).

Chinese real time voice cloning (VC) and Chinese text to speech (TTS). 好用的中文语音克隆兼中文语音合成系统，包含语音编码器、语音合成器、声码器和可视化模块。

6 Nov 8, 2022

Universal End2End Training Platform, including pre-training, classification tasks, machine translation, and etc.

背景安装教程快速上手（一）预训练模型（二）机器翻译（三）文本分类 TenTrans 进阶 1. 多语言机器翻译 2. 跨语言预训练背景 TrenTrans是一个统一的端到端的多语言多任务预训练平台，支持多种预训练方式，以及序列生成和自然语言理解任务。安装教程 git clone git

Tencent Minority-Mandarin Translation Team

42 Dec 20, 2022

A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models

wav2vec-toolkit A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models This repository accompanies the

29 Oct 23, 2022

Model for recasing and repunctuating ASR transcripts

Recasing and punctuation model based on Bert Benoit Favre 2021 This system converts a sequence of lowercase tokens without punctuation to a sequence o

88 Dec 29, 2022

A demo of chinese asr

chinese_asr_demo 一个端到端的中文语音识别模型训练、测试框架具备数据预处理、模型训练、解码、计算wer等等功能训练数据训练数据采用thchs_30，

4 Dec 9, 2021

An end to end ASR Transformer model training repo

END TO END ASR TRANSFORMER 本项目基于transformer 6*encoder+6*decoder的基本结构构造的端到端的语音识别系统 Model Instructions 1.数据准备: 自行下载数据，遵循文件结构如下： ├── data │ ├── train │

10 Jul 19, 2022

Comments

Cross-compile issue

Looks like the build system does not respect toolchain/arch setting in menuconfig. Even I set it to arm, it still tries to use x86 toolchain (host system toolchain).

[ 53%] Building C object asr_lib/CMakeFiles/asr_lib.dir/src/asr.c.o cc1: error: bad value (‘armv7-a’) for ‘-march=’ switch cc1: note: valid arguments to ‘-march=’ switch are: nocona core2 nehalem corei7 westmere sandybridge corei7-avx ivybridge core-avx-i haswell core-avx2 broadwell skylake skylake-avx512 cannonlake icelake-client rocketlake icelake-server cascadelake tigerlake cooperlake sapphirerapids alderlake bonnell atom silvermont slm goldmont goldmont-plus tremont knl knm x86-64 x86-64-v2 x86-64-v3 x86-64-v4 eden-x2 nano nano-1000 nano-2000 nano-3000 nano-x2 eden-x4 nano-x4 k8 k8-sse3 opteron opteron-sse3 athlon64 athlon64-sse3 athlon-fx amdfam10 barcelona bdver1 bdver2 bdver3 bdver4 znver1 znver2 znver3 btver1 btver2 native make[2]: *** [asr_lib/CMakeFiles/asr_lib.dir/build.make:76: asr_lib/CMakeFiles/asr_lib.dir/src/asr.c.o] Error 1 make[1]: *** [CMakeFiles/Makefile2:246: asr_lib/CMakeFiles/asr_lib.dir/all] Error 2

opened by mike-2020 0

Releases(v0.1.0)

v0.1.0(Oct 16, 2021)

2021.10.16 upload initial AM&LM files am_xxyy[q].zip: acoustic model, the q means quant to int8 model (about 1/4 size of float model) lmS.zip: Small version language model, 12+6+8MB lmM.zip: Middle version language model, 104+59+72MB lmL.zip: Large version language model, 750+403+72MB
（mega: https://mega.nz/folder/0lwGFLgI#omc_tYjwkl1kLKhZqUG5Ng） font.zip: optional simple GBK font file for embedded display Or you can download from BaiduYunPan: 链接：https://pan.baidu.com/s/1DzTcgi-Q7_T8mnKHfj1R8Q 提取码：wcq8

Files below: ↓↓↓↓↓↓
Source code(tar.gz)
Source code(zip)
am_3316.zip(11.52 MB)
am_3324.zip(20.44 MB)
am_3332.zip(31.48 MB)
am_7332.zip(50.36 MB)
font.zip(1.73 MB)
lmM.zip(146.50 MB)
lmS.zip(15.81 MB)
maix_asr_nezha_D1(2.14 MB)
r329_7332_192.bin(10.98 MB)
r329_toolchain.tar.gz(54.65 MB)

Owner

Sipeed

AIoT opensource hardware platform

GitHub Repository

Official codebase for Can Wikipedia Help Offline Reinforcement Learning?

82 Dec 19, 2022

Autoregressive Entity Retrieval

The GENRE (Generative ENtity REtrieval) system as presented in Autoregressive Entity Retrieval implemented in pytorch. @inproceedings{decao2020autoreg

611 Dec 16, 2022

Linear programming solver for paper-reviewer matching and mind-matching

Paper-Reviewer Matcher A python package for paper-reviewer matching algorithm based on topic modeling and linear programming. The algorithm is impleme

66 Jul 05, 2022

🕹 An esoteric language designed so that the program looks like the transcript of a Pokémon battle

PokéBattle is an esoteric language designed so that the program looks like the transcript of a Pokémon battle. Original inspiration and specification

9 Jan 11, 2022

Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech tagging and word segmentation.

Pytorch-NLU，一个中文文本分类、序列标注工具包，支持中文长文本、短文本的多类、多标签分类任务，支持中文命名实体识别、词性标注、分词等序列标注任务。 Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classifi

186 Dec 24, 2022

CCF BDCI BERT系统调优赛题baseline（Pytorch版本）

CCF BDCI BERT系统调优赛题baseline（Pytorch版本）此版本基于Pytorch后端的huggingface进行实现。由于此实现使用了Oneflow的dataloader作为数据读入的方式，因此也需要安装Oneflow。其它框架的数据读取可以参考OneflowDataloade

9 Oct 13, 2022

Poetry PEP 517 Build Backend & Core Utilities

Poetry Core A PEP 517 build backend implementation developed for Poetry. This project is intended to be a light weight, fully compliant, self-containe

293 Jan 02, 2023

An official repository for tutorials of Probabilistic Modelling and Reasoning (2021/2022) - a University of Edinburgh master's course.

PMR computer tutorials on HMMs (2021-2022) This is a repository for computer tutorials of Probabilistic Modelling and Reasoning (2021/2022) - a Univer

10 Dec 06, 2022

Guide: Finetune GPT2-XL (1.5 Billion Parameters) and GPT-NEO (2.7 B) on a single 16 GB VRAM V100 Google Cloud instance with Huggingface Transformers using DeepSpeed

Guide: Finetune GPT2-XL (1.5 Billion Parameters) and GPT-NEO (2.7 Billion Parameters) on a single 16 GB VRAM V100 Google Cloud instance with Huggingfa

289 Jan 06, 2023

Maix Speech AI lib, including ASR, chat, TTS etc.

Related tags

Overview

Maix-Speech

Brief

Build

Clone code by:

Compile

More project structure usage

License

You might also like...

A fast Text-to-Speech (TTS) model. Work well for English, Mandarin/Chinese, Japanese, Korean, Russian and Tibetan (so far). 快速语音合成模型，适用于英语、普通话/中文、日语、韩语、俄语和藏语（当前已测试）。

Command Line Text-To-Speech using Google TTS

PyTorch Implementation of VAENAR-TTS: Variational Auto-Encoder based Non-AutoRegressive Text-to-Speech Synthesis.

Chinese real time voice cloning (VC) and Chinese text to speech (TTS).

Universal End2End Training Platform, including pre-training, classification tasks, machine translation, and etc.

A collection of scripts to preprocess ASR datasets and finetune language-specific Wav2Vec2 XLSR models

Model for recasing and repunctuating ASR transcripts

A demo of chinese asr

An end to end ASR Transformer model training repo

Comments

Cross-compile issue

Releases(v0.1.0)

v0.1.0(Oct 16, 2021)

Owner

Sipeed

Official codebase for Can Wikipedia Help Offline Reinforcement Learning?

Autoregressive Entity Retrieval

Linear programming solver for paper-reviewer matching and mind-matching

🕹 An esoteric language designed so that the program looks like the transcript of a Pokémon battle

Ptorch NLU, a Chinese text classification and sequence annotation toolkit, supports multi class and multi label classification tasks of Chinese long text and short text, and supports sequence annotation tasks such as Chinese named entity recognition, part of speech tagging and word segmentation.

CCF BDCI BERT系统调优赛题baseline（Pytorch版本）

Poetry PEP 517 Build Backend & Core Utilities

An official repository for tutorials of Probabilistic Modelling and Reasoning (2021/2022) - a University of Edinburgh master's course.

Guide: Finetune GPT2-XL (1.5 Billion Parameters) and GPT-NEO (2.7 B) on a single 16 GB VRAM V100 Google Cloud instance with Huggingface Transformers using DeepSpeed

ConferencingSpeech2022; Non-intrusive Objective Speech Quality Assessment (NISQA) Challenge

The entmax mapping and its loss, a family of sparse softmax alternatives.

A raytrace framework using taichi language

A simple word search made in python

Submit issues and feature requests for our API here.

A full spaCy pipeline and models for scientific/biomedical documents.

Repository for the paper "Optimal Subarchitecture Extraction for BERT"

Searching keywords in PDF file folders

NeuTex: Neural Texture Mapping for Volumetric Neural Rendering

Russian words synonyms and antonyms

Revisiting Pre-trained Models for Chinese Natural Language Processing (Findings of EMNLP 2020)