無料で使える中品質なテキスト読み上げソフトウェア、VOICEVOXのコア

Last update: Aug 29, 2022

Related tags

Audio voicevox_core

Overview

VOICEVOX CORE

VOICEVOX の音声合成コア。

Releases にビルド済みのコアライブラリ（.so/.dll）があります。

依存関係

CUDA 11.1 と CUDNN のインストールと LibTorch のダウンロードが必要です。

API

core.h をご参照ください。

サンプルの実行

まず Releases からコアライブラリが入った zip をダウンロードしておきます。

Python 3

ソースコードから実行

cd example/python

# example/python のディレクトリにコアライブラリが入った zip ファイルを展開

# Windowsの場合、DLLからLIBファイルの作成
./makelib.bat core

# 環境構築
pip install -r requirements.txt
python setup.py install  # Linuxの場合は先頭に `LIBRARY_PATH="$LIBRARY_PATH:."` が必要

# # うまく行かないときは毎回以下を実行すると良いかも
# python setup.py clean
# rm -r build *.cpp

# 実行（Windowsの場合）
PATH="$PATH:$HOME/libtorch/lib/" python run.py \
    --text "これは本当に実行できているんですか" \
    --speaker_id 1

# 実行（Windows以外の場合）
LD_LIBRARY_PATH="$LD_LIBRARY_PATH:/libtorch/lib/" python run.py \
    --text "これは本当に実行できているんですか" \
    --speaker_id 1

# 引数の紹介
# --text 読み上げるテキスト
# --speaker_id 話者ID
# --use_gpu GPUを使う
# --f0_speaker_id 音高の話者ID（デフォルト値はspeaker_id）
# --f0_correct 音高の補正値（デフォルト値は0。+-0.3くらいで結果が大きく変わります）

「ImportError: DLL load failed: 指定されたモジュールが見つかりません。」というエラーが出た場合は libtorch のパスが間違っているかもしれません。

Docker から

# イメージのビルド
docker build -t voicevox_core example/python

# コンテナの起動(音声を保存しておくボリュームを作成)
docker run -it -v ~/voicevox:/root/voice voicevox_core bash

# テスト音声 `おはようございます-1.wav` を生成
python run.py --text おはようございます --speaker_id 1
mv *.wav ~/voice
exit

# 音声の再生
aplay ~/voice/おはようございます-1.wav

その他の言語

サンプルコードを実装された際はぜひお知らせください。こちらに追記させて頂きます。

事例紹介

VOICEVOX ENGINE SHARP @yamachu ･･･ VOICEVOX ENGINE の C# 実装
Node VOICEVOX Engine @y-chan ･･･ VOICEVOX ENGINE の Node.js/C++ 実装

ライセンス

サンプルコードおよび core.h は MIT LICENSE です。

Releases にあるビルド済みのコアライブラリは別ライセンスなのでご注意ください。

無料で使える中品質なテキスト読み上げソフトウェア、VOICEVOXのコア

Related tags

Overview

VOICEVOX CORE

依存関係

API

サンプルの実行

Python 3

ソースコードから実行

Docker から

その他の言語

事例紹介

ライセンス

You might also like...

Releases(0.13.0-check-directml-rust.0)

0.13.0-check-directml-rust.0(Aug 18, 2022)

check-2022-04-11(Apr 10, 2022)

Owner

Hiroshiba

pyo is a Python module written in C to help digital signal processing script creation.

GNU Radio – the Free and Open Software Radio Ecosystem

This bot can stream audio or video files and urls in telegram voice chats

Anki vector Music ❤ is the best and only Telegram VC player with playlists, Multi Playback, Channel play and more

TONet: Tone-Octave Network for Singing Melody Extraction from Polyphonic Music

An AI for Music Generation

:sound: Play and Record Sound with Python :snake:

Pyroomacoustics is a package for audio signal processing for indoor applications. It was developed as a fast prototyping platform for beamforming algorithms in indoor scenarios.

A simple voice detection system which can be applied practically for designing a device with capability to detect a baby’s cry and automatically turning on music

A lightweight yet powerful audio-to-MIDI converter with pitch bend detection

Hide Your Secret Message in any Wave Audio File.

python wrapper for rubberband

Pyrogram bot to automate streaming music in voice chats

Audio processor to map oracle notes in the VoG raid in Destiny 2 to call outs.

A simple python script to play bell sound in your system infinitely, just for fun and experimental purposes

User-friendly Voice Cloning Application

This Bot can extract audios and subtitles from video files

DaisyXmusic ❤ A bot that can play music on Telegram Group and Channel Voice Chats

Minimal command-line music player written in Python

Synchronize a local directory of songs' (MP3, MP4) metadata (genre, ratings) and playlists with a Plex server.