PI Zero W Audio Book

Motivation and requirements

My dad is practically blind and at 80 years has trouble hearing and operating tiny or more complicated electronics controls. Touch screens, smart phones, keyboards, and small mp3 players are completely out of the picture. I have tried using small dummy MP3 player (Sencor) with 5 buttons (prev, next, play|pause, volume up/down) as an initial assessment whether audio book player is something he would be able to control. Even though he uesd it, he was struggling with controlling it and the small player with 2-3x overloaded button controlls was too much. Also it lacked a fundamental option of remote book update. So I've decided to build custom player with following requirements:

volume control is an analog knob (ideally it turns off all the way to the left)
keep the number of buttons to minimum (spaced far apart - resilient to random touch)
allow remote content change - wifi
open content (not locked to a publisher)
does not need to be battery operated
minimal level of state indicators
sufficient output volume to drive speakers/headphones

Install

Dependencie

Use venv for managing dependencies

python3 -mvenv env
activate env with `source env/bin/activate`
pip3 install gpiozero
pip3 install python-mpd2
pip3 install google-cloud-texttospeech

knihaui.py

User pi on Raspberry PI Zero has this repo checked out under knihaui folder.
There is also folder /data on the root writable by pi user.
/etc/rc.local is modified to disable video output, set PCM volume to 100, set IO pins and set permissions on /data
We have wifi_restart.sh and related service definition to automatically ping and restart wifi.
/etc/systemd/system/knihaui.service takes care of running the UI.
Service is enabled with systemctl enable knihaui.
MPD is installed and enabled on the system running on port 6600 and using /data for media directory.
Unused or extra components are disabled. We keep avahi for name discovery.
To prolong SD card lifetime download overlayfs and use as per instructions in readme.

newsgen.py

download project certificate from google cloud to env/newsgen-credentials.json` To run:
export GOOGLE_APPLICATION_CREDENTIALS=env/newsgen-credentials.json
source env/bin/activate
Running python3 newsgen.py creates /tmp/news.mp3 if successful

Listen to Example brief in Slovak here

Automate with crontab.

V0

V0 was the set of scripts to slice larger audio books into manageable small files suitable for dumb players. This also allowed to prepend "chapter X" voice at the start of each slice.

V1

V1 is the physical build with buttons that my dad is using right now.

Build hardware using Pi zero W
PY UI that drives the buttons and controlls MPD
Test remotre upgrade capability - SSH
Add support for internet radios (SRo and Radio Litera)
Add doc of system modification of raspbian to this doc

V2

HW: Add serial port output to external connector for improved troubleshooting
HW: Replace potentiometer with rotary encoder and set master volume directly using Alsa
HW: Add rocker switch with indicator to allow turn off/on and immediate powered-on indication
OS: Serial console
SW: rotary switch volume control
SW: user request to have information about the day available as another station
OS: read-only mount mode to prolong SD card lifetime

Audio book player for senior visually impaired.

Related tags

Overview

PI Zero W Audio Book

Motivation and requirements

Install

Dependencie

knihaui.py

newsgen.py

V0

V1

V2

Schematic

Photos

Owner

Andrej Hosna

The official repository for Audio ALBERT

A python package for calculating the PESQ.

Scrap electronic music charts into CSV files

Code for "Audio-driven Talking Face Video Generation with Learning-based Personalized Head Pose"

A telegram bot for which is help to play songs in vc 🥰 give 🌟 and fork this repo before use 😏

Improved Python UI to convert Youtube URL to .mp3 file.

Stream Music 🎵 𝘼 𝙗𝙤𝙩 𝙩𝙝𝙖𝙩 𝙘𝙖𝙣 𝙥𝙡𝙖𝙮 𝙢𝙪𝙨𝙞𝙘 𝙤𝙣 𝙏𝙚𝙡𝙚𝙜𝙧𝙖𝙢 𝙂𝙧𝙤𝙪𝙥 𝙖𝙣𝙙 𝘾𝙝𝙖𝙣𝙣𝙚𝙡 𝙑𝙤𝙞𝙘𝙚 𝘾𝙝𝙖𝙩𝙨 𝘼𝙫𝙖𝙞𝙡?

Audio fingerprinting and recognition in Python

In this project we can see how we can generate automatic music using character RNN.

A Python library for audio data augmentation. Inspired by albumentations. Useful for machine learning.

A rofi-blocks script that searches youtube and plays the selected audio on mpv.

A Python wrapper for the high-quality vocoder "World"

Guide & Examples to create deeplearning gstreamer plugins and use them in your pipeline

Converting UGG files from Rode Wireless Go II transmitters (unsompressed recordings) to WAV format

Expressive Digital Signal Processing (DSP) package for Python

Omniscient Mozart, being able to transcribe everything in the music, including vocal, drum, chord, beat, instruments, and more.

Port Hitsuboku Kumi Chinese CVVC voicebank to deepvocal. / 筆墨クミDeepvocal中文音源

Sparse Beta-Divergence Tensor Factorization Library

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

This is a python package that turns any images into MIDI files that views the same as them