spafe: Simplified Python Audio-Features Extraction

Last update: Jan 01, 2023

Overview

spafe: Simplified Python Audio-Features Extraction

spafe aims to simplify features extractions from mono audio files. The library can extract of the following features: BFCC, LFCC, LPC, LPCC, MFCC, IMFCC, MSRCC, NGCC, PNCC, PSRCC, PLP, RPLP, Frequency-stats etc. It also provides various filterbank modules (Mel, Bark and Gammatone filterbanks) and other spectral statistics.

Installation

Dependencies

spafe requires:

Python (>= 3.5)
NumPy (>= 1.17.2)
SciPy (>= 1.3.1)

User installation

If you already have a working installation of numpy and scipy, you can simply install spafe using pip:

pip install -U spafe

or conda (not available at the moment):

conda install spafe

How to use

Various examples on how to use spafe filter banks or feature extraction techniques are available under examples.

Contributing

Contributions are welcome and encouraged. To learn more about how to contribute to spafe please refer to the Contributing guidelines

Comments

Comparing Spafe with librosa and surfboard

This is both a comment for the paper and for the repo's README: the first python lib that came to my mind when I saw the review request was librosa. Could you - in the paper's statement of need - elaborate on why this lib is relevant in comparison to what librosa already provides? This would neatly fit with the other challengers (namely, Bob, SpeechPy and python_speech_features). The same argument could be made with the less well-known surfboard.

As a side note and quasi-request, it'd be nice to have such a comparison outlined in the README. If I were someone that came across this repo, i'd like to know how this is not "yet another audio feature extraction lib", and how this one might fit my needs (especially in comparison to other libs). If I were of bad faith, i'd say i'd like to see a bit of marketing :smiling_imp:

opened by hadware 8
Run examples has TypeError:

TypeError: dct(): incompatible function arguments. The following argument types are supported: 1. (a: array, type: int, axes: object = None, inorm: int = 0, out: object = None, nthreads: int = 1) -> array

opened by Amforever 8
segmentation fault

segmentation fault occured when my program attempted to access the frames which returned by the function stride_trick the signal length is 1852416 in "stride_trick" a.size returns 3704832
question

opened by weixiu00 6
Type hinting and general code review
Reference Issue

This is part of the code review for JOSS: https://github.com/openjournals/joss-reviews/issues/4739

This PR is based on the edits from #44 . That other PR should probably be merged before this one.

What does this implement/fix? Explain your changes.

This PR has has three aims:

I'll add type hints to the function signatures (I hope you don't mind, but I really do love type hints, @hbredin might testify on this)

Adding type hints sort of forces me to comb through the code and get a better understanding it of it

If I see fishy things I might :

either add a # TODO or # WARNING comment for you to solve. You can probably solve them by directly pushing in this PR if you wish

correct them myself if they're extremely obvious. In this case you might want to check that I haven't done anything bad.
opened by hadware 4
Fixes to setup, dependencies and CI
Reference Issue

This is a PR relating to https://github.com/openjournals/joss-reviews/issues/4739#issuecomment-1260768703

First of all, since this is our first contact, I have to say that this lib is very well made :smiley: , thanks a lot for your work! The code is very clean, the API is very clear, the tests are very exhaustive, and the documentation seems to be very complete.

The changes in this PR are somewhat nickpicky, but I think they really help making the lib more "standard".

What does this implement/fix? Explain your changes.

This fixes and "standardizes" a couple of things:

I move most of the constants from the spafe/version.py to the setup.py , as this was not a standard thing to do. If this choice was very opinionated (i.e., you had a very good reason to do so), you can obviously revert that.

I put the test and documentation dependencies in tests and docs , in the extras_require field. This is somewhat more standard, and allows the user to run pip install spafe[docs] or pip install spafe[tests] which I think is a bit cleaner. Note that running pip install spafe[anything] installs the "main" dependencies by default.

I left the "main" dependencies in the requirements.txt, as i think this to be pretty practical.

Having the ability to plot things is very good, but requiring the install of matplotlib by default is, I think, a bit bloated. I made this dependency optional, and added its install through pip install spafe[plotting]. You should probably say something about that in the install instructions.

I relaxed the required versions for scipy/numpy ( >= instead of == , this is very useful if spafe has to share its env with some other libraries)

I fixed the github actions in several ways:

I used a pip install instead of conda install, this makes the tests setup faster and much simpler

The old config didn't use the python-version matrix

I removed the tests for python 3.5 and 3.6 (they wouldn't be able to run anyway, the required numpy/scipy versions were not available for theses versions of python).

I added an action for the documentation. This couldn't run with the others actions, as you don't want a PR branch to activate a doc build.
opened by hadware 4
The discription of " gammatone_fbanks.gammatone_filter_banks" of document is out of the version

By written in pycharm of the piece of the code in the document, I found that the "return" of the method is not a array of fbanks but a array of "a array of fbanks" and another array that I dont know its mean, I guess it is an array of center frequencies by print it out. Please fix up the document.

opened by WindDevil 3

IndexError: index 3806427 is out of bounds for axis 0 with size 39168

  File "/usr/local/lib/python3.9/site-packages/spafe/features/spfeats.py", line 279, in extract_feats
    feats["mode_frequency"] = frequencies[amplitudes.argmax()]
IndexError: index 3806427 is out of bounds for axis 0 with size 39168

  File "/usr/local/lib/python3.9/site-packages/spafe/features/spfeats.py", line 270, in extract_feats
    feats["peak_frequency"] = frequencies[np.argmax(amplitudes)]
IndexError: index 3806427 is out of bounds for axis 0 with size 39168

We are getting in the wrong way the argmax() of amplitudes when amplitudes dimensions is greater than 1.

Moreover, mode_frequency and peak_frequency have the same value, because amplitudes.argmax() and np.argmax(amplitudes) are the same, but this is a different issue https://github.com/SuperKogito/spafe/issues/34

opened by Helias 3

Value error: negative dimensions are not allowed when using spafe library
I am working in audio files and I use this code to extract lfcc features:

audio_path = '03-01-02-02-01-01-01_norm.wav'

fs, sig = scipy.io.wavfile.read(audio_path)

lfcc = lfcc(sig, 13)

but I have this value error:

negative dimensions are not allowed

I think that it is because my sig array has negative values, can you give me a way to fix it? Thank you for yor help
question
opened by barrydjenaba 3
'value error: shapes not aligned' while using pncc

pnccs = pncc(sig,fs,13) Traceback (most recent call last): File "", line 1, in File "D:\old data\Python\Python35\lib\site-packages\spafe\features\pncc.py", line 205, in pncc b=gammatone_filter.T) File "<array_function internals>", line 6, in dot ValueError: shapes (400,46) and (257,26) not aligned: 46 (dim 1) != 257 (dim 0)

bug spafe.features

opened by mohammadalihumayun 3
MFE not found

i try this https://spafe.readthedocs.io/en/latest/features/mfcc.html, but got this "ImportError: cannot import name 'mfe' from 'spafe.features.mfcc'"
bug documentation question

opened by ibnudaqiqil 3
fix: non-int issue and remove useless variables
Reference Issue

closes https://github.com/SuperKogito/spafe/issues/14

What does this implement/fix? Explain your changes.

Cast to int frame_length and frame_step as suggested from the issue.

Remove unused variables

Any other comments?

Thanks https://github.com/aacarneiro/spafe/commit/95a8c785a7982db697a2ee5ff9f57845f5f56f5f @aacarneiro
opened by Helias 2

Releases(v0.2.0)

v0.2.0(Jul 13, 2022)
What's new?

Added spectrogram implementations.

Added predefined filter banks input to the spafe features functions for faster batch processing.

Added the Constant Q Cepstral Coefficients (CQCC) implementation.

Improved, restyled & added references and examples to the documentation.

Improved and simplified the code and the tests .

Inspected Cython and GPU accelerations to code (dropped).

Tested support for different sampling rates (8kHz, 16kHz, 32kHz, 44100Hz, 48kHz).

Converters bug fixed.

Documentation available at https://superkogito.github.io/spafe/v0.2.0/index.html

New contributors:

Stefano Borzì https://github.com/Helias

Christian Heider Nielsen https://github.com/cnheider

Source code(tar.gz)
Source code(zip)
v0.1.2(Jul 12, 2022)
Fixed framing copping bug.

Re-implemented dominant frequencies extraction module.

Fixed MFCC liftering.

Fixed windowing.

Setup CI; automated testing and coverage.

Setup formatting and automated reviews.

Documentation is available at https://superkogito.github.io/spafe/v0.1.2/index.html

Source code(tar.gz)
Source code(zip)
v0.1.1(Jul 12, 2022)
Only includes small text and Readme fixes in comparison to 0.1.0.

Documentation available at https://spafe.readthedocs.io/en/latest/.

Source code(tar.gz)
Source code(zip)
v0.1.0(Jul 12, 2022)
Initial release of Spafe.

Structure:

spafe.frequencies

spafe.frequencies.dominant_frequencies

spafe.frequencies.fundamental_frequencies

spafe.features

spafe.features.bfcc

spafe.features.gfcc

spafe.features.lfcc

spafe.features.lpc

spafe.features.mfcc

spafe.features.msrcc

spafe.features.ngcc

spafe.features.pncc

spafe.features.psrcc

spafe.features.rplp

spafe.features.spfeats

spafe.fbanks

spafe.fbanks.bark_fbanks

spafe.fbanks.gammatone_fbanks

spafe.fbanks.linear_fbanks

spafe.fbanks.mel_fbanks

spafe.utils

spafe.utils.cepstral

spafe.utils.converters

spafe.utils.filters

spafe.utils.levinsondr

spafe.utils.preprocessing

spafe.utils.spectral

spafe.utils.vis

Documentation available at https://spafe.readthedocs.io/en/latest/.

Source code(tar.gz)
Source code(zip)

Owner

Ayoub Malek

MSc. in EE & IT from TUM, ML engineer, programming enthusiast and coffee addict.

GitHub Repository https://spafe.readthedocs.io

Spotify Song Recommendation Program

Spotify-Song-Recommendation-Program Made by Esra Nur Özüm Written in Python The aim of this project was to build a recommendation system that recommen

1 Jun 30, 2022

A small project where I identify notes and key harmonies in a piece of music and use them further to recreate and generate the same piece of music through Python

5 Oct 07, 2022

An audio-solving python funcaptcha solving module

funcapsolver funcapsolver is a funcaptcha audio-solving module, which allows captchas to be interacted with and solved with the use of google's speech

8 Nov 21, 2022

The official repository for Audio ALBERT

AALBERT Here is also the official repository of AALBERT, which is Pytorch lightning reimplementation of the paper, Audio ALBERT: A Lite Bert for Self-

55 Dec 11, 2022

Mentos Music Bot With Python

Mentos Music Bot For Any Query Join Our Support Group 👥 Special Thanks - @OfficialYukki Hey Welcome To Here 💫 💫 You Can Make Your Own Music Bot Fo

13 Oct 21, 2022

A Quick Music Player Made Fully in Python

Quick Music Player Made Fully In Python. Pure Python, cross platform, single function module with no dependencies for playing sounds. Installation & S

1 Dec 24, 2021

Identify the emotion of multiple speakers in an Audio Segment

MevonAI - Speech Emotion Recognition

111 Jan 07, 2023

OpenClubhouse - A third-part web application based on flask to play Clubhouse audio.

1.1k Jan 05, 2023

Python module for handling audio metadata

Mutagen is a Python module to handle audio metadata. It supports ASF, FLAC, MP4, Monkey's Audio, MP3, Musepack, Ogg Opus, Ogg FLAC, Ogg Speex, Ogg The

1.1k Dec 31, 2022

Expressive Digital Signal Processing (DSP) package for Python

AudioLazy Development Last release PyPI status Real-Time Expressive Digital Signal Processing (DSP) Package for Python! Laziness and object representa

642 Dec 26, 2022

Telegram Bot to play music in VoiceChat with Channel Support and autostarts Radio.

VCPlayerBot Telegram bot to stream videos in telegram voicechat for both groups and channels. Supports live streams, YouTube videos and telegram media

1 Oct 15, 2021

Praat in Python, the Pythonic way

Parselmouth - Praat in Python, the Pythonic way Parselmouth is a Python library for the Praat software. Though other attempts have been made at portin

786 Jan 09, 2023

Music generation using ml / dl

Data analysis Document here the project: deep_music Description: Project Description Data Source: Type of analysis: Please document the project the be

0 Jul 03, 2022

DCL - An easy to use diacritic library used for diacritic and accent manipulation.

Diacritics Library This library is used for adding, and removing diacritics from strings. Getting started Start by importing the module: import dcl DC

6 Jun 03, 2022

Synchronize a local directory of songs' (MP3, MP4) metadata (genre, ratings) and playlists with a Plex server.

PlexMusicSync Synchronize a local directory of songs' (MP3, MP4) metadata (genre, ratings) and playlists (m3u, m3u8) with a Plex server. The song file

9 Jul 07, 2022

Sync Toolbox - Python package with reference implementations for efficient, robust, and accurate music synchronization based on dynamic time warping (DTW)

66 Jan 02, 2023

spafe: Simplified Python Audio-Features Extraction

Related tags

Overview

spafe: Simplified Python Audio-Features Extraction

Installation

Dependencies

User installation

How to use

Contributing

Comments

Reference Issue

What does this implement/fix? Explain your changes.

Reference Issue

What does this implement/fix? Explain your changes.

Reference Issue

What does this implement/fix? Explain your changes.

Any other comments?

Releases(v0.2.0)

v0.2.0(Jul 13, 2022)

What's new?

New contributors:

v0.1.2(Jul 12, 2022)

v0.1.1(Jul 12, 2022)

v0.1.0(Jul 12, 2022)

Owner

Ayoub Malek

Spotify Song Recommendation Program

A small project where I identify notes and key harmonies in a piece of music and use them further to recreate and generate the same piece of music through Python

An audio-solving python funcaptcha solving module

The official repository for Audio ALBERT

Mentos Music Bot With Python

A Quick Music Player Made Fully in Python

Identify the emotion of multiple speakers in an Audio Segment

OpenClubhouse - A third-part web application based on flask to play Clubhouse audio.

Python module for handling audio metadata

Expressive Digital Signal Processing (DSP) package for Python

Telegram Bot to play music in VoiceChat with Channel Support and autostarts Radio.

Praat in Python, the Pythonic way

Music generation using ml / dl

DCL - An easy to use diacritic library used for diacritic and accent manipulation.

Synchronize a local directory of songs' (MP3, MP4) metadata (genre, ratings) and playlists with a Plex server.

Sync Toolbox - Python package with reference implementations for efficient, robust, and accurate music synchronization based on dynamic time warping (DTW)

Mina - A Telegram Music Bot 5 mandatory Assistant written in Python using Pyrogram and Py-Tgcalls

This is my voice assistant Patric!

Tradutor de um arquivo MIDI para ser usado em um simulador RISC-V(RARS)

Implementation of "Slow-Fast Auditory Streams for Audio Recognition, ICASSP, 2021" in PyTorch