commonfate 📦commonfate 📦 - Common Fate Model and Transform.

Last update: Jan 08, 2022

Related tags

Overview

Common Fate Transform and Model for Python

This package is a python implementation of the Common Fate Transform and Model to be used for audio source separation as described in an ICASSP 2016 paper "Common Fate Model for Unison source Separation".

Common Fate Transform

The Common Fate Transform is based on a signal representation that divides a complex spectrogram into a grid of patches of arbitrary size. These complex patches are then processed by a two-dimensional discrete Fourier transform, forming a tensor representation which reveals spectral and temporal modulation textures.

Common Fate Model

An adapted factorization model similar to the PARAFAC/CANDECOMP factorisation allows to decompose the common fate transform tesnor into different time-varying harmonic sources based on their particular common modulation profile: hence the name Common Fate Model.

Usage

See the full API documentation at http://aliutkus.github.io/commonfate.

Applying the Common Fate Transform

import commonfate

# # forward transform

# STFT Parameters

framelength = 1024
hopsize = 256
X = commonfate.transform.forward(signal, framelength, hopsize)

# Patch Parameters
W = (32, 48)
mhop = (16, 24)

Z = commonfate.transform.forward(X, W, mhop, real=False)

# inverse transform of cft
Y = commonfate.transform.inverse(
    Z, fdim=2, hop=mhop, shape=X.shape, real=False
)
# back to time domain
y = commonfate.transform.inverse(
    Y, fdim=1, hop=hopsize, shape=x.shape
)

Fitting the Common Fate Model

import commonfate

# initialiase and fit the common fate model
cfm = commonfate.model.CFM(z, nb_components=10, nb_iter=100).fit()

# get the fitted factors
(A, H, C) = cfm.factors

# returns the of z approximation using the fitted factors
z_hat = cfm.approx()

Decompose an audio signal using CFT and CFM

commonfate has a built-in wrapper which computes the Common Fate Transform, fits the model according to the Common Fate Model and return the synthesised time domain signal components obtained through wiener / soft mask filtering.

The following example requires to install pysoundfile.

import commonfate
import soundfile as sf

# loading signal
(audio, fs) = sf.read(filename, always_2d=True)

# decomposes the audio signal into
# (nb_components, nb_samples, nb_channels)
components = decompose.process(
    audio,
    nb_iter=100,
    nb_components=10,
    n_fft=1024,
    n_hop=256,
    cft_patch=(32, 48),
    cft_hop=(16, 24)
)

# write out the third component to wave file
sf.write(
    "comp_3.wav",
    components[2, ...],
    fs
)

Optimisations

The current common fate model implementation makes heavily use of the Einstein Notation. We use the numpy einsum module which can be slow on large tensors. To speed up the computation time we recommend to install Daniel Smith's opt_einsum package.

Installation via pip

pip install -e 'git+https://github.com/dgasmith/opt_einsum.git#egg=opt_einsum'

commonfate automatically detects if the package is installed.

References

You can download and read the paper here. If you use this package, please reference to the following publication:

@inproceedings{stoeter2016cfm,
  TITLE = {{Common Fate Model for Unison source Separation}},
  AUTHOR = {St{\"o}ter, Fabian-Robert and Liutkus, Antoine and Badeau, Roland and Edler, Bernd and Magron, Paul},
  BOOKTITLE = {{41st International Conference on Acoustics, Speech and Signal Processing (ICASSP)}},
  ADDRESS = {Shanghai, China},
  PUBLISHER = {{IEEE}},
  SERIES = {Proceedings of the 41st International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
  YEAR = {2016},
  KEYWORDS = {Non-Negative tensor factorization ; Sound source separation ; Common Fate Model},
}

Comments

Change factorisation notation so that it matches the paper

https://github.com/aliutkus/commonfate/blob/master/commonfate/model.py#L121

is different to paper where we state:

the output should therefore be changed to (A, H, C)
enhancement

opened by faroit 1
Raised a MemoryError after called decompose.process

code: (audio, fs) = sf.read('1.wav', always_2d=True) components = commonfate.decompose.process( audio, nb_components=10, ) sf.write( "comp_3.wav", components[2, ...], fs ) raise errors: Traceback (most recent call last): File "F:/py_project/independent-project.git/music_ctl_lettin/music_feature_engineering/process_music.py", line 190, in nb_components=10, File "C:\Users\Besitzer\AppData\Local\Programs\Python\Python36-32\lib\site-packages\commonfate\decompose.py", line 63, in process n_hop, File "C:\Users\Besitzer\AppData\Local\Programs\Python\Python36-32\lib\site-packages\commonfate\transform.py", line 325, in forward stft = fftFunction(stft, frameShape, axes=range(len(frameShape))) File "C:\Users\Besitzer\AppData\Local\Programs\Python\Python36-32\lib\site-packages\numpy\fft\fftpack.py", line 1099, in rfftn a = rfft(a, s[-1], axes[-1], norm) File "C:\Users\Besitzer\AppData\Local\Programs\Python\Python36-32\lib\site-packages\numpy\fft\fftpack.py", line 372, in rfft _real_fft_cache) File "C:\Users\Besitzer\AppData\Local\Programs\Python\Python36-32\lib\site-packages\numpy\fft\fftpack.py", line 83, in _raw_fft r = work_function(a, wsave) MemoryError

Any ideas?

opened by okideal 1

commonfate 📦commonfate 📦 - Common Fate Model and Transform.

Related tags

Overview

Common Fate Transform and Model for Python

Common Fate Transform

Common Fate Model

Usage

Applying the Common Fate Transform

Fitting the Common Fate Model

Decompose an audio signal using CFT and CFM

Optimisations

Installation via pip

References

You might also like...

C++ library for audio and music analysis, description and synthesis, including Python bindings

An app made in Python using the PyTube and Tkinter libraries to download videos and MP3 audio.

Small Python application that links a Digico console and Reaper, handling automatic marker insertion and tracking.

Anki vector Music ❤ is the best and only Telegram VC player with playlists, Multi Playback, Channel play and more

Just-Music - Spotify API Driven Music Web app, that allows to listen and control and share songs

Audio fingerprinting and recognition in Python

Python library for audio and music analysis

?️ Open Source Audio Matching and Mastering

Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications

Comments

Change factorisation notation so that it matches the paper

Raised a MemoryError after called decompose.process

Releases(0.1.3)

0.1.3(Mar 24, 2016)

0.1.2(Mar 22, 2016)

0.1.1(Mar 22, 2016)

0.1(Mar 22, 2016)

Owner

Fabian-Robert Stöter

Carnatic Notes Predictor for audio files

Audio book player for senior visually impaired.

TONet: Tone-Octave Network for Singing Melody Extraction from Polyphonic Music

This Bot can extract audios and subtitles from video files

Mentos Music Bot With Python

Speech recognition module for Python, supporting several engines and APIs, online and offline.

python script for getting mp3 files from yaoutube playlist

Marsyas - Music Analysis, Retrieval and Synthesis for Audio Signals

Audio Retrieval with Natural Language Queries: A Benchmark Study

❤️ Hi There Im Cozmo Music Bot A next gen powerful telegram group Music bot for get your Songs and music @Venuja_Sadew

Sync Toolbox - Python package with reference implementations for efficient, robust, and accurate music synchronization based on dynamic time warping (DTW)

Code for csig audio deepfake detection

Accompanying code for our paper "Point Cloud Audio Processing"

Mousai is a simple application that can identify song like Shazam

LibXtract is a simple, portable, lightweight library of audio feature extraction functions.

Generating a structured library of .wav samples with Python.

GiantMIDI-Piano is a classical piano MIDI dataset contains 10,854 MIDI files of 2,786 composers

Simple discord bot by @merive 🤖

This is a realtime voice translator program which gets input from user at any language and converts it to the desired language that the user asks

Library for Python 3 to communicate with the Google Chromecast.