Implementing a simplified copy of Shazam application from scratch using MinHashing and LSH.

Last update: Nov 17, 2022

Overview

Building Shazam from scratch

In this repository we tried to implement a simplified copy of the Shazam application able to tell you the name of a song listening to a short sample.

Overview

Converting the songs from mp3 to wav with Librosa and extraction of the peaks
MinHashing with permutations on the shingles matrix
Locality sensitive hashing to divide the songs in buckets
Shazam!

pickle is a folder that contains the songs peaks, the shingles array and the shingle matrix in pickle format.
ShazamLSH.ipynb is the main notebook that only contains the explanation of the steps and some comments
function.py contains all the implemented function needed to execute the notebook

Resources

This is the dataset we used and processed:

https://www.kaggle.com/dhrumil140396/mp3s32k

We also share some useful links can help to understand what is the process behind Min Hashing and LSH in order to recognise song:

Implementing a simplified copy of Shazam application from scratch using MinHashing and LSH.

Related tags

Overview

Building Shazam from scratch

Overview

Contents

Resources

Owner

Arturo Ghinassi

Implementation for paper MLP-Mixer: An all-MLP Architecture for Vision

Spectral normalization (SN) is a widely-used technique for improving the stability and sample quality of Generative Adversarial Networks (GANs)

This repository is an implementation of paper : Improving the Training of Graph Neural Networks with Consistency Regularization

A GUI for Face Recognition, based upon Docker, Tkinter, GPU and a camera device.

Official Pytorch implementation of the paper: "Locally Shifted Attention With Early Global Integration"

Facial recognition project

Extracting knowledge graphs from language models as a diagnostic benchmark of model performance.

hySLAM is a hybrid SLAM/SfM system designed for mapping

Justmagic - Use a function as a method with this mystic script, like in Nim

YoloAll is a collection of yolo all versions. you you use YoloAll to test yolov3/yolov5/yolox/yolo_fastest

An easy way to build PyTorch datasets. Modularly build datasets and automatically cache processed results

Mitsuba 2: A Retargetable Forward and Inverse Renderer

Annealed Flow Transport Monte Carlo

Source code of all the projects of Udacity Self-Driving Car Engineer Nanodegree.

A LiDAR point cloud cluster for panoptic segmentation

A PyTorch-based library for semi-supervised learning

A simple, high level, easy-to-use open source Computer Vision library for Python.

[ACM MM 2021] Diverse Image Inpainting with Bidirectional and Autoregressive Transformers

Fermi Problems: A New Reasoning Challenge for AI

A simple consistency training framework for semi-supervised image semantic segmentation