This is a python script to navigate and extract the FSD50K dataset

Last update: Nov 23, 2021

Related tags

Data Analysis fsd50k_extractor

Overview

FSD50K navigator

This is a script I use to navigate the sound dataset from FSK50K. https://zenodo.org/record/4060432#.YYYJ9HtBxH4

To use this download the files from datasets. The categories is in the Ground truth file to be downloaded in the links above.

These have several script

find_audio.py - This is the script you want
convert.py - this only exist so that I can convert data for a Wio Terminal project. https://wiki.seeedstudio.com/Wio-Terminal-TinyML-EI-3/
sound_category.py - this is to show the category of a file

Usage

installation

pipenv install
Start the shell with pipenv shell then use the script as described below.

find_audio.py

Usage

Basic use - python find_audio.py categories
Show all label of a file - python find_audio.py categories -s
Exclude file with certain categories - python find_audio.py categories -x xcategories
Search Multiple file - python find_audio.py cat1,cat2,etc
Search Exclude Multiple file - python find_audio.py cat1,cat2,etc -x xcat1,xcat2
Help - python find_audio.py -h

convert.py

Usage is similar for find_audio.py except for -p, -i and -o

Basic use - python convert.py categories -p new_category_name -i source_audio_directory -o output_directory

sound_category.py

Basic use - python sound_category.py audio_name

Owner

sweemeng

sweemeng

GitHub Repository

simple way to build the declarative and destributed data pipelines with python

unipipeline simple way to build the declarative and distributed data pipelines. Why you should use it Declarative strict config Scaffolding Fully type

0 Jan 26, 2022

Zipline, a Pythonic Algorithmic Trading Library

Zipline is a Pythonic algorithmic trading library. It is an event-driven system for backtesting. Zipline is currently used in production as the backte

15.7k Jan 07, 2023

For making Tagtog annotation into csv dataset

tagtog_relation_extraction for making Tagtog annotation into csv dataset How to Use On Tagtog 1. Go to Project Downloads 2. Download all documents,

4 Dec 28, 2021

This cosmetics generator allows you to generate the new Fortnite cosmetics, Search pak and search cosmetics!

COSMETICS GENERATOR This cosmetics generator allows you to generate the new Fortnite cosmetics, Search pak and search cosmetics! Remember to put the l

11 Dec 13, 2022

small package with utility functions for analyzing (fly) calcium imaging data

fly2p Tools for analyzing two-photon (2p) imaging data collected with Vidrio Scanimage software and micromanger. Loading scanimage data relies on scan

3 Dec 14, 2022

Desafio 1 ~ Bantotal

Challenge 01 | Bantotal Please read the instructions for the challenge by selecting your preferred language below: Español Português License Copyright

44 Sep 28, 2022

Instant search for and access to many datasets in Pyspark.

SparkDataset Provides instant access to many datasets right from Pyspark (in Spark DataFrame structure). Drop a star if you like the project. 😃 Motiv

31 Dec 16, 2022

Churn prediction with PySpark

It is expected to develop a machine learning model that can predict customers who will leave the company.

3 Aug 13, 2021

Template for a Dataflow Flex Template in Python

Dataflow Flex Template in Python This repository contains a template for a Dataflow Flex Template written in Python that can easily be used to build D

5 Apr 28, 2022

A distributed block-based data storage and compute engine

Nebula is an extremely-fast end-to-end interactive big data analytics solution. Nebula is designed as a high-performance columnar data storage and tabular OLAP engine.

131 Dec 26, 2022

The Master's in Data Science Program run by the Faculty of Mathematics and Information Science

The Master's in Data Science Program run by the Faculty of Mathematics and Information Science is among the first European programs in Data Science and is fully focused on data engineering and data a

2 Jun 17, 2022

Py-price-monitoring - A Python price monitor

A Python price monitor This project was focused on Brazil, so the monitoring is

1 Jan 04, 2022

An orchestration platform for the development, production, and observation of data assets.

Dagster An orchestration platform for the development, production, and observation of data assets. Dagster lets you define jobs in terms of the data f

6.2k Jan 08, 2023

Fancy data functions that will make your life as a data scientist easier.

WhiteBox Utilities Toolkit: Tools to make your life easier Fancy data functions that will make your life as a data scientist easier. Installing To ins

3 Oct 03, 2022

Kennedy Institute of Rheumatology University of Oxford Project November 2019

TradingBot6M Kennedy Institute of Rheumatology University of Oxford Project November 2019 Run Change api.txt to binance api key: https://www.binance.c

2 Nov 16, 2021

A lightweight interface for reading in output from the Weather Research and Forecasting (WRF) model into xarray Dataset

xwrf A lightweight interface for reading in output from the Weather Research and Forecasting (WRF) model into xarray Dataset. The primary objective of

43 Nov 29, 2022

Hg002-qc-snakemake - HG002 QC Snakemake

HG002 QC Snakemake To Run Resources and data specified within snakefile (hg002QC

2 Feb 16, 2022

Data processing with Pandas.

Processing-data-with-python This is a simple example showing how to use Pandas to create a dataframe and the processing data with python. The jupyter

1 Jan 23, 2022

Retentioneering: product analytics, data-driven customer journey map optimization, marketing analytics, web analytics, transaction analytics, graph visualization, and behavioral segmentation with customer segments in Python.

What is Retentioneering? Retentioneering is a Python framework and library to assist product analysts and marketing analysts as it makes it easier to

581 Jan 07, 2023

fds is a tool for Data Scientists made by DAGsHub to version control data and code at once.

Fast Data Science, AKA fds, is a CLI for Data Scientists to version control data and code at once, by conveniently wrapping git and dvc

359 Dec 22, 2022