Replication Package for AequeVox:Automated Fariness Testing for Speech Recognition Systems

Last update: Aug 28, 2022

Related tags

Deep Learning AequeVox

Overview

AequeVox

Replication Package for AequeVox:Automated Fariness Testing for Speech Recognition Systems

README under development.

Python Packages Required

numpy
scipy
math
librosa
random
time
json
threading
re
nltk

ASR Specific Packages

Google Cloud

speech
Storage

Microsoft Azure

Azure.cognitiveservices.speech

IBM Cloud

ibm_watson
ibm_watson.websocket
Ibm_cloud_sdk_core.authenticators

The code is separated into 2 sections, Generation and Analysis.

Generation:

transGen.py

Lists all transformation types and magnitudes to be used. Can be modified as necessary.
Requires the specification of file names of all the original speech files.

Generates transformed speech files with form {Original File Name}{Transformation Type Abbreviation}{Magnitude of Transformation Parameter, theta}.wav

List of Abbreviations.

A - Amplitude
C - Clipping
D - Drop
F - Frame
HP - Highpass
LP - LP
N - Noise
S - Scale

GCP_Recog.py

Requires Google cloud client libraries and associated keys.

Takes a group name and the list of all original files in the group to generate transcripts.

MS_Recog.py

Requires Microsoft Azure client libraries and associated key and region.

Takes a group name and the list of all original files in the group to generate transcripts.

IBM_Recog.py

Requires IBM client libraries and associated key and service URL..

Takes a group name and the list of all original files in the group to generate transcripts.

compASR.py

Takes the names of two ASR systems and group names to generate a distance metric. Result yields text files with distance metrics for specified groups.

Users are requested to use the distance metrics to calculate the D values for each transformation.

Replication Package for AequeVox:Automated Fariness Testing for Speech Recognition Systems

Related tags

Overview

AequeVox

Owner

Sai Sathiesh

The official pytorch implementation of our paper "Is Space-Time Attention All You Need for Video Understanding?"

PyTorch implementation for the Neuro-Symbolic Sudoku Solver leveraging the power of Neural Logic Machines (NLM)

Probabilistic Gradient Boosting Machines

(JMLR' 19) A Python Toolbox for Scalable Outlier Detection (Anomaly Detection)

Hamiltonian Dynamics with Non-Newtonian Momentum for Rapid Sampling

1st Place Solution to ECCV-TAO-2020: Detect and Represent Any Object for Tracking

A simple Tensorflow based library for deep and/or denoising AutoEncoder.

MADE (Masked Autoencoder Density Estimation) implementation in PyTorch

Arch-Net: Model Distillation for Architecture Agnostic Model Deployment

Code for the Convolutional Vision Transformer (ConViT)

Face recognition system using MTCNN, FACENET, SVM and FAST API to track participants of Big Brother Brasil in real time.

Minimal deep learning library written from scratch in Python, using NumPy/CuPy.

Instant-nerf-pytorch - NeRF trained SUPER FAST in pytorch

Implementation of Uformer, Attention-based Unet, in Pytorch

Tensorflow implementation of "BEGAN: Boundary Equilibrium Generative Adversarial Networks"

[WACV 2020] Reducing Footskate in Human Motion Reconstruction with Ground Contact Constraints

This is the official implementation of VaxNeRF (Voxel-Accelearated NeRF).

This is the code for ACL2021 paper A Unified Generative Framework for Aspect-Based Sentiment Analysis

Multi-Glimpse Network With Python

Using OpenAI's CLIP to upscale and enhance images