PyTorch implementation of "A Two-Stage End-to-End System for Speech-in-Noise Hearing Aid Processing"

Last update: Aug 19, 2022

Overview

Implementation of the Sheffield entry for the first Clarity enhancement challenge (CEC1)

This repository contains the PyTorch implementation of "A Two-Stage End-to-End System for Speech-in-Noise Hearing Aid Processing", the Sheffield entry for the first Clarity enhancement challenge (CEC1). The system consists of a Conv-TasNet based denoising module, and a finite-inpulse-response (FIR) filter based amplification module. A differentiable approximation to the Cambridge MSBG model released in the CEC1 is used in the loss function.

Requirements

To run the training recipe of the amplification module, the MSBG package and PyTorch STOI are needed.

Training

To build the overall system, the Conv-TasNet based denoising module needs to be trained in the first stage, and the scripts are in the recipe_den_convtasnet. The FIR based amplification module is trained in the second stage, and the scripts are in the recipe_amp_fir. The MBSTOI folder contains the MBSTOI implementation from the CEC1 project, with also the DBSTOI implementation.

References

[1] Luo Y, Mesgarani N. Conv-tasnet: Surpassing ideal time–frequency magnitude masking for speech separation[J]. IEEE/ACM transactions on audio, speech, and language processing, 2019, 27(8): 1256-1266.
[2] Andersen A H, de Haan J M, Tan Z H, et al. Refinement and validation of the binaural short time objective intelligibility measure for spatially diverse conditions[J]. Speech Communication, 2018, 102: 1-13.
[3] C.H.Taal, R.C.Hendriks, R.Heusdens, J.Jensen 'A Short-Time Objective Intelligibility Measure for Time-Frequency Weighted Noisy Speech', ICASSP 2010, Texas, Dallas.

Citation

If you use this work, please cite:

@article{tutwo,
  title={A Two-Stage End-to-End System for Speech-in-Noise Hearing Aid Processing},
  author={Tu, Zehai and Zhang, Jisi and Ma, Ning and Barker, Jon},
  year={2021},
  booktitle={The Clarity Workshop on Machine Learning Challenges for Hearing Aids (Clarity-2021)},
}

PyTorch implementation of "A Two-Stage End-to-End System for Speech-in-Noise Hearing Aid Processing"

Related tags

Overview

Implementation of the Sheffield entry for the first Clarity enhancement challenge (CEC1)

Requirements

Training

References

Citation

Owner

The code release of paper Low-Light Image Enhancement with Normalizing Flow

yolov5 deepsort 行人车辆跟踪检测计数

A Transformer-Based Siamese Network for Change Detection

In this project, we'll be making our own screen recorder in Python using some libraries.

Experiments with differentiable stacks and queues in PyTorch

A machine learning malware analysis framework for Android apps.

Sandbox for training deep learning networks

ResNEsts and DenseNEsts: Block-based DNN Models with Improved Representation Guarantees

Predictive Maintenance LSTM

RoFormer_pytorch

[EMNLP 2021] MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations

[NeurIPS 2021] Shape from Blur: Recovering Textured 3D Shape and Motion of Fast Moving Objects

RepMLP: Re-parameterizing Convolutions into Fully-connected Layers for Image Recognition

A simple and extensible library to create Bayesian Neural Network layers on PyTorch.

Code and data for the paper "Hearing What You Cannot See"

Multi-Anchor Active Domain Adaptation for Semantic Segmentation (ICCV 2021 Oral)

On the Limits of Pseudo Ground Truth in Visual Camera Re-Localization

Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games

A collection of awesome resources image-to-image translation.

tensorflow implementation of 'YOLO : Real-Time Object Detection'

PyTorch implementation of "A Two-Stage End-to-End System for Speech-in-Noise Hearing Aid Processing"

Related tags

Overview

Implementation of the Sheffield entry for the first Clarity enhancement challenge (CEC1)

Requirements

Training

References

Citation

Owner

The code release of paper Low-Light Image Enhancement with Normalizing Flow

yolov5 deepsort 行人 车辆 跟踪 检测 计数

A Transformer-Based Siamese Network for Change Detection

In this project, we'll be making our own screen recorder in Python using some libraries.

Experiments with differentiable stacks and queues in PyTorch

A machine learning malware analysis framework for Android apps.

Sandbox for training deep learning networks

ResNEsts and DenseNEsts: Block-based DNN Models with Improved Representation Guarantees

Predictive Maintenance LSTM

RoFormer_pytorch

[EMNLP 2021] MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations

[NeurIPS 2021] Shape from Blur: Recovering Textured 3D Shape and Motion of Fast Moving Objects

RepMLP: Re-parameterizing Convolutions into Fully-connected Layers for Image Recognition

A simple and extensible library to create Bayesian Neural Network layers on PyTorch.

Code and data for the paper "Hearing What You Cannot See"

Multi-Anchor Active Domain Adaptation for Semantic Segmentation (ICCV 2021 Oral)

On the Limits of Pseudo Ground Truth in Visual Camera Re-Localization

Code for Towards Unifying Behavioral and Response Diversity for Open-ended Learning in Zero-sum Games

A collection of awesome resources image-to-image translation.

tensorflow implementation of 'YOLO : Real-Time Object Detection'

yolov5 deepsort 行人车辆跟踪检测计数