A Convolutional Transformer for Keyword Spotting

Last update: Jan 27, 2022

Related tags

Overview

☢️ Audiomer ☢️

Audiomer: A Convolutional Transformer for Keyword Spotting

[ `arXiv` ]	[ `Previous SOTA` ]	[ `Model Architecture` ]

Results on SpeechCommands

Model Architecture

Performer Conv-Attention

Usage

To reproduce the results in the paper, follow the instructions:

To download the Speech Commands v2 dataset, run: python3 datamodules/SpeechCommands12.py
To train Audiomer-S and Audiomer-L on all three datasets thrice, run: python3 run_expts.py
To evaluate a model on a dataset, run: python3 evaluate.py --checkpoint_path /path/to/checkpoint.ckpt --model <model type> --dataset <name of dataset>.
For example: python3 evaluate.py --checkpoint_path ./epoch=300.ckpt --model S --dataset SC20

System requirements

NVIDIA GPU with CUDA
Python 3.6 or higher.
pytorch_lightning
torchaudio
performer_pytorch

Owner

GitHub Repository

A tf.keras implementation of Facebook AI's MadGrad optimization algorithm

MADGRAD Optimization Algorithm For Tensorflow This package implements the MadGrad Algorithm proposed in Adaptivity without Compromise: A Momentumized,

20 Aug 18, 2022

The code of paper "Block Modeling-Guided Graph Convolutional Neural Networks".

Block Modeling-Guided Graph Convolutional Neural Networks This repository contains the demo code of the paper: Block Modeling-Guided Graph Convolution

22 Dec 08, 2022

Using fully convolutional networks for semantic segmentation with caffe for the cityscapes dataset

Using fully convolutional networks for semantic segmentation (Shelhamer et al.) with caffe for the cityscapes dataset How to get started Download the

27 Jun 06, 2022

Code for KDD'20 "Generative Pre-Training of Graph Neural Networks"

GPT-GNN: Generative Pre-Training of Graph Neural Networks GPT-GNN is a pre-training framework to initialize GNNs by generative pre-training. It can be

346 Dec 19, 2022

Federated learning on graph, especially on graph neural networks (GNNs), knowledge graph, and private GNN.

198 Dec 20, 2022

SUPERVISED-CONTRASTIVE-LEARNING-FOR-PRE-TRAINED-LANGUAGE-MODEL-FINE-TUNING - The Facebook paper about fine tuning RoBERTa with contrastive loss

"# SUPERVISED-CONTRASTIVE-LEARNING-FOR-PRE-TRAINED-LANGUAGE-MODEL-FINE-TUNING" i

28 Dec 12, 2022

Computer Vision is an elective course of MSAI, SCSE, NTU, Singapore

[AI6122] Computer Vision is an elective course of MSAI, SCSE, NTU, Singapore. The repository corresponds to the AI6122 of Semester 1, AY2021-2022, starting from 08/2021. The instructor of this course

5 Sep 12, 2022

code for our paper "Source Data-absent Unsupervised Domain Adaptation through Hypothesis Transfer and Labeling Transfer"

SHOT++ Code for our TPAMI submission "Source Data-absent Unsupervised Domain Adaptation through Hypothesis Transfer and Labeling Transfer" that is ext

75 Dec 16, 2022

Validated, scalable, community developed variant calling, RNA-seq and small RNA analysis

Validated, scalable, community developed variant calling, RNA-seq and small RNA analysis. You write a high level configuration file specifying your in

917 Jan 03, 2023

Deploy pytorch classification model using Flask and Streamlit

1 Nov 17, 2021

High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.

TL;DR Ignite is a high-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently. Click on the image to

4.2k Jan 01, 2023

Source code, data, and evaluation details for “Cross-Lingual Citations in English Papers: A Large-Scale Analysis of Prevalence, Formation, and Ramifications”

Analysis of cross-lingual citations in English papers Contents initial_analysis Source code, data, and evaluation details as published at ICADL2020 ci

1 Oct 27, 2022

A Convolutional Transformer for Keyword Spotting

Related tags

Overview

☢️ Audiomer ☢️

Results on SpeechCommands

Model Architecture

Performer Conv-Attention

Usage

System requirements

Owner

A tf.keras implementation of Facebook AI's MadGrad optimization algorithm

The code of paper "Block Modeling-Guided Graph Convolutional Neural Networks".

Using fully convolutional networks for semantic segmentation with caffe for the cityscapes dataset

Code for KDD'20 "Generative Pre-Training of Graph Neural Networks"

Federated learning on graph, especially on graph neural networks (GNNs), knowledge graph, and private GNN.

SUPERVISED-CONTRASTIVE-LEARNING-FOR-PRE-TRAINED-LANGUAGE-MODEL-FINE-TUNING - The Facebook paper about fine tuning RoBERTa with contrastive loss

Computer Vision is an elective course of MSAI, SCSE, NTU, Singapore

code for our paper "Source Data-absent Unsupervised Domain Adaptation through Hypothesis Transfer and Labeling Transfer"

Validated, scalable, community developed variant calling, RNA-seq and small RNA analysis

Deploy pytorch classification model using Flask and Streamlit

High-level library to help with training and evaluating neural networks in PyTorch flexibly and transparently.

Source code, data, and evaluation details for “Cross-Lingual Citations in English Papers: A Large-Scale Analysis of Prevalence, Formation, and Ramifications”

Composing methods for ML training efficiency

Details about the wide minima density hypothesis and metrics to compute width of a minima

Submodular Subset Selection for Active Domain Adaptation (ICCV 2021)

MediaPipe Kullanarak İleri Seviye Bilgisayarla Görü

Graph Self-Supervised Learning for Optoelectronic Properties of Organic Semiconductors

这是一个利用facenet和retinaface实现人脸识别的库，可以进行在线的人脸识别。

A PyTorch Lightning Callback for pushing models to the Hugging Face Hub 🤗⚡️

Flaxformer: transformer architectures in JAX/Flax