FMFCC-A

This project is the description of FMFCC-A (audio track of FMFCC) dataset and Challenge resluts.

The FMFCC-A dataset is shared through BaiduCloud (website: https://pan.baidu.com/s/1CGPkC8VfjXVBZjluEHsW6g , password: IIES). The FMFCC-A dataset is by far the largest publicly available Mandarin dataset for synthetic speech detection, which contains 40,000 synthesized Mandarin utterances that generated by 11 Mandarin TTS systems and two Mandarin VC systems, and 10,000 genuine Mandarin utterances collected from 58 speakers. In addition, the official website of FMFCC-A (Audio track of the first fake media forensic challenge of China Society of Image and Graphics) is http://fmfcc.net/ . We hope that the FMFCC-A dataset can fill the gap of lack of Mandarin datasets for synthetic speech detection under various audio post-processing operations.

If you find the code or dataset is usefull, please cite the following papers: FMFCC-A: A Challenging Mandarin Dataset for Synthetic Speech Detection

The description of FMFCC-A (audio track of FMFCC) dataset and Challenge resluts.

Related tags

Overview

FMFCC-A

Owner

Non-Official Pytorch implementation of "Face Identity Disentanglement via Latent Space Mapping" https://arxiv.org/abs/2005.07728 Using StyleGAN2 instead of StyleGAN

Info and sample codes for "NTU RGB+D Action Recognition Dataset"

Source code of NeurIPS 2021 Paper ''Be Confident! Towards Trustworthy Graph Neural Networks via Confidence Calibration''

Code release for The Devil is in the Channels: Mutual-Channel Loss for Fine-Grained Image Classification (TIP 2020)

Stereo Radiance Fields (SRF): Learning View Synthesis for Sparse Views of Novel Scenes

ShuttleNet: Position-aware Fusion of Rally Progress and Player Styles for Stroke Forecasting in Badminton (AAAI 2022)

A Pytorch implementation of the multi agent deep deterministic policy gradients (MADDPG) algorithm

make ASCII Art by Deep Learning

CAPITAL: Optimal Subgroup Identification via Constrained Policy Tree Search

Classifying cat and dog images using Kaggle dataset

Deep Dual Consecutive Network for Human Pose Estimation (CVPR2021)

toroidal - a lightweight transformer library for PyTorch

[TIP 2021] SADRNet: Self-Aligned Dual Face Regression Networks for Robust 3D Dense Face Alignment and Reconstruction

Orthogonal Over-Parameterized Training

Deep ViT Features as Dense Visual Descriptors

A TensorFlow implementation of Neural Program Synthesis from Diverse Demonstration Videos

An implementation of the research paper "Retina Blood Vessel Segmentation Using A U-Net Based Convolutional Neural Network"

Implementation of Analyzing and Improving the Image Quality of StyleGAN (StyleGAN 2) in PyTorch

Wav2Vec for speech recognition, classification, and audio classification

SSPNet: Scale Selection Pyramid Network for Tiny Person Detection from UAV Images.