메인

오류신고

해당 데이터에 오류가 발견되면 오류신고해주세요.

오류신고

데이터 정보

오류신고를 진행 하실 데이터 정보를 담은 표입니다.
제목(Main)	FAD: A Chinese Dataset for Fake Audio Detection
제목(Sub)
저자	Haoxin Ma;Jiangyan Yi;
제공처	OpenAIRE
리포지터리	OpenAIRE

접수 정보

오류신고 접수 정보를 담은 표이며, 메일주소, 오류내용을 입력합니다.
아이디
메일주소	오류신고 접수 정보를 담은 표이며, 메일주소, 오류내용을 입력합니다.
오류 구분
오류내용	개인정보 노출방지를 위해 개인정보 내용은 가급적 자제하여 주시기 바랍니다. 일방적인 욕설 및 부정적인 내용 작성시 원작자의 판단에 따라 신고자에게 피해가 발생할 수 있습니다. 깨끗하고 청렴한 서비스 문화를 위해 필요한 정보만 기재해주시면 감사하겠습니다.

접수하기

추천합니다 관심데이터

2022

해외

공개

CC-BY-4.0

English

FAD: A Chinese Dataset for Fake Audio Detection

FAD: A Chinese Dataset for Fake Audio Detection Haoxin Ma;Jiangyan Yi;

Fake audio detection is a growing concern and some relevant datasets have been designed for research. But there is no standard public Chinese dataset under additive noise conditions. In this paper, we aim to fill in the gap and design a
Chinese fake audio detection dataset (FAD) for studying more generalized detection methods. Twelve mainstream speech generation techniques are used to generate fake audios. To simulate the real-life scenarios, three noise datasets are selected for
noisy adding at five different signal noise ratios. FAD dataset can be used not only for fake audio detection, but also for detecting the algorithms of fake utterances for
audio forensics. Baseline results are presented with analysis. The results that show fake audio detection methods with generalization remain challenging.
The FAD dataset is publicly available. The source code of baselines is available on GitHub https://github.com/ADDchallenge/FAD

The FAD dataset is designed to evaluate the methods of fake audio detection and fake algorithms recognition and other relevant studies. To better study the robustness of the methods under noisy
conditions when applied in real life, we construct the corresponding noisy dataset. The total FAD dataset consists of two versions: clean version and noisy version. Both versions are divided into
disjoint training, development and test sets in the same way. There is no speaker overlap across these three subsets. Each test sets is further divided into seen and unseen test sets. Unseen test sets can
evaluate the generalization of the methods to unknown types. It is worth mentioning that both real audios and fake audios in the unseen test set are unknown to the model.
For the noisy speech part, we select three noise database for simulation. Additive noises are added to each audio in the clean dataset at 5 different SNRs. The additive noises of the unseen test set and the
remaining subsets come from different noise databases. In each version of FAD dataset, there are 138400 utterances in training set, 14400 utterances in development set, 42000 utterances in seen test set, and 21000 utterances in unseen test set. More detailed statistics are demonstrated in the Tabel 2.

Clean Real Audios Collection
From the point of eliminating the interference of irrelevant factors, we collect clean real audios from
two aspects: 5 open resources from OpenSLR platform (http://www.openslr.org/12/) and one self-recording dataset.

Clean Fake Audios Generation
We select 11 representative speech synthesis methods to generate the fake audios and one partially fake audios.

Noisy Audios Simulation
Noisy audios aim to quantify the robustness of the methods under noisy conditions. To simulate the real-life scenarios, we artificially sample the noise signals and add them to clean audios at 5 different
SNRs, which are 0dB, 5dB, 10dB, 15dB and 20dB. Additive noises are selected from three noise databases: PNL 100 Nonspeech Sounds, NOISEX-92, and TAU Urban Acoustic Scenes.

This data set is licensed with a CC BY-NC-ND 4.0 license.
You can cite the data using the following BibTeX entry:
@inproceedings{ma2022fad,
title={FAD: A Chinese Dataset for Fake Audio Detection},
author={Haoxin Ma, Jiangyan Yi, Chenglong Wang, Xunrui Yan, Jianhua Tao, Tao Wang, Shiming Wang, Le Xu, Ruibo Fu},
booktitle={Submitted to the 36th Conference on Neural Information Processing Systems (NeurIPS 2022) Track on Datasets and Benchmarks },
year={2022},
}

데이터 생성 이력정보

데이터등록일 : 2022-06-09

데이터셋 의미 관계 정보

Reset

의미관계가 형성된 정보를 클릭하면 통합검색 결과로 이동합니다.

본 서비스는 크로미움(Chromium)기반의 브라우저에서만 제공됩니다.

원문정보

http://dx.doi.org/10.5281/zenodo.6641573 http://dx.doi.org/10.5281/zenodo.6635521 http://dx.doi.org/10.5281/zenodo.6623227

데이터를 소유한 기관으로 연결되며, 로그인이 필요할 수도 있습니다.

65 조회수
1 다운로드수
추천수 0
공유수 0
인용횟수 0

제공처

리포지터리: OpenAIRE

DOI: 10.5281/zenodo.6641573

인용정보생성

자세히 보기 복사

라이센스: CC-BY-4.0

공유하기

검색 연산자	기능	검색시 예
공백	두 개의 검색어(식)을 모두 포함하고 있는 문서 검색	(데이터 기술)
()	우선순위가 가장 높은 연산자	(정확 (데이터\|data))
\|	두 개의 검색어(식) 중 하나 이상 포함하고 있는문서 검색	(항공 \| 토양)
!	NOT 이후에 있는 검색어가 포함된 문서는 제외	(데이터 !연구)
*	검색어의 *란에 0개 이상의 임의의 문자가 포함된 문서 검색	data*
" "	따옴표 내의 구문과 완전히 일치하는 문서만 검색	"Transform"

FAD: A Chinese Dataset for Fake Audio Detection

데이터 생성 이력정보

데이터셋 의미 관계 정보

관련 과제/논문 정보

원문정보