CN-Celeb
许可协议:
CC BY-SA 4.0
Overview
This data is a large-scale speaker recognition dataset collected 'in the wild'. The dataset contains more than 130,000 utterances from 1,000 Chinese celebrities, and covers 11 different genres in real world. All the audio files are coded as single channel and sampled at 16kHz with 16-bit precision.
Citation
Please use the following citation when referencing the dataset:
@misc{fan2019cnceleb,
title={CN-CELEB: a challenging Chinese speaker recognition dataset},
author={Yue Fan and Jiawen Kang and Lantian Li and Kaicheng Li and Haolin Chen and Sitong
Cheng and Pengyuan Zhang and Ziya Zhou and Yunqi Cai and Dong Wang},
year={2019},
eprint={1911.01799},
archivePrefix={arXiv},
primaryClass={eess.AS}
}