CN-Celeb
Audio
NLP
|...
许可协议: CC BY-SA 4.0

Overview

This data is a large-scale speaker recognition dataset collected 'in the wild'. The dataset contains more than 130,000 utterances from 1,000 Chinese celebrities, and covers 11 different genres in real world. All the audio files are coded as single channel and sampled at 16kHz with 16-bit precision.

Citation

Please use the following citation when referencing the dataset:

@misc{fan2019cnceleb,
  title={CN-CELEB: a challenging Chinese speaker recognition dataset},
  author={Yue Fan and Jiawen Kang and Lantian Li and Kaicheng Li and Haolin Chen and Sitong
Cheng and Pengyuan Zhang and Ziya Zhou and Yunqi Cai and Dong Wang},
  year={2019},
  eprint={1911.01799},
  archivePrefix={arXiv},
  primaryClass={eess.AS}
}

License

CC BY-SA 4.0

数据概要
数据格式
Audio,
数据量
--
文件大小
29.66GB
发布方
CSLT at Tsinghua University
The Center for Speech and Language Technology (CSLT), Tsinghua University, was established with the goal of conducting cut-edging research on intelligent human-machine interactions, particularly the research on speech and language techniques.
数据集反馈
立即开始构建AI