graviti
产品服务
解决方案
知识库
公开数据集
关于我们
aidatatang200zh
Audio
许可协议: CC-BY-NC-ND 4.0

Overview

Aidatatang_200zh is a free Chinese Mandarin speech corpus provided by Beijing DataTang Technology Co., Ltd.The corpus is a subset of a much bigger data (free 1505 hours Chinese Mandarin speech corpus) set which was recorded in the same environment as this open source data. Please visit the website DataTang for more details.

Data Format

The contents and the corresponding descriptions of the corpus include:

  • The corpus contains 200 hours of acoustic data, which is mostly mobile recorded data.
  • 600 speakers from different accent areas in China are invited to participate in the recording.
  • The transcription accuracy for each sentence is larger than 98%.
  • Recordings are conducted in a quiet indoor environment.
  • The database is divided into training set, validation set, and testing set in a ratio of 7: 1: 2.
  • Detail information such as speech data coding and speaker information is preserved in the metadata file.
  • Segmented transcripts are also provided.

License

All datasets on this page are copyright by us and published under the CC BY-NC-ND 4.0 license.

数据概要
数据格式
audio,
数据量
--
文件大小
17.47GB
发布方
DataTang
DataTang is a community of creators-of world-changers and future-builders.
| 数据量 -- | 大小 17.47GB
aidatatang200zh
Audio
许可协议: CC-BY-NC-ND 4.0

Overview

Aidatatang_200zh is a free Chinese Mandarin speech corpus provided by Beijing DataTang Technology Co., Ltd.The corpus is a subset of a much bigger data (free 1505 hours Chinese Mandarin speech corpus) set which was recorded in the same environment as this open source data. Please visit the website DataTang for more details.

Data Format

The contents and the corresponding descriptions of the corpus include:

  • The corpus contains 200 hours of acoustic data, which is mostly mobile recorded data.
  • 600 speakers from different accent areas in China are invited to participate in the recording.
  • The transcription accuracy for each sentence is larger than 98%.
  • Recordings are conducted in a quiet indoor environment.
  • The database is divided into training set, validation set, and testing set in a ratio of 7: 1: 2.
  • Detail information such as speech data coding and speaker information is preserved in the metadata file.
  • Segmented transcripts are also provided.

License

All datasets on this page are copyright by us and published under the CC BY-NC-ND 4.0 license.

0
立即开始构建AI
graviti
wechat-QR
长按保存识别二维码,关注Graviti公众号

Copyright@Graviti
沪ICP备19019574号
沪公网安备 31011002004865号