graviti
产品服务
解决方案
知识库
公开数据集
关于我们
AISHELL1
Audio
许可协议: Apache License 2.0

Overview

This Open Source Mandarin Speech Corpus, AISHELL-ASR0009-OS1, is 178 hours long. It is a part of AISHELL-ASR0009, of which utterance contains 11 domains, including smart home, autonomous driving, and industrial production. The whole recording was put in quiet indoor environment, using 3 different devices at the same time: high fidelity microphone (44.1kHz, 16-bit,); Android-system mobile phone (16kHz, 16-bit), iOS-system mobile phone (16kHz, 16-bit). Audios in high fidelity were re-sampled to 16kHz to build AISHELL- ASR0009-OS1. 400 speakers from different accent areas in China were invited to participate in the recording. The manual transcription accuracy rate is above 95%, through professional speech annotation and strict quality inspection. The corpus is divided into training, development and testing sets.

Data Format

/readme.txt

/SPEECHDATA

​ +—— /S0252

​ +—— /S0252_mic #高保真数据

​ +—— BAC009S0252W0001.wav

​ +—— BAC009S0252W0001.txt

Citation

Please use the following citation when referencing the dataset:

@article{DBLP:journals/corr/abs-1709-05522,
  author    = {Hui Bu and
               Jiayu Du and
               Xingyu Na and
               Bengu Wu and
               Hao Zheng},
  title     = {{AISHELL-1:} An Open-Source Mandarin Speech Corpus and {A} Speech
               Recognition Baseline},
  journal   = {CoRR},
  volume    = {abs/1709.05522},
  year      = {2017},
  url       = {http://arxiv.org/abs/1709.05522},
  archivePrefix = {arXiv},
  eprint    = {1709.05522},
  timestamp = {Mon, 13 Aug 2018 16:46:31 +0200},
  biburl    = {https://dblp.org/rec/journals/corr/abs-1709-05522.bib},
  bibsource = {dblp computer science bibliography, https://dblp.org}
}

License

This dataset is published under Apache License 2.0 License.

数据概要
数据格式
audio,
数据量
--
文件大小
14.51GB
发布方
AISHELL
Aishell is an innovative company focusing on artificial intelligence, big data and technical services.
| 数据量 -- | 大小 14.51GB
AISHELL1
Audio
许可协议: Apache License 2.0

Overview

This Open Source Mandarin Speech Corpus, AISHELL-ASR0009-OS1, is 178 hours long. It is a part of AISHELL-ASR0009, of which utterance contains 11 domains, including smart home, autonomous driving, and industrial production. The whole recording was put in quiet indoor environment, using 3 different devices at the same time: high fidelity microphone (44.1kHz, 16-bit,); Android-system mobile phone (16kHz, 16-bit), iOS-system mobile phone (16kHz, 16-bit). Audios in high fidelity were re-sampled to 16kHz to build AISHELL- ASR0009-OS1. 400 speakers from different accent areas in China were invited to participate in the recording. The manual transcription accuracy rate is above 95%, through professional speech annotation and strict quality inspection. The corpus is divided into training, development and testing sets.

Data Format

/readme.txt

/SPEECHDATA

​ +—— /S0252

​ +—— /S0252_mic #高保真数据

​ +—— BAC009S0252W0001.wav

​ +—— BAC009S0252W0001.txt

Citation

Please use the following citation when referencing the dataset:

@article{DBLP:journals/corr/abs-1709-05522,
  author    = {Hui Bu and
               Jiayu Du and
               Xingyu Na and
               Bengu Wu and
               Hao Zheng},
  title     = {{AISHELL-1:} An Open-Source Mandarin Speech Corpus and {A} Speech
               Recognition Baseline},
  journal   = {CoRR},
  volume    = {abs/1709.05522},
  year      = {2017},
  url       = {http://arxiv.org/abs/1709.05522},
  archivePrefix = {arXiv},
  eprint    = {1709.05522},
  timestamp = {Mon, 13 Aug 2018 16:46:31 +0200},
  biburl    = {https://dblp.org/rec/journals/corr/abs-1709-05522.bib},
  bibsource = {dblp computer science bibliography, https://dblp.org}
}

License

This dataset is published under Apache License 2.0 License.

0
立即开始构建AI
graviti
wechat-QR
长按保存识别二维码,关注Graviti公众号

Copyright@Graviti
沪ICP备19019574号
沪公网安备 31011002004865号