LibriSpeech ASR
Audio
NLP
|...
许可协议: CC BY 4.0

Overview

LibriSpeech is a corpus of approximately 1000 hours of 16kHz read English speech, prepared by Vassil Panayotov with the assistance of Daniel Povey. The data is derived from read audiobooks from the LibriVox project, and has been carefully segmented and aligned.

Citation

Please use the following citation when referencing the dataset:

@inproceedings{inproceedings,
author = {Panayotov, Vassil and Chen, Guoguo and Povey, Daniel and Khudanpur, Sanjeev},
year = {2015},
month = {04},
pages = {5206-5210},
title = {Librispeech: An ASR corpus based on public domain audio books},
doi = {10.1109/ICASSP.2015.7178964}
}

License

CC BY 4.0

数据概要
数据格式
Audio,
数据量
--
文件大小
140.02GB
发布方
Center for Language and Speech Processing
The Johns Hopkins Center for Language and Speech Processing (CLSP) is an interdisciplinary research and educational center focused on the science and technology of language and speech.
数据集反馈
立即开始构建AI