graviti
产品服务
解决方案
知识库
公开数据集
关于我们
Sinhala [si-lk] ASR
许可协议: CC-BY-SA 4.0

Overview

This dataset was collected for speech technology research.

This dataset was collected from native Sinhala speakers who volunteered to supply the data. The audio was recorded on standard consumer smartphones, in various environments. The audio is delivered in a downsampled lossless format (16kHz, 16 bit, mono, FLAC audio).

Some quality checks have been done on the data, but there might still be mistranscriptions or artifacts in the audio.

Citation

Please use the following citation when referencing the dataset:

@inproceedings{47392,
title = {Crowd-Sourced Speech Corpora for Javanese, Sundanese,  Sinhala, Nepali, and Bangladeshi Bengali},
author = {Oddur Kjartansson and Supheakmungkol Sarin and Knot Pipatsrisawat and Martin Jansche and Linne Ha},
year = {2018},
URL = {http://dx.doi.org/10.21437/SLTU.2018-11},
booktitle = {Proc. The 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages},
pages = {52--55}
}
数据概要
数据格式
sound,
数据量
21.8K
文件大小
--
| 数据量 21.8K | 大小 --
Sinhala [si-lk] ASR
许可协议: CC-BY-SA 4.0

Overview

This dataset was collected for speech technology research.

This dataset was collected from native Sinhala speakers who volunteered to supply the data. The audio was recorded on standard consumer smartphones, in various environments. The audio is delivered in a downsampled lossless format (16kHz, 16 bit, mono, FLAC audio).

Some quality checks have been done on the data, but there might still be mistranscriptions or artifacts in the audio.

Citation

Please use the following citation when referencing the dataset:

@inproceedings{47392,
title = {Crowd-Sourced Speech Corpora for Javanese, Sundanese,  Sinhala, Nepali, and Bangladeshi Bengali},
author = {Oddur Kjartansson and Supheakmungkol Sarin and Knot Pipatsrisawat and Martin Jansche and Linne Ha},
year = {2018},
URL = {http://dx.doi.org/10.21437/SLTU.2018-11},
booktitle = {Proc. The 6th Intl. Workshop on Spoken Language Technologies for Under-Resourced Languages},
pages = {52--55}
}
0
立即开始构建AI
graviti
wechat-QR
长按保存识别二维码,关注Graviti公众号

Copyright@Graviti
沪ICP备19019574号
沪公网安备 31011002004865号