graviti
产品服务
解决方案
知识库
公开数据集
关于我们
Korean Single Speaker Speech Dataset
Aesthetics
|...
许可协议: CC-BY-SA 4.0

Overview

[Updated on September 28, 2019] KSS Dataset: Korean Single speaker Speech Dataset

KSS Dataset is designed for the Korean text-to-speech task. It consists of audio files recorded by a professional female voice actoress and their aligned text extracted from my books. As a copyright holder, by courtesy of the publishers, I release this dataset to the public. To my best knowledge, this is the first publicly available speech dataset for Korean.

File Format

Each line in transcript.v.1.3.txt is delimited by | into six fields.

  • A. Audio file path
  • B. Original script
  • C. Expanded script
  • D. Decomposed script
  • E. Audio duration (seconds)
  • F. English translation

e.g.,

1/1_0470.wav|저는 보통 20분 정도 낮잠을 잡니다.|저는 보통 이십 분 정도 낮잠을 잡니다.|저는 보통 이십 분 정도 낮잠을 잡니다.|4.1|I usually take a nap for 20 minutes.

Specification

License

NC-SA 4.0. You CANNOT use this dataset for ANY COMMERCIAL purpose. Otherwise, you can freely use this.

Citation

If you want to cite KSS Dataset, please refer to this:

Kyubyong Park, KSS Dataset: Korean Single speaker Speech Dataset, https://kaggle.com/bryanpark/korean-single-speaker-speech-dataset, 2018

Reference

Check out this for a project using this KSS Dataset.

Contact

You can contact me at kbpark.linguist@gmail.com.

April, 2018.

Kyubyong Park

数据概要
数据格式
image,
数据量
12.855K
文件大小
366.69MB
发布方
Kyubyong Park
| 数据量 12.855K | 大小 366.69MB
Korean Single Speaker Speech Dataset
Aesthetics
许可协议: CC-BY-SA 4.0

Overview

[Updated on September 28, 2019] KSS Dataset: Korean Single speaker Speech Dataset

KSS Dataset is designed for the Korean text-to-speech task. It consists of audio files recorded by a professional female voice actoress and their aligned text extracted from my books. As a copyright holder, by courtesy of the publishers, I release this dataset to the public. To my best knowledge, this is the first publicly available speech dataset for Korean.

File Format

Each line in transcript.v.1.3.txt is delimited by | into six fields.

  • A. Audio file path
  • B. Original script
  • C. Expanded script
  • D. Decomposed script
  • E. Audio duration (seconds)
  • F. English translation

e.g.,

1/1_0470.wav|저는 보통 20분 정도 낮잠을 잡니다.|저는 보통 이십 분 정도 낮잠을 잡니다.|저는 보통 이십 분 정도 낮잠을 잡니다.|4.1|I usually take a nap for 20 minutes.

Specification

License

NC-SA 4.0. You CANNOT use this dataset for ANY COMMERCIAL purpose. Otherwise, you can freely use this.

Citation

If you want to cite KSS Dataset, please refer to this:

Kyubyong Park, KSS Dataset: Korean Single speaker Speech Dataset, https://kaggle.com/bryanpark/korean-single-speaker-speech-dataset, 2018

Reference

Check out this for a project using this KSS Dataset.

Contact

You can contact me at kbpark.linguist@gmail.com.

April, 2018.

Kyubyong Park

0
立即开始构建AI
graviti
wechat-QR
长按保存识别二维码,关注Graviti公众号

Copyright@Graviti
沪ICP备19019574号
沪公网安备 31011002004865号