graviti logo产品公开数据集关于我们
登录
364
0
11
The LJ Speech
创建来自Hello Dataset / Robert
概要
活动

Overview

This is a public domain speech dataset consisting of 13,100 short audio clips of a single speaker reading passages from 7 non-fiction books. A transcription is provided for each clip. Clips vary in length from 1 to 10 seconds and have a total length of approximately 24 hours.

Data Collection

Total Clips13,100
Total Words225,715
Total Characters1,308,678
Total Duration23:55:17
Mean Clip Duration6.57 sec
Min Clip Duration1.11 sec
Max Clip Duration10.10 sec
Mean Words per Clip17.23
Distinct Words13,821

Data Format

Metadata is provided in transcripts.csv. This file consists of one record per line, delimited by the pipe character (0x7c). The fields are:

  1. ID: this is the name of the corresponding .wav file
  2. Transcription: words spoken by the reader (UTF-8)
  3. Normalized Transcription: transcription with numbers, ordinals, and monetary units expanded into full words (UTF-8).

Each audio file is a single-channel 16-bit PCM WAV with a sample rate of 22050 Hz.

Citation

@misc{ljspeech17,
  author       = {Keith Ito and Linda Johnson},
  title        = {The LJ Speech Dataset},
  howpublished = {\url{https://keithito.com/LJ-Speech-Dataset/}},
  year         = 2017
}

License

Custom

数据集信息
应用场景NLP
标注类型Text
LicenseCustom
更新时间2021-03-24 22:54:05
数据概要
数据格式Audio
数据数量0
文件大小3MB
标注数量0
版权归属方
Keith Ito
标注方
未知
了解更多和支持
立即开始构建AI
免费开始联系我们