graviti logo产品公开数据集关于我们
登录
381
0
17
AISHELL1
创建来自Hello Dataset / Robert
概要
活动

Overview

This Open Source Mandarin Speech Corpus, AISHELL-ASR0009-OS1, is 178 hours long. It is a part of AISHELL-ASR0009, of which utterance contains 11 domains, including smart home, autonomous driving, and industrial production. The whole recording was put in quiet indoor environment, using 3 different devices at the same time: high fidelity microphone (44.1kHz, 16-bit,); Android-system mobile phone (16kHz, 16-bit), iOS-system mobile phone (16kHz, 16-bit). Audios in high fidelity were re-sampled to 16kHz to build AISHELL- ASR0009-OS1.400 speakers from different accent areas in China were invited to participate in the recording. The manual transcription accuracy rate is above 95%,through professional speech annotation and strict quality inspection. The corpus is divided into training, development and testing sets.

Data Format

/readme.txt

/SPEECHDATA

​ +—— /S0252

​ +—— /S0252_mic #高保真数据

​ +—— BAC009S0252W0001.wav

​ +—— BAC009S0252W0001.txt

Citation

Please use the following citation when referencing the dataset:

@article{DBLP:journals/corr/abs-1709-05522,
  author    = {Hui Bu and
               Jiayu Du and
               Xingyu Na and
               Bengu Wu and
               Hao Zheng},
  title     = {{AISHELL-1:} An Open-Source Mandarin Speech Corpus and {A} Speech
               Recognition Baseline},
  journal   = {CoRR},
  volume    = {abs/1709.05522},
  year      = {2017},
  url       = {http://arxiv.org/abs/1709.05522},
  archivePrefix = {arXiv},
  eprint    = {1709.05522},
  timestamp = {Mon, 13 Aug 2018 16:46:31 +0200},
  biburl    = {https://dblp.org/rec/journals/corr/abs-1709-05522.bib},
  bibsource = {dblp computer science bibliography, https://dblp.org}
}

License

Apache-2.0

数据集信息
应用场景NLP
标注类型Audio
LicenseApache-2.0
更新时间2021-03-24 22:46:50
数据概要
数据格式Audio
数据数量0
文件大小15MB
标注数量0
版权归属方
AISHELL
标注方
未知
了解更多和支持
立即开始构建AI
免费开始联系我们