graviti logo产品公开数据集关于我们
登录
292
0
13
Kannada-MNIST
创建来自Hello Dataset
概要
代码
活动

Overview

Here, we disseminate a new handwritten digits-dataset, termed Kannada-MNIST, for the Kannada script, that can potentially serve as a direct drop-in replacement for the original MNIST dataset.

Data Collection

This dataset is based off of the efforts of 65 volunteers from Bangalore, India, who are native speakers and users of the Kannada language and the script. This was curated to serve as a direct one-to-one drop-in replacement for the original MNIST dataset (akin to Fashion-MNIST and K-MNIST datasets).

65 volunteers were recruited in Bangalore, India, who were native speakers of the language as well as day-to-day users of the numeral script. Each volunteer filled out an A3 sheet containing a 32 × 40 grid. This yielded filled-out A3 sheets containing 128 instances of each number which we posit is large enough to capture most of the natural intra-volunteer variations of the glyph shapes. All of the sheets thus collected were scanned at 600 dots-per-inch resolution using the Konica Accurio-Press-C6085 scanner that yielded 65 4963 × 3509 png images.

Data Format

The main Kannada-MNIST dataset that consists of a training set of 60000 28 × 28 gray-scale sample images.

Citation

Please use the following citation when referencing the dataset:

@article{prabhu2019kannada,
  title={Kannada-MNIST: A new handwritten digits dataset for the Kannada language},
  author={Prabhu, Vinay Uday},
  journal={arXiv preprint arXiv:1908.01242},
  year={2019}
}
🎉感谢Hello Dataset的贡献
数据集信息
应用场景MNIST
标注类型Classification
任务类型暂无
LicenseUnknown
更新时间2021-03-24 22:54:31
数据概要
数据格式Image
数据数量60k
已标注数量0
文件大小64KB
版权归属方
Vinay Uday Prabhu
标注方
未知
了解更多和支持
相关数据集
MultiMNIST
创建来自Robert
EMNIST
创建来自Robert
MNIST
创建来自AChenQ
立即开始构建AI
免费开始联系我们