graviti logo产品公开数据集关于我们
登录
5.8K
4
154
概要
讨论
代码
活动
a54d6dba-3510-44ff-9697-a439248fc3ad
7d1ad23·
Jul 9, 2021 2:14 PM
·3Commits
second test commit wall-time

Overview

We introduce RP2K, a new large-scale retail product dataset for fine-grained image classification. Unlike previous datasets focusing on relatively few products, we collect more than 500,000 images of retail products on shelves belonging to 2000 different products. Our dataset aims to advance the research in retail object recognition, which has massive applications such as automatic shelf auditing and image-based product information retrieval.

Our dataset enjoys following properties: (1) It is by far the largest scale dataset in terms of product categories. (2) All images are captured manually in physical retail stores with natural lightings, matching the scenario of real applications. (3) We provide rich annotations to each object, including the sizes, shapes and flavors/scents. We believe our dataset could benefit both computer vision research and retail industry.

Overview information of the RP2K dataset

16480904

Categorized information of the RP2K dataset

16480905

Data Collection

Pipeline of our data collection process. Our photo collectors were first distributed in over 500 different retail stores and collected over 10k high-resolution shelf images. Then we use a pre-trained detection model to extract the bounding boxes of potential objects of interests. After that, our human annotators discard the incorrect bounding boxes, including heavily occluded images and images that is not a valid retail product. The remaining images are annotated by the annotators.

img

Data Annotation

Our dataset contains two components: the original shelf images and the individual object images cropped from the shelf images. The shelf images are labeled with the shelf type, store ID, and a list of bounding boxes of objects of interest. For each image cropped from its bounding box, we provide rich annotations include the SKU ID, product name, brand, product type, shape, size, flavor/scent and the bounding box reference to its corresponding shelf image. Fig. 5 demonstrates some sample attributes of the object images. Note that some attributes may not be applicable to particular products.

We also provide meta category label for each object image, in two different ways. One is categorized by its product type, which reflflects the placement of the products, i.e., products with the same type usually placed on the same or nearby shelves. We include 6 meta categories by product types: dairy, liquor, beer, cosmetics, non-alcoholic drinks and seasoning.

Another categorization method is by its product shape. We include 7 shapes, bottle, can, box, bag, jar, handled bottle and pack , which covers all possible shapes that appeared in our dataset. These 7 shapes are also used in training our pre-annotation detector. The sample images for different meta-categories are shown in Fig. 4.

Besides these two meta categorization method, our rich labels provide an option for the users to evaluate their algorithms on a customized fifine-grained level.

Data Format

Sample images from our dataset. Precise retail product recognition on shelves is considered highly challenging because (a) Products from the same line may have different sizes, and they usually have similar appearances but different prices. The image size could not reflect the real size of the products.(b) The manufacturer usually make multiple flavors for one product line, but their appearance only have subtle differences on the labels.(c) Product images may be captured at different camera angles according to its placement location on shelves. The image can also be stretched due to camera distortion.

Citation

This dataset and code packages are free for academic usage. You can run them at your own risk. For other purposes, please contact the corresponding author Jingtian Peng (pjt@pinlandata.com)

@article{peng2020rp2k,
 title={RP2K: A Large-Scale Retail Product Dataset forFine-Grained Image Classification},
 author={Peng, Jingtian and Xiao, Chang and Wei, Xun and Li, Yifan},
 journal={arXiv preprint arXiv:2006.12634},
 year={2020}
}
数据预览
查看数据
🎉感谢Graviti Open Datasets的贡献
数据集信息
应用场景Smart Retailing
标注类型Classification
任务类型暂无
LicenseCustom
更新时间2021-03-24 23:09:31
数据概要
数据格式Image
数据数量378.54K
已标注数量380535
文件大小6GB
版权归属方
Pinlan
标注方
Testin
了解更多和支持
立即开始构建AI
免费开始联系我们