MVTec Densely Segmented Supermarket Dataset (MVTec D2S)
The Densely Segmented Supermarket (D2S) dataset is a benchmark for instance-aware semantic segmentation in an industrial domain. It contains 21,000 high-resolution images with pixel-wise labels of all object instances. The objects comprise groceries and everyday products from 60 categories. The benchmark is designed such that it resembles the real-world setting of an automatic checkout, inventory, or warehouse system. The training images only contain objects of a single class on a homogeneous background, while the validation and test sets are much more complex and diverse. To further benchmark the robustness of instance segmentation methods, the scenes are acquired with different lightings, rotations, and backgrounds.
We ensure that there are no ambiguities in the labels and that every instance is labeled comprehensively. The annotations are pixel-precise and allow using crops of single instances for artificial data augmentation. The dataset covers several challenges highly relevant in the field, such as a limited amount of training data and a high diversity in the test and validation sets.
More info can be found in the corresponding paper and the video below.
For ease-of-use, the data is provided in the same format as the well-known COCO dataset (http://cocodataset.org).
- Images: The ‘images’-folder contains all images including the artificially augmented ones as described in the paper.
- Download Images (6.0 GB)
- Annotations: Contains the annotations for different training and validation splits
- Download Annotations (40 MB)
If you use the D2S dataset in scientific work, please cite
Patrick Follmann, Tobias Böttger, Philipp Härtinger, Rebecca König, Markus Ulrich: MVTec D2S: Densely Segmented Supermarket Dataset; in: European Conference on Computer Vision (ECCV), 569-585, 2018.
If you use the D2S amodal dataset in scientific work, please cite
Patrick Follmann, Rebecca König, Philipp Härtinger, and Michael Klostermann. Learning to See the Invisible: End-to-End Trainable Amodal Instance Segmentation; CoRR abs/1804.08864 (2018), 2018.
The data is released under the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License (CC BY-NC-SA 4.0). For using the data in a way that falls under the commercial use clause of the license, please contact us.
If you have any questions or comments about the dataset, feel free to contact us via this form.