Dataset Versions

v4

2023-04-17 11:35pm

Generated on Apr 17, 2023

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Dataset Split

Train Set %
0Images
Valid Set %
0Images
Test Set 100%
3999Images

Preprocessing

Static Crop: 6-92% Horizontal Region, 0-100% Vertical Region
Modify Classes: 9 remapped, 8 dropped

Augmentations

Outputs per training example: 3
Flip: Horizontal, Vertical
90° Rotate: Upside Down
Saturation: Between -16% and +16%
Brightness: Between -19% and +19%
Exposure: Between -11% and +11%
Blur: Up to 6px
Noise: Up to 5% of pixels