Dataset Versions

v37

2023-09-27 9:17am

Generated on Sep 27, 2023

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Dataset Split

Train Set 3%
5Images
Valid Set 1%
1Images
Test Set 97%
170Images

Preprocessing

Auto-Orient: Applied
Resize: Stretch to 620x620

Augmentations

Outputs per training example: 5
Flip: Horizontal
Hue: Between -13° and +13°
Brightness: Between -13% and +13%
Exposure: Between -5% and +5%
Blur: Up to 1px
Noise: Up to 1% of pixels