Dataset Versions

v2

2023-10-05 4:36pm

Generated on Oct 5, 2023

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Dataset Split

Train Set %
0Images
Valid Set 21%
35Images
Test Set 79%
133Images

Preprocessing

Auto-Orient: Applied
Resize: Stretch to 640x640

Augmentations

Outputs per training example: 2
Flip: Horizontal
Crop: 0% Minimum Zoom, 15% Maximum Zoom
Rotation: Between -16° and +16°
Brightness: Between -15% and +15%
Exposure: Between -24% and +24%
Blur: Up to 2px
Noise: Up to 2% of pixels