Dataset Versions

v3

2023-11-23 5:56pm

Generated on Nov 23, 2023

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Dataset Split

Train Set %
0Images
Valid Set 51%
405Images
Test Set 49%
390Images

Preprocessing

Auto-Orient: Applied
Resize: Stretch to 640x640

Augmentations

Outputs per training example: 3
Rotation: Between -33° and +33°
Saturation: Between -50% and +50%
Noise: Up to 12% of pixels
Cutout: 8 boxes with 16% size each