Dataset Versions

v5

Validation_Main2

Generated on Aug 3, 2024

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Dataset Split

Train Set %
0Images
Valid Set 100%
937Images
Test Set %
0Images

Preprocessing

Auto-Orient: Applied
Resize: Stretch to 128x128

Augmentations

Outputs per training example: 3
Shear: ±10° Horizontal, ±10° Vertical
Brightness: Between -5% and +5%
Exposure: Between -6% and +6%
Blur: Up to 0.7px