Dataset Versions

v4

2024-05-29 3:50pm

Generated on May 29, 2024

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Dataset Split

Train Set %
0Images
Valid Set 65%
146Images
Test Set 35%
78Images

Preprocessing

Auto-Orient: Applied
Resize: Stretch to 640x640

Augmentations

Outputs per training example: 2
Flip: Horizontal
90° Rotate: Clockwise, Counter-Clockwise
Grayscale: Apply to 15% of images
Hue: Between -15° and +15°
Saturation: Between -25% and +25%
Brightness: Between -15% and +15%
Blur: Up to 2.5px
Noise: Up to 0.1% of pixels