Dataset Versions

v2

2024-05-23 12:36pm

Generated on May 23, 2024

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Dataset Split

Train Set 4%
870Images
Valid Set 0%
81Images
Test Set 0%
40Images

Preprocessing

Auto-Orient: Applied
Resize: Stretch to 640x640
Grayscale: Applied

Augmentations

Outputs per training example: 3
Flip: Horizontal
Rotation: Between -15° and +15°
Shear: ±10° Horizontal, ±10° Vertical
Brightness: Between -20% and +20%
Exposure: Between -10% and +10%
Blur: Up to 2.5px
Noise: Up to 0.5% of pixels