Dataset Versions

v3

2023-05-10 12:27am

Generated on May 10, 2023

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Dataset Split

Train Set %
0Images
Valid Set 100%
391Images
Test Set %
0Images

Preprocessing

Auto-Orient: Applied
Resize: Stretch to 640x640
Auto-Adjust Contrast: Using Contrast Stretching

Augmentations

Outputs per training example: 2
Flip: Horizontal
Shear: ±5° Horizontal, ±5° Vertical
Brightness: Between -15% and +15%
Bounding Box: Brightness: Between -8% and +8%
Bounding Box: Blur: Up to 2.5px
Bounding Box: Noise: Up to 4% of pixels