Dataset Versions

v1

2024-05-22 8:17pm

Generated on May 22, 2024

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Dataset Split

Train Set 7%
891Images
Valid Set 1%
79Images
Test Set 0%
41Images

Preprocessing

Auto-Orient: Applied
Resize: Stretch to 320x320
Modify Classes: 0 remapped, 1 dropped

Augmentations

Outputs per training example: 3
Rotation: Between -10° and +10°
Grayscale: Apply to 15% of images
Hue: Between -20° and +20°
Saturation: Between -25% and +25%
Brightness: Between -15% and +15%
Exposure: Between -10% and +10%

Similar Projects

See More