Dataset Versions

v2

2024-11-12 7:34am

Generated on Nov 12, 2024

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Dataset Split

Train Set 3%
30Images
Valid Set 88%
984Images
Test Set 9%
104Images

Preprocessing

Auto-Orient: Applied
Resize: Stretch to 640x640

Augmentations

Outputs per training example: 3
Crop: 0% Minimum Zoom, 27% Maximum Zoom
Shear: ±15° Horizontal, ±15° Vertical
Grayscale: Apply to 25% of images
Hue: Between -24° and +24°
Blur: Up to 0.9px

Similar Projects

See More
1.1k images