Dataset Versions

v7

2023-12-05 8:13pm

Generated on Dec 5, 2023

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Dataset Split

Train Set 5%
30Images
Valid Set 1%
3Images
Test Set 94%
532Images

Preprocessing

Auto-Orient: Applied
Resize: Fill (with center crop) in 640x640

Augmentations

Outputs per training example: 3
Crop: 0% Minimum Zoom, 50% Maximum Zoom
Rotation: Between -15° and +15°
Shear: ±15° Horizontal, ±15° Vertical
Exposure: Between -25% and +25%
Blur: Up to 3px

Similar Projects

See More