Dataset Versions

v7

2024-05-22 8:03pm

Generated on May 22, 2024

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Dataset Split

Train Set 7%
942Images
Valid Set 1%
91Images
Test Set 0%
48Images

Preprocessing

Auto-Orient: Applied
Resize: Fit within 960x960

Augmentations

Outputs per training example: 3
Crop: 0% Minimum Zoom, 28% Maximum Zoom
Shear: ±15° Horizontal, ±15° Vertical
Brightness: Between -20% and +20%
Exposure: Between -15% and +15%
Blur: Up to 2.1px
Noise: Up to 1.49% of pixels

Similar Projects

See More