Dataset Versions

v4

2023-06-05 9:39pm

Generated on Jun 5, 2023

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Dataset Split

Train Set %
0Images
Valid Set 29%
20Images
Test Set 71%
48Images

Preprocessing

Auto-Orient: Applied
Resize: Stretch to 100x100

Augmentations

Outputs per training example: 3
Rotation: Between -15° and +15°
Grayscale: Apply to 8% of images
Blur: Up to 2.5px
Noise: Up to 5% of pixels
Bounding Box: Blur: Up to 2.5px

Similar Projects

See More