Dataset Versions

v4

2023-05-08 10:01pm

Generated on May 9, 2023

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Dataset Split

Train Set 8%
9Images
Valid Set 45%
51Images
Test Set 47%
53Images

Preprocessing

Auto-Orient: Applied
Isolate Objects: Applied
Resize: Stretch to 80x80

Augmentations

Outputs per training example: 3
Flip: Horizontal
Crop: 0% Minimum Zoom, 20% Maximum Zoom
Bounding Box: Crop: 0% Minimum Zoom, 99% Maximum Zoom