Dataset Versions

v4

2023-04-07 1:05pm

Generated on Apr 7, 2023

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Dataset Split

Train Set 1%
162Images
Valid Set 99%
20007Images
Test Set 0%
45Images

Preprocessing

Auto-Orient: Applied
Resize: Stretch to 640x640
Tile: 3 rows x 3 columns
Modify Classes: 20 remapped, 0 dropped

Augmentations

Outputs per training example: 3
Flip: Horizontal
Exposure: Between -25% and +25%