Dataset Versions

v6

2023-01-23 10:12pm

Generated on Jan 23, 2023

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Dataset Split

Train Set 5%
21Images
Valid Set 83%
344Images
Test Set 12%
49Images

Preprocessing

Auto-Orient: Applied
Resize: Stretch to 640x640

Augmentations

Outputs per training example: 3
Bounding Box: Brightness: Between -22% and +22%
Bounding Box: Blur: Up to 5.25px

Similar Projects

See More