Dataset Versions

v9

2023-05-17 8:25am

Generated on May 17, 2023

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Dataset Split

Train Set 2%
62Images
Valid Set 48%
1701Images
Test Set 50%
1794Images

Preprocessing

Auto-Orient: Applied
Resize: Stretch to 640x640
Modify Classes: 6 remapped, 25 dropped

Augmentations

Outputs per training example: 2
Flip: Horizontal, Vertical
Rotation: Between -24° and +24°
Hue: Between -80° and +80°
Noise: Up to 5% of pixels
Bounding Box: Exposure: Between -14% and +14%
Bounding Box: Blur: Up to 1.5px

Similar Projects

See More