Dataset Versions

v2

2023-05-05 12:58am

Generated on May 4, 2023

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Preprocessing

Auto-Orient: Applied
Resize: Stretch to 640x640
Modify Classes: 1 remapped, 0 dropped
Filter Null: Require at least 36% of images to contain annotations.

Augmentations

Outputs per training example: 3
Rotation: Between -15° and +15°
Brightness: Between -25% and +25%
Exposure: Between -7% and +7%
Blur: Up to 0.5px
Noise: Up to 2% of pixels