Dataset Versions

v10

2024-05-12 7:11pm

Generated on May 12, 2024

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Preprocessing

Auto-Orient: Applied
Resize: Stretch to 640x640
Filter Null: Require all images to contain annotations.

Augmentations

Outputs per training example: 3
Grayscale: Apply to 15% of images
Saturation: Between -29% and +29%
Brightness: Between -18% and +18%
Exposure: Between -10% and +10%
Blur: Up to 1.2px
Noise: Up to 0.65% of pixels