Dataset Versions

v7

2023-09-01 10:17am

Generated on Sep 1, 2023

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Preprocessing

Auto-Orient: Applied
Resize: Stretch to 640x640

Augmentations

Outputs per training example: 2
Flip: Horizontal, Vertical
90° Rotate: Clockwise, Counter-Clockwise, Upside Down
Crop: 38% Minimum Zoom, 38% Maximum Zoom
Rotation: Between -45° and +45°
Shear: ±22° Horizontal, ±16° Vertical
Grayscale: Apply to 42% of images
Hue: Between -87° and +87°
Saturation: Between -49% and +49%
Brightness: Between -4% and +4%
Exposure: Between -2% and +2%
Blur: Up to 0.5px
Noise: Up to 1% of pixels
Cutout: 5 boxes with 11% size each
Mosaic: Applied