Dataset Versions

v5

2024-01-14 3:40pm

Generated on Jan 14, 2024

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Preprocessing

Auto-Orient: Applied
Resize: Stretch to 640x640

Augmentations

Outputs per training example: 3
Flip: Horizontal, Vertical
90° Rotate: Clockwise, Counter-Clockwise, Upside Down
Crop: 0% Minimum Zoom, 46% Maximum Zoom
Rotation: Between -15° and +15°
Shear: ±15° Horizontal, ±15° Vertical
Grayscale: Apply to 25% of images
Hue: Between -180° and +180°
Saturation: Between -99% and +99%
Brightness: Between -78% and +78%
Exposure: Between -52% and +52%
Blur: Up to 10px
Noise: Up to 25% of pixels
Cutout: 25 boxes with 6% size each
Mosaic: Applied