Dataset Versions

v4

2023-12-02 12:39am

Generated on Dec 1, 2023

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Preprocessing

Auto-Orient: Applied
Resize: Stretch to 640x640

Augmentations

Outputs per training example: 2
Crop: 0% Minimum Zoom, 20% Maximum Zoom
Rotation: Between -15° and +15°
Grayscale: Apply to 31% of images
Hue: Between -25° and +25°
Exposure: Between -25% and +25%
Noise: Up to 5% of pixels
Bounding Box: Rotation: Between -15° and +15°