Dataset Versions

v16

2024-03-08 4:53am

Generated on Mar 7, 2024

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Preprocessing

Auto-Orient: Applied
Resize: Stretch to 640x640
Auto-Adjust Contrast: Using Histogram Equalization
Filter Null: Require at least 5% of images to contain annotations.

Augmentations

Outputs per training example: 3
Flip: Horizontal, Vertical
90° Rotate: Clockwise, Counter-Clockwise, Upside Down
Crop: 0% Minimum Zoom, 5% Maximum Zoom
Grayscale: Apply to 3% of images
Hue: Between -20° and +20°
Brightness: Between -15% and +15%
Blur: Up to 2.7px
Noise: Up to 1.73% of pixels

Similar Projects

See More
44 images 2 models