Dataset Versions

v18

2023-07-24 9:01pm

Generated on Jul 25, 2023

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Preprocessing

Auto-Orient: Applied
Resize: Stretch to 640x640
Grayscale: Applied
Tile: 2 rows x 2 columns
Filter Null: Require at least 50% of images to contain annotations.

Augmentations

Outputs per training example: 3
Flip: Horizontal
Crop: 0% Minimum Zoom, 20% Maximum Zoom
Rotation: Between -15° and +15°
Grayscale: Apply to 12% of images
Hue: Between -25° and +25°
Saturation: Between -25% and +25%
Brightness: Between -25% and +25%
Exposure: Between -11% and +11%
Blur: Up to 2.5px
Noise: Up to 1% of pixels
Mosaic: Applied
Bounding Box: Crop: 0% Minimum Zoom, 20% Maximum Zoom
Bounding Box: Shear: ±15° Horizontal, ±15° Vertical

Similar Projects

See More
1.1k images 1 model