Dataset Versions

v1

2023-09-05 2:49am

Generated on Sep 4, 2023

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Preprocessing

Auto-Orient: Applied
Isolate Objects: Applied
Static Crop: 25-75% Horizontal Region, 25-75% Vertical Region
Resize: Stretch to 640x640
Auto-Adjust Contrast: Using Histogram Equalization
Grayscale: Applied
Tile: 2 rows x 2 columns
Modify Classes: 11 remapped, 2 dropped
Filter Null: Require all images to contain annotations.

Augmentations

Outputs per training example: 3
Crop: 0% Minimum Zoom, 50% Maximum Zoom
Grayscale: Apply to 25% of images
Hue: Between -135° and +135°
Saturation: Between -57% and +57%
Brightness: Between -75% and +75%
Exposure: Between -57% and +57%
Blur: Up to 7.75px
Noise: Up to 17% of pixels
Cutout: 6 boxes with 41% size each
Mosaic: Applied
Bounding Box: Flip: Horizontal
Bounding Box: 90° Rotate: Clockwise, Counter-Clockwise, Upside Down
Bounding Box: Crop: 0% Minimum Zoom, 50% Maximum Zoom
Bounding Box: Rotation: Between -31° and +31°
Bounding Box: Shear: ±27° Horizontal, ±29° Vertical
Bounding Box: Brightness: Between -99% and +99%
Bounding Box: Exposure: Between -69% and +69%
Bounding Box: Blur: Up to 25px
Bounding Box: Noise: Up to 15% of pixels

Similar Projects

See More