Dataset Versions

v11

2022-05-28 10:02am

Generated on May 28, 2022

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Dataset Split

Train Set %
0Images
Valid Set 39%
1661Images
Test Set 61%
2594Images

Preprocessing

Auto-Orient: Applied
Isolate Objects: Applied
Static Crop: 25-75% Horizontal Region, 25-75% Vertical Region
Resize: Stretch to 416x416
Auto-Adjust Contrast: Using Adaptive Equalization
Tile: 2 rows x 2 columns
Modify Classes: 0 remapped, 1 dropped
Filter Null: Require all images to contain annotations.

Augmentations

Outputs per training example: 1
Flip: Horizontal, Vertical
Rotation: Between -45° and +45°
Hue: Between -125° and +125°
Saturation: Between -51% and +51%
Blur: Up to 10px
Noise: Up to 5% of pixels
Cutout: 5 boxes with 40% size each
Mosaic: Applied
Bounding Box: 90° Rotate: Clockwise, Counter-Clockwise
Bounding Box: Crop: 0% Minimum Zoom, 45% Maximum Zoom
Bounding Box: Rotation: Between -23° and +23°
Bounding Box: Brightness: Between -51% and +51%
Bounding Box: Blur: Up to 15px
Bounding Box: Noise: Up to 5% of pixels