Dataset Versions

v13

2023-11-28 4:19pm

Generated on Nov 28, 2023

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Dataset Split

Train Set 0%
3Images
Valid Set 100%
614Images
Test Set %
0Images

Preprocessing

Auto-Orient: Applied
Resize: Fit (black edges) in 640x640
Auto-Adjust Contrast: Using Contrast Stretching
Filter Null: Require at least 90% of images to contain annotations.

Augmentations

Outputs per training example: 3
Flip: Horizontal, Vertical
90° Rotate: Clockwise, Counter-Clockwise, Upside Down
Brightness: Between -19% and +19%
Cutout: 5 boxes with 27% size each
Mosaic: Applied
Bounding Box: Flip: Horizontal, Vertical
Bounding Box: Crop: 0% Minimum Zoom, 20% Maximum Zoom
Bounding Box: Brightness: Between -25% and +25%

Similar Projects

See More
1.3k images 3 models