Dataset Versions

v2

2023-03-04 6:48pm

Generated on Mar 4, 2023

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Dataset Split

Train Set %
0Images
Valid Set %
0Images
Test Set 100%
353Images

Preprocessing

Auto-Orient: Applied
Resize: Stretch to 1280x1280
Tile: 8 rows x 8 columns
Filter Null: Require at least 98% of images to contain annotations.

Augmentations

Outputs per training example: 1
Crop: 0% Minimum Zoom, 30% Maximum Zoom
Rotation: Between -15° and +15°
Mosaic: Applied