Dataset Versions

v1

2023-10-10 2:38pm

Generated on Oct 10, 2023

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Dataset Split

Train Set 2%
16Images
Valid Set %
0Images
Test Set 98%
911Images

Preprocessing

Auto-Orient: Applied
Resize: Stretch to 640x640
Tile: 4 rows x 4 columns
Filter Null: Require at least 90% of images to contain annotations.

Augmentations

Outputs per training example: 3
Flip: Horizontal, Vertical
Bounding Box: Flip: Horizontal, Vertical
Bounding Box: Brightness: Between -56% and +56%
Bounding Box: Blur: Up to 6px
Bounding Box: Noise: Up to 5% of pixels

Similar Projects

See More