Dataset Versions

v1

2024-12-13 9:05am

Generated on Dec 13, 2024

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Dataset Split

Train Set %
0Images
Valid Set 98%
303Images
Test Set 2%
7Images

Preprocessing

Auto-Orient: Applied
Resize: Stretch to 640x640
Grayscale: Applied
Auto-Adjust Contrast: Using Contrast Stretching
Filter Null: Require at least 50% of images to contain annotations.

Augmentations

No augmentations were applied.