Dataset Versions

v5

2023-01-26 9:01pm

Generated on Jan 26, 2023

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Dataset Split

Train Set %
0Images
Valid Set %
0Images
Test Set 100%
1099Images

Preprocessing

Auto-Adjust Contrast: Using Adaptive Equalization
Grayscale: Applied

Augmentations

Outputs per training example: 3
Grayscale: Apply to 100% of images
Cutout: 1 boxes with 15% size each
Bounding Box: Noise: Up to 5% of pixels