Dataset Versions

v2

2023-07-09 3:25am

Generated on Jul 8, 2023

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Dataset Split

Train Set 2%
3Images
Valid Set 47%
59Images
Test Set 50%
63Images

Preprocessing

Resize: Stretch to 640x640
Grayscale: Applied

Augmentations

Outputs per training example: 3
Grayscale: Apply to 25% of images
Bounding Box: Shear: ±15° Horizontal, ±15° Vertical
Bounding Box: Blur: Up to 2.5px

Similar Projects

See More
1.5k images 1 model
684 images 1 model