Dataset Versions

v16

2024-05-11 2:00am

Generated on May 11, 2024

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Preprocessing

Isolate Objects: Applied
Resize: Fit within 832x832
Modify Classes: 0 remapped, 2 dropped
Filter by Tag: 0 required, 1 dropped

Augmentations

Outputs per training example: 2
Saturation: Between -3% and +3%
Brightness: Between -22% and +22%
Exposure: Between -1% and +1%
Mosaic: Applied
Bounding Box: Blur: Up to 0.5px