Dataset Versions

v4

augmented_data

Generated on Jun 23, 2022

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Dataset Split

Train Set 6%
600Images
Valid Set 1%
100Images
Test Set 93%
9700Images

Preprocessing

Modify Classes: 1 remapped, 0 dropped

Augmentations

Outputs per training example: 3
Brightness: Between -25% and +25%
Bounding Box: Rotation: Between -45° and +45°
Bounding Box: Shear: ±30° Horizontal, ±18° Vertical
Bounding Box: Noise: Up to 5% of pixels

Similar Projects

See More