Dataset Versions

v4

2023-06-17 5:01pm

Generated on Jun 17, 2023

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Dataset Split

Train Set %
0Images
Valid Set 100%
400Images
Test Set %
0Images

Preprocessing

Auto-Orient: Applied
Static Crop: 37-59% Horizontal Region, 31-64% Vertical Region
Resize: Stretch to 640x640
Auto-Adjust Contrast: Using Adaptive Equalization
Tile: 2 rows x 2 columns

Augmentations

Outputs per training example: 3
90° Rotate: Clockwise, Counter-Clockwise
Shear: ±15° Horizontal, ±15° Vertical
Exposure: Between -40% and +40%
Bounding Box: Shear: ±37° Horizontal, ±6° Vertical