Dataset Versions

v2

2022-12-23 10:10am

Generated on Dec 23, 2022

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Preprocessing

Auto-Orient: Applied
Isolate Objects: Applied
Static Crop: 25-75% Horizontal Region, 25-75% Vertical Region
Resize: Stretch to 640x640
Auto-Adjust Contrast: Using Adaptive Equalization
Grayscale: Applied
Tile: 2 rows x 2 columns
Modify Classes: 0 remapped, 0 dropped

Augmentations

Outputs per training example: 1
Flip: Horizontal
90° Rotate: Clockwise, Counter-Clockwise
Shear: ±15° Horizontal, ±15° Vertical
Grayscale: Apply to 100% of images
Hue: Between -78° and +78°
Brightness: Between -41% and +41%
Exposure: Between -49% and +49%
Noise: Up to 5% of pixels
Mosaic: Applied
Bounding Box: Flip: Horizontal, Vertical
Bounding Box: Exposure: Between -25% and +25%