Browse » Benchmark » Classification Benchmark
Top Classification Benchmark Datasets
Roboflow hosts the most popular computer and machine vision benchmarking and transfer learning datasets. Datasets in this category include MNIST, Fashion-MNIST, and more.
251 images of playing cricket, football & baseball.
Original Dataset from Kaggle - Bikram Saha
This dataset contains 252 images of playing cricket, football, and baseball. (1) cricket
- 95
images (2) football
- 77
images (3) baseball
- 79
images
This is a dataset for image classification in sports. This model will help to identify if the sport or activity occurring in the image or video feed is, or most closely resembles, cricket
, football
, or baseball
.
The raw image versions (v1
or v5
) of the dataset can be downloaded, or the entire dataset can be cloned, to your own project for image classification, or to label the figures in the images for object detection, instance or semantic segmentation, etc.
Nike, Adidas and Converse Shoes Dataset for Classification
This dataset was obtained from Kaggle: https://www.kaggle.com/datasets/die9origephit/nike-adidas-and-converse-imaged/
Dataset Collection Methodology:
"The dataset was obtained downloading images from Google images
. The images with a .webp
format were transformed into .jpg
images. The obtained images were randomly shuffled and resized so that all the images had a resolution of 240x240 pixels
. Then, they were split into train
and test
datasets and saved."
Versions:
- v1:
original_raw-images
: the original images without Preprocessing or Augmentation applied, other than Auto-Orient to remove EXIF data. These images are in the original train/test split from Kaggle:237 images in each train set
and38 images in each test set
- v2:
original_trainTestSplit-augmented3x
: the original train/test split, augmented with 3x image generation. This version was not trained with Roboflow Train. - v3:
original_trainTestSplit-augmented5x
: the original train/test split, augmented with 5x image generation. This version was not trained with Roboflow Train. - v4:
rawImages_70-20-10split
: the original images without Preprocessing or Augmentation applied, other than Auto-Orient to remove EXIF data. Dataset splies were modified to a70% train
,20% valid
,10%
test train/valid/test split- NOTE: 70%/20%/10% split:
576 images in train set
,166 images in valid set
,83 images in test set
- NOTE: 70%/20%/10% split:
- v5:
70-20-10split-augmented3x
: modified to a70% train
,20% valid
,10%
test train/valid/test split, augmented with 3x image generation. This version was trained with Roboflow Train. - v6:
70-20-10split-augmented5x
: modified to a70% train
,20% valid
,10%
test train/valid/test split, augmented with 5x image generation. This version was trained with Roboflow Train.
Indoor Scene Recognition
From the official dataset page: Indoor scene recognition is a challenging open problem in high level vision. Most scene recognition models that work well for outdoor scenes perform poorly in the indoor domain. The main difficulty is that while some indoor scenes (e.g. corridors) can be well characterized by global spatial properties, others (e.g., bookstores) are better characterized by the objects they contain. More generally, to address the indoor scenes recognition problem we need a model that can exploit local and global discriminative information.
Database
The database contains 67 Indoor categories
... The number of images varies across categories, but there are at least 100 images per category. All images are in jpg format. The images provided here are for research purposes only.
Paper
A. Quattoni, and A.Torralba. Recognizing Indoor Scenes. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2009.
Acknowledgments
Thanks to Aude Oliva for helping to create the database of indoor scenes.
Funding for this research was provided by NSF Career award (IIS 0747120)
THE MNIST DATABASE of handwritten digits
Authors:
- Yann LeCun, Courant Institute, NYU
- Corinna Cortes, Google Labs, New York
- Christopher J.C. Burges, Microsoft Research, Redmond
Dataset Obtained From: http://yann.lecun.com/exdb/mnist/
All images were sized 28x28 in the original dataset
The MNIST database of handwritten digits, available from this page, has a training set of 60,000 examples, and a test set of 10,000 examples. It is a subset of a larger set available from NIST. The digits have been size-normalized and centered in a fixed-size image.
It is a good database for people who want to try learning techniques and pattern recognition methods on real-world data while spending minimal efforts on preprocessing and formatting.
Version 1 (original-images_trainSetSplitBy80_20):
- Original, raw images, with the
train
set split to provide 80% of its images to the training set and 20% of its images to the validation set - Trained from Roboflow Classification Model's ImageNet training checkpoint
Version 2 (original-images_ModifiedClasses_trainSetSplitBy80_20):
- Original, raw images, with the
train
set split to provide 80% of its images to the training set and 20% of its images to the validation set - Modify Classes, a Roboflow preprocessing feature, was employed to change class names from
0
,1
,2
,3
,4
,5
,6
,7
,8
,9
toone
,two
,three
,four
,five
,six
,seven
,eight
,nine
- Trained from the Roboflow Classification Model's ImageNet training checkpoint
Version 3 (original-images_Original-MNIST-Splits):
- Original images, with the original splits for MNIST:
train
(86% of images - 60,000 images) set andtest
(14% of images - 10,000 images) set only. - This version was not trained
Citation:
@article{lecun2010mnist,
title={MNIST handwritten digit database},
author={LeCun, Yann and Cortes, Corinna and Burges, CJ},
journal={ATT Labs [Online]. Available: http://yann.lecun.com/exdb/mnist},
volume={2},
year={2010}
}
CIFAR-10
The CIFAR-10 and CIFAR-100 are labeled subsets of the 80 million tiny images dataset. They were collected by Alex Krizhevsky, Vinod Nair, and Geoffrey Hinton.
- More info on CIFAR-10: https://www.cs.toronto.edu/~kriz/cifar.html
- TensorFlow listing of the dataset: https://www.tensorflow.org/datasets/catalog/cifar10
- GitHub repo for converting CIFAR-10
tarball
files topng
format: https://github.com/knjcode/cifar2png
All images were sized 32x32 in the original dataset
The CIFAR-10
dataset consists of 60,000 32x32 colour images in 10 classes
, with 6,000 images per class. There are 50,000
training images and 10,000 test
images [in the original dataset].
The dataset is divided into five training batches and one test batch, each with 10,000 images. The test
batch contains exactly 1,000 randomly-selected images from each class. The training batches contain the remaining images in random order, but some training batches may contain more images from one class than another. Between them, the training batches contain exactly 5,000 images from each class.
Here are the classes in the dataset, as well as 10 random images from each:
The classes are completely mutually exclusive. There is no overlap between automobiles
and trucks
. Automobile
includes sedans, SUVs, things of that sort. Truck
includes only big trucks. Neither includes pickup trucks.
Version 1 (original-images_Original-CIFAR10-Splits):
- Original images, with the original splits for CIFAR-10:
train
(83.33% of images - 50,000 images) set andtest
(16.67% of images - 10,000 images) set only. - This version was not trained
Version 3 (original-images_trainSetSplitBy80_20):
- Original, raw images, with the
train
set split to provide 80% of its images to the training set (approximately 40,000 images) and 20% of its images to the validation set (approximately 10,000 images) - https://blog.roboflow.com/train-test-split/
Citation:
@TECHREPORT{Krizhevsky09learningmultiple,
author = {Alex Krizhevsky},
title = {Learning multiple layers of features from tiny images},
institution = {},
year = {2009}
}
Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms
Authors:
- Han Xiao, Kashif Rasul and Roland Vollgraf
- https://arxiv.org/abs/1708.07747
Dataset Obtained From: https://github.com/zalandoresearch/fashion-mnist
All images were sized 28x28 in the original dataset
Fashion-MNIST
is a dataset of Zalando's article images—consisting of a training set of 60,000 examples and a test set of 10,000 examples. Each example is a 28x28 grayscale image, associated with a label from 10 classes. We intend Fashion-MNIST
to serve as a direct drop-in replacement for the original MNIST dataset for benchmarking machine learning algorithms. It shares the same image size and structure of training and testing splits.
Here's an example of how the data looks (each class takes three-rows):
Version 1 (original-images_Original-FashionMNIST-Splits):
- Original images, with the original splits for MNIST:
train
(86% of images - 60,000 images) set andtest
(14% of images - 10,000 images) set only. - This version was not trained
Version 3 (original-images_trainSetSplitBy80_20):
- Original, raw images, with the
train
set split to provide 80% of its images to the training set and 20% of its images to the validation set - https://blog.roboflow.com/train-test-split/
Citation:
@online{xiao2017/online,
author = {Han Xiao and Kashif Rasul and Roland Vollgraf},
title = {Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms},
date = {2017-08-28},
year = {2017},
eprintclass = {cs.LG},
eprinttype = {arXiv},
eprint = {cs.LG/1708.07747},
}