Browse » Benchmark » Classification Benchmark

Top Classification Benchmark Datasets

Roboflow hosts the most popular computer and machine vision benchmarking and transfer learning datasets. Datasets in this category include MNIST, Fashion-MNIST, and more.

251 images of playing cricket, football & baseball.

Original Dataset from Kaggle - Bikram Saha

This dataset contains 252 images of playing cricket, football, and baseball. (1) cricket - 95 images (2) football - 77 images (3) baseball - 79 images

This is a dataset for image classification in sports. This model will help to identify if the sport or activity occurring in the image or video feed is, or most closely resembles, cricket, football, or baseball.

The raw image versions (v1 or v5) of the dataset can be downloaded, or the entire dataset can be cloned, to your own project for image classification, or to label the figures in the images for object detection, instance or semantic segmentation, etc.

Nike, Adidas and Converse Shoes Dataset for Classification

This dataset was obtained from Kaggle: https://www.kaggle.com/datasets/die9origephit/nike-adidas-and-converse-imaged/

Dataset Collection Methodology:

"The dataset was obtained downloading images from Google images. The images with a .webp format were transformed into .jpg images. The obtained images were randomly shuffled and resized so that all the images had a resolution of 240x240 pixels. Then, they were split into train and test datasets and saved."

Versions:

  • v1: original_raw-images: the original images without Preprocessing or Augmentation applied, other than Auto-Orient to remove EXIF data. These images are in the original train/test split from Kaggle: 237 images in each train set and 38 images in each test set
  • v2: original_trainTestSplit-augmented3x: the original train/test split, augmented with 3x image generation. This version was not trained with Roboflow Train.
  • v3: original_trainTestSplit-augmented5x: the original train/test split, augmented with 5x image generation. This version was not trained with Roboflow Train.
  • v4: rawImages_70-20-10split: the original images without Preprocessing or Augmentation applied, other than Auto-Orient to remove EXIF data. Dataset splies were modified to a 70% train, 20% valid, 10% test train/valid/test split
    • NOTE: 70%/20%/10% split: 576 images in train set, 166 images in valid set, 83 images in test set
  • v5: 70-20-10split-augmented3x: modified to a 70% train, 20% valid, 10% test train/valid/test split, augmented with 3x image generation. This version was trained with Roboflow Train.
  • v6: 70-20-10split-augmented5x: modified to a 70% train, 20% valid, 10% test train/valid/test split, augmented with 5x image generation. This version was trained with Roboflow Train.

Indoor Scene Recognition

Examples of Images From the official dataset page: Indoor scene recognition is a challenging open problem in high level vision. Most scene recognition models that work well for outdoor scenes perform poorly in the indoor domain. The main difficulty is that while some indoor scenes (e.g. corridors) can be well characterized by global spatial properties, others (e.g., bookstores) are better characterized by the objects they contain. More generally, to address the indoor scenes recognition problem we need a model that can exploit local and global discriminative information.

Database

The database contains 67 Indoor categories ... The number of images varies across categories, but there are at least 100 images per category. All images are in jpg format. The images provided here are for research purposes only.

Paper

A. Quattoni, and A.Torralba. Recognizing Indoor Scenes. IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2009.

Acknowledgments

Thanks to Aude Oliva for helping to create the database of indoor scenes.
                            Funding for this research was provided by NSF Career award (IIS 0747120)
                            
MNIST
4

THE MNIST DATABASE of handwritten digits

Authors:

  • Yann LeCun, Courant Institute, NYU
  • Corinna Cortes, Google Labs, New York
  • Christopher J.C. Burges, Microsoft Research, Redmond

Dataset Obtained From: http://yann.lecun.com/exdb/mnist/

All images were sized 28x28 in the original dataset

The MNIST database of handwritten digits, available from this page, has a training set of 60,000 examples, and a test set of 10,000 examples. It is a subset of a larger set available from NIST. The digits have been size-normalized and centered in a fixed-size image.

It is a good database for people who want to try learning techniques and pattern recognition methods on real-world data while spending minimal efforts on preprocessing and formatting.

Version 1 (original-images_trainSetSplitBy80_20):

  • Original, raw images, with the train set split to provide 80% of its images to the training set and 20% of its images to the validation set
  • Trained from Roboflow Classification Model's ImageNet training checkpoint

Version 2 (original-images_ModifiedClasses_trainSetSplitBy80_20):

  • Original, raw images, with the train set split to provide 80% of its images to the training set and 20% of its images to the validation set
  • Modify Classes, a Roboflow preprocessing feature, was employed to change class names from 0, 1, 2, 3, 4, 5, 6, 7, 8, 9 to one, two, three, four, five, six, seven, eight, nine
  • Trained from the Roboflow Classification Model's ImageNet training checkpoint

Version 3 (original-images_Original-MNIST-Splits):

  • Original images, with the original splits for MNIST: train (86% of images - 60,000 images) set and test (14% of images - 10,000 images) set only.
  • This version was not trained

Citation:

@article{lecun2010mnist,
                              title={MNIST handwritten digit database},
                              author={LeCun, Yann and Cortes, Corinna and Burges, CJ},
                              journal={ATT Labs [Online]. Available: http://yann.lecun.com/exdb/mnist},
                              volume={2},
                              year={2010}
                            }
                            

CIFAR-10

The CIFAR-10 and CIFAR-100 are labeled subsets of the 80 million tiny images dataset. They were collected by Alex Krizhevsky, Vinod Nair, and Geoffrey Hinton.

All images were sized 32x32 in the original dataset

The CIFAR-10 dataset consists of 60,000 32x32 colour images in 10 classes, with 6,000 images per class. There are 50,000 training images and 10,000 test images [in the original dataset].

The dataset is divided into five training batches and one test batch, each with 10,000 images. The test batch contains exactly 1,000 randomly-selected images from each class. The training batches contain the remaining images in random order, but some training batches may contain more images from one class than another. Between them, the training batches contain exactly 5,000 images from each class.

Here are the classes in the dataset, as well as 10 random images from each: Visualized CIFAR-10 Dataset Subset

The classes are completely mutually exclusive. There is no overlap between automobiles and trucks. Automobile includes sedans, SUVs, things of that sort. Truck includes only big trucks. Neither includes pickup trucks.

Version 1 (original-images_Original-CIFAR10-Splits):

  • Original images, with the original splits for CIFAR-10: train (83.33% of images - 50,000 images) set and test (16.67% of images - 10,000 images) set only.
  • This version was not trained

Version 3 (original-images_trainSetSplitBy80_20):

  • Original, raw images, with the train set split to provide 80% of its images to the training set (approximately 40,000 images) and 20% of its images to the validation set (approximately 10,000 images)
  • https://blog.roboflow.com/train-test-split/ Train/Valid/Test Split Rebalancing

Citation:

@TECHREPORT{Krizhevsky09learningmultiple,
                                author = {Alex Krizhevsky},
                                title = {Learning multiple layers of features from tiny images},
                                institution = {},
                                year = {2009}
                            }
                            

Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms

Authors:

Dataset Obtained From: https://github.com/zalandoresearch/fashion-mnist

All images were sized 28x28 in the original dataset

Fashion-MNIST is a dataset of Zalando's article images—consisting of a training set of 60,000 examples and a test set of 10,000 examples. Each example is a 28x28 grayscale image, associated with a label from 10 classes. We intend Fashion-MNIST to serve as a direct drop-in replacement for the original MNIST dataset for benchmarking machine learning algorithms. It shares the same image size and structure of training and testing splits.

Here's an example of how the data looks (each class takes three-rows): Visualized Fashion MNIST dataset

Version 1 (original-images_Original-FashionMNIST-Splits):

  • Original images, with the original splits for MNIST: train (86% of images - 60,000 images) set and test (14% of images - 10,000 images) set only.
  • This version was not trained

Version 3 (original-images_trainSetSplitBy80_20):

Citation:

@online{xiao2017/online,
                              author       = {Han Xiao and Kashif Rasul and Roland Vollgraf},
                              title        = {Fashion-MNIST: a Novel Image Dataset for Benchmarking Machine Learning Algorithms},
                              date         = {2017-08-28},
                              year         = {2017},
                              eprintclass  = {cs.LG},
                              eprinttype   = {arXiv},
                              eprint       = {cs.LG/1708.07747},
                            }