aicook Computer Vision Project
Updated 3 years ago
16k
563
Metrics
Background Information
This dataset was curated and annotated by - Karel Cornelis.
The original dataset (v1) is composed of 516 images of various ingredients inside a fridge. The project was created as part of a groupwork for a postgraduate applied AI at Erasmus Brussels - we made an object detection model to identify ingredients in a fridge.
From the recipe dataset we used (which is a subset of the recipe1M dataset) we distilled the top50 ingredients and used 30 of those to randomly fill our fridge.
Read this blog post to learn more about the model production process: How I Used Computer Vision to Make Sense of My Fridge
Watch this video to see the model in action: AICook
The dataset is available under the MIT License.
Getting Started
You can download this dataset for use within your own project, fork it into a workspace on Roboflow to create your own model, or test one of the trained versions within the app.
Dataset Versions
Version 1 (v1) - 516 images (original-images)
- Preprocessing: Auto-Orient
- Augmentations: No augmentations applied
- Training Metrics: This version of the dataset was not trained
Version 2 (v2) - 3,050 images (aicook-augmented-trainFromCOCO)
- Preprocessing: Auto-Orient, Resize (Stretch to 416x416)
- Augmentations:
- Outputs per training example: 8 Rotation: Between -3° and +3° Exposure: Between -20% and +20% Blur: Up to 3px Noise: Up to 5% of pixels Cutout: 12 boxes with 10% size each
- Training Metrics: Trained from the COCO Checkpoint in Public Models ("transfer learning") on Roboflow
- mAP = 97.6%, precision = 86.9%, recall = 98.5%
Version 3 (v3) - 3,050 images (aicook-augmented-trainFromScratch)
- Preprocessing: Auto-Orient, Resize (Stretch to 416x416)
- Augmentations:
- Outputs per training example: 8 Rotation: Between -3° and +3° Exposure: Between -20% and +20% Blur: Up to 3px Noise: Up to 5% of pixels Cutout: 12 boxes with 10% size each
- Training Metrics: Trained from "scratch" (no transfer learning employed) on Roboflow
- mAP = 97.9%, precision = 79.6%, recall = 98.6%
Version 4 (v4) - 3,050 images images (aicook-augmented)
- Preprocessing: Auto-Orient, Resize (Stretch to 416x416)
- Augmentations:
- Outputs per training example: 8 Rotation: Between -3° and +3° Exposure: Between -20% and +20% Blur: Up to 3px Noise: Up to 5% of pixels Cutout: 12 boxes with 10% size each
- Training Metrics: This version of the dataset was not trained
Karel Cornelis - LinkedIn
Use This Trained Model
Try it in your browser, or deploy via our Hosted Inference API and other deployment methods.
Build Computer Vision Applications Faster with Supervision
Visualize and process your model results with our reusable computer vision tools.
Cite This Project
If you use this dataset in a research paper, please cite it using the following BibTeX:
@misc{
aicook-lcv4d_dataset,
title = { aicook Dataset },
type = { Open Source Dataset },
author = { Karel Cornelis },
howpublished = { \url{ https://universe.roboflow.com/karel-cornelis-q2qqg/aicook-lcv4d } },
url = { https://universe.roboflow.com/karel-cornelis-q2qqg/aicook-lcv4d },
journal = { Roboflow Universe },
publisher = { Roboflow },
year = { 2021 },
month = { sep },
note = { visited on 2024-11-21 },
}