Website Screenshots Computer Vision Project
The Roboflow Website Screenshots dataset is a synthetically generated dataset composed of screenshots from over 1000 of the world's top websites. They have been automatically annotated to label the following classes:
button - navigation links, tabs, etc. heading - text that was enclosed in <h1> to <h6> tags. link - inline, textual <a> tags. label - text labeling form fields. text - all other text. image - <img>, <svg>, or <video> tags, and icons. iframe - ads and 3rd party content.
Example
This is an example image and annotation from the dataset:
Usage
Annotated screenshots are very useful in Robotic Process Automation. But they can be expensive to label. This dataset would cost over $4000 for humans to label on popular labeling services. We hope this dataset provides a good starting point for your project. Try it with a model from our model library.
Collecting Custom Data
Roboflow is happy to provide a custom screenshots dataset to meet your particular needs. We can crawl public or internal web applications. Just reach out and we'll be happy to provide a quote!
About Roboflow
Roboflow makes managing, preprocessing, augmenting, and versioning datasets for computer vision seamless.
Developers reduce 50% of their boilerplate code when using Roboflow's workflow, save training time, and increase model reproducibility.
Trained Model API
This project has a trained model available that you can try in your browser and use to get predictions via our Hosted Inference API and other deployment methods.
Cite This Project
If you use this dataset in a research paper, please cite it using the following BibTeX:
@misc{
website-screenshots-ibe6t_dataset,
title = { Website Screenshots Dataset },
type = { Open Source Dataset },
author = { Roboflow Public },
howpublished = { \url{ https://universe.roboflow.com/roboflow-public/website-screenshots-ibe6t } },
url = { https://universe.roboflow.com/roboflow-public/website-screenshots-ibe6t },
journal = { Roboflow Universe },
publisher = { Roboflow },
year = { 2021 },
month = { dec },
note = { visited on 2024-05-16 },
}
Connect Your Model With Program Logic
Find utilities and guides to help you start using the Website Screenshots project in your project.