4

Website Screenshots Computer Vision Project

TRY THIS MODEL
Drop an image or

About This Dataset

The Roboflow Website Screenshots dataset is a synthetically generated dataset composed of screenshots from over 1000 of the world's top websites. They have been automatically annotated to label the following classes:
:fa-spacer:

  • button - navigation links, tabs, etc.
  • heading - text that was enclosed in <h1> to <h6> tags.
  • link - inline, textual <a> tags.
  • label - text labeling form fields.
  • text - all other text.
  • image - <img>, <svg>, or <video> tags, and icons.
  • iframe - ads and 3rd party content.

Example

This is an example image and annotation from the dataset:
WIkipedia Screenshot

Usage

Annotated screenshots are very useful in Robotic Process Automation. But they can be expensive to label. This dataset would cost over $4000 for humans to label on popular labeling services. We hope this dataset provides a good starting point for your project. Try it with a model from our model library.

Collecting Custom Data

Roboflow is happy to provide a custom screenshots dataset to meet your particular needs. We can crawl public or internal web applications. Just reach out and we'll be happy to provide a quote!

About Roboflow

Roboflow makes managing, preprocessing, augmenting, and versioning datasets for computer vision seamless.
:fa-spacer:
Developers reduce 50% of their boilerplate code when using Roboflow's workflow, save training time, and increase model reproducibility.
:fa-spacer:

Roboflow Wordmark

Trained Model API

This project has a trained model available that you can try in your browser and use to get predictions via our Hosted Inference API and other deployment methods.

Cite this Project

If you use this dataset in a research paper, please cite it using the following BibTeX:

@misc{ website-screenshots_dataset,
    title = { Website Screenshots Dataset },
    type = { Open Source Dataset },
    author = { Brad Dwyer },
    howpublished = { \url{ https://universe.roboflow.com/roboflow-gw7yv/website-screenshots } },
    url = { https://universe.roboflow.com/roboflow-gw7yv/website-screenshots },
    journal = { Roboflow Universe },
    publisher = { Roboflow },
    year = { 2022 },
    month = { aug },
    note = { visited on 2023-02-01 },
}

Source

Brad Dwyer

Maintainer

Roboflow

Last Updated

5 months ago

Project Type

Object Detection

Subject

elements

Classes

button, field, heading, iframe, image, label, link, text

License

MIT