Browse » Documents

Documents

Datasets related to using computer vision with images of documents, invoices, papers, contracts, screenshots, text, signatures, pdfs, jpegs, pngs, and more. Commonly used with optical character recognition (OCR) to translate text into usable data.

Guide to extract document structure
https://blog.roboflow.com/using-computer-vision-extract-document-structure/

Document processing case study
https://roboflow.com/case-study/column

Research for object detection with PDFs
https://vtechworks.lib.vt.edu/bitstream/handle/10919/109979/ObjectDetectionReport.pdf?sequence=16&isAllowed=y

The BIB_Detection image dataset contains over one thousand images of athletes running in races, with their annotations for all bibs. This dataset and the already-trained object detection model are a great place to start for anyone looking to build a computer vision project around a race. Additionally, this data could be used to augment an existing detection dataset that detects numbers or characters more generally.

About This Dataset

The Roboflow Website Screenshots dataset is a synthetically generated dataset composed of screenshots from over 1000 of the world's top websites. They have been automatically annotated to label the following classes:
:fa-spacer:

  • button - navigation links, tabs, etc.
  • heading - text that was enclosed in <h1> to <h6> tags.
  • link - inline, textual <a> tags.
  • label - text labeling form fields.
  • text - all other text.
  • image - <img>, <svg>, or <video> tags, and icons.
  • iframe - ads and 3rd party content.

Example

This is an example image and annotation from the dataset:
WIkipedia Screenshot

Usage

Annotated screenshots are very useful in Robotic Process Automation. But they can be expensive to label. This dataset would cost over $4000 for humans to label on popular labeling services. We hope this dataset provides a good starting point for your project. Try it with a model from our model library.

Collecting Custom Data

Roboflow is happy to provide a custom screenshots dataset to meet your particular needs. We can crawl public or internal web applications. Just reach out and we'll be happy to provide a quote!

About Roboflow

Roboflow makes managing, preprocessing, augmenting, and versioning datasets for computer vision seamless.
:fa-spacer:
Developers reduce 50% of their boilerplate code when using Roboflow's workflow, save training time, and increase model reproducibility.
:fa-spacer:

Roboflow Wordmark