Top Documents Datasets

Datasets related to using computer vision with images of documents, invoices, papers, contracts, screenshots, text, signatures, pdfs, jpegs, pngs, and more. Commonly used with optical character recognition (OCR) to translate text into usable data.

Guide to extract document structure
https://blog.roboflow.com/using-computer-vision-extract-document-structure/

Document processing case study
https://roboflow.com/case-study/column

Research for object detection with PDFs
https://vtechworks.lib.vt.edu/bitstream/handle/10919/109979/ObjectDetectionReport.pdf?sequence=16&isAllowed=y

1682 images1 model
Updated 10 months ago
4
3866 images1 model
Updated 2 years ago
1
883 images2 models
Updated 10 months ago
4
116 images1 model
Updated 2 months ago
2
1798 images1 model
Updated 10 months ago
4
6063 images1 model
Updated 2 years ago
5
13
368 images1 model
Updated 2 years ago
30
1206 images1 model
Updated 3 years ago
5
12040 images1 model
Updated 2 years ago
21
6076 images1 model
Updated 2 years ago
2
1325 images2 models
Updated 2 years ago
4
1719 images1 model
Updated 10 months ago
1
142 images2 models
Updated 2 years ago
1
1483 images2 models
Updated 3 years ago
1
1237 images1 model
Updated 3 months ago
2
4588 images2 models
Updated 3 years ago
701 images1 model
Updated 10 months ago
3
829 images1 model
Updated 10 months ago
1
8973 images1 model
Updated 3 years ago
508 images
Updated 10 months ago
2000 images
Updated 10 months ago
1
4174 images
Updated 3 years ago
100 images
Updated 2 years ago
1
327 images
Updated 2 years ago
4418 images
Updated 3 years ago
3893 images
Updated 10 months ago
3393 images
Updated 3 years ago
14244 images
Updated 3 years ago
350 images
Updated 2 years ago
2
13033 images
Updated 10 months ago
1
3915 images
Updated 3 years ago
1
4018 images
Updated 3 years ago
6301 images
Updated 3 years ago
7793 images
Updated 3 years ago
18463 images
Updated 3 years ago