Top Multimodal Image Datasets and Models

Roboflow hosts free public multimodal datasets covering a wide range of areas, from pallet load manifest to LaTeX OCR. Below we have curated a few multimodal datasets and models that you can use for your next vision project. You can download or fork these datasets for use in building your own multimodal models, you can also use the search bar above to search for datasets that meet your needs.

Build and share your datasets and models using Roboflow.

ChartQA Dataset

20818 images | 1 exports | Last updated 13 days ago

ChartQA Dataset

LaTeX OCR Dataset

23939 images | 2 exports | Last updated 2 months ago

LaTeX OCR Dataset

Pallet Load Manifest Dataset

170 images | 8 exports | Last updated 2 months ago

Pallet Load Manifest Dataset

TLID Dataset

136 images | 5 exports | Last updated 6 months ago

TLID Dataset