Open Poetry Vision Computer Vision Project

Drop an image or


The Open Poetry Vision dataset is a synthetic dataset created by Roboflow for OCR tasks.

It combines a random image from the Open Images Dataset with text primarily sampled from Gwern's GPT-2 Poetry project. Each image in the dataset contains between 1 and 5 strings in a variety of fonts and colors randomly positioned in the 512x512 canvas. The classes correspond to the font of the text.

Example Image:
Example Image

Use Cases

A common OCR workflow is to use a neural network to isolate text for input into traditional optical character recognition software. This dataset could make a good starting point for an OCR project like business card parsing or automated paper form-processing.

Alternatively, you could try your hand using this as a neural font identification dataset. Nvidia, amongst others, have had success with this task.

Using this Dataset

Use the fork button to copy this dataset to your own Roboflow account and export it with new preprocessing settings (perhaps resized for your model's desired format or converted to grayscale), or additional augmentations to make your model generalize better. This particular dataset would be very well suited for Roboflow's new advanced Bounding Box Only Augmentations.

Version 5 of this dataset (classes_all_text-raw-images) has all classes remapped to be labeled as "text." This was accomplished by using Modify Classes as a preprocessing step.

Version 6 of this dataset (classes_all_text-augmented-FAST) has all classes remapped to be labeled as "text." and was trained with Roboflow's Fast Model.

Version 7 of this dataset (classes_all_text-augmented-ACCURATE) has all classes remapped to be labeled as "text." and was trained with Roboflow's Accurate Model.

About Roboflow

Roboflow makes managing, preprocessing, augmenting, and versioning datasets for computer vision seamless.

Developers reduce 50% of their code when using Roboflow's workflow, automate annotation quality assurance, save training time, and increase model reproducibility.

Roboflow Workmark

Cite this Project

If you use this dataset in a research paper, please cite it using the following BibTeX:

@misc{ open-poetry-vision_dataset,
    title = { Open Poetry Vision Dataset },
    type = { Open Source Dataset },
    author = { Brad Dwyer },
    howpublished = { \url{ } },
    url = { },
    journal = { Roboflow Universe },
    publisher = { Roboflow },
    year = { 2022 },
    month = { apr },
    note = { visited on 2022-10-04 },


Brad Dwyer



Last Updated

6 months ago

Project Type

Object Detection




American Typewriter, Andale Mono, Apple Chancery, Arial, Avenir, Baskerville, Big Caslon, Bradley Hand, Brush Script MT, Chalkboard, Comic Sans MS, Copperplate, Courier, Didot, Futura, Geneva, Georgia, Gill Sans, Helvetica, Herculanum, Impact, Kefa, Lucida Grande, Luminari, Marker Felt, Menlo, Monaco, Noteworthy, Optima, PT Sans, PT Serif, Palatino, Papyrus, Phosphate, Rockwell, SF Pro, SignPainter, Skia, Snell Roundhand, Tahoma, Times New Roman, Trebuchet MS, Verdana


CC BY 4.0