Roboflow

Open Poetry Vision

Object Detection

Open Poetry Vision Computer Vision Project

TRY THIS MODEL
Drop an image or

Images

2000 images
Explore Dataset

Overview

The Open Poetry Vision dataset is a synthetic dataset created by Roboflow for OCR tasks.

It combines a random image from the Open Images Dataset with text primarily sampled from Gwern's GPT-2 Poetry project. Each image in the dataset contains between 1 and 5 strings in a variety of fonts and colors randomly positioned in the 512x512 canvas. The classes correspond to the font of the text.

Example Image: Example Image

Use Cases

A common OCR workflow is to use a neural network to isolate text for input into traditional optical character recognition software. This dataset could make a good starting point for an OCR project like business card parsing or automated paper form-processing.

Alternatively, you could try your hand using this as a neural font identification dataset. Nvidia, amongst others, have had success with this task.

Using this Dataset

Use the fork button to copy this dataset to your own Roboflow account and export it with new preprocessing settings (perhaps resized for your model's desired format or converted to grayscale), or additional augmentations to make your model generalize better. This particular dataset would be very well suited for Roboflow's new advanced Bounding Box Only Augmentations.

Version 5 of this dataset (classes_all_text-raw-images) has all classes remapped to be labeled as "text." This was accomplished by using Modify Classes as a preprocessing step.

Version 6 of this dataset (classes_all_text-augmented-FAST) has all classes remapped to be labeled as "text." and was trained with Roboflow's Fast Model.

Version 7 of this dataset (classes_all_text-augmented-ACCURATE) has all classes remapped to be labeled as "text." and was trained with Roboflow's Accurate Model.

About Roboflow

Roboflow makes managing, preprocessing, augmenting, and versioning datasets for computer vision seamless.

Developers reduce 50% of their code when using Roboflow's workflow, automate annotation quality assurance, save training time, and increase model reproducibility.

Roboflow Workmark

Trained Model API

This project has a trained model available that you can try in your browser and use to get predictions via our Hosted Inference API and other deployment methods.

Cite This Project

If you use this dataset in a research paper, please cite it using the following BibTeX:

@misc{
                            open-poetry-vision_dataset,
                            title = { Open Poetry Vision Dataset },
                            type = { Open Source Dataset },
                            author = { Brad Dwyer },
                            howpublished = { \url{ https://universe.roboflow.com/roboflow-gw7yv/open-poetry-vision } },
                            url = { https://universe.roboflow.com/roboflow-gw7yv/open-poetry-vision },
                            journal = { Roboflow Universe },
                            publisher = { Roboflow },
                            year = { 2022 },
                            month = { apr },
                            note = { visited on 2024-03-19 },
                            }
                        

Connect Your Model With Program Logic

Find utilities and guides to help you start using the Open Poetry Vision project in your project.

Source

Brad Dwyer

Maintainer

Roboflow

Last Updated

2 years ago

Project Type

Object Detection

Subject

text

Views: 541

Views in previous 30 days: 8

Downloads: 27

Downloads in previous 30 days: 4

License

CC BY 4.0

Classes

American Typewriter Andale Mono Apple Chancery Arial Avenir Baskerville Big Caslon Bradley Hand Brush Script MT Chalkboard Comic Sans MS Copperplate Courier Didot Futura Geneva Georgia Gill Sans Helvetica Herculanum Impact Kefa Lucida Grande Luminari Marker Felt Menlo Monaco Noteworthy Optima PT Sans PT Serif Palatino Papyrus Phosphate Rockwell SF Pro SignPainter Skia Snell Roundhand Tahoma Times New Roman Trebuchet MS Verdana