Open Poetry Vision Computer Vision Project

Brad Dwyer

Updated 3 years ago

593

views

31

downloads
Classes (43)
American Typewriter
Andale Mono
Apple Chancery
Arial
Avenir
Baskerville
Big Caslon
Bradley Hand
Brush Script MT
Chalkboard
Comic Sans MS
Copperplate
Courier
Didot
Futura
Geneva
Georgia
Gill Sans
Helvetica
Herculanum
Impact
Kefa
Lucida Grande
Luminari
Marker Felt
Menlo
Monaco
Noteworthy
Optima
PT Sans
PT Serif
Palatino
Papyrus
Phosphate
Rockwell
SF Pro
SignPainter
Skia
Snell Roundhand
Tahoma
Times New Roman
Trebuchet MS
Verdana

Metrics

Try This Model
Drop an image or
Description

Overview

The Open Poetry Vision dataset is a synthetic dataset created by Roboflow for OCR tasks.

It combines a random image from the Open Images Dataset with text primarily sampled from Gwern's GPT-2 Poetry project. Each image in the dataset contains between 1 and 5 strings in a variety of fonts and colors randomly positioned in the 512x512 canvas. The classes correspond to the font of the text.

Example Image: Example Image

Use Cases

A common OCR workflow is to use a neural network to isolate text for input into traditional optical character recognition software. This dataset could make a good starting point for an OCR project like business card parsing or automated paper form-processing.

Alternatively, you could try your hand using this as a neural font identification dataset. Nvidia, amongst others, have had success with this task.

Using this Dataset

Use the fork button to copy this dataset to your own Roboflow account and export it with new preprocessing settings (perhaps resized for your model's desired format or converted to grayscale), or additional augmentations to make your model generalize better. This particular dataset would be very well suited for Roboflow's new advanced Bounding Box Only Augmentations.

Version 5 of this dataset (classes_all_text-raw-images) has all classes remapped to be labeled as "text." This was accomplished by using Modify Classes as a preprocessing step.

Version 6 of this dataset (classes_all_text-augmented-FAST) has all classes remapped to be labeled as "text." and was trained with Roboflow's Fast Model.

Version 7 of this dataset (classes_all_text-augmented-ACCURATE) has all classes remapped to be labeled as "text." and was trained with Roboflow's Accurate Model.

About Roboflow

Roboflow makes managing, preprocessing, augmenting, and versioning datasets for computer vision seamless.

Developers reduce 50% of their code when using Roboflow's workflow, automate annotation quality assurance, save training time, and increase model reproducibility.

Roboflow Workmark

Use This Trained Model

Try it in your browser, or deploy via our Hosted Inference API and other deployment methods.

Supervision

Build Computer Vision Applications Faster with Supervision

Visualize and process your model results with our reusable computer vision tools.

Cite This Project

LICENSE
CC BY 4.0

If you use this dataset in a research paper, please cite it using the following BibTeX:

                        @misc{
                            open-poetry-vision_dataset,
                            title = { Open Poetry Vision Dataset },
                            type = { Open Source Dataset },
                            author = { Brad Dwyer },
                            howpublished = { \url{ https://universe.roboflow.com/roboflow-gw7yv/open-poetry-vision } },
                            url = { https://universe.roboflow.com/roboflow-gw7yv/open-poetry-vision },
                            journal = { Roboflow Universe },
                            publisher = { Roboflow },
                            year = { 2022 },
                            month = { apr },
                            note = { visited on 2024-11-21 },
                            }