Speech_bubbles Computer Vision Project

San Jose

Updated a year ago

546

views

17

downloads
Classes (7)
General_speech
blast_sound
hit_sound
narration speech
people_sound
thought_speech
wind_sound
Description

Here are a few use cases for this project:

  1. Comic Book and Graphic Novel Analysis: By identifying different speech and sound bubbles, users can create a searchable database of comics or graphic novels, facilitating research and analysis of storytelling methods, themes, or author styles over time.

  2. Accessible Comic Reading for Visually Impaired: Develop an application that reads aloud the contents of speech bubbles in a specific order, allowing visually impaired users to enjoy and understand the story, dialogue, and sound effects in comics or graphic novels.

  3. Automated Video Subtitling for Comic-based Content: Utilize the Speech_bubbles model to convert comic book panels into video frames and automatically generate subtitles or captions based on the identified speech and sound effects, making comic-based content more accessible to individuals who are deaf or hard of hearing.

  4. Comic Book Translation and Localization: Automatically detect speech bubbles in scanned comic book images and extract their contents for text translation, assisting publishers in translating and localizing comics for different language markets more efficiently.

  5. Comic Book Metadata Enrichment: Enhance digital libraries or online comic collection platforms by automatically detecting speech types and generating metadata tags, making comics more discoverable and organized for readers, researchers, and collectors.

Supervision

Build Computer Vision Applications Faster with Supervision

Visualize and process your model results with our reusable computer vision tools.

Cite This Project

LICENSE
CC BY 4.0

If you use this dataset in a research paper, please cite it using the following BibTeX:

                        @misc{
                            speech_bubbles_dataset,
                            title = { Speech_bubbles Dataset },
                            type = { Open Source Dataset },
                            author = { San Jose },
                            howpublished = { \url{ https://universe.roboflow.com/san-jose/speech_bubbles } },
                            url = { https://universe.roboflow.com/san-jose/speech_bubbles },
                            journal = { Roboflow Universe },
                            publisher = { Roboflow },
                            year = { 2023 },
                            month = { apr },
                            note = { visited on 2024-09-26 },
                            }
                        
                    

Similar Projects

See More