Dataset Versions

v3

release-640-filtered

Generated on Sep 14, 2022

Popular Download Formats

Pascal VOC XML
Common XML annotation format for local data munging (pioneered by ImageNet).
PaliGemma
PaliGemma JSONL format used for fine-tuning PaliGemma, Google's open multimodal vision model.
CreateML JSON
CreateML JSON format is used with Apple's CreateML and Turi Create tools.
Other Formats
Choose another format.

Dataset Split

Train Set 70%
8472Images
Valid Set 20%
2359Images
Test Set 10%
1209Images

Preprocessing

Auto-Orient: Applied
Resize: Stretch to 640x640
Modify Classes: 0 remapped, 27 dropped

Augmentations

No augmentations were applied.

Similar Projects

See More
paper parts
Annotate
Show/hide annotations(H)
train
Labels
Attributes
Raw Data

Annotations

Group:
paper-parts

CLASSES

LAYERS

 
page number
1
 
reference text
1

Unused Classes

abstract heading
abstract text
algorithm
author
chapter
claim
claim number
comittee
corollary
corollary number
date
definition
definition number
degree
equation
equation number
example
example caption
figure
figure caption
figure title
footnote
lemma
lemma number
list of content heading
list of content text
paragraph
proposition
proposition number
reference title
scheme
scheme caption
section
subsection
subsubsection
table
table caption
table of contents text
table of contents title
table title
theorem
theorem number
title
university

Tags

No Tags Applied
Type and select tags below to add them to the image.

Attributes

14919_103.jpg

640x640
0.41MP

Updated Sep 14, 2022

6:16PM
GMT+00:00

Generated by Roboflow

Training Set

Transforms

Auto-Orient Applied
Resize Stretch to 640x640
Modify Classes Applied

Annotation History

Loading...

Raw Data

{
    "camera": "Generated by Roboflow",
    "classes": {
        "date": 133,
        "abstract text": 104,
        "chapter": 642,
        "paragraph": 15070,
        "corollary number": 27,
        "subsection": 1977,
        "theorem number": 48,
        "scheme": 46,
        "university": 140,
        "lemma number": 81,
        "lemma": 96,
        "section": 1716,
        "title": 175,
        "comittee": 136,
        "figure caption": 2820,
        "example": 42,
        "reference title": 128,
        "proposition": 83,
        "table of contents title": 97,
        "claim": 10,
        "definition": 39,
        "page number": 11833,
        "table": 1185,
        "algorithm": 43,
        "example caption": 31,
        "table caption": 953,
        "figure": 3106,
        "list of content heading": 165,
        "subsubsection": 1013,
        "author": 162,
        "table of contents text": 227,
        "equation": 3136,
        "scheme caption": 47,
        "degree": 147,
        "reference text": 994,
        "equation number": 1696,
        "footnote": 781,
        "table title": 23,
        "figure title": 23,
        "corollary": 27,
        "theorem": 49,
        "definition number": 37,
        "proposition number": 83,
        "abstract heading": 91,
        "claim number": 9,
        "list of content text": 325
    },
    "datasets": [
        "XceQSkgv4WOoOmaebwcK"
    ],
    "destination": "4aec0bf692333cc4dbde685593a3b8bf",
    "height": 640,
    "id": "wXLKEzwGMXtOEfLqAWho",
    "label": [],
    "labels": [],
    "name": "14919_103.jpg",
    "numSteps": 3,
    "owner": "pwYAXv9BTpqLyFfgQoPZ",
    "preprocessing": [
        "auto-orient",
        "resize:[\"Stretch to\",640,640]",
        "remap:[\"d3fecd7b44e7d3ef15a19e6edbc0a4b2\"]"
    ],
    "preprocessingParsed": [
        {
            "name": "Auto-Orient",
            "value": "Applied"
        },
        {
            "name": "Resize",
            "value": "Stretch to 640x640"
        },
        {
            "name": "Modify Classes",
            "value": "Applied"
        }
    ],
    "source": "wXLKEzwGMXtOEfLqAWho",
    "split": "train",
    "split.XceQSkgv4WOoOmaebwcK.3": "train",
    "status": "generated",
    "transforms": "[\n    \"auto-orient\",\n    \"resize:[\\\"Stretch to\\\",640,640]\",\n    \"remap:[\\\"d3fecd7b44e7d3ef15a19e6edbc0a4b2\\\"]\"\n]",
    "updated": {
        "_seconds": 1663179375,
        "_nanoseconds": 78000000
    },
    "updatedDate": "Sep 14, 2022",
    "updatedTime": "6:16PM",
    "updatedTimezone": "+00:00",
    "versions": [
        "XceQSkgv4WOoOmaebwcK/3"
    ],
    "width": 640
}
{
    "boxes": [
        {
            "label": "reference text",
            "x": 318.5,
            "y": 313.5,
            "width": 508,
            "height": 532.5
        },
        {
            "label": "page number",
            "x": 319.5,
            "y": 590.5,
            "width": 17,
            "height": 10.5
        }
    ],
    "height": 640,
    "key": "14919_103_jpg.rf.0f0069219990f80cd3918dc7753488d7.jpg",
    "width": 640
}

Annotation Editor

Delete
Save (Enter)

Smart Polygon

Click inside to remove area or outside to expand.
Undo
Redo
Simplify
Simple Complex
Delete
Finish (Enter)
170%
Reset