Comics Datasets Framework: Mix of Comics datasets for detection   benchmarking

Emanuele Vivoli; Irene Campaioli; Mariateresa Nardoni; Niccol\`o; Biondi; Marco Bertini; Dimosthenis Karatzas

arXiv:2407.03540·cs.CV·July 8, 2024

Comics Datasets Framework: Mix of Comics datasets for detection benchmarking

Emanuele Vivoli, Irene Campaioli, Mariateresa Nardoni, Niccol\`o, Biondi, Marco Bertini, Dimosthenis Karatzas

PDF

Open Access 2 Repos

TL;DR

This paper introduces the Comics Datasets Framework to standardize annotations, incorporate diverse comic styles, and benchmark detection models, addressing key challenges in comic object detection research.

Contribution

It standardizes dataset annotations, creates Comics100 to balance manga representation, and provides a comprehensive benchmarking framework for detection architectures.

Findings

01

Standardized annotations across datasets

02

Curated Comics100 dataset with diverse styles

03

Benchmark results for multiple detection models

Abstract

Comics, as a medium, uniquely combine text and images in styles often distinct from real-world visuals. For the past three decades, computational research on comics has evolved from basic object detection to more sophisticated tasks. However, the field faces persistent challenges such as small datasets, inconsistent annotations, inaccessible model weights, and results that cannot be directly compared due to varying train/test splits and metrics. To address these issues, we aim to standardize annotations across datasets, introduce a variety of comic styles into the datasets, and establish benchmark results with clear, replicable settings. Our proposed Comics Datasets Framework standardizes dataset annotations into a common format and addresses the overrepresentation of manga by introducing Comics100, a curated collection of 100 books from the Digital Comics Museum, annotated for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComics and Graphic Narratives · Mental Health via Writing