Learning Gaussian Data Augmentation in Feature Space for One-shot Object Detection in Manga
Takara Taniguchi, Ryosuke Furuta

TL;DR
This paper introduces a Gaussian feature space augmentation technique for one-shot manga character detection, effectively handling pose variation and class imbalance, and outperforming traditional image space augmentation methods.
Contribution
It proposes a novel feature space data augmentation method using learned Gaussian noise variance to enhance one-shot object detection in manga.
Findings
Improved detection accuracy for seen and unseen classes.
Outperforms image space augmentation methods.
Effective handling of pose variation and class imbalance.
Abstract
We tackle one-shot object detection in Japanese Manga. The rising global popularity of Japanese manga has made the object detection of character faces increasingly important, with potential applications such as automatic colorization. However, obtaining sufficient data for training conventional object detectors is challenging due to copyright restrictions. Additionally, new characters appear every time a new volume of manga is released, making it impractical to re-train object detectors each time to detect these new characters. Therefore, one-shot object detection, where only a single query (reference) image is required to detect a new character, is an essential task in the manga industry. One challenge with one-shot object detection in manga is the large variation in the poses and facial expressions of characters in target images, despite having only one query image as a reference.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEducational Systems and Policies · Innovation in Digital Healthcare Systems · Edcuational Technology Systems
