Semantic Segmentation for Compound figures

Weixin Jiang; Eric Schwenker; Maria Chan; Oliver Cossairt

arXiv:1912.07142·cs.CV·February 5, 2021·1 cites

Semantic Segmentation for Compound figures

Weixin Jiang, Eric Schwenker, Maria Chan, Oliver Cossairt

PDF

Open Access

TL;DR

This paper introduces a semantic segmentation method for decomposing compound figures in scientific literature into individual subfigures, facilitating better information retrieval and analysis.

Contribution

It presents an anchor-based detection algorithm that combines global layout and local features to accurately identify master images within compound figures.

Findings

01

Effective separation of compound figures demonstrated on a labeled dataset.

02

Improved accuracy over baseline methods in master image detection.

03

Facilitates linking subfigures with caption information.

Abstract

Scientific literature contains large volumes of unstructured data,with over 30\% of figures constructed as a combination of multiple images, these compound figures cannot be analyzed directly with existing information retrieval tools. In this paper, we propose a semantic segmentation approach for compound figure separation, decomposing the compound figures into "master images". Each master image is one part of a compound figure governed by a subfigure label (typically "(a), (b), (c), etc"). In this way, the separated subfigures can be easily associated with the description information in the caption. In particular, we propose an anchor-based master image detection algorithm, which leverages the correlation between master images and subfigure labels and locates the master images in a two-step manner. First, a subfigure label detector is built to extract the global layout information of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques · Natural Language Processing Techniques · Advanced Image and Video Retrieval Techniques