Plot2Spectra: an Automatic Spectra Extraction Tool
Weixin Jiang, Eric Schwenker, Trevor Spreadbury, Kai Li, Maria K.Y., Chan, Oliver Cossairt

TL;DR
Plot2Spectra is an automated tool that extracts spectral data from graph images, enabling large-scale analysis and machine learning applications in material science.
Contribution
The paper introduces a novel two-stage framework combining axis detection and semantic segmentation for automatic spectral data extraction from plot images.
Findings
High accuracy in axis and plot line detection.
Effective in extracting data from diverse spectroscopy graphs.
Accelerates data collection for materials research.
Abstract
Different types of spectroscopies, such as X-ray absorption near edge structure (XANES) and Raman spectroscopy, play a very important role in analyzing the characteristics of different materials. In scientific literature, XANES/Raman data are usually plotted in line graphs which is a visually appropriate way to represent the information when the end-user is a human reader. However, such graphs are not conducive to direct programmatic analysis due to the lack of automatic tools. In this paper, we develop a plot digitizer, named Plot2Spectra, to extract data points from spectroscopy graph images in an automatic fashion, which makes it possible for large scale data acquisition and analysis. Specifically, the plot digitizer is a two-stage framework. In the first axis alignment stage, we adopt an anchor-free detector to detect the plot region and then refine the detected bounding boxes with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpectroscopy and Chemometric Analyses · Machine Learning in Materials Science · Remote-Sensing Image Classification
