Scalable multicomponent spectral analysis for high-throughput data annotation
Rui Patrick Xian, Ralph Ernstorfer, Philipp Michael Pelz

TL;DR
This paper introduces a scalable, parallelized spectral analysis method for high-throughput data annotation, enabling efficient and accurate analysis of large spectroscopic datasets in materials science.
Contribution
The authors develop a MapReduce-compatible, parallel spectral fitting approach that scales linearly with spectral components, suitable for high-performance computing environments.
Findings
Linear computational scaling demonstrated on experimental datasets
Efficient high-quality data annotation achieved
Applicable to various spectroscopic techniques
Abstract
Orchestrating parametric fitting of multicomponent spectra at scale is an essential yet underappreciated task in high-throughput quantification of materials and chemical composition. To automate the annotation process for spectroscopic and diffraction data collected in counts of hundreds to thousands, we present a systematic approach compatible with high-performance computing infrastructures using the MapReduce model and task-based parallelization. We implement the approach in software and demonstrate linear computational scaling with respect to spectral components using multidimensional experimental materials characterization datasets from photoemission spectroscopy and powder electron diffraction as benchmarks. Our approach enables efficient generation of high-quality data annotation and online spectral analysis and is applicable to a variety of analytical techniques in materials…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Spectroscopy and Chemometric Analyses · Electrochemical Analysis and Applications
