Linking the Dynamic PicoProbe Analytical Electron-Optical Beam Line / Microscope to Supercomputers
Alexander Brace, Rafael Vescovi, Ryan Chard, Nickolaus D. Saint,, Arvind Ramanathan, Nestor J. Zaluzec, Ian Foster

TL;DR
This paper presents a software architecture that enables high-volume data transfer from the Dynamic PicoProbe electron-optical microscope to supercomputers, supporting advanced data analysis and reinterrogation for scientific discovery.
Contribution
It introduces a scalable infrastructure for large-scale data transfer and analysis, integrating machine learning and metadata management for high-throughput scientific workflows.
Findings
Supports transfer of hundreds of GB per day
Enables advanced machine learning analysis
Facilitates reanalysis of past experimental data
Abstract
The Dynamic PicoProbe at Argonne National Laboratory is undergoing upgrades that will enable it to produce up to 100s of GB of data per day. While this data is highly important for both fundamental science and industrial applications, there is currently limited on-site infrastructure to handle these high-volume data streams. We address this problem by providing a software architecture capable of supporting large-scale data transfers to the neighboring supercomputers at the Argonne Leadership Computing Facility. To prepare for future scientific workflows, we implement two instructive use cases for hyperspectral and spatiotemporal datasets, which include: (i) off-site data transfer, (ii) machine learning/artificial intelligence and traditional data analysis approaches, and (iii) automatic metadata extraction and cataloging of experimental results. This infrastructure supports expected…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Research Data Management Practices · Distributed and Parallel Computing Systems
