fairDMS: Rapid Model Training by Data and Model Reuse
Ahsan Ali, Hemant Sharma, Rajkumar Kettimuthu, Peter Kenesei, Dennis, Trujillo, Antonino Miceli, Ian Foster, Ryan Coffee, Jana Thayer, Zhengchun, Liu

TL;DR
fairDMS introduces a rapid model training framework that leverages data and model reuse to significantly accelerate data labeling, training, and model updating processes for high-throughput scientific instruments.
Contribution
It presents a novel architecture for organizing data and models enabling fast retrieval and fine-tuning, addressing ML performance degradation in high-rate data environments.
Findings
Achieves up to 100x data labeling speedup
Provides 200x faster training times
Enables 92x faster end-to-end model updates
Abstract
Extracting actionable information rapidly from data produced by instruments such as the Linac Coherent Light Source (LCLS-II) and Advanced Photon Source Upgrade (APS-U) is becoming ever more challenging due to high (up to TB/s) data rates. Conventional physics-based information retrieval methods are hard-pressed to detect interesting events fast enough to enable timely focusing on a rare event or correction of an error. Machine learning~(ML) methods that learn cheap surrogate classifiers present a promising alternative, but can fail catastrophically when changes in instrument or sample result in degradation in ML performance. To overcome such difficulties, we present a new data storage and ML model training architecture designed to organize large volumes of data and models so that when model degradation is detected, prior models and/or data can be queried rapidly and a more suitable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsScientific Computing and Data Management · Machine Learning and Data Classification
