Dark solitons in Bose-Einstein condensates: a dataset for many-body physics research
Amilson R. Fritsch, Shangjie Guo, Sophia M. Koh, I. B. Spielman,, Justyna P. Zwolak

TL;DR
This paper introduces a large, labeled dataset of Bose-Einstein condensate images with solitonic excitations, enabling machine learning applications in many-body physics research.
Contribution
It provides a comprehensive dataset with manual and ML-based labels, along with a physics-informed analysis framework for studying nonlinear many-body phenomena.
Findings
Over 16,000 experimental images included
Approximately 33% of images are manually labeled
Remaining images are automatically labeled using SolDet
Abstract
We establish a dataset of over experimental images of Bose--Einstein condensates containing solitonic excitations to enable machine learning (ML) for many-body physics research. About of this dataset has manually assigned and carefully curated labels. The remainder is automatically labeled using SolDet -- an implementation of a physics-informed ML data analysis framework -- consisting of a convolutional-neural-network-based classifier and OD as well as a statistically motivated physics-informed classifier and a quality metric. This technical note constitutes the definitive reference of the dataset, providing an opportunity for the data science community to develop more sophisticated analysis tools, to further understand nonlinear many-body physics, and even advance cold atom experiments.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCold Atom Physics and Bose-Einstein Condensates · Scientific Computing and Data Management
