Algorithms and Statistical Models for Scientific Discovery in the Petabyte Era
Brian Nord, Andrew J. Connolly, Jamie Kinney, Jeremy Kubica, Gautaum, Narayan, Joshua E. G. Peek, Chad Schafer, Erik J. Tollerud, Camille Avestruz,, G. Jogesh Babu, Simon Birrer, Douglas Burke, Jo\~ao Caldeira, Douglas A., Caldwell, Joleen K. Carlberg, Yen-Chi Chen

TL;DR
This paper discusses the challenges and opportunities of using advanced algorithms and statistical models, including AI, for scientific discovery in astronomy's large, complex datasets, emphasizing collaboration, ethics, and methodological evolution.
Contribution
It provides a comprehensive framework and recommendations for developing and applying algorithms and statistical models to facilitate scientific discovery in the petabyte data era.
Findings
Identifies key challenges in large-scale data analysis.
Proposes new paradigms for collaboration and education.
Highlights ethical considerations in algorithm development.
Abstract
The field of astronomy has arrived at a turning point in terms of size and complexity of both datasets and scientific collaboration. Commensurately, algorithms and statistical models have begun to adapt --- e.g., via the onset of artificial intelligence --- which itself presents new challenges and opportunities for growth. This white paper aims to offer guidance and ideas for how we can evolve our technical and collaborative frameworks to promote efficient algorithmic development and take advantage of opportunities for scientific discovery in the petabyte era. We discuss challenges for discovery in large and complex data sets; challenges and requirements for the next stage of development of statistical methodologies and algorithmic tool sets; how we might change our paradigms of collaboration and education; and the ethical implications of scientists' contributions to widely applicable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
