Plausible Adversarial Attacks on Direct Parameter Inference Models in   Astrophysics

Benjamin Horowitz; Peter Melchior

arXiv:2211.14788·astro-ph.CO·November 29, 2022

Plausible Adversarial Attacks on Direct Parameter Inference Models in Astrophysics

Benjamin Horowitz, Peter Melchior

PDF

Open Access

TL;DR

This paper investigates how small adversarial perturbations can mislead astrophysical inference models, revealing their vulnerability and raising concerns about their reliability in detecting new physics.

Contribution

It demonstrates that adversarial attacks can deceive cosmological parameter inference networks, highlighting the need for robustness in astrophysical machine learning models.

Findings

01

Adversarial perturbations can cause false detection of new physics.

02

Inference networks are vulnerable to small systematic biases.

03

Standard estimators remain insensitive to these adversarial attacks.

Abstract

In this abstract we explore the possibility of introducing biases in physical parameter inference models from adversarial-type attacks. In particular, we inject small amplitude systematics into inputs to a mixture density networks tasked with inferring cosmological parameters from observed data. The systematics are constructed analogously to white-box adversarial attacks. We find that the analysis network can be tricked into spurious detection of new physics in cases where standard cosmological estimators would be insensitive. This calls into question the robustness of such networks and their utility for reliably detecting new physics.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGamma-ray bursts and supernovae