Interpreting automatic AGN classifiers with saliency maps
T. Peruzzi, M. Pasquato, S. Ciroi, M. Berton, P. Marziani, E. Nardini

TL;DR
This paper demonstrates how interpretability tools like saliency maps can elucidate the decision-making process of machine learning classifiers for AGN spectral classification, linking model features to physical spectral lines.
Contribution
The study applies ML interpretability techniques to AGN classification, showing how saliency maps reveal physically meaningful spectral features influencing model decisions.
Findings
Saliency maps highlight spectral regions affecting AGN type predictions.
t-SNE visualization shows clear separation of AGN types in spectral space.
Model decisions align with known physical spectral features.
Abstract
The classification of the optical spectra of active galactic nuclei (AGN) into different types is well founded on AGN physics, but it involves some degree of human oversight and cannot be reliably scaled to large data sets. Machine learning (ML) tackles such a classification problem in a fast and reproducible way, but is often perceived as a black box. However, ML interpretability and explainability are active research areas in computer science, increasingly providing us with tools to alleviate this issue. We applied ML interpretability tools to a classifier trained to predict AGN type from spectra, to demonstrate the use of such tools in this context. We trained a support-vector machine on 3346 high-quality, low redshift AGN spectra from SDSS DR15 with an existing reliable classification as type 1, type 2, or intermediate type. On a selection of test-set spectra, we computed the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
