Opening the black box of neural nets: case studies in stop/top discrimination
Thomas Roxlo, Matthew Reece

TL;DR
This paper introduces techniques to interpret neural networks by visualizing their decision boundaries and identifying key input features, demonstrated through case studies on stop particle discrimination in high-energy physics.
Contribution
It presents a novel method for exploring neural network functionality and extracting human-readable approximations, applied to particle physics classification tasks.
Findings
Neurons strongly correlate with known physics variables like $m_{T2}^{ ext{ll}}$.
Networks identify event features such as missing transverse momentum and angular correlations.
Interpretation tools reveal how neural networks differentiate signal from background.
Abstract
We introduce techniques for exploring the functionality of a neural network and extracting simple, human-readable approximations to its performance. By performing gradient ascent on the input space of the network, we are able to produce large populations of artificial events which strongly excite a given classifier. By studying the populations of these events, we then directly produce what are essentially contour maps of the network's classification function. Combined with a suite of tools for identifying the input dimensions deemed most important by the network, we can utilize these maps to efficiently interpret the dominant criteria by which the network makes its classification. As a test case, we study networks trained to discriminate supersymmetric stop production in the dilepton channel from Standard Model backgrounds. In the case of a heavy stop decaying to a light neutralino,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Physics and Python Applications · Particle physics theoretical and experimental studies · Scientific Computing and Data Management
