Mixture-of-Experts Graph Transformers for Interpretable Particle Collision Detection
Donatella Genovese, Alessandro Sgroi, Alessio Devoto, Samuel, Valentine, Lennox Wood, Cristiano Sebastiani, Stefano Giagu, Monica, D'Onofrio, Simone Scardapane

TL;DR
This paper introduces a Graph Transformer with Mixture-of-Experts layers that enhances interpretability and accuracy in particle collision event classification, linking model decisions to physics features.
Contribution
It presents a novel model combining Graph Transformers and Mixture-of-Experts for interpretable high-energy physics data analysis, improving transparency without sacrificing performance.
Findings
Achieves competitive classification accuracy on simulated collider data.
Provides interpretable insights aligned with physics principles.
Demonstrates potential for trustworthy AI in particle physics.
Abstract
The Large Hadron Collider at CERN produces immense volumes of complex data from high-energy particle collisions, demanding sophisticated analytical techniques for effective interpretation. Neural Networks, including Graph Neural Networks, have shown promise in tasks such as event classification and object identification by representing collisions as graphs. However, while Graph Neural Networks excel in predictive accuracy, their "black box" nature often limits their interpretability, making it difficult to trust their decision-making processes. In this paper, we propose a novel approach that combines a Graph Transformer model with Mixture-of-Expert layers to achieve high predictive performance while embedding interpretability into the architecture. By leveraging attention maps and expert specialization, the model offers insights into its internal decision-making, linking predictions to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · COVID-19 diagnosis using AI · Anomaly Detection Techniques and Applications
MethodsAttention Is All You Need · Laplacian EigenMap · Absolute Position Encodings · Softmax · Linear Layer · Adam · Residual Connection · Dropout · Multi-Head Attention · Position-Wise Feed-Forward Layer
