Mixture-of-Experts Graph Transformers for Interpretable Particle   Collision Detection

Donatella Genovese; Alessandro Sgroi; Alessio Devoto; Samuel; Valentine; Lennox Wood; Cristiano Sebastiani; Stefano Giagu; Monica; D'Onofrio; Simone Scardapane

arXiv:2501.03432·cs.LG·January 9, 2025

Mixture-of-Experts Graph Transformers for Interpretable Particle Collision Detection

Donatella Genovese, Alessandro Sgroi, Alessio Devoto, Samuel, Valentine, Lennox Wood, Cristiano Sebastiani, Stefano Giagu, Monica, D'Onofrio, Simone Scardapane

PDF

Open Access 1 Repo

TL;DR

This paper introduces a Graph Transformer with Mixture-of-Experts layers that enhances interpretability and accuracy in particle collision event classification, linking model decisions to physics features.

Contribution

It presents a novel model combining Graph Transformers and Mixture-of-Experts for interpretable high-energy physics data analysis, improving transparency without sacrificing performance.

Findings

01

Achieves competitive classification accuracy on simulated collider data.

02

Provides interpretable insights aligned with physics principles.

03

Demonstrates potential for trustworthy AI in particle physics.

Abstract

The Large Hadron Collider at CERN produces immense volumes of complex data from high-energy particle collisions, demanding sophisticated analytical techniques for effective interpretation. Neural Networks, including Graph Neural Networks, have shown promise in tasks such as event classification and object identification by representing collisions as graphs. However, while Graph Neural Networks excel in predictive accuracy, their "black box" nature often limits their interpretability, making it difficult to trust their decision-making processes. In this paper, we propose a novel approach that combines a Graph Transformer model with Mixture-of-Expert layers to achieve high predictive performance while embedding interpretability into the architecture. By leveraging attention maps and expert specialization, the model offers insights into its internal decision-making, linking predictions to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

DonatellaGenovese/Mixture-of-Expert-Graph-Transformer
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · COVID-19 diagnosis using AI · Anomaly Detection Techniques and Applications

MethodsAttention Is All You Need · Laplacian EigenMap · Absolute Position Encodings · Softmax · Linear Layer · Adam · Residual Connection · Dropout · Multi-Head Attention · Position-Wise Feed-Forward Layer