Consistent Feature Construction with Constrained Genetic Programming for Experimental Physics
No\"elie Cherrier, Jean-Philippe Poli, Maxime Defurne, Franck, Sabati\'e

TL;DR
This paper introduces a novel, interpretable feature construction method using constrained genetic programming with dimensional consistency, significantly improving classification accuracy in high energy physics datasets.
Contribution
It combines genetic programming with physics-aware grammars to automatically construct interpretable features with units, validated by physics experts.
Findings
Constructed features improve classification accuracy.
Features are physically interpretable and validated by experts.
First method to incorporate units in genetic programming for physics
Abstract
A good feature representation is a determinant factor to achieve high performance for many machine learning algorithms in terms of classification. This is especially true for techniques that do not build complex internal representations of data (e.g. decision trees, in contrast to deep neural networks). To transform the feature space, feature construction techniques build new high-level features from the original ones. Among these techniques, Genetic Programming is a good candidate to provide interpretable features required for data analysis in high energy physics. Classically, original features or higher-level features based on physics first principles are used as inputs for training. However, physicists would benefit from an automatic and interpretable feature construction for the classification of particle collision events. Our main contribution consists in combining different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsInterpretability
