Epicure: Distilling Sequence Model Predictions into Patterns
Miltiadis Allamanis, Earl T. Barr

TL;DR
Epicure is a method that distills sequence model predictions into high-precision patterns, improving the accuracy of name predictions and anomaly detection in high entropy sequence tasks.
Contribution
It introduces Epicure, a novel approach that maps sequence model predictions into a lattice of general patterns, enhancing prediction accuracy over traditional methods.
Findings
Epicure improves name prediction accuracy by 61% at 10% false alarm rate.
It outperforms standard model predictions in detecting anomalous names.
Epicure effectively captures rare sequences through pattern distillation.
Abstract
Most machine learning models predict a probability distribution over concrete outputs and struggle to accurately predict names over high entropy sequence distributions. Here, we explore finding abstract, high-precision patterns intrinsic to these predictions in order to make abstract predictions that usefully capture rare sequences. In this short paper, we present Epicure, a method that distils the predictions of a sequence model, such as the output of beam search, into simple patterns. Epicure maps a model's predictions into a lattice that represents increasingly more general patterns that subsume the concrete model predictions. On the tasks of predicting a descriptive name of a function given the source code of its body and detecting anomalous names given a function, we show that Epicure yields accurate naming patterns that match the ground truth more often compared to just the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Anomaly Detection Techniques and Applications · Advanced Text Analysis Techniques
