Boolformer: Symbolic Regression of Logic Functions with Transformers
St\'ephane d'Ascoli, Arthur Renard, Vassilis Papadopoulos, Samy Bengio, Josh Susskind, Emmanuel Abb\'e

TL;DR
Boolformer is a Transformer-based model capable of symbolic regression of Boolean functions, effectively predicting formulas from full or partial data, and demonstrating potential as an interpretable alternative in real-world applications.
Contribution
Introduces Boolformer, a novel Transformer model for symbolic Boolean function regression, capable of handling incomplete data and outperforming genetic algorithms in speed.
Findings
Successfully predicts formulas from full truth tables.
Performs well with noisy or incomplete data.
Achieves significant speedup over genetic algorithms.
Abstract
We introduce Boolformer, a Transformer-based model trained to perform end-to-end symbolic regression of Boolean functions. First, we show that it can predict compact formulas for complex functions not seen during training, given their full truth table. Then, we demonstrate that even with incomplete or noisy observations, Boolformer is still able to find good approximate expressions. We evaluate Boolformer on a broad set of real-world binary classification datasets, demonstrating its potential as an interpretable alternative to classic machine learning methods. Finally, we apply it to the widespread task of modeling the dynamics of gene regulatory networks and show through a benchmark that Boolformer is competitive with state-of-the-art genetic algorithms, with a speedup of several orders of magnitude. Our code and models are available publicly.
Peer Reviews
Decision·Submitted to ICLR 2024
- The paper is well written and easy to follow. - Careful description of the generation of synthetic data, and a good analysis of the possible bias included. - Empirical evaluation establishes the effectiveness of the approach. - A good section of limitation addressing some of my concerns (e.g., not being able to deal with large formulas) which would otherwise go to the weakness below.
- While there are some engineering for the embedder, the rest of the approach seems quite standard and straightforward (which is not necessarily a bad thing). - It might not be that surprising that Boolformer is faster on GRNs tasks. After all, it has been trained for a long time and the training data could have covered what it needed in these tasks. I am curious, however, is there a similar comparison of efficiency in the noiseless regime?
- Well written and structured - Clear motivation - Authors will open source the implementation
- It would be good to have more examples of real-world usages of the method. Speed (inference) superiority is great, but maybe speed is not even a concern in the domains that the technique is intended to be applied. - It would be good to have some analysis on which type of tasks is solvable by this method vs. others. - Does not generalize to larger formulas.
- A new problem setting in which logical expressions are symbolically regressed by Transformer. - Experimental results on several real-world applications as well as on the generated logical equation data are reported. - The results using PMLB database shows accuracy comparable to Random Forest and logistic regression, and furthermore, the learned models are expected to provide excellent explanatory properties. - Using GRNs, Boolformer is shown to have both excellent accuracy and speed.
- The problem statement in the introduction does not match the solution. In Section 1, citing (Abbe et al., 2022), the authors point out that Transformer learns complex models in terms of the Fourier spectrum, resulting in poor generalization performance when samples are insufficient. As a contribution, Section 1.1 claims that it is robust to noisy and incomplete observations. However, the Boolformer proposed in this paper is a relatively natural application of the Transformer, and there is no r
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvolutionary Algorithms and Applications · Gene Regulatory Network Analysis · Metaheuristic Optimization Algorithms Research
MethodsAttention Is All You Need · Linear Layer · Multi-Head Attention · Byte Pair Encoding · Softmax · Dense Connections · Position-Wise Feed-Forward Layer · Absolute Position Encodings · Residual Connection · Adam
