Orthogonal Gradient Boosting for Simpler Additive Rule Ensembles
Fan Yang, Pierre Le Bodic, Michael Kamp, Mario Boley

TL;DR
This paper introduces an orthogonal gradient boosting method that improves the interpretability and accuracy of rule ensembles by selecting more general, shorter rules through a novel objective function.
Contribution
It proposes a new objective function for gradient boosting that better guides rule selection towards simpler, more interpretable rules without sacrificing accuracy.
Findings
Significantly improves the interpretability/accuracy trade-off in rule ensembles.
Efficient incremental computation of objective values for related rule conditions.
Demonstrates effectiveness across various prediction tasks.
Abstract
Gradient boosting of prediction rules is an efficient approach to learn potentially interpretable yet accurate probabilistic models. However, actual interpretability requires to limit the number and size of the generated rules, and existing boosting variants are not designed for this purpose. Though corrective boosting refits all rule weights in each iteration to minimise prediction risk, the included rule conditions tend to be sub-optimal, because commonly used objective functions fail to anticipate this refitting. Here, we address this issue by a new objective function that measures the angle between the risk gradient vector and the projection of the condition output vector onto the orthogonal complement of the already selected conditions. This approach correctly approximate the ideal update of adding the risk gradient itself to the model and favours the inclusion of more general and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Medical Image Segmentation Techniques · Advanced Numerical Analysis Techniques
