Neural Attention Forests: Transformer-Based Forest Improvement
Andrei V. Konstantinov, Lev V. Utkin, Alexey A. Lukashin, Vladimir A., Muliukha

TL;DR
The paper introduces Neural Attention Forests (NAF), a novel transformer-based method that integrates neural attention mechanisms into random forests for improved regression and classification on tabular data.
Contribution
It presents a new neural attention mechanism integrated into random forests, trained end-to-end, enhancing prediction accuracy with a transformer-like architecture.
Findings
Demonstrates improved performance on real datasets.
Provides publicly available code for the method.
Shows neural attention enhances forest predictions.
Abstract
A new approach called NAF (the Neural Attention Forest) for solving regression and classification tasks under tabular training data is proposed. The main idea behind the proposed NAF model is to introduce the attention mechanism into the random forest by assigning attention weights calculated by neural networks of a specific form to data in leaves of decision trees and to the random forest itself in the framework of the Nadaraya-Watson kernel regression. In contrast to the available models like the attention-based random forest, the attention weights and the Nadaraya-Watson regression are represented in the form of neural networks whose weights can be regarded as trainable parameters. The first part of neural networks with shared weights is trained for all trees and computes attention weights of data in leaves. The second part aggregates outputs of the tree networks and aims to minimize…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Machine Learning and Data Classification · Face and Expression Recognition
