Soft Hoeffding Tree: A Transparent and Differentiable Model on Data Streams
Kirsten K\"obschall, Lisa Hartung, Stefan Kramer

TL;DR
The paper introduces soft Hoeffding trees (SoHoT), a differentiable, transparent model for data streams that combines the strengths of Hoeffding trees and soft trees, enabling integration into deep learning workflows.
Contribution
It presents a novel differentiable and transparent data stream model, combining Hoeffding trees' extensibility with soft trees' differentiability, and introduces a new gating function for split regulation.
Findings
SoHoT outperforms standard Hoeffding trees in class probability estimation.
SoHoT maintains transparency better than soft trees with minimal performance loss.
Hyperparameter tuning allows trading off transparency and accuracy.
Abstract
We propose soft Hoeffding trees (SoHoT) as a new differentiable and transparent model for possibly infinite and changing data streams. Stream mining algorithms such as Hoeffding trees grow based on the incoming data stream, but they currently lack the adaptability of end-to-end deep learning systems. End-to-end learning can be desirable if a feature representation is learned by a neural network and used in a tree, or if the outputs of trees are further processed in a deep learning model or workflow. Different from Hoeffding trees, soft trees can be integrated into such systems due to their differentiability, but are neither transparent nor explainable. Our novel model combines the extensibility and transparency of Hoeffding trees with the differentiability of soft trees. We introduce a new gating function to regulate the balance between univariate and multivariate splits in the tree.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Stream Mining Techniques · Advanced Database Systems and Queries · Data Mining Algorithms and Applications
