Using dynamical quantization to perform split attempts in online tree   regressors

Saulo Martiello Mastelini; Andre Carlos Ponce de Leon Ferreira de; Carvalho

arXiv:2012.00083·cs.LG·December 4, 2020

Using dynamical quantization to perform split attempts in online tree regressors

Saulo Martiello Mastelini, Andre Carlos Ponce de Leon Ferreira de, Carvalho

PDF

TL;DR

This paper introduces the Quantization Observer, a hashing-based method for efficiently evaluating split points in online regression trees, reducing computational costs and memory usage while maintaining accuracy.

Contribution

The paper presents QO, a novel online split evaluation method for numerical features that is simple, efficient, and easily integrable into existing incremental decision trees.

Findings

01

QO achieves $O(1)$ monitoring cost per instance.

02

QO provides accurate split point suggestions with less memory and processing time.

03

QO outperforms previous methods in experimental evaluations.

Abstract

A central aspect of online decision tree solutions is evaluating the incoming data and enabling model growth. For such, trees much deal with different kinds of input features and partition them to learn from the data. Numerical features are no exception, and they pose additional challenges compared to other kinds of features, as there is no trivial strategy to choose the best point to make a split decision. The problem is even more challenging in regression tasks because both the features and the target are continuous. Typical online solutions evaluate and store all the points monitored between split attempts, which goes against the constraints posed in real-time applications. In this paper, we introduce the Quantization Observer (QO), a simple yet effective hashing-based algorithm to monitor and evaluate split point candidates in numerical features for online tree regressors. QO can be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.