Stability of decision trees and logistic regression
Nino Arsov, Martin Pavlovski, Ljupco Kocarev

TL;DR
This paper investigates the stability of decision trees and logistic regression, deriving new stability notions, analyzing their dependence on model parameters, and proposing a framework for empirical stability measurement.
Contribution
It introduces new stability concepts for decision trees and logistic regression, provides theoretical bounds, and develops a novel empirical stability measurement framework.
Findings
Stability of decision trees depends on the number of leaves.
Logistic regression stability depends on the Hessian's smallest eigenvalue.
Logistic regression is generally not a stable learning algorithm.
Abstract
Decision trees and logistic regression are one of the most popular and well-known machine learning algorithms, frequently used to solve a variety of real-world problems. Stability of learning algorithms is a powerful tool to analyze their performance and sensitivity and subsequently allow researchers to draw reliable conclusions. The stability of these two algorithms has remained obscure. To that end, in this paper, we derive two stability notions for decision trees and logistic regression: hypothesis and pointwise hypothesis stability. Additionally, we derive these notions for L2-regularized logistic regression and confirm existing findings that it is uniformly stable. We show that the stability of decision trees depends on the number of leaves in the tree, i.e., its depth, while for logistic regression, it depends on the smallest eigenvalue of the Hessian matrix of the cross-entropy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical and Computational Modeling · Neural Networks and Applications · Bayesian Modeling and Causal Inference
MethodsLogistic Regression
