A Hybrid Tsallis-Polarization Impurity Measure for Decision Trees: Theoretical Foundations and Empirical Evaluation
Edouard Lansiaux, Idriss Jairi, Hayfa Zgaya-Biau

TL;DR
This paper introduces a new hybrid impurity measure for decision trees, combining Tsallis entropy and polarization, offering a balance of theoretical soundness and computational efficiency, with strong empirical performance.
Contribution
The paper presents the Integrated Tsallis Combination (ITC), a novel hybrid impurity measure with proven theoretical properties and competitive empirical results, advancing decision tree impurity measures.
Findings
ITC variants achieve 88.38-89.16% accuracy on benchmark datasets.
Simple parametric measures like Tsallis 0.5 reach 91.17% accuracy.
No significant performance difference among top impurity measures.
Abstract
We introduce the Integrated Tsallis Combination (ITC), a hybrid impurity measure for decision tree learning that combines normalized Tsallis entropy with an exponential polarization component. While many existing measures sacrifice theoretical soundness for computational efficiency or vice versa, ITC provides a mathematically principled framework that balances both aspects. The core innovation lies in the complementarity between Tsallis entropy's information-theoretic foundations and the polarization component's sensitivity to distributional asymmetry. We establish key theoretical properties-concavity under explicit parameter conditions, proper boundary conditions, and connections to classical measures-and provide a rigorous justification for the hybridization strategy. Through an extensive comparative evaluation on seven benchmark datasets comparing 23 impurity measures with five-fold…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStatistical Mechanics and Entropy · Clusterin in disease pathology · Benford’s Law and Fraud Detection
