Efficient decision tree training with new data structure for secure multi-party computation
Koki Hamada, Dai Ikarashi, Ryo Kikuchi, Koji Chida

TL;DR
This paper introduces a new data structure for secure multi-party decision tree training that reduces the number of comparisons from exponential to linear in the tree height, improving efficiency and practicality.
Contribution
The authors develop a novel secure data structure that enables efficient decision tree training in MPC without dummy rows, reducing comparisons from exponential to linear in tree height.
Findings
Training a decision tree with height 5 takes 33 seconds on 100,000 rows.
The new protocol requires only O(hmn log n) comparisons, improving over previous exponential methods.
Implementation demonstrates practical feasibility for large datasets.
Abstract
We propose a secure multi-party computation (MPC) protocol that constructs a secret-shared decision tree for a given secret-shared dataset. The previous MPC-based decision tree training protocol (Abspoel et al. 2021) requires comparisons, being exponential in the tree height and with and being the number of rows and that of attributes in the dataset, respectively. The cause of the exponential number of comparisons in is that the decision tree training algorithm is based on the divide-and-conquer paradigm, where dummy rows are added after each split in order to hide the number of rows in the dataset. We resolve this issue via secure data structure that enables us to compute an aggregate value for every group while hiding the grouping information. By using this data structure, we can train a decision tree without adding dummy rows while hiding the size of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Complexity and Algorithms in Graphs
