Towards Scalable Meta-Learning of near-optimal Interpretable Models via Synthetic Model Generations

Kyaw Hpone Myint; Zhe Wu; Alexandre G.R. Day; Giri Iyengar

arXiv:2511.04000·cs.LG·November 7, 2025

Towards Scalable Meta-Learning of near-optimal Interpretable Models via Synthetic Model Generations

Kyaw Hpone Myint, Zhe Wu, Alexandre G.R. Day, Giri Iyengar

PDF

Open Access

TL;DR

This paper presents a scalable meta-learning approach for decision trees by synthetically generating near-optimal models, reducing computational costs while maintaining high performance in high-stakes applications.

Contribution

It introduces a novel synthetic data generation method for meta-learning decision trees, enabling scalable and efficient training without relying on real-world data.

Findings

01

Performance comparable to real-data pre-training

02

Significant reduction in computational costs

03

Enhanced flexibility in data generation

Abstract

Decision trees are widely used in high-stakes fields like finance and healthcare due to their interpretability. This work introduces an efficient, scalable method for generating synthetic pre-training data to enable meta-learning of decision trees. Our approach samples near-optimal decision trees synthetically, creating large-scale, realistic datasets. Using the MetaTree transformer architecture, we demonstrate that this method achieves performance comparable to pre-training on real-world data or with computationally expensive optimal decision trees. This strategy significantly reduces computational costs, enhances data generation flexibility, and paves the way for scalable and efficient meta-learning of interpretable decision tree models.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Imbalanced Data Classification Techniques · Financial Distress and Bankruptcy Prediction