Feature-Budgeted Random Forest
Feng Nan, Joseph Wang, Venkatesh Saligrama

TL;DR
This paper introduces a novel random forest algorithm designed to minimize prediction error within a specified feature acquisition budget, balancing accuracy and cost during prediction.
Contribution
The paper presents a new random forest method that explicitly accounts for feature acquisition costs and provides theoretical guarantees on cost efficiency.
Findings
Achieves superior accuracy-cost trade-offs on benchmark datasets.
Provides near-optimal guarantees for feature acquisition costs.
Outperforms existing prediction-time algorithms in experiments.
Abstract
We seek decision rules for prediction-time cost reduction, where complete data is available for training, but during prediction-time, each feature can only be acquired for an additional cost. We propose a novel random forest algorithm to minimize prediction error for a user-specified {\it average} feature acquisition budget. While random forests yield strong generalization performance, they do not explicitly account for feature costs and furthermore require low correlation among trees, which amplifies costs. Our random forest grows trees with low acquisition cost and high strength based on greedy minimax cost-weighted-impurity splits. Theoretically, we establish near-optimal acquisition cost guarantees for our algorithm. Empirically, on a number of benchmark datasets we demonstrate superior accuracy-cost curves against state-of-the-art prediction-time algorithms.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Face and Expression Recognition · Anomaly Detection Techniques and Applications
