Cross Feature Selection to Eliminate Spurious Interactions and Single Feature Dominance Explainable Boosting Machines
Shree Charran R, Sandipan Das Mahapatra

TL;DR
This paper introduces a novel cross-feature selection approach for Explainable Boosting Machines to eliminate spurious interactions and feature dominance, enhancing interpretability and predictive accuracy.
Contribution
It proposes a multi-step feature selection and ensemble technique that improves EBM interpretability and performance by addressing feature interaction issues.
Findings
Outperforms vanilla EBM on benchmark datasets
Enhances interpretability and feature stability
Reduces single feature dominance in predictions
Abstract
Interpretability is a crucial aspect of machine learning models that enables humans to understand and trust the decision-making process of these models. In many real-world applications, the interpretability of models is essential for legal, ethical, and practical reasons. For instance, in the banking domain, interpretability is critical for lenders and borrowers to understand the reasoning behind the acceptance or rejection of loan applications as per fair lending laws. However, achieving interpretability in machine learning models is challenging, especially for complex high-performance models. Hence Explainable Boosting Machines (EBMs) have been gaining popularity due to their interpretable and high-performance nature in various prediction tasks. However, these models can suffer from issues such as spurious interactions with redundant features and single-feature dominance across all…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Stock Market Forecasting Methods · Imbalanced Data Classification Techniques
Methodsenergy-based model · Feature Selection
