Governance by Evidence: Regulated Predictors in Decision-Tree Models
Alexios Veskoukis, Dimitris Kalles

TL;DR
This paper analyzes decision-tree models to identify predictors that fall under privacy regulations, highlighting the prevalence of regulated data types across industries and informing privacy-preserving practices.
Contribution
It introduces a systematic approach to categorize decision-tree predictors based on privacy laws, revealing industry-specific patterns and temporal trends in regulated data usage.
Findings
Many predictors are in regulated categories, especially in healthcare.
Significant industry differences in the use of regulated data.
Temporal patterns show changes in predictor usage over time.
Abstract
Decision-tree methods are widely used on structured tabular data and are valued for interpretability across many sectors. However, published studies often list the predictors they use (for example age, diagnosis codes, location). Privacy laws increasingly regulate such data types. We use published decision-tree papers as a proxy for real-world use of legally governed data. We compile a corpus of decision-tree studies and assign each reported predictor to a regulated data category (for example health data, biometric identifiers, children's data, financial attributes, location traces, and government IDs). We then link each category to specific excerpts in European Union and United States privacy laws. We find that many reported predictors fall into regulated categories, with the largest shares in healthcare and clear differences across industries. We analyze prevalence, industry…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Privacy, Security, and Data Protection · Ethics and Social Impacts of AI
