Predicting and Explaining Behavioral Data with Structured Feature Space Decomposition
Peter G Fennell, Zhiya Zuo, Kristina Lerman

TL;DR
This paper introduces S3D, a statistical method inspired by decision trees, for modeling and explaining human behavioral data by selecting important features, partitioning data into homogeneous groups, and providing interpretable predictions.
Contribution
The paper presents S3D, a novel structured sum-of-squares decomposition approach that improves interpretability and predictive accuracy in behavioral data modeling.
Findings
S3D achieves comparable prediction accuracy to state-of-the-art methods.
S3D provides interpretable models revealing key behavioral factors.
The method effectively identifies subgroups within heterogeneous populations.
Abstract
Modeling human behavioral data is challenging due to its scale, sparseness (few observations per individual), heterogeneity (differently behaving individuals), and class imbalance (few observations of the outcome of interest). An additional challenge is learning an interpretable model that not only accurately predicts outcomes, but also identifies important factors associated with a given behavior. To address these challenges, we describe a statistical approach to modeling behavioral data called the structured sum-of-squares decomposition (S3D). The algorithm, which is inspired by decision trees, selects important features that collectively explain the variation of the outcome, quantifies correlations between the features, and partitions the subspace of important features into smaller, more homogeneous blocks that correspond to similarly-behaving subgroups within the population. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods · Sentiment Analysis and Opinion Mining · Topic Modeling
