Mutual Information Scoring: Increasing Interpretability in Categorical Clustering Tasks with Applications to Child Welfare Data
Pranav Sankhe, Seventy F. Hall, Melanie Sage, Maria Y. Rodriquez,, Varun Chandola, Kenneth Joseph

TL;DR
This paper introduces a new categorical clustering method using mutual information scoring to improve interpretability of foster youth data, revealing biases and guiding future research to support vulnerable youth.
Contribution
It presents a novel clustering and summarization approach that uncovers biases in foster care data and suggests directions for further qualitative investigation.
Findings
Identified systemic biases in foster youth administrative data.
Enhanced interpretability of categorical clustering results.
Provided insights for targeted interventions and future research.
Abstract
Youth in the American foster care system are significantly more likely than their peers to face a number of negative life outcomes, from homelessness to incarceration. Administrative data on these youth have the potential to provide insights that can help identify ways to improve their path towards a better life. However, such data also suffer from a variety of biases, from missing data to reflections of systemic inequality. The present work proposes a novel, prescriptive approach to using these data to provide insights about both data biases and the systems and youth they track. Specifically, we develop a novel categorical clustering and cluster summarization methodology that allows us to gain insights into subtle biases in existing data on foster youth, and to provide insight into where further (often qualitative) research is needed to identify potential ways of assisting youth.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHomelessness and Social Issues · Food Security and Health in Diverse Populations · Urban, Neighborhood, and Segregation Studies
