"Not in My Backyard": LLMs Uncover Online and Offline Social Biases Against Homelessness
Jonathan A. Karr Jr., Benjamin F. Herbst, Matthew L. Sisk, Xueyun Li, Ting Hua, Matthew Hauenstein, Georgina Curto, Nitesh V. Chawla

TL;DR
This paper investigates online and offline social biases against homelessness, demonstrating that data quantity significantly impacts classifier performance and revealing prevalent negative biases, especially on Reddit, with implications for policy and social perception.
Contribution
It introduces a new multi-domain dataset and shows how GPT-4.1-generated pseudo-labels improve bias detection models, emphasizing data quantity over model size for effective bias classification.
Findings
Negative bias against PEH is widespread online and offline.
GPT-4.1 pseudo-labeling enhances classifier performance.
Small models can approach GPT-4.1 accuracy with sufficient data.
Abstract
Homelessness is a persistent social challenge, impacting millions worldwide. Over 876,000 people experienced homelessness (PEH) in the U.S. in 2025. Social bias is a significant barrier to alleviation, shaping public perception and influencing policymaking. Given that online textual media and offline city council discourse reflect and influence part of public opinion, it provides valuable insights to identify and track social biases against PEH. We present a new, manually-annotated multi-domain dataset compiled from Reddit, X (formerly Twitter), news articles, and city council meeting minutes across ten U.S. cities. Our 16-category multi-label taxonomy creates a challenging long-tail classification problem: some categories appear in less than 1% of samples, while others exceed 70%. We find that small human-annotated datasets (1,702 samples) are insufficient for training effective…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHomelessness and Social Issues
