An Investigation of Linguistic Biases in LLM-Based Recommendations
Nitin Venkateswaran, Jason Ang, Deep Adhikari, Tarun Krishna Dasari

TL;DR
This study examines how linguistic biases in large language models influence restaurant and product recommendations across different dialects, revealing sensitivity to Indian English and Code-Switching in various models.
Contribution
It provides a detailed analysis of dialect-specific biases in LLM recommendations using rigorous statistical methods and diverse datasets.
Findings
Models show dialect sensitivity in restaurant recommendations, especially with Indian English and Code-Switching.
Product recommendations vary by dialect, with larger models favoring Indian English and Code-Switching prompts.
No consistent trend in model size effects; biases depend on dialect and category.
Abstract
We investigate linguistic biases in LLM-based restaurant and product recommendations given prompts varying across Southern American English (AE), Indian English (IE), and Code-Switched Hindi-English dialects, using the Yelp Open dataset (Yelp Inc., 2023) and Walmart product reviews dataset (PromptCloud,2020). We add lists of restaurant and product names balanced by cuisine type and product category to the prompts given to the LLM, and we zero-shot prompt the LLMs in a cold-start setting to select the top-20 restaurant and product recommendations from these lists for each of the dialect-varied prompts. We prompt LLMs using different list samples across 20 seeds for better generalization, and aggregate per cuisine-type and per category response counts for each seed, question/prompt, and LLM model. We run mixed-effects regression models for each model family and topic (restaurant/product)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
