Understanding the Capabilities and Limitations of Large Language Models   for Cultural Commonsense

Siqi Shen; Lajanugen Logeswaran; Moontae Lee; Honglak Lee; Soujanya; Poria; Rada Mihalcea

arXiv:2405.04655·cs.CL·May 9, 2024·2 cites

Understanding the Capabilities and Limitations of Large Language Models for Cultural Commonsense

Siqi Shen, Lajanugen Logeswaran, Moontae Lee, Honglak Lee, Soujanya, Poria, Rada Mihalcea

PDF

Open Access 1 Video

TL;DR

This paper evaluates large language models' ability to understand cultural commonsense, revealing significant biases and performance variations across different cultures and query languages.

Contribution

It provides a comprehensive analysis of LLMs' cultural commonsense understanding, highlighting biases and influencing factors, and offers insights for developing culturally aware models.

Findings

01

LLMs show performance disparities across cultures

02

Cultural context affects general commonsense capabilities

03

Query language impacts LLMs' cultural task performance

Abstract

Large language models (LLMs) have demonstrated substantial commonsense understanding through numerous benchmark evaluations. However, their understanding of cultural commonsense remains largely unexamined. In this paper, we conduct a comprehensive examination of the capabilities and limitations of several state-of-the-art LLMs in the context of cultural commonsense tasks. Using several general and cultural commonsense benchmarks, we find that (1) LLMs have a significant discrepancy in performance when tested on culture-specific commonsense knowledge for different cultures; (2) LLMs' general commonsense capability is affected by cultural context; and (3) The language used to query the LLMs can impact their performance on cultural-related tasks. Our study points to the inherent bias in the cultural understanding of LLMs and provides insights that can help develop culturally aware language…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Understanding the Capabilities and Limitations of Large Language Models for Cultural Commonsense· underline

Taxonomy

TopicsNatural Language Processing Techniques

MethodsAttentive Walk-Aggregating Graph Neural Network