Large Language Models are Easily Confused: A Quantitative Metric,   Security Implications and Typological Analysis

Yiyi Chen; Qiongxiu Li; Russa Biswas; Johannes Bjerva

arXiv:2410.13237·cs.CL·February 11, 2025

Large Language Models are Easily Confused: A Quantitative Metric, Security Implications and Typological Analysis

Yiyi Chen, Qiongxiu Li, Russa Biswas, Johannes Bjerva

PDF

Open Access 1 Video

TL;DR

This paper introduces a new metric to quantify language confusion in large language models, analyzes its patterns, and explores security implications, especially in multilingual contexts, revealing linguistic regularities and vulnerabilities.

Contribution

It presents a novel Language Confusion Entropy metric, links language confusion to security issues, and uses linguistic typology for interpretability and improved LLM alignment.

Findings

01

Language Confusion Entropy effectively measures confusion in LLMs.

02

Patterns of language confusion are linked to linguistic typology.

03

Language confusion impacts security, especially in multilingual embedding attacks.

Abstract

Language Confusion is a phenomenon where Large Language Models (LLMs) generate text that is neither in the desired language, nor in a contextually appropriate language. This phenomenon presents a critical challenge in text generation by LLMs, often appearing as erratic and unpredictable behavior. We hypothesize that there are linguistic regularities to this inherent vulnerability in LLMs and shed light on patterns of language confusion across LLMs. We introduce a novel metric, Language Confusion Entropy, designed to directly measure and quantify this confusion, based on language distributions informed by linguistic typology and lexical variation. Comprehensive comparisons with the Language Confusion Benchmark (Marchisio et al., 2024) confirm the effectiveness of our metric, revealing patterns of language confusion across LLMs. We further link language confusion to LLM security, and find…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Large Language Models are Easily Confused: A Quantitative Metric, Security Implications and Typological Analysis· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Text Readability and Simplification