Simulating Meaning, Nevermore! Introducing ICR: A Semiotic-Hermeneutic Metric for Evaluating Meaning in LLM Text Summaries
Natalie Perez, Sreyoshi Bhaduri, Aman Chadha

TL;DR
This paper introduces ICR, a semiotic-hermeneutic metric for evaluating the nuanced and context-dependent meaning in LLM-generated summaries, highlighting the limitations of traditional lexical similarity metrics.
Contribution
The paper presents the ICR metric, a novel qualitative evaluation method integrating semiotics and hermeneutics to better assess semantic accuracy in LLM outputs.
Findings
LLMs excel at linguistic similarity but struggle with contextual meaning.
Performance of LLMs varies across datasets and models.
Larger datasets improve semantic evaluation but do not fully resolve interpretive gaps.
Abstract
Meaning in human language is relational, context dependent, and emergent, arising from dynamic systems of signs rather than fixed word-concept mappings. In computational settings, this semiotic and interpretive complexity complicates the generation and evaluation of meaning. This article proposes an interdisciplinary framework for studying meaning in large language model (LLM) generated language by integrating semiotics and hermeneutics with qualitative research methods. We review prior scholarship on meaning and machines, examining how linguistic signs are transformed into vectorized representations in static and contextualized embedding models, and identify gaps between statistical approximation and human interpretive meaning. We then introduce the Inductive Conceptual Rating (ICR) metric, a qualitative evaluation approach grounded in inductive content analysis and reflexive thematic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational and Text Analysis Methods · Topic Modeling · Language and cultural evolution
