# Computational hermeneutics: evaluating generative AI as a cultural technology

**Authors:** Cody Kommers, Ruth Ahnert, Maria Antoniak, Emmanouil Benetos, Steve Benford, Mercedes Bunz, Baptiste Caramiaux, Shauna Concannon, Martin Disley, James Dobson, Yali Du, Edgar Duéñez-Guzmán, Kerry Francksen, Evelyn Gius, Jonathan W. Y. Gray, Ryan Heuser, Sarah Immel, Richard Jean So, Sang Leigh, Dalaki Livingston, Hoyt Long, Meredith Martin, Georgia Meyer, Daniela Mihai, Ashley Noel-Hirst, Kirsten Ostherr, Deven Parker, Yipeng Qin, Jessica Ratcliff, Emily Robinson, Karina Rodriguez, Adam Sobey, Ted Underwood, Aditya Vashistha, Matthew Wilkens, Youyou Wu, Yuan Zheng, Drew Hemment

PMC · DOI: 10.3389/frai.2026.1753041 · Frontiers in Artificial Intelligence · 2026-02-26

## TL;DR

This paper introduces computational hermeneutics as a new way to evaluate generative AI by focusing on meaning and context rather than just accuracy.

## Contribution

The paper proposes computational hermeneutics as a novel framework for evaluating generative AI systems through interpretive cultural challenges.

## Key findings

- GenAI systems function as 'context machines' that inherently address interpretive challenges like situatedness, plurality, and ambiguity.
- The paper introduces three principles for hermeneutic evaluation: iterative benchmarks, inclusion of people, and measuring cultural context.
- Computational hermeneutics shifts AI evaluation from standardized accuracy to contextual meaning.

## Abstract

Generative AI (GenAI) systems are increasingly recognized as cultural technologies, yet current evaluation frameworks often treat culture as a variable to be measured rather than fundamental to the system's operation. Drawing on hermeneutic theory from the humanities, we argue that GenAI systems function as "context machines" that must inherently address three interpretive challenges: situatedness (meaning only emerges in context), plurality (multiple valid interpretations coexist), and ambiguity (interpretations naturally conflict). We present computational hermeneutics as an emerging framework offering an interpretive account of what GenAI systems do, and how they might do it better. We offer three principles for hermeneutic evaluation—that benchmarks should be iterative, not one-off; include people, not just machines; and measure cultural context, not just model output. This perspective offers a nascent paradigm for designing and evaluating contemporary AI systems: shifting from standardized questions about accuracy to contextual ones about meaning.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12979437/full.md

## References

117 references — full list in the complete paper: https://tomesphere.com/paper/PMC12979437/full.md

---
Source: https://tomesphere.com/paper/PMC12979437