Building and Measuring Trust between Large Language Models
Maarten Buyl, Yousra Fettach, Guillaume Bied, Tijl De Bie

TL;DR
This paper investigates how trust between large language models can be built and measured, comparing explicit psychological questionnaires with implicit behavioral indicators, revealing complex and sometimes contradictory trust dynamics.
Contribution
It introduces methods to build trust in LLMs through rapport, scripting, and prompt adaptation, and compares explicit and implicit trust measures in multi-agent interactions.
Findings
Explicit trust measures are often negatively correlated with implicit trust indicators.
Implicit measures like susceptibility to persuasion may better reflect trust between LLMs.
Context-specific trust assessments are more informative than direct questionnaires.
Abstract
As large language models (LLMs) increasingly interact with each other, most notably in multi-agent setups, we may expect (and hope) that `trust' relationships develop between them, mirroring trust relationships between human colleagues, friends, or partners. Yet, though prior work has shown LLMs to be capable of identifying emotional connections and recognizing reciprocity in trust games, little remains known about (i) how different strategies to build trust compare, (ii) how such trust can be measured implicitly, and (iii) how this relates to explicit measures of trust. We study these questions by relating implicit measures of trust, i.e. susceptibility to persuasion and propensity to collaborate financially, with explicit measures of trust, i.e. a dyadic trust questionnaire well-established in psychology. We build trust in three ways: by building rapport dynamically, by starting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
