A Need for Trust in Conversational Interface Research
Justin Edwards, Elaheh Sanoubari

TL;DR
This paper highlights the importance of trust in conversational interfaces, reviews various understandings and measurement approaches, and calls for clearer definitions to advance research in the field.
Contribution
It provides a comprehensive overview of trust concepts and measurement methods in conversational interaction research, emphasizing the need for standardized definitions.
Findings
Trust is recognized as critical in conversational interactions.
Current understanding of trust varies across studies.
Measurement approaches for trust are diverse and lack standardization.
Abstract
Across several branches of conversational interaction research including interactions with social robots, embodied agents, and conversational assistants, users have identified trust as a critical part of those interactions. Nevertheless, there is little agreement on what trust means within these sort of interactions or how trust can be measured. In this paper, we explore some of the dimensions of trust as it has been understood in previous work and we outline some of the ways trust has been measured in the hopes of furthering discussion of the concept across the field.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
A Need for Trust in Conversational Interface Research
Justin Edwards
University College DublinIreland
and
Elaheh Sanoubari
University of ManitobaCanada
(2019)
Abstract.
Across several branches of conversational interaction research including interactions with social robots, embodied agents, and conversational assistants, users have identified trust as a critical part of those interactions. Nevertheless, there is little agreement on what trust means within these sort of interactions or how trust can be measured. In this paper, we explore some of the dimensions of trust as it has been understood in previous work and we outline some of the ways trust has been measured in the hopes of furthering discussion of the concept across the field.
trust, design, conversational agents, Human-robot interaction, Human-agent interaction
††journalyear: 2019††conference: 1st International Conference on Conversational User Interfaces; August 22–23, 2019; Dublin, Ireland††booktitle: 1st International Conference on Conversational User Interfaces (CUI 2019), August 22–23, 2019, Dublin, Ireland††price: 15.00††doi: 10.1145/3342775.3342809††isbn: 978-1-4503-7187-2/19/08††ccs: Human-centered computing HCI theory, concepts and models††ccs: Human-centered computing Interaction design theory, concepts and paradigms††ccs: Human-centered computing User centered design
1. Introduction
Trust is an important dimension in how people use any technology and development or extinction of it plays a large role in the ultimate adoption of the technology (Young et al., 2009). Trust is specifically important in scenarios where people need to accept information provided by an agent and follow its suggestions to benefit from it (Freedy et al., 2007). Designing such interfaces with trust in mind can help maintain effective relationship with agents.
One example of these interfaces is conversational user interfaces (CUIs) which can include text-based dialogue systems, voice-based conversational assistants, embodied virtual agents, and social robots. While CUIs have become increasingly prevalent in the last several years, numerous user studies have revealed that people have concerns about trusting these interfaces (Clark et al., 2019; Olson and Kemery, 2019; Cowan et al., 2017). Nevertheless, there has been little discussion in the community on how to define trust in these interactions, how to measure trust, or how designers can make interfaces more trustworthy (Torre et al., 2018a).
Users can have a range of issues with CUIs that fall under the umbrella of trust, yet have fundamental differences. For example, a person may have mistrust in a robot because they think it is faulty and not computationally capable of doing a task (Salem et al., 2015), or they may not trust it because they think it is malicious and trying deceiving them (Short et al., 2010). We argue that it is important to differentiate different aspects of trust and what shapes them as only by understanding and operationalizing trust will it be possible to meet user needs when designing CUIs. In this paper, we outline some of the possible dimensions to consider when defining trust for CUIs and explore some avenues for measuring trust in conversational interactions.
2. Defining Trust
The first step in designing for trust is understanding and operationalizing what is meant when CUI users mention the term. While other topics in human-computer interaction (HCI) have attempted to map the meaning and role of trust in their field (e.g. automation (Lee and See, 2004) and e-commerce (Egger, 2000)) this has not yet occurred in CUI interactions. As interactions with these interfaces are frequently modeled on human interactions both by designers and implicitly by users (Cowan et al., 2017), there may be reason to believe that trust in this context has a social or relational purpose. This conceptualization of trust, what has been operationalized as a willingness to become vulnerable to others in the social sciences (Rousseau et al., 1998), has been applied in HCI previously in the context of information exchanges on websites (McKnight et al., 2002).
CUI user studies have revealed different meanings of trust however. One study of intelligent personal assistant (IPA) users found many users referring to issues around trust in the CUI’s intents, for example regarding identity and privacy protections (Clark et al., 2019). Here, users were not concerned with disclosure of sensitive information because of a social uneasiness, but instead because they were unsure how their data would be treated and what data was being collected in the first place. This notion of trust has been previously explored in the context of media consumers level of trust in media sources (Kiousis, 2001) and may help to frame this version of trust in HCI.
Other user studies have revealed a conceptualization of trust that relates instead to a belief in the abilities and competency of the interface (Luger and Sellen, 2016). This definition of trust more closely resembles the concept of credibility or believability which has been well studied in the context of websites and news media credibility (Flanagin and Metzger, 2007; Fogg and Tseng, 1999).
Furthermore, trust or lack of it can be analyzed by where it is originated and how it is shaped. That is, in human-computer interaction trust can be synthesized by traits in both the computer and the human. For example, different people can have different attitudes towards the similar technologies which can be shaped by factor such as cultural differences (Haring et al., 2014), personality traits (Salem et al., 2015), context/task (Wang et al., 2010), familiarity (de Visser et al., 2016), etc.
Clearly, these definitions of trust are heterogeneous and the single concept of trust may have multifaceted meanings for people interacting with CUIs. A better understanding of these dimensions of trust is needed as CUI research moves forward.
3. Measuring trust
Because of the multifaceted nature of trust, it is clear that a mixed-methods approach would be necessary in measuring different dimensions of the construct. Some of these methods might include behavioural measures, subjective questionnaires, analysis of body language and linguistic choices, and physiological measures.
Previous assessments of trust have taken varied approaches according to the way the concept was operationalized. This sort of operationalizing of trust has been done in automation research to better capture the concept through emprically supported factors (Hoff and Bashir, 2015). Much work in interactions literature has been done on the Big Five personality traits (John et al., 1999), in which trustworthiness is considered a component of agreeableness and thus a stable aspect of personality. Work looking at trust this way has used Likert-scale questionnaires to gauge people’s ratings of those traits in embodied conversational agents (Celiktutan and Gunes, 2017).
Other work has viewed trust as a trait that is acquired through behavioural experience and thusly measured trust though people’s choices in an economic game like the classic prisoner’s dilemma (Torre et al., 2018b). Additionally, measures of interpersonal trust in human-human dialogues and measures of functional kinds of trust for e-commerce systems have been correlated or been proposed to be studied alongside physiological measures like heart rate (Mitkidis et al., 2015) and eye movements (Riegelsberger et al., 2003). Methods involving laboratory experiments are of course costly to utilize, so analysis of existing or easily collected data are likewise important. In some studies of trust as an interpersonal construct in human-human dialogues, trust has been measured through corpus analysis of linguistic features (Scissors et al., 2009) and nonverbal cues (Lee et al., 2013). Analyses like these may be particularly useful in quickly measuring trust for conversational agents deployed over the web like commercial IPAs and chatbots. Combining subjective, behavioral, and physiological measures of trust not only enrich the mixed methods approach proposed here for studying trust, but they may also, in combination, allow for creation of robust validated measures which are lacking in conversational HCI in general (Clark et al., 2018).
4. Conclusion
Trust has been recognized as an integral dimension in how people request and use information from machines, how people physically interact with machines, and how people work alongside machines. While trust has been explored in different areas of HCI and social sciences in previous work, the growing research areas that involve CUIs need definitions and measurements for understanding the construct in conversational contexts. Because these interactions are fundamentally modeled on human-human interactions and applied to computers, they must be informed by research both in social science and HCI. Further design of CUIs requires our community to engage deeply with social constructs like trust in order to better understand user behaviour and facilitate adoption of these technologies.
Acknowledgements.
This research was supported by Science Foundation Ireland (SFI) award number: 13/RC/2106 ADAPT. The authors would like to thank Ilaria Torre, Leigh Clark, Benjamin Cowan, and all attendees of Measuring and Designing Trust in Human-Agent Interaction at HAI 2018 for valuable discussion of this topic.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1)
- 2Celiktutan and Gunes (2017) Oya Celiktutan and Hatice Gunes. 2017. Automatic Prediction of Impressions in Time and across Varying Context: Personality, Attractiveness and Likeability. IEEE Transactions on Affective Computing 8, 1 (Jan. 2017), 29–42. https://doi.org/10.1109/TAFFC.2015.2513401 · doi ↗
- 3Clark et al . (2018) Leigh Clark, Phillip Doyle, Diego Garaialde, Emer Gilmartin, Stephan Schlögl, Jens Edlund, Matthew Aylett, João Cabral, Cosmin Munteanu, and Benjamin Cowan. 2018. The State of Speech in HCI: Trends, Themes and Challenges. ar Xiv:1810.06828 [cs] (Oct. 2018). http://arxiv.org/abs/1810.06828 ar Xiv: 1810.06828.
- 4Clark et al . (2019) Leigh Clark, Cosmin Munteanu, Vincent Wade, Benjamin R. Cowan, Nadia Pantidi, Orla Cooney, Philip Doyle, Diego Garaialde, Justin Edwards, Brendan Spillane, Emer Gilmartin, and Christine Murad. 2019. What Makes a Good Conversation?: Challenges in Designing Truly Conversational Agents. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems - CHI ’19 . ACM Press, Glasgow, Scotland Uk, 1–12. https://doi.org/10.1145/3290605.3300705 · doi ↗
- 5Cowan et al . (2017) Benjamin R Cowan, Nadia Pantidi, David Coyle, Kellie Morrissey, Peter Clarke, Sara Al-Shehri, David Earley, and Natasha Bandeira. 2017. What can i help you with?: infrequent users’ experiences of intelligent personal assistants. In Proceedings of the 19th International Conference on Human-Computer Interaction with Mobile Devices and Services . ACM, 43.
- 6de Visser et al . (2016) Ewart J de Visser, Samuel S Monfort, Ryan Mc Kendrick, Melissa AB Smith, Patrick E Mc Knight, Frank Krueger, and Raja Parasuraman. 2016. Almost human: Anthropomorphism increases trust resilience in cognitive agents. Journal of Experimental Psychology: Applied 22, 3 (2016), 331.
- 7Egger (2000) Florian N Egger. 2000. Trust me, I’m an online vendor: towards a model of trust for e-commerce system design. In CHI’00 extended abstracts on Human factors in computing systems . ACM, 101–102.
- 8Flanagin and Metzger (2007) Andrew J. Flanagin and Miriam J. Metzger. 2007. The role of site features, user attributes, and information verification behaviors on the perceived credibility of web-based information. New Media & Society 9, 2 (April 2007), 319–342. https://doi.org/10.1177/1461444807075015 · doi ↗
