# Towards automatic estimation of conversation floors within F-formations

**Authors:** Chirag Raman, Hayley Hung

arXiv: 1907.10384 · 2020-09-30

## TL;DR

This paper investigates how to automatically estimate multiple conversation floors within F-formations by analyzing simultaneous speaker turns, revealing that multiple floors often exist and that larger groups tend to have shorter simultaneous turns.

## Contribution

It introduces a metric based on simultaneous speaker turns to infer multiple conversation floors within F-formations, unifying spatial and temporal perspectives.

## Key findings

- Multiple conversation floors can exist within an F-formation.
- Larger groups tend to have shorter durations of simultaneous speaking turns.

## Abstract

The detection of free-standing conversing groups has received significant attention in recent years. In the absence of a formal definition, most studies operationalize the notion of a conversation group either through a spatial or a temporal lens. Spatially, the most commonly used representation is the F-formation, defined by social scientists as the configuration in which people arrange themselves to sustain an interaction. However, the use of this representation is often accompanied with the simplifying assumption that a single conversation occurs within an F-formation. Temporally, various categories have been used to organize conversational units; these include, among others, turn, topic, and floor. Some of these concepts are hard to define objectively by themselves. The present work constitutes an initial exploration into unifying these perspectives by primarily posing the question: can we use the observation of simultaneous speaker turns to infer whether multiple conversation floors exist within an F-formation? We motivate a metric for the existence of distinct conversation floors based on simultaneous speaker turns, and provide an analysis using this metric to characterize conversations across F-formations of varying cardinality. We contribute two key findings: firstly, at the average speaking turn duration of about two seconds for humans, there is evidence for the existence of multiple floors within an F-formation; and secondly, an increase in the cardinality of an F-formation correlates with a decrease in duration of simultaneous speaking turns.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1907.10384/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/1907.10384/full.md

## References

38 references — full list in the complete paper: https://tomesphere.com/paper/1907.10384/full.md

---
Source: https://tomesphere.com/paper/1907.10384