Natural interaction with traffic control cameras through multimodal interfaces
Marco Grazioso, Alessandro Sebastian Podda, Silvio Barra, Francesco, Cutugno

TL;DR
This paper presents a multimodal interface using natural language and gestures for traffic control cameras, enhancing interaction speed and naturalness in surveillance scenarios to improve urban safety and security.
Contribution
It introduces a meta user interface leveraging the Put That There paradigm, combining voice and gesture controls via Kinect 2 for efficient traffic surveillance management.
Findings
Effective multimodal interaction with traffic cameras demonstrated
Improved speed and naturalness in user interactions
Enhanced situational awareness in control room scenarios
Abstract
Human-Computer Interfaces have always played a fundamental role in usability and commands' interpretability of the modern software systems. With the explosion of the Artificial Intelligence concept, such interfaces have begun to fill the gap between the user and the system itself, further evolving in Adaptive User Interfaces (AUI). Meta Interfaces are a further step towards the user, and they aim at supporting the human activities in an ambient interactive space; in such a way, the user can control the surrounding space and interact with it. This work aims at proposing a meta user interface that exploits the Put That There paradigm to enable the user to fast interaction by employing natural language and gestures. The application scenario is a video surveillance control room, in which the speed of actions and reactions is fundamental for urban safety and driver and pedestrian security.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
