Feedback Driven Multi Stereo Vision System for Real-Time Event Analysis
Mohamed Benkedadra, Matei Mancas, Sidi Ahmed Mahmoudi

TL;DR
This paper presents a real-time 3D stereo vision system that fuses multiple cameras for robust scene understanding, enabling applications like event recognition and subject tracking in complex environments.
Contribution
It introduces a novel multi-camera stereo vision pipeline with feedback mechanisms for adaptive scene analysis in interactive systems.
Findings
Effective multi-camera fusion for scene reconstruction
Preliminary success in event recognition and tracking
Roadmap for production deployment
Abstract
2D cameras are often used in interactive systems. Other systems like gaming consoles provide more powerful 3D cameras for short range depth sensing. Overall, these cameras are not reliable in large, complex environments. In this work, we propose a 3D stereo vision based pipeline for interactive systems, that is able to handle both ordinary and sensitive applications, through robust scene understanding. We explore the fusion of multiple 3D cameras to do full scene reconstruction, which allows for preforming a wide range of tasks, like event recognition, subject tracking, and notification. Using possible feedback approaches, the system can receive data from the subjects present in the environment, to learn to make better decisions, or to adapt to completely new environments. Throughout the paper, we introduce the pipeline and explain our preliminary experimentation and results. Finally,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
