Perceive What Matters: Relevance-Driven Scheduling for Multimodal Streaming Perception
Dingcheng Huang, Xiaotong Zhang, Kamal Youcef-Toumi

TL;DR
This paper introduces a lightweight, relevance-driven perception scheduling framework for multimodal streaming perception in human-robot collaboration, reducing latency and improving key perception metrics by leveraging scene context and previous outputs.
Contribution
It presents a novel real-time perception scheduling method that efficiently allocates computational resources based on scene relevance, addressing redundancy and latency issues in multimodal perception pipelines.
Findings
Reduces computational latency by up to 27.52%.
Achieves a 72.73% improvement in MMPose activation recall.
Attains up to 98% keyframe accuracy.
Abstract
In modern human-robot collaboration (HRC) applications, multiple perception modules jointly extract visual, auditory, and contextual cues to achieve comprehensive scene understanding, enabling the robot to provide appropriate assistance to human agents intelligently. While executing multiple perception modules on a frame-by-frame basis enhances perception quality in offline settings, it inevitably accumulates latency, leading to a substantial decline in system performance in streaming perception scenarios. Recent work in scene understanding, termed Relevance, has established a solid foundation for developing efficient methodologies in HRC. However, modern perception pipelines still face challenges related to information redundancy and suboptimal allocation of computational resources. Drawing inspiration from the Relevance concept and the information sparsity in HRC events, we propose a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Social Robot Interaction and HRI · Robot Manipulation and Learning
