Edge-Efficient Two-Stream Multimodal Architecture for Non-Intrusive Bathroom Fall Detection
Haitian Wang, Yiren Wang, Xinyu Wang, Sheldon Fung, Atif Mansoor

TL;DR
This paper introduces a low-power, two-stream multimodal architecture for non-intrusive bathroom fall detection using radar and vibration data, achieving high accuracy and real-time performance on edge devices.
Contribution
It proposes a novel two-stream model with cross-conditioned fusion that explicitly encodes causal links between motion and impact, improving fall detection accuracy and efficiency.
Findings
Achieved 96.1% accuracy and 88.0% recall on the benchmark dataset.
Reduced inference latency from 35.9 ms to 15.8 ms on Raspberry Pi 4B.
Lowered energy consumption per window from 14200 mJ to 10750 mJ.
Abstract
Falls in wet bathroom environments are a major safety risk for seniors living alone. Recent work has shown that mmWave-only, vibration-only, and existing multimodal schemes, such as vibration-triggered radar activation, early feature concatenation, and decision-level score fusion, can support privacy-preserving, non-intrusive fall detection. However, these designs still treat motion and impact as loosely coupled streams, depending on coarse temporal alignment and amplitude thresholds, and do not explicitly encode the causal link between radar-observed collapse and floor impact or address timing drift, object drop confounders, and latency and energy constraints on low-power edge devices. To this end, we propose a two-stream architecture that encodes radar signals with a Motion--Mamba branch for long-range motion patterns and processes floor vibration with an Impact--Griffin branch that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
