THOR: Thermal-guided Hand-Object Reasoning via Adaptive Vision Sampling
Soroush Shahi, Farzad Shahabi, Rama Nabulsi, Glenn Fernandes, Aggelos Katsaggelos, Nabil Alshurafa

TL;DR
THOR is a real-time adaptive sampling method using thermal cues to efficiently monitor hand activities with wearable cameras, significantly reducing data while maintaining high recognition accuracy.
Contribution
It introduces a thermal-guided adaptive RGB frame sampling technique that localizes hand-object interactions and adjusts sampling rates based on activity transitions.
Findings
Captures all activity segments using only 3% of RGB data.
Achieves 95% F1-score in activity recognition, comparable to full video.
Validated on in-the-wild study with 14 participants and extensive datasets.
Abstract
Wearable cameras are increasingly used as an observational and interventional tool for human behaviors by providing detailed visual data of hand-related activities. This data can be leveraged to facilitate memory recall for logging of behavior or timely interventions aimed at improving health. However, continuous processing of RGB images from these cameras consumes significant power impacting battery lifetime, generates a large volume of unnecessary video data for post-processing, raises privacy concerns, and requires substantial computational resources for real-time analysis. We introduce THOR, a real-time adaptive spatio-temporal RGB frame sampling method that leverages thermal sensing to capture hand-object patches and classify them in real-time. We use low-resolution thermal camera data to identify moments when a person switches from one hand-related activity to another, and adjust…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Emotion and Mood Recognition · Context-Aware Activity Recognition Systems
