Accoustate: Auto-annotation of IMU-generated Activity Signatures under Smart Infrastructure
Soumyajit Chatterjee, Arun Singh, Bivas Mitra, and Sandip Chakraborty

TL;DR
Accoustate introduces an edge-based, privacy-preserving method for auto-annotating IMU data in smart infrastructure using acoustic signatures and cross-modal HAR models, achieving high accuracy in multi-occupant scenarios.
Contribution
It presents a novel acoustic-based auto-annotation framework for IMU data that handles simultaneous activities and reduces reliance on video data and manual labeling.
Findings
Achieves 82.59% accuracy in a workshop environment.
Achieves 98.32% accuracy in a kitchen environment.
Effectively distinguishes overlapping activities of two individuals.
Abstract
Human activities within smart infrastructures generate a vast amount of IMU data from the wearables worn by individuals. Many existing studies rely on such sensory data for human activity recognition (HAR); however, one of the major bottlenecks is their reliance on pre-annotated or labeled data. Manual human-driven annotations are neither scalable nor efficient, whereas existing auto-annotation techniques heavily depend on video signatures. Still, video-based auto-annotation needs high computation resources and has privacy concerns when the data from a personal space, like a smart-home, is transferred to the cloud. This paper exploits the acoustic signatures generated from human activities to label the wearables' IMU data at the edge, thus mitigating resource requirement and data privacy concerns. We utilize acoustic-based pre-trained HAR models for cross-modal labeling of the IMU data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Context-Aware Activity Recognition Systems · Speech and Audio Processing
