Autonomous Human Activity Classification from Ego-vision Camera and Accelerometer Data
Yantao Lu, Senem Velipasalar

TL;DR
This paper introduces a novel autonomous method combining ego-vision camera and IMU data with capsule networks and genetic algorithms to improve fine-grained human activity classification accuracy.
Contribution
It presents a new approach that integrates egocentric video and IMU data using capsule networks and genetic algorithms for autonomous network parameter tuning.
Findings
Achieved 86.6% accuracy for 9-label classification.
Achieved 77.2% accuracy for 26-label classification.
Combining both modalities outperforms single-modality approaches.
Abstract
There has been significant amount of research work on human activity classification relying either on Inertial Measurement Unit (IMU) data or data from static cameras providing a third-person view. Using only IMU data limits the variety and complexity of the activities that can be detected. For instance, the sitting activity can be detected by IMU data, but it cannot be determined whether the subject has sat on a chair or a sofa, or where the subject is. To perform fine-grained activity classification from egocentric videos, and to distinguish between activities that cannot be differentiated by only IMU data, we present an autonomous and robust method using data from both ego-vision cameras and IMUs. In contrast to convolutional neural network-based approaches, we propose to employ capsule networks to obtain features from egocentric video data. Moreover, Convolutional Long Short Term…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHuman Pose and Action Recognition · Video Surveillance and Tracking Methods · Context-Aware Activity Recognition Systems
