IMUGPT 2.0: Language-Based Cross Modality Transfer for Sensor-Based Human Activity Recognition
Zikang Leng, Amitrajit Bhattacharjee, Hrudhai Rajasekhar, Lizhe Zhang,, Elizabeth Bruda, Hyeokhyen Kwon, Thomas Pl\"otz

TL;DR
This paper evaluates language-based cross modality transfer for human activity recognition, introduces enhancements to IMUGPT for practical use, and demonstrates significant reductions in data generation effort.
Contribution
It provides a large-scale evaluation of language-driven cross modality transfer and introduces motion filtering and diversity metrics to improve IMUGPT's practical applicability.
Findings
Diversity metrics reduce virtual IMU data generation effort by at least 50%.
Language-based transfer is effective for sensor-based human activity recognition.
Enhanced IMUGPT enables more practical and efficient HAR applications.
Abstract
One of the primary challenges in the field of human activity recognition (HAR) is the lack of large labeled datasets. This hinders the development of robust and generalizable models. Recently, cross modality transfer approaches have been explored that can alleviate the problem of data scarcity. These approaches convert existing datasets from a source modality, such as video, to a target modality (IMU). With the emergence of generative AI models such as large language models (LLMs) and text-driven motion synthesis models, language has become a promising source data modality as well as shown in proof of concepts such as IMUGPT. In this work, we conduct a large-scale evaluation of language-based cross modality transfer to determine their effectiveness for HAR. Based on this study, we introduce two new extensions for IMUGPT that enhance its use for practical HAR application scenarios: a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsContext-Aware Activity Recognition Systems · Human Pose and Action Recognition
MethodsSparse Evolutionary Training
