Large Language Models Memorize Sensor Datasets! Implications on Human   Activity Recognition Research

Harish Haresamudram; Hrudhai Rajasekhar; Nikhil Murlidhar Shanbhogue,; Thomas Ploetz

arXiv:2406.05900·cs.LG·June 11, 2024

Large Language Models Memorize Sensor Datasets! Implications on Human Activity Recognition Research

Harish Haresamudram, Hrudhai Rajasekhar, Nikhil Murlidhar Shanbhogue,, Thomas Ploetz

PDF

Open Access

TL;DR

This paper reveals that large language models have likely memorized wearable sensor datasets used in human activity recognition benchmarks, raising concerns about the validity of previous evaluation results.

Contribution

It demonstrates that LLMs can memorize sensor data from HAR datasets, highlighting the need to reconsider evaluation methods in sensor-based activity recognition research.

Findings

01

LLMs show significant data memorization from HAR datasets

02

GPT-4 can reproduce sensor data blocks from benchmark datasets

03

Memorization may compromise the validity of LLM-based HAR evaluations

Abstract

The astonishing success of Large Language Models (LLMs) in Natural Language Processing (NLP) has spurred their use in many application domains beyond text analysis, including wearable sensor-based Human Activity Recognition (HAR). In such scenarios, often sensor data are directly fed into an LLM along with text instructions for the model to perform activity classification. Seemingly remarkable results have been reported for such LLM-based HAR systems when they are evaluated on standard benchmarks from the field. Yet, we argue, care has to be taken when evaluating LLM-based HAR systems in such a traditional way. Most contemporary LLMs are trained on virtually the entire (accessible) internet -- potentially including standard HAR datasets. With that, it is not unlikely that LLMs actually had access to the test data used in such benchmark experiments.The resulting contamination of training…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsContext-Aware Activity Recognition Systems

MethodsAttention Is All You Need · Softmax · Layer Normalization · Linear Layer · Byte Pair Encoding · Label Smoothing · Adam · Residual Connection · Multi-Head Attention · Position-Wise Feed-Forward Layer