RAG-HAR: Retrieval Augmented Generation-based Human Activity Recognition
Nirhoshan Sivaroopan, Hansi Karunarathna, Chamara Madarasingha, Anura Jayasumana, Kanchana Thilakarathna

TL;DR
RAG-HAR is a training-free, retrieval-augmented framework using large language models for human activity recognition, achieving state-of-the-art results without dataset-specific training.
Contribution
It introduces a novel retrieval-based approach leveraging LLMs and prompt optimization for accurate HAR without training or fine-tuning.
Findings
Achieves state-of-the-art performance on six HAR benchmarks.
Operates without model training or fine-tuning.
Capable of recognizing unseen human activities.
Abstract
Human Activity Recognition (HAR) underpins applications in healthcare, rehabilitation, fitness tracking, and smart environments, yet existing deep learning approaches demand dataset-specific training, large labeled corpora, and significant computational resources.We introduce RAG-HAR, a training-free retrieval-augmented framework that leverages large language models (LLMs) for HAR. RAG-HAR computes lightweight statistical descriptors, retrieves semantically similar samples from a vector database, and uses this contextual evidence to make LLM-based activity identification. We further enhance RAG-HAR by first applying prompt optimization and introducing an LLM-based activity descriptor that generates context-enriched vector databases for delivering accurate and highly relevant contextual information. Along with these mechanisms, RAG-HAR achieves state-of-the-art performance across six…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
