TL;DR
ZARA is a training-free, knowledge- and retrieval-augmented framework that enables robust motion time-series reasoning using LLMs, without retraining, by grounding signals in verifiable natural language priors.
Contribution
It introduces ZARA, a novel framework that combines knowledge distillation and retrieval to enable training-free, grounded reasoning on motion sensor data with LLMs.
Findings
ZARA outperforms existing methods on eight benchmarks.
It generalizes well to unseen subjects and datasets.
It demonstrates strong transferability across sensor domains.
Abstract
Motion sensor time-series are central to Human Activity Recognition (HAR), yet conventional approaches are constrained to fixed activity sets and typically require costly parameter retraining to adapt to new behaviors. While Large Language Models (LLMs) offer promising open-set reasoning capabilities, applying them directly to numerical time-series often leads to hallucinations and weak grounding. To address this challenge, we propose ZARA (Zero-training Activity Reasoning Agents), a knowledge- and retrieval-augmented agentic framework for motion time-series reasoning in a training-free inference setting. Rather than relying on black-box projections, ZARA distills reference data into a statistically grounded textual knowledge base that transforms implicit signal patterns into verifiable natural-language priors. Guided by retrieved evidence, ZARA iteratively selects discriminative cues…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
