Emergent Symbolic Structure in Health Foundation Models: Extraction, Alignment, and Cross-Modal Transfer

Gajendra Katuwal; Advait Koparkar; Salar Abbaspourazad; Anshuman Mishra; Sarvesh Kirthivasan

arXiv:2605.07407·cs.LG·May 11, 2026

Emergent Symbolic Structure in Health Foundation Models: Extraction, Alignment, and Cross-Modal Transfer

Gajendra Katuwal, Advait Koparkar, Salar Abbaspourazad, Anshuman Mishra, Sarvesh Kirthivasan

PDF

TL;DR

This paper introduces a post-training framework to interpret, align, and transfer health foundation model embeddings across modalities, revealing a shared symbolic structure that preserves physiological information.

Contribution

The authors propose a novel method to decompose and align frozen embeddings into interpretable symbols, enabling effective cross-modal transfer without retraining.

Findings

01

Symbols associate with health conditions and physiological attributes.

02

Cross-modal transfer retains over 95% of in-domain performance.

03

Alignment recovers a shared low-dimensional physiological subspace.

Abstract

Health foundation models (FMs) learn useful representations from wearable sensors, but interpreting what they encode and transferring that knowledge across modalities after training remains difficult. We present a post-training framework that decomposes frozen embeddings into interpretable directions, referred to as symbols, and use these symbols to align the embedding spaces without retraining. We evaluate the framework on three FMs for photoplethysmography (PPG) and accelerometer data, independently pretrained on ~20M minutes of unlabeled data from ~172K participants, and analyzed on a held-out cohort of 30K subjects. We find that extracted symbols associate selectively with health conditions and physiological attributes, and these associations are partially shared across modalities and architectures. Cross-modal transfer via symbols retains more than 95% of in-domain performance, is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.