# Profile-aided distillation framework for personalized sleep analysis with compact models using LLM-guided synthetic data

**Authors:** Huimin Zheng, Xingxing Ai, Xueyan Liu, Xiaofen Xing, Xiangmin Xu

PMC · DOI: 10.3389/fphys.2025.1678364 · Frontiers in Physiology · 2026-01-05

## TL;DR

This paper introduces a framework for personalized sleep analysis on edge devices using synthetic data and model distillation to overcome data and hardware limitations.

## Contribution

A novel framework combining synthetic data generation and profile-aided model distillation for efficient, personalized sleep analysis on edge devices.

## Key findings

- Synthetic data generation improves model training despite limited real-world physiological data.
- Distilled models perform comparably to large language models while meeting edge device constraints.
- The framework enables real-time, personalized sleep analysis on resource-limited hardware.

## Abstract

Enabling personalized sleep analysis and interaction directly on edge devices is crucial for providing real-time health insights and tailored guidance. However, this goal remains challenging due to the scarcity of high-quality physiological data and the computational constraints of edge hardware.

We propose a framework for personalized sleep analysis on edge devices that addresses two key obstacles: limited publicly available physiological datasets and the restricted capacity of compact models. To mitigate data scarcity, we introduce a Physiologically-Constrained Adaptive Hierarchical Copula approach, which leverages large language model–guided optimization to synthesize diverse and realistic physiological signals. To enhance personalized inference on resource-limited models, we further develop Profile-Aided Distillation of Expert Inference with MoE LoRA, which integrates user-specific profile information to improve the performance of edge-deployed models.

Extensive experiments on both public and in-house datasets show that the distilled models achieve performance comparable to state-of-the-art large language models, while operating efficiently within the computational and memory constraints of edge devices.

These results demonstrate that the proposed framework offers a practical and effective solution for enabling personalized sleep analysis and user interaction in resource-constrained environments, bridging the gap between high-performance modeling and real-time, on-device healthcare applications.

## Full-text entities

- **Diseases:** insomnia (MESH:D007319), COVID-19 (MESH:D000086382), Diabetic (MESH:D003920), sleep irregularities (MESH:D008599), XL (MESH:D000080345), PC (MESH:D015324), sleep disorders (MESH:D012893), apnea (MESH:D001049), sleep apnea (MESH:D012891), autonomic (MESH:D001342), LLMs (MESH:D007806), fatigue (MESH:D005221)
- **Chemicals:** CoT (-), PA (MESH:D011478), melatonin (MESH:D008550), alcohol (MESH:D000438), API 2 (MESH:C111224)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** RK3588 — Homo sapiens (Human), Transformed cell line (CVCL_9N36)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12812568/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12812568/full.md

## References

44 references — full list in the complete paper: https://tomesphere.com/paper/PMC12812568/full.md

---
Source: https://tomesphere.com/paper/PMC12812568