Mixed-Precision Information Bottlenecks for On-Device Trait-State Disentanglement in Bipolar Agitation Detection
Joydeep Chandra

TL;DR
This paper introduces MP-IB, a mixed-precision framework for disentangling trait and state voice biomarkers in bipolar agitation detection, enabling efficient, real-time on-device monitoring with high accuracy and privacy.
Contribution
The novel MP-IB approach uses mixed-precision quantization as an information bottleneck to separate stable traits from volatile states in voice data.
Findings
Achieves rho=0.117 in agitation detection, outperforming baselines.
Zero-shot transfer to CREMA-D yields AUC=0.817.
End-to-end latency of 23.4 ms on small devices.
Abstract
Continuous monitoring of bipolar disorder agitation via voice biomarkers requires disentangling stable speaker traits from volatile affective states on resource-constrained edge devices. We introduce MP-IB, the first framework to treat mixed-precision quantization as an information bottleneck for clinical trait-state separation. The core insight is that numerical precision itself controls capacity: an FP16 trait head (1,024 bits) encodes speaker identity, while an INT4 state head (128 bits) captures agitation, yielding 8x information asymmetry without adversarial training. We augment this with Dynamic Precision Scheduling and Multi-Scale Temporal Fusion. On Bridge2AI-Voice (N=833, 4 sessions/participant, strict speaker-independent CV), MP-IB achieves rho = 0.117 (95\% CI: [0.089, 0.145], p=0.003 vs. chance), outperforming 94M-parameter WavLM-Adapter with in-domain SSL continuation (rho…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
