LaScA: Language-Conditioned Scalable Modelling of Affective Dynamics
Kosmas Pinitas, Ilias Maglogiannis

TL;DR
This paper introduces LaScA, a framework that uses language models to semantically condition affective dynamics prediction, combining interpretability with improved accuracy in human-centered AI.
Contribution
It presents a novel, interpretable affect modelling approach that leverages language models as semantic context conditioners over handcrafted features.
Findings
Consistent accuracy improvements over baselines on Aff-Wild2 and SEWA datasets.
Semantic conditioning maintains interpretability while enhancing predictive performance.
Framework offers a transparent alternative to end-to-end black-box affect models.
Abstract
Predicting affect in unconstrained environments remains a fundamental challenge in human-centered AI. While deep neural embeddings dominate contemporary approaches, they often lack interpretability and limit expert-driven refinement. We propose a novel framework that uses Language Models (LMs) as semantic context conditioners over handcrafted affect descriptors to model changes in Valence and Arousal. Our approach begins with interpretable facial geometry and acoustic features derived from structured domain knowledge. These features are transformed into symbolic natural-language descriptions encoding their affective implications. A pretrained LM processes these descriptions to generate semantic context embeddings that act as high-level priors over affective dynamics. Unlike end-to-end black-box pipelines, our framework preserves feature transparency while leveraging the contextual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
