TL;DR
This study systematically evaluates how context, explicit moral knowledge, and model size affect Schwartz value detection in political texts, revealing nuanced impacts on different model types and value categories.
Contribution
It provides a comprehensive comparison of context and knowledge integration methods across various models, highlighting that larger models and more context do not always lead to better performance.
Findings
Full-document context improves supervised models by 3.8-4.8 macro-F1 points.
Retrieved moral knowledge consistently benefits models, especially with early fusion.
Scaling models does not guarantee performance gains, and simple early fusion often outperforms complex methods.
Abstract
Detecting Schwartz values in political text is difficult because implicit cues often depend on surrounding arguments and fine-grained distinctions between neighboring values. We study when context and explicit moral knowledge help sentence-level value detection. Using the ValuesML/Touch{\'e} ValueEval format, we compare sentence, window, and full-document inputs; no-RAG and retrieval-augmented settings with a curated moral knowledge base; supervised DeBERTa-v3-base/large encoders; and zero-shot LLMs from 12B to 123B parameters. The results show that more context is not uniformly better: full-document context improves supervised DeBERTa encoders by 3.8--4.8 macro-F1 points over sentence-only input, but does not consistently help zero-shot LLMs. Retrieved moral knowledge is more consistently useful in matched comparisons, improving each tested model family and context condition under…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
