Are Lexicon-Based Tools Still the Gold Standard for Valence Analysis in Low-Resource Flemish?
Ratna Kandala, Katie Hoemann

TL;DR
This study evaluates the effectiveness of traditional lexicon-based tools versus Dutch-specific LLMs in capturing emotional valence in spontaneous Flemish language, finding current models insufficient and highlighting the need for culturally tailored solutions.
Contribution
It provides an empirical comparison of LLMs and traditional tools for valence analysis in low-resource Flemish, emphasizing the necessity for culturally adapted models.
Findings
Dutch LLMs underperform compared to traditional tools in real-world narratives.
Traditional lexicon-based tools like LIWC and Pattern remain more accurate for valence detection.
Highlighting the need for developing culturally and linguistically tailored models for emotion analysis.
Abstract
Understanding the nuances in everyday language is pivotal for advancements in computational linguistics & emotions research. Traditional lexicon-based tools such as LIWC and Pattern have long served as foundational instruments in this domain. LIWC is the most extensively validated word count based text analysis tool in the social sciences and Pattern is an open source Python library offering functionalities for NLP. However, everyday language is inherently spontaneous, richly expressive, & deeply context dependent. To explore the capabilities of LLMs in capturing the valences of daily narratives in Flemish, we first conducted a study involving approximately 25,000 textual responses from 102 Dutch-speaking participants. Each participant provided narratives prompted by the question, "What is happening right now and how do you feel about it?", accompanied by self-assessed valence ratings…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health via Writing · Sentiment Analysis and Opinion Mining · Computational and Text Analysis Methods
