DECO-MWE: building a linguistic resource of Korean multiword expressions for feature-based sentiment analysis

Jaeho Han; Changhoe Hwang; Seongyong Choi; Gwanghoon Yoo; Eric Laporte; Jeesun Nam

arXiv:2605.10295·cs.CL·May 12, 2026

DECO-MWE: building a linguistic resource of Korean multiword expressions for feature-based sentiment analysis

Jaeho Han, Changhoe Hwang, Seongyong Choi, Gwanghoon Yoo, Eric Laporte, Jeesun Nam

PDF

TL;DR

This paper presents DECO-MWE, a linguistic resource of Korean multiword expressions for sentiment analysis, built using finite-state transducers and tested on cosmetics reviews, achieving high retrieval performance.

Contribution

It introduces a finite-state methodology for constructing a Korean MWE lexicon tailored for feature-based sentiment analysis, covering various MWE categories.

Findings

01

Achieved 0.806 f-measure in MWE retrieval on test corpus.

02

Constructed a sizeable polarity MWE lexicon for FBSA.

03

Demonstrated the effectiveness of finite-state transducers in modeling MWEs.

Abstract

This paper aims to construct a linguistic resource of Korean Multiword Expressions for Feature-Based Sentiment Analysis (FBSA): DECO-MWE. Dealing with multiword expressions (MWEs) has been a critical issue in FBSA since many constructs reveal lexical idiosyncrasy. To construct linguistic resources of sentiment MWEs efficiently, we utilize the Local Grammar Graph (LGG) methodology: DECO-MWE is formalized as a Finite-State Transducer that represents lexical-syntactic restrictions on MWEs. In this study, we built a corpus of cosmetics review texts, which show particularly frequent occurrences of MWEs. Based on an empirical examination of the corpus, four types of MWEs have been distinguished. The DECO-MWE thus covers the following four categories: Standard Polarity MWEs (SMWEs), Domain-Dependent Polarity MWEs (DMWEs), Compound Named Entity MWEs (EMWEs) and Compound Feature MWEs (FMWEs).…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.