DECO-MWE: building a linguistic resource of Korean multiword expressions for feature-based sentiment analysis
Jaeho Han, Changhoe Hwang, Seongyong Choi, Gwanghoon Yoo, Eric Laporte, Jeesun Nam

TL;DR
This paper presents DECO-MWE, a linguistic resource of Korean multiword expressions for sentiment analysis, built using finite-state transducers and tested on cosmetics reviews, achieving high retrieval performance.
Contribution
It introduces a finite-state methodology for constructing a Korean MWE lexicon tailored for feature-based sentiment analysis, covering various MWE categories.
Findings
Achieved 0.806 f-measure in MWE retrieval on test corpus.
Constructed a sizeable polarity MWE lexicon for FBSA.
Demonstrated the effectiveness of finite-state transducers in modeling MWEs.
Abstract
This paper aims to construct a linguistic resource of Korean Multiword Expressions for Feature-Based Sentiment Analysis (FBSA): DECO-MWE. Dealing with multiword expressions (MWEs) has been a critical issue in FBSA since many constructs reveal lexical idiosyncrasy. To construct linguistic resources of sentiment MWEs efficiently, we utilize the Local Grammar Graph (LGG) methodology: DECO-MWE is formalized as a Finite-State Transducer that represents lexical-syntactic restrictions on MWEs. In this study, we built a corpus of cosmetics review texts, which show particularly frequent occurrences of MWEs. Based on an empirical examination of the corpus, four types of MWEs have been distinguished. The DECO-MWE thus covers the following four categories: Standard Polarity MWEs (SMWEs), Domain-Dependent Polarity MWEs (DMWEs), Compound Named Entity MWEs (EMWEs) and Compound Feature MWEs (FMWEs).…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
