Mixing syntagmatic and paradigmatic information for concept detection

Louis Chartrand; Mohamed Bouguessa

arXiv:1904.04461·cs.CL·May 27, 2019·1 cites

Mixing syntagmatic and paradigmatic information for concept detection

Louis Chartrand, Mohamed Bouguessa

PDF

Open Access

TL;DR

This paper improves concept detection in textual data by combining syntagmatic and paradigmatic information through a novel topic model based on word embeddings, leading to better performance and flexibility.

Contribution

It introduces a topic model that integrates paradigmatic relations via word embeddings, enhancing concept detection accuracy over traditional methods.

Findings

01

Significant increase in concept detection performance.

02

Enhanced flexibility in expressing target concepts.

03

Effective integration of syntagmatic and paradigmatic information.

Abstract

In the last decades, philosophers have begun using empirical data for conceptual analysis, but corpus-based conceptual analysis has so far failed to develop, in part because of the absence of reliable methods to automatically detect concepts in textual data. Previous attempts have shown that topic models can constitute efficient concept detection heuristics, but while they leverage the syntagmatic relations in a corpus, they fail to exploit paradigmatic relations, and thus probably fail to model concepts accurately. In this article, we show that using a topic model that models concepts on a space of word embeddings (Hu and Tsujii, 2016) can lead to significant increases in concept detection performance, as well as enable the target concept to be expressed in more flexible ways using word vectors.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Advanced Text Analysis Techniques · Natural Language Processing Techniques