Mined Semantic Analysis: A New Concept Space Model for Semantic Representation of Textual Data
Walid Shalaby, Wlodek Zadrozny

TL;DR
Mined Semantic Analysis (MSA) introduces an unsupervised concept space model that leverages implicit concept relations from encyclopedic corpora to improve semantic representation and relatedness measurement of textual data.
Contribution
MSA is the first to mine implicit concept relations from encyclopedic graphs, enhancing semantic representations and establishing statistical significance in relatedness evaluations.
Findings
MSA achieves competitive performance on semantic relatedness benchmarks.
The study demonstrates the statistical insignificance of performance differences among top methods.
MSA offers interpretable and simple semantic representations.
Abstract
Mined Semantic Analysis (MSA) is a novel concept space model which employs unsupervised learning to generate semantic representations of text. MSA represents textual structures (terms, phrases, documents) as a Bag of Concepts (BoC) where concepts are derived from concept rich encyclopedic corpora. Traditional concept space models exploit only target corpus content to construct the concept space. MSA, alternatively, uncovers implicit relations between concepts by mining for their associations (e.g., mining Wikipedia's "See also" link graph). We evaluate MSA's performance on benchmark datasets for measuring semantic relatedness of words and sentences. Empirical results show competitive performance of MSA compared to prior state-of-the-art methods. Additionally, we introduce the first analytical study to examine statistical significance of results reported by different semantic relatedness…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
