EmbNum: Semantic labeling for numerical values with deep metric learning
Phuc Nguyen, Khai Nguyen, Ryutaro Ichise, Hideaki Takeda

TL;DR
EmbNum is a neural embedding model that effectively assigns semantic labels to numerical attributes by learning distribution-agnostic representations, outperforming existing methods in accuracy and efficiency.
Contribution
The paper introduces EmbNum, a novel deep metric learning approach for semantic labeling of numerical data without assuming specific data distributions.
Findings
EmbNum significantly outperforms state-of-the-art methods in accuracy.
EmbNum is more efficient in semantic labeling tasks.
The model works well on diverse datasets like City Data and Open Data.
Abstract
Semantic labeling for numerical values is a task of assigning semantic labels to unknown numerical attributes. The semantic labels could be numerical properties in ontologies, instances in knowledge bases, or labeled data that are manually annotated by domain experts. In this paper, we refer to semantic labeling as a retrieval setting where the label of an unknown attribute is assigned by the label of the most relevant attribute in labeled data. One of the greatest challenges is that an unknown attribute rarely has the same set of values with the similar one in the labeled data. To overcome the issue, statistical interpretation of value distribution is taken into account. However, the existing studies assume a specific form of distribution. It is not appropriate in particular to apply open data where there is no knowledge of data in advance. To address these problems, we propose a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Advanced Graph Neural Networks · Machine Learning in Healthcare
