# Improving Distantly-supervised Entity Typing with Compact Latent Space   Clustering

**Authors:** Bo Chen, Xiaotao Gu, Yufeng Hu, Siliang Tang, Guoping Hu, Yueting, Zhuang, Xiang Ren

arXiv: 1904.06475 · 2019-04-16

## TL;DR

This paper introduces a novel regularization method called Compact Latent Space Clustering (CLSC) for distantly supervised entity typing, which improves label inference and embedding quality by clustering similar mentions, leading to better classification accuracy.

## Contribution

The paper proposes CLSC, a new approach that dynamically constructs mention similarity graphs and uses label propagation to enhance distantly supervised entity typing.

## Key findings

- CLSC outperforms existing methods on standard benchmarks.
- It effectively utilizes noisy data through label propagation.
- The method improves mention embedding clustering and classification accuracy.

## Abstract

Recently, distant supervision has gained great success on Fine-grained Entity Typing (FET). Despite its efficiency in reducing manual labeling efforts, it also brings the challenge of dealing with false entity type labels, as distant supervision assigns labels in a context agnostic manner. Existing works alleviated this issue with partial-label loss, but usually suffer from confirmation bias, which means the classifier fit a pseudo data distribution given by itself. In this work, we propose to regularize distantly supervised models with Compact Latent Space Clustering (CLSC) to bypass this problem and effectively utilize noisy data yet. Our proposed method first dynamically constructs a similarity graph of different entity mentions; infer the labels of noisy instances via label propagation. Based on the inferred labels, mention embeddings are updated accordingly to encourage entity mentions with close semantics to form a compact cluster in the embedding space,thus leading to better classification performance. Extensive experiments on standard benchmarks show that our CLSC model consistently outperforms state-of-the-art distantly supervised entity typing systems by a significant margin.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.06475/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/1904.06475/full.md

## References

36 references — full list in the complete paper: https://tomesphere.com/paper/1904.06475/full.md

---
Source: https://tomesphere.com/paper/1904.06475