Halo: Learning Semantics-Aware Representations for Cross-Lingual   Information Extraction

Hongyuan Mei; Sheng Zhang; Kevin Duh; Benjamin Van Durme

arXiv:1805.08271·cs.CL·May 23, 2018·1 cites

Halo: Learning Semantics-Aware Representations for Cross-Lingual Information Extraction

Hongyuan Mei, Sheng Zhang, Kevin Duh, Benjamin Van Durme

PDF

Open Access

TL;DR

Halo is a training method that enhances cross-lingual information extraction by making neural models learn semantics-aware representations, improving robustness and generalization without extra parameters.

Contribution

The paper introduces Halo, a novel training approach that enforces local semantic consistency in neural representations for improved cross-lingual extraction.

Findings

01

Improves robustness of models in low-resource scenarios

02

Enhances generalization in high-resource settings

03

Does not increase model complexity or parameters

Abstract

Cross-lingual information extraction (CLIE) is an important and challenging task, especially in low resource scenarios. To tackle this challenge, we propose a training method, called Halo, which enforces the local region of each hidden state of a neural model to only generate target tokens with the same semantic structure tag. This simple but powerful technique enables a neural model to learn semantics-aware representations that are robust to noise, without introducing any extra parameter, thus yielding better generalization in both high and low resource settings.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications