Multilingual Entity and Relation Extraction from Unified to Language-specific Training
Zixiang Wang, Jian Yang, Tongliang Li, Jiaheng Liu, Ying Mo, Jiaqi, Bai, Longtao He, Zhoujun Li

TL;DR
This paper introduces a two-stage multilingual training framework, mERE, that reduces language interference in entity and relation extraction, improving performance across multiple languages by combining universal and language-specific features.
Contribution
The paper proposes a novel two-stage training method with a joint model, mERE, that effectively mitigates language interference and enhances multilingual entity and relation extraction performance.
Findings
Outperforms monolingual and multilingual baselines
Lightweight and effective for relational triple extraction
Easily transferable to other backbone models
Abstract
Entity and relation extraction is a key task in information extraction, where the output can be used for downstream NLP tasks. Existing approaches for entity and relation extraction tasks mainly focus on the English corpora and ignore other languages. Thus, it is critical to improving performance in a multilingual setting. Meanwhile, multilingual training is usually used to boost cross-lingual performance by transferring knowledge from languages (e.g., high-resource) to other (e.g., low-resource) languages. However, language interference usually exists in multilingual tasks as the model parameters are shared among all languages. In this paper, we propose a two-stage multilingual training method and a joint model called Multilingual Entity and Relation Extraction framework (mERE) to mitigate language interference across languages. Specifically, we randomly concatenate sentences in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Data Quality and Management
