Contrastive Triple Extraction with Generative Transformer

Hongbin Ye; Ningyu Zhang; Shumin Deng; Mosha Chen; Chuanqi Tan; Fei; Huang; Huajun Chen

arXiv:2009.06207·cs.CL·January 26, 2023·25 cites

Contrastive Triple Extraction with Generative Transformer

Hongbin Ye, Ningyu Zhang, Shumin Deng, Mosha Chen, Chuanqi Tan, Fei, Huang, Huajun Chen

PDF

Open Access 1 Video

TL;DR

This paper presents a novel contrastive generative transformer model for triple extraction that improves faithfulness and performance in information extraction tasks, addressing limitations of previous sequence generation methods.

Contribution

The paper introduces a contrastive training mechanism and new attention and calibration techniques to enhance the accuracy and faithfulness of triple extraction with a generative transformer.

Findings

01

Outperforms baseline models on NYT, WebNLG, and MIE datasets.

02

Achieves higher precision and recall in triple extraction.

03

Demonstrates effectiveness of contrastive training in sequence generation.

Abstract

Triple extraction is an essential task in information extraction for natural language processing and knowledge graph construction. In this paper, we revisit the end-to-end triple extraction task for sequence generation. Since generative triple extraction may struggle to capture long-term dependencies and generate unfaithful triples, we introduce a novel model, contrastive triple extraction with a generative transformer. Specifically, we introduce a single shared transformer module for encoder-decoder-based generation. To generate faithful results, we propose a novel triplet contrastive training object. Moreover, we introduce two mechanisms to further improve model performance (i.e., batch-wise dynamic attention-masking and triple-wise calibration). Experimental results on three datasets (i.e., NYT, WebNLG, and MIE) show that our approach achieves better performance than that of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Contrastive Triple Extraction with Generative Transformer· underline

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Multimodal Machine Learning Applications