WR-ONE2SET: Towards Well-Calibrated Keyphrase Generation

Binbin Xie; Xiangpeng Wei; Baosong Yang; Huan Lin; Jun Xie; Xiaoli; Wang; Min Zhang; Jinsong Su

arXiv:2211.06862·cs.CL·February 17, 2023·1 cites

WR-ONE2SET: Towards Well-Calibrated Keyphrase Generation

Binbin Xie, Xiangpeng Wei, Baosong Yang, Huan Lin, Jun Xie, Xiaoli, Wang, Min Zhang, Jinsong Su

PDF

Open Access 1 Repo

TL;DR

This paper introduces WR-ONE2SET, a novel approach to keyphrase generation that reduces calibration errors by adaptively weighting training instances and re-assigning targets, leading to more accurate and reliable keyphrase predictions.

Contribution

It proposes WR-ONE2SET, an extension of ONE2SET, with adaptive weighting and target re-assignment mechanisms to improve calibration and reduce over-estimation of no-keyphrase tokens.

Findings

01

Significantly reduces calibration errors in keyphrase generation.

02

Improves the accuracy of keyphrase predictions across datasets.

03

Demonstrates the effectiveness and generality of the proposed methods.

Abstract

Keyphrase generation aims to automatically generate short phrases summarizing an input document. The recently emerged ONE2SET paradigm (Ye et al., 2021) generates keyphrases as a set and has achieved competitive performance. Nevertheless, we observe serious calibration errors outputted by ONE2SET, especially in the over-estimation of $\emptyset$ token (means "no corresponding keyphrase"). In this paper, we deeply analyze this limitation and identify two main reasons behind: 1) the parallel generation has to introduce excessive $\emptyset$ as padding tokens into training instances; and 2) the training mechanism assigning target to each slot is unstable and further aggravates the $\emptyset$ token over-estimation. To make the model well-calibrated, we propose WR-ONE2SET which extends ONE2SET with an adaptive instance-level cost Weighting strategy and a target Re-assignment…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

deeplearnxmu/wr-one2set
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Text Analysis Techniques