WR-ONE2SET: Towards Well-Calibrated Keyphrase Generation
Binbin Xie, Xiangpeng Wei, Baosong Yang, Huan Lin, Jun Xie, Xiaoli, Wang, Min Zhang, Jinsong Su

TL;DR
This paper introduces WR-ONE2SET, a novel approach to keyphrase generation that reduces calibration errors by adaptively weighting training instances and re-assigning targets, leading to more accurate and reliable keyphrase predictions.
Contribution
It proposes WR-ONE2SET, an extension of ONE2SET, with adaptive weighting and target re-assignment mechanisms to improve calibration and reduce over-estimation of no-keyphrase tokens.
Findings
Significantly reduces calibration errors in keyphrase generation.
Improves the accuracy of keyphrase predictions across datasets.
Demonstrates the effectiveness and generality of the proposed methods.
Abstract
Keyphrase generation aims to automatically generate short phrases summarizing an input document. The recently emerged ONE2SET paradigm (Ye et al., 2021) generates keyphrases as a set and has achieved competitive performance. Nevertheless, we observe serious calibration errors outputted by ONE2SET, especially in the over-estimation of token (means "no corresponding keyphrase"). In this paper, we deeply analyze this limitation and identify two main reasons behind: 1) the parallel generation has to introduce excessive as padding tokens into training instances; and 2) the training mechanism assigning target to each slot is unstable and further aggravates the token over-estimation. To make the model well-calibrated, we propose WR-ONE2SET which extends ONE2SET with an adaptive instance-level cost Weighting strategy and a target Re-assignment…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Text Analysis Techniques
