Enhancing Keyphrase Extraction from Microblogs using Human Reading Time
Yingyi Zhang, Chengzhi Zhang

TL;DR
This paper introduces novel neural network models that incorporate human reading time, measured via eye fixation durations, to improve keyphrase extraction from microblogs, demonstrating superior performance over baseline models.
Contribution
It is the first to leverage human reading time as a feature in neural keyphrase extraction models for microblogs, using eye-tracking data to enhance model effectiveness.
Findings
Proposed models outperform baseline models in keyphrase extraction accuracy.
Using eye fixation durations improves the identification of salient words.
Human reading time as an external feature enhances model performance.
Abstract
The premise of manual keyphrase annotation is to read the corresponding content of an annotated object. Intuitively, when we read, more important words will occupy a longer reading time. Hence, by leveraging human reading time, we can find the salient words in the corresponding content. However, previous studies on keyphrase extraction ignore human reading features. In this article, we aim to leverage human reading time to extract keyphrases from microblog posts. There are two main tasks in this study. One is to determine how to measure the time spent by a human on reading a word. We use eye fixation durations extracted from an open source eye-tracking corpus (OSEC). Moreover, we propose strategies to make eye fixation duration more effective on keyphrase extraction. The other task is to determine how to integrate human reading time into keyphrase extraction models. We propose two novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
