Utilizing Out-Domain Datasets to Enhance Multi-Task Citation Analysis
Dominique Mercier, Syed Tahseen Raza Rizvi, Vikas Rajashekar, Sheraz, Ahmed, Andreas Dengel

TL;DR
This paper proposes an end-to-end multi-task model that leverages out-domain datasets to improve citation sentiment and intent analysis, addressing data scarcity issues and enhancing performance across domain-specific and cross-domain tasks.
Contribution
It introduces a novel multi-task model utilizing out-domain data and explores different data scheduling methods for improved citation analysis.
Findings
Sequential data scheduling benefits domain-specific tasks.
Shuffled data feeding enhances cross-domain performance.
Out-domain datasets effectively mitigate data scarcity.
Abstract
Citations are generally analyzed using only quantitative measures while excluding qualitative aspects such as sentiment and intent. However, qualitative aspects provide deeper insights into the impact of a scientific research artifact and make it possible to focus on relevant literature free from bias associated with quantitative aspects. Therefore, it is possible to rank and categorize papers based on their sentiment and intent. For this purpose, larger citation sentiment datasets are required. However, from a time and cost perspective, curating a large citation sentiment dataset is a challenging task. Particularly, citation sentiment analysis suffers from both data scarcity and tremendous costs for dataset annotation. To overcome the bottleneck of data scarcity in the citation analysis domain we explore the impact of out-domain data during training to enhance the model performance.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Sentiment Analysis and Opinion Mining · Advanced Graph Neural Networks
