Discourse-Aware Scientific Paper Recommendation via QA-Style Summarization and Multi-Level Contrastive Learning
Shenghua Wang, Zhen Yin

TL;DR
This paper introduces OMRC-MR, a hierarchical, discourse-aware framework for scientific paper recommendation that leverages QA-style summarization and multi-level contrastive learning to improve relevance and interpretability.
Contribution
It proposes a novel structured summarization and contrastive learning approach that enhances content-based paper recommendation with better semantic alignment and interpretability.
Findings
Outperforms state-of-the-art baselines in Precision@10 and Recall@10.
QA-style summarization yields more coherent and factually complete representations.
Demonstrates effectiveness on multiple datasets including a new Sci-OMRC dataset.
Abstract
The rapid growth of open-access (OA) publications has intensified the challenge of identifying relevant scientific papers. Due to privacy constraints and limited access to user interaction data, recent efforts have shifted toward content-based recommendation, which relies solely on textual information. However, existing models typically treat papers as unstructured text, neglecting their discourse organization and thereby limiting semantic completeness and interpretability. To address these limitations, we propose OMRC-MR, a hierarchical framework that integrates QA-style OMRC (Objective, Method, Result, Conclusion) summarization, multi-level contrastive learning, and structure-aware re-ranking for scholarly recommendation. The QA-style summarization module converts raw papers into structured and discourse-consistent representations, while multi-level contrastive objectives align…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Biomedical Text Mining and Ontologies · Information Retrieval and Search Behavior
