Focus-Driven Contrastive Learniang for Medical Question Summarization
Ming Zhang, Shuai Dou, Ziyang Wang, Yunfang Wu

TL;DR
This paper introduces a focus-driven contrastive learning framework for medical question summarization, significantly improving sentence representations and summary quality over traditional models.
Contribution
The paper proposes a novel contrastive learning approach that leverages question focus to generate hard negatives, enhancing sentence-level understanding in medical question summarization.
Findings
Achieves state-of-the-art results on three medical datasets.
Gains of 5.33, 12.85, and 3.81 points over BART baseline.
Better sentence representations and question focus capture.
Abstract
Automatic medical question summarization can significantly help the system to understand consumer health questions and retrieve correct answers. The Seq2Seq model based on maximum likelihood estimation (MLE) has been applied in this task, which faces two general problems: the model can not capture well question focus and and the traditional MLE strategy lacks the ability to understand sentence-level semantics. To alleviate these problems, we propose a novel question focus-driven contrastive learning framework (QFCL). Specially, we propose an easy and effective approach to generate hard negative samples based on the question focus, and exploit contrastive learning at both encoder and decoder to obtain better sentence level representations. On three medical benchmark datasets, our proposed model achieves new state-of-the-art results, and obtains a performance gain of 5.33, 12.85 and 3.81…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Text and Document Classification Technologies
MethodsAttention Is All You Need · Linear Layer · Tanh Activation · Sigmoid Activation · Long Short-Term Memory · Layer Normalization · Softmax · Adam · Multi-Head Attention · Dense Connections
