Focus-Driven Contrastive Learniang for Medical Question Summarization

Ming Zhang; Shuai Dou; Ziyang Wang; Yunfang Wu

arXiv:2209.00484·cs.CL·February 15, 2023·1 cites

Focus-Driven Contrastive Learniang for Medical Question Summarization

Ming Zhang, Shuai Dou, Ziyang Wang, Yunfang Wu

PDF

Open Access 1 Repo

TL;DR

This paper introduces a focus-driven contrastive learning framework for medical question summarization, significantly improving sentence representations and summary quality over traditional models.

Contribution

The paper proposes a novel contrastive learning approach that leverages question focus to generate hard negatives, enhancing sentence-level understanding in medical question summarization.

Findings

01

Achieves state-of-the-art results on three medical datasets.

02

Gains of 5.33, 12.85, and 3.81 points over BART baseline.

03

Better sentence representations and question focus capture.

Abstract

Automatic medical question summarization can significantly help the system to understand consumer health questions and retrieve correct answers. The Seq2Seq model based on maximum likelihood estimation (MLE) has been applied in this task, which faces two general problems: the model can not capture well question focus and and the traditional MLE strategy lacks the ability to understand sentence-level semantics. To alleviate these problems, we propose a novel question focus-driven contrastive learning framework (QFCL). Specially, we propose an easy and effective approach to generate hard negative samples based on the question focus, and exploit contrastive learning at both encoder and decoder to obtain better sentence level representations. On three medical benchmark datasets, our proposed model achieves new state-of-the-art results, and obtains a performance gain of 5.33, 12.85 and 3.81…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zhangming880102/qfcl
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Text and Document Classification Technologies

MethodsAttention Is All You Need · Linear Layer · Tanh Activation · Sigmoid Activation · Long Short-Term Memory · Layer Normalization · Softmax · Adam · Multi-Head Attention · Dense Connections