Clinical Reading Comprehension with Encoder-Decoder Models Enhanced by   Direct Preference Optimization

Md Sultan Al Nahian; Ramakanth Kavuluru

arXiv:2407.14000·cs.IR·July 22, 2024

Clinical Reading Comprehension with Encoder-Decoder Models Enhanced by Direct Preference Optimization

Md Sultan Al Nahian, Ramakanth Kavuluru

PDF

Open Access

TL;DR

This paper enhances clinical reading comprehension by combining encoder-decoder models with direct preference optimization, achieving significant improvements in radiology question answering and pioneering the use of DPO without human-labeled data.

Contribution

It introduces a novel application of DPO to encoder-decoder models for clinical reading comprehension, improving state-of-the-art performance and demonstrating DPO's effectiveness without human preference data.

Findings

01

Achieved 12-15 F1 point improvements on RadQA.

02

First to apply DPO to reading comprehension tasks.

03

Demonstrated DPO's effectiveness without human-labeled preferences.

Abstract

Extractive question answering over clinical text is a crucial need to help deal with the deluge of clinical text generated in hospitals. While encoder models (e.g., BERT) have been popular for this reading comprehension task, recently encoder-decoder models (e.g., T5) are on the rise. There is also the emergence of preference optimization techniques to align decoder-only LLMs with human preferences. In this paper, we combine encoder-decoder models with the direct preference optimization (DPO) method to improve over prior state of the art for the RadQA radiology question answering task by 12-15 F1 points. To the best of our knowledge, this effort is the first to show that DPO method also works for reading comprehension via novel heuristics to generate preference data without human inputs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsIntelligent Tutoring Systems and Adaptive Learning · Text Readability and Simplification · Machine Learning in Healthcare

MethodsDirect Preference Optimization · ALIGN