Pre-trained Language Model for Biomedical Question Answering

Wonjin Yoon; Jinhyuk Lee; Donghyeon Kim; Minbyul Jeong; Jaewoo Kang

arXiv:1909.08229·cs.CL·September 19, 2019·5 cites

Pre-trained Language Model for Biomedical Question Answering

Wonjin Yoon, Jinhyuk Lee, Donghyeon Kim, Minbyul Jeong, Jaewoo Kang

PDF

Open Access 3 Repos

TL;DR

This paper evaluates BioBERT, a pre-trained biomedical language model, demonstrating its superior performance in answering various biomedical questions and highlighting the importance of tailored pre-/post-processing strategies.

Contribution

It introduces BioBERT for biomedical QA and shows its effectiveness across different question types with optimized processing techniques.

Findings

01

BioBERT outperforms previous models in biomedical question answering.

02

Pre-training on SQuAD enhances BioBERT's performance.

03

Proper pre-/post-processing improves answer accuracy.

Abstract

The recent success of question answering systems is largely attributed to pre-trained language models. However, as language models are mostly pre-trained on general domain corpora such as Wikipedia, they often have difficulty in understanding biomedical questions. In this paper, we investigate the performance of BioBERT, a pre-trained biomedical language model, in answering biomedical questions including factoid, list, and yes/no type questions. BioBERT uses almost the same structure across various question types and achieved the best performance in the 7th BioASQ Challenge (Task 7b, Phase B). BioBERT pre-trained on SQuAD or SQuAD 2.0 easily outperformed previous state-of-the-art models. BioBERT obtains the best performance when it uses the appropriate pre-/post-processing strategies for questions, passages, and answers.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies