Optimal path for Biomedical Text Summarization Using Pointer GPT
Hyunkyung Han, Jaesik Choi

TL;DR
This paper introduces Pointer-GPT, a modified transformer model with pointer networks for biomedical text summarization, improving factual accuracy and preservation of original content over traditional GPT models.
Contribution
The paper presents a novel integration of pointer networks into GPT to enhance biomedical text summarization accuracy and content preservation.
Findings
Pointer-GPT outperforms original GPT in ROUGE scores.
The model better preserves core information in summaries.
Potential to improve electronic medical record systems.
Abstract
Biomedical text summarization is a critical tool that enables clinicians to effectively ascertain patient status. Traditionally, text summarization has been accomplished with transformer models, which are capable of compressing long documents into brief summaries. However, transformer models are known to be among the most challenging natural language processing (NLP) tasks. Specifically, GPT models have a tendency to generate factual errors, lack context, and oversimplify words. To address these limitations, we replaced the attention mechanism in the GPT model with a pointer network. This modification was designed to preserve the core values of the original text during the summarization process. The effectiveness of the Pointer-GPT model was evaluated using the ROUGE score. The results demonstrated that Pointer-GPT outperformed the original GPT model. These findings suggest that pointer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Biomedical Text Mining and Ontologies · Natural Language Processing Techniques
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Linear Layer · Byte Pair Encoding · Multi-Head Attention · Cosine Annealing · Linear Warmup With Cosine Annealing · Dense Connections · Residual Connection · Attention Dropout
