Optimizing Automatic Summarization of Long Clinical Records Using   Dynamic Context Extension:Testing and Evaluation of the NBCE Method

Guoqing Zhang; Keita Fukuyama; Kazumasa Kishimoto; and Tomohiro Kuroda

arXiv:2411.08586·cs.AI·November 15, 2024

Optimizing Automatic Summarization of Long Clinical Records Using Dynamic Context Extension:Testing and Evaluation of the NBCE Method

Guoqing Zhang, Keita Fukuyama, Kazumasa Kishimoto, and Tomohiro Kuroda

PDF

Open Access

TL;DR

This paper introduces a novel method for summarizing long clinical records using a 7B language model enhanced with a Native Bayes Context Extend and a new decoding mechanism, achieving high-quality summaries within limited context windows.

Contribution

The paper presents a new approach combining Native Bayes Context Extend and a redesigned decoding process to improve long input summarization with smaller models.

Findings

01

Achieved near-parity with Google's 175B Gemini on ROUGE-L metrics.

02

Enhanced summarization quality with a 7B model using the proposed methods.

03

Demonstrated resource-efficient performance in clinical record summarization.

Abstract

Summarizing patient clinical notes is vital for reducing documentation burdens. Current manual summarization makes medical staff struggle. We propose an automatic method using LLMs, but long inputs cause LLMs to lose context, reducing output quality especially in small size model. We used a 7B model, open-calm-7b, enhanced with Native Bayes Context Extend and a redesigned decoding mechanism to reference one sentence at a time, keeping inputs within context windows, 2048 tokens. Our improved model achieved near parity with Google's over 175B Gemini on ROUGE-L metrics with 200 samples, indicating strong performance using less resources, enhancing automated EMR summarization feasibility.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Biomedical Text Mining and Ontologies · Topic Modeling