Sectioning of Biomedical Abstracts: A Sequence of Sequence Classification Task
Mehmet Efruz Karabulut, K. Vijay-Shanker

TL;DR
This paper investigates deep learning models for classifying sections of biomedical abstracts, focusing on the SSN-4 model's performance, generalization, and the impact of task-specific word embeddings.
Contribution
It evaluates the SSN-4 model's effectiveness, explores component trade-offs, and examines generalization to new datasets in biomedical abstract section classification.
Findings
SSN-4 model performs well on RCT dataset
Model does not generalize effectively beyond RCT data
Task-specific word embeddings can influence performance
Abstract
Rapid growth of the biomedical literature has led to many advances in the biomedical text mining field. Among the vast amount of information, biomedical article abstracts are the easily accessible sources. However, the number of the structured abstracts, describing the rhetorical sections with one of Background, Objective, Method, Result and Conclusion categories is still not considerable. Exploration of valuable information in the biomedical abstracts can be expedited with the improvements in the sequential sentence classification task. Deep learning based models has great performance/potential in achieving significant results in this task. However, they can often be overly complex and overfit to specific data. In this project, we study a state-of-the-art deep learning model, which we called SSN-4 model here. We investigate different components of the SSN-4 model to study the trade-off…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Topic Modeling · Advanced Text Analysis Techniques
