A Self-supervised Approach for Semantic Indexing in the Context of COVID-19 Pandemic
Nima Ebadi, Peyman Najafirad

TL;DR
This paper introduces a self-supervised transformer-based model for automatic semantic indexing of COVID-19 literature, outperforming existing models and effectively handling rapidly evolving pandemic-related concepts.
Contribution
The study presents a novel self-supervised semantic indexing approach tailored for pandemic crises, demonstrating superior performance on COVID-19 literature compared to prior models.
Findings
Outperforms BioASQ Task 8a models by micro-F1 of 0.1
Achieves higher LCA-F score by 0.08 on average
Effective in detecting pandemic-specific supplementary concepts
Abstract
The pandemic has accelerated the pace at which COVID-19 scientific papers are published. In addition, the process of manually assigning semantic indexes to these papers by experts is even more time-consuming and overwhelming in the current health crisis. Therefore, there is an urgent need for automatic semantic indexing models which can effectively scale-up to newly introduced concepts and rapidly evolving distributions of the hyperfocused related literature. In this research, we present a novel semantic indexing approach based on the state-of-the-art self-supervised representation learning and transformer encoding exclusively suitable for pandemic crises. We present a case study on a novel dataset that is based on COVID-19 papers published and manually indexed in PubMed. Our study shows that our self-supervised model outperforms the best performing models of BioASQ Task 8a by micro-F1…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Sentiment Analysis and Opinion Mining · Advanced Text Analysis Techniques
