Automating Categorization of Scientific Texts with In-Context Learning and Prompt-Chaining in Large Language Models
Gautam Kishore Shahi, Oliver Hummel

TL;DR
This paper evaluates large language models' ability to classify scientific texts using prompt engineering techniques, showing prompt chaining improves accuracy especially for hierarchical categories, but struggles with fine-grained classification.
Contribution
It systematically assesses the effectiveness of in-context learning and prompt chaining for scientific text classification using LLMs and hierarchical taxonomies.
Findings
Prompt chaining outperforms pure in-context learning in classification accuracy.
LLMs with prompt chaining surpass state-of-the-art models for domain and subject prediction.
Classifying research topics at the third hierarchical level remains challenging, with about 50% accuracy.
Abstract
The relentless expansion of scientific literature presents significant challenges for navigation and knowledge discovery. Within Research Information Retrieval, established tasks such as text summarization and classification remain crucial for enabling researchers and practitioners to effectively navigate this vast landscape, so that efforts have increasingly been focused on developing advanced research information systems. These systems aim not only to provide standard keyword-based search functionalities but also to incorporate capabilities for automatic content categorization within knowledge-intensive organizations across academia and industry. This study systematically evaluates the performance of off-the-shelf Large Language Models (LLMs) in analyzing scientific texts according to a given classification scheme. We utilized the hierarchical ORKG taxonomy as a classification…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
