A Method for Discovering and Extracting Author Contributions Information from Scientific Biomedical Publications
Dominika Tkaczyk, Andrew Collins, Joeran Beel

TL;DR
This paper presents a method to automatically identify and extract author contribution roles from biomedical publications' 'Authors' contributions' sections using clustering, OpenIE, and supervised learning, aiding in assessment of scientific achievements.
Contribution
It introduces a novel approach combining clustering, OpenIE, and supervised learning to automatically extract author roles from natural language in biomedical papers.
Findings
Achieved 0.71 precision in role extraction
Discovered common roles like experimenting, analysis, and study design
Built a training set for supervised extraction from semi-automated clustering
Abstract
Creating scientific publications is a complex process, typically composed of a number of different activities, such as designing the experiments, data preparation, programming software and writing and editing the manuscript. The information about the contributions of individual authors of a paper is important in the context of assessing authors' scientific achievements. Some publications in biomedical disciplines contain a description of authors' roles in the form of a short section written in natural language, typically entitled "Authors' contributions". In this paper, we present an analysis of roles commonly appearing in the content of these sections, and propose an algorithm for automatic extraction of authors' roles from natural language text in scientific publications. During the first part of the study, we used clustering techniques, as well as Open Information Extraction…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Topic Modeling · Advanced Text Analysis Techniques
