The way we cite: common metadata used across disciplines for defining bibliographic references
Erika Alves dos Santos, Silvio Peroni, Marcos Luiz Mucheroni

TL;DR
This paper investigates citation practices across disciplines, analyzing metadata used in references to identify common patterns and issues in standardization and completeness, which impact reference clarity and extraction.
Contribution
It provides a comprehensive analysis of metadata used in bibliographic references across multiple disciplines, highlighting common practices and deficiencies.
Findings
Articles, books, and proceedings are the most cited entities.
Certain references lack essential metadata for easy identification.
Metadata usage varies significantly across disciplines.
Abstract
Current citation practices observed in articles are very noisy, confusing, and not standardised, making identifying the cited works problematic for hu-mans and any reference extraction software. In this work, we want to investigate such citation practices for referencing different types of entities and, in particular, to understand the most used metadata in bibliographic refer-ences. We identified 36 types of cited entities (the most cited ones were articles, books, and proceeding papers) within the 34,140 bibliographic references extracted from a vast set of journal articles on 27 different subject ar-eas. The analysis of such bibliographic references, grouped by the particular type of cited entities, enabled us to highlight the most used metadata for de-fining bibliographic references across the subject areas. However, we also noticed that, in some cases, bibliographic references did…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Natural Language Processing Techniques · Biomedical Text Mining and Ontologies
