The Medical Metaphors Corpus (MCC)
Anna Sofia Lippolis, Andrea Giovanni Nuzzolese, Aldo Gangemi

TL;DR
The Medical Metaphors Corpus (MCC) is a new annotated dataset of 792 scientific metaphors in medical and biological texts, designed to advance computational metaphor detection and understanding in specialized domains.
Contribution
This paper introduces MCC, the first annotated resource for scientific metaphors in medical and biological texts, with diverse sources and graded metaphoricity scores, facilitating domain-specific metaphor research.
Findings
State-of-the-art models perform modestly on scientific metaphors.
MCC enables benchmarking and development of better metaphor detection tools.
The dataset supports applications in communication and generation systems.
Abstract
Metaphor is a fundamental cognitive mechanism that shapes scientific understanding, enabling the communication of complex concepts while potentially constraining paradigmatic thinking. Despite the prevalence of figurative language in scientific discourse, existing metaphor detection resources primarily focus on general-domain text, leaving a critical gap for domain-specific applications. In this paper, we present the Medical Metaphors Corpus (MCC), a comprehensive dataset of 792 annotated scientific conceptual metaphors spanning medical and biological domains. MCC aggregates metaphorical expressions from diverse sources including peer-reviewed literature, news media, social media discourse, and crowdsourced contributions, providing both binary and graded metaphoricity judgments validated through human annotation. Each instance includes source-target conceptual mappings and perceived…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLanguage, Metaphor, and Cognition · Action Observation and Synchronization · Neurobiology of Language and Bilingualism
