From raw affiliations to organization identifiers

Myrto Kallipoliti; Serafeim Chatzopoulos; Miriam Baglioni; Eleni Adamidi; Paris Koloveas; Thanasis Vergoulis

arXiv:2505.07577·cs.DL·May 14, 2025

From raw affiliations to organization identifiers

Myrto Kallipoliti, Serafeim Chatzopoulos, Miriam Baglioni, Eleni Adamidi, Paris Koloveas, Thanasis Vergoulis

PDF

1 Repo

TL;DR

This paper introduces AffRo, a novel method for accurate affiliation matching in scholarly metadata, addressing complex affiliation strings with advanced parsing and disambiguation, supported by a new curated dataset for benchmarking.

Contribution

The paper presents AffRo, a new approach for affiliation matching that handles complex strings, and introduces AffRoDB, a curated dataset for systematic evaluation.

Findings

01

AffRo outperforms existing methods in accuracy.

02

AffRoDB enables robust benchmarking of affiliation algorithms.

03

The approach effectively disambiguates multiple organizations in affiliation strings.

Abstract

Accurate affiliation matching, which links affiliation strings to standardized organization identifiers, is critical for improving research metadata quality, facilitating comprehensive bibliometric analyses, and supporting data interoperability across scholarly knowledge bases. Existing approaches fail to handle the complexity of affiliation strings that often include mentions of multiple organizations or extraneous information. In this paper, we present AffRo, a novel approach designed to address these challenges, leveraging advanced parsing and disambiguation techniques. We also introduce AffRoDB, an expert-curated dataset to systematically evaluate affiliation matching algorithms, ensuring robust benchmarking. Results demonstrate the effectiveness of AffRp in accurately identifying organizations from complex affiliation strings.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mkallipo/affiliation-matching
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.