Automatic Metadata Generation using Associative Networks
Marko A. Rodriguez, Johan Bollen, Herbert Van de Sompel

TL;DR
This paper presents a novel, content-analysis-independent method for automatic metadata generation that propagates metadata through associative networks built from existing metadata, improving efficiency and applicability across media types.
Contribution
It introduces an associative network-based approach for metadata propagation, avoiding costly content analysis and enabling broad application across resource media.
Findings
Effective metadata propagation demonstrated on bibliographic datasets
Method is computationally inexpensive and media-agnostic
Outperforms traditional content-based metadata generation techniques
Abstract
In spite of its tremendous value, metadata is generally sparse and incomplete, thereby hampering the effectiveness of digital information services. Many of the existing mechanisms for the automated creation of metadata rely primarily on content analysis which can be costly and inefficient. The automatic metadata generation system proposed in this article leverages resource relationships generated from existing metadata as a medium for propagation from metadata-rich to metadata-poor resources. Because of its independence from content analysis, it can be applied to a wide variety of resource media types and is shown to be computationally inexpensive. The proposed method operates through two distinct phases. Occurrence and co-occurrence algorithms first generate an associative network of repository resources leveraging existing repository metadata. Second, using the associative network as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
