PDF articles metadata harvester

Leon Andretti Abdillah

arXiv:1301.6591·cs.DL·August 1, 2013

PDF articles metadata harvester

Leon Andretti Abdillah

PDF

Open Access

TL;DR

This paper discusses methods for extracting and embedding metadata in PDF scientific articles using XMP to improve consistency and accessibility of journal information.

Contribution

It introduces a metadata harvester approach utilizing XMP for scientific PDFs, enhancing metadata extraction and embedding processes.

Findings

01

Proposes a new PDF metadata harvesting method using XMP.

02

Demonstrates improved metadata consistency in scientific PDFs.

03

Highlights the importance of embedded metadata for scientific dissemination.

Abstract

Scientific journals are very important in recording the finding from researchers around the world. The recent media to disseminate scientific journals is PDF. On scheme to find the scientific journals over the internet is via metadata. Metadata stores information about article summary. Embedding metadata into PDF of scientific article will grant the consistency of metadata readness. Harvesting the metadata from scientific journal is very interesting field at the moment. This paper will discuss about scientific journal metadata harvesters involving XMP.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsWeb Data Mining and Analysis · Semantic Web and Ontologies · Data Quality and Management