PDF articles metadata harvester
Leon Andretti Abdillah

TL;DR
This paper discusses methods for extracting and embedding metadata in PDF scientific articles using XMP to improve consistency and accessibility of journal information.
Contribution
It introduces a metadata harvester approach utilizing XMP for scientific PDFs, enhancing metadata extraction and embedding processes.
Findings
Proposes a new PDF metadata harvesting method using XMP.
Demonstrates improved metadata consistency in scientific PDFs.
Highlights the importance of embedded metadata for scientific dissemination.
Abstract
Scientific journals are very important in recording the finding from researchers around the world. The recent media to disseminate scientific journals is PDF. On scheme to find the scientific journals over the internet is via metadata. Metadata stores information about article summary. Embedding metadata into PDF of scientific article will grant the consistency of metadata readness. Harvesting the metadata from scientific journal is very interesting field at the moment. This paper will discuss about scientific journal metadata harvesters involving XMP.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWeb Data Mining and Analysis · Semantic Web and Ontologies · Data Quality and Management
