TL;DR
This paper presents a methodology for monitoring Open Access trends at a national level using open data sources and machine learning, demonstrated through a case study of France's research publications.
Contribution
The paper introduces a flexible, open-source methodology for tracking Open Access trends using web data and machine learning, applicable beyond France.
Findings
French OA rate ranged from 39% to 42% between 2013-2017.
OA rate varies by publication type, publisher, and discipline.
Methodology can be adapted for other countries and contexts.
Abstract
After the launch of multiple plans for Open Science, there is now a need for an accurate method or tool to monitor the Open Science trends and in particular Open Access (OA) trends. We address this requirement with a methodology that we developed and tested for France, but that could be extended to other countries. Only open data and information available on the Web are used, leveraging as much as we can large-scale systems such as Unpaywall, HAL (the main open repository in France, part of the CNRS), ORCID and IDRef (referential for French Higher Education and Research). We used rule-based and machine learning techniques to enrich the metadata of the publications. We estimate that the overall OA rate for French affiliated publications ranges from 39% to 42% between 2013 and 2017. The trend is slightly up, except for the last year, but we gather evidence that shows this is a consequence…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
