Check Your Data Freedom: A Taxonomy to Assess Life Science Database Openness
Melanie Dulong De Rosnay

TL;DR
This paper introduces a taxonomy of legal and technical restrictions on life science databases, helping to assess and improve data openness for better reuse and innovation.
Contribution
It develops a comprehensive taxonomy of restrictions and a checklist to evaluate and enhance data openness in life science databases.
Findings
Most terms of use are not harmonized or clear.
A small set of restrictions can identify open databases.
A checklist aids curators in increasing data openness.
Abstract
Molecular biology data are subject to terms of use that vary widely between databases and curating institutions. This research presents a taxonomy of contractual and technical restrictions applicable to databases in life science. It builds upon research led by Science Commons demonstrating why open data and the freedom to integrate facilitate innovation and how this openness can be achieved. The taxonomy describes technical and legal restrictions applicable to life science databases, and its metadata have been used to assess terms of use of databases hosted by Life Science Resource Name (LSRN) Schema. While a few public domain policies are standardized, most terms of use are not harmonized, difficult to understand and impose controls that prevent others from effectively reusing data. Identifying a small number of restrictions allows one to quickly appreciate which databases are open. A…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
