Show me the data!

My submitted HowOpenIsIt? comments

October 8th, 2012 | Posted by rmounce in Panton Fellowship updates

I just submitted some comments to SPARC / PLOS / OASPA’s request for public comment on their new HowOpenIsIt? material here. If you haven’t done so yourself, the deadline is TODAY 5pm (EST).

Below are the comments I submitted. A mixture of praise for remembering to include machine-readability. Concern over some possible interpretations, and practical points on providing Hyperlinks or URLs for all the CC licenses mentioned:


* I heartily support & commend that Machine Readability takes pride of place within this guide to Open Access. This freedom was there from the start in the Budapest declaration: “…crawl them for indexing, pass them as data to software, or use them for any other lawful purpose…” but in recent years this freedom has been often neglected by some, and worse actively-restricted by some subscription-based publishers in their contractual agreements. Yet it represents one of the most important freedoms that needs to be enabled by Open Access. It has been estimated that over 50 million academic articles have been published and the volume of publications is increasing rapidly year on year. The only rational way we’ll be able to make full use of all this research both NOW and in the future, is if we are allowed to use machines to help us make sense of this vast and growing literature.

* I am slightly worried that the statement on machine readability for Open Access, could yet still provide a barrier for use by publishers to protect their content from mining: “…through a community standard API or protocol” perhaps leaves too much to interpretation. The API provided could be a poor one, inflexible and not sufficiently cutting-edge for the research required. I think there is no need for a clause on how machines might be let access to Open Access research if it is published CC-BY as mentioned under Reuse Rights. Only that the medium in which the work is published (PDF, HTML, XML or other) is sufficiently machine-interpretable and not DRM-protected.

* I support that the guide itself is licensed under CC-BY-NC-ND to prevent derivative or modified works, to prevent interoperability problems. This is in line with both W3C ( and IETF practices.

* May I suggest the paper version of this guide (if there is to be one) be printed with full URLs to the CC-BY-NC-ND, CC-BY, & CC BY-NC licenses mentioned in the guide. Likewise the electronic/digital version should have clickable hyperlinks to further explain these contractions.

* I think the guide should make it clearer that the label ‘Open Access’ should only be applied to content that has all of the full top-line suite of rights. Anything less than this in any of the categories is nearly but not quite Open Access. There are other terms available for such less Open content, like ‘free access’, ‘public access’, ‘less-restricted access’ that can all be applied in some form or combination to apply to the set of rights in between ‘Open Access’ and ‘Closed Access’. This guide should reaffirm that only the full suite of Open rights makes a work Open Access.

* However, I do wonder if the question of who holds copyright (author or publisher) is somewhat irrelevant to Open Access? I certainly support that authors retain copyright to their own content, but in instances where the publisher has taken the copyright and the work is in all other respects fulfilling the other qualities of Open Access – is this not Open Access? Surely then the Copyright column is just a special case subset of the Reuse Rights column? The issue of who holds copyright is something important but separate to Open Access in my opinion.

* Ditto for ‘Author Posting’ this duplicates what is given in the Reuse Rights column, just a special case for the author. This section is usefully distinct in grey not-quite-Open Access cases, but for Open Access it is just a rewritten duplication that *anyone* has the right to reuse/repost.

At some point I also intend to make comment on BMC’s Open Data & Open Bibliography RFP but the deadline for that is much later and I have LOTS of work to do in the mean time, so that’ll have to wait for a bit…