Scholarly Communication

Just published: A Comparative Study of e-Journal Archiving Solutions

I am pleased to announce that the JISC-funded report A Comparative Study of e-Journal Archiving Solutions has just been published and is now available to download as a pdf from the JISC Collections website. It has been a great pleasure to work with Julia Chruszcz, Maggie Jones and Terry Morrow on this study over the last few months.

The report is the result of a call by the JISC, issued in January 2008, for a Comparative Study of e-Journal Archiving Solutions. The Invitation to Tender asked for a report that “will be published for wide use by institutions to inform policies and investment in e-journal archiving solutions.” The ITT also stated that the report should “also inform negotiations undertaken by JISC Collections and NESLi2 when seeking publishers’ compliance to deposit content with at least one e-journal archiving solution.”

The report contains chapters covering: Approaches to e-journal preservation, Publisher licensing and legal deposit, Comparisons of Six Current e-Journal Archiving Programmes (LOCKSS, CLOCKSS, Portico, the KB e-depot, OCLC’s Electronic Collections Online, and the British Library’s e-journal Digital Archive), Practical experience of e-journal archiving solutions, Evaluation of four common scenarios/trigger events, and Criteria for judging relevance and value of new archiving initiatives. There are two appendices on Publisher Participation in different programmes.

The report has the following recommendations:

  1. When negotiating NESLi2 agreements, JISC’s negotiators should take the initiative by specifying archiving requirements, including a short-list of approved archiving solutions.
  2. To help quantify the insurance risk and the necessary appropriate investment, bodies representing publishers and other trade organisations should gather and share statistical information on the likelihood of the trigger events outlined in this report.
  3. Post cancellation access conditions should be defined in the licensing agreement between libraries and publishers. Publishers should be strongly encouraged to cooperate with one or more external e-journal archiving solutions as well as provide their own post-cancellation service (at minimal cost).
  4. The publisher (or subscription agent) should state their policy on perpetual access under the four scenarios described in section 9.
  5. When titles are sold on to other publishers, the Transfer Code of Practice (see section 9.3.) should be followed.
  6. Archiving service providers and publishers should work together to develop standard cross-industry definitions of trigger events and protocols on the conditions for release of archived content. Project Transfer is a potential exemplar. The ground rules for any post-trigger event negotiation should be clear and transparent and established  in advance.
  7. Archive service providers must provide greater clarity on coverage details, including not only publishers and titles, but also the years and issues included in the archive.
  8. Using the scenarios outlined in this report, libraries should carry out a risk assessment on the impact of loss of access to e-journals by their institution, and a cost/benefit analysis, in order to judge the value and relevance of the archiving solutions on offer.
  9. Relevant UK bodies and institutions should use whatever influence they can bring to bear to ensure that archiving solutions cover publishers and titles of particular value to UK libraries.
  10. The findings of this study should be reviewed and updated at regular intervals to reflect continuing developments in the field of e-journal archiving and preservation.

Its publication comes hot on the heels of two related studies  the Portico/Ithaka e-journal archiving survey of US Library Directors  and the JISC-funded UK LOCKSS Pilot Programme Evaluation Report. A further blog entry will follow!

just published: Research Data Preservation Costs Report

I have posted two previous entries to the blog in March and January detailing progress with the JISC-funded research data preservation costs study. I am pleased to report that the online executive summary and full report (pdf file) titled “Keeping Research Data Safe: a cost model and guidance for UK Universities” is now published and can be downloaded from the JISC website.

It has been an very intensive piece of work over four months and I am extremely grateful to the many colleagues who contributed and made this possible. We have uncovered a lot of valuable data and approaches and hope this can be built on by future studies and implementation and testing. We have attempted to “show our workings” as far as possible to facilitate this so  the text of the report is accompanied by extensive appendices.

We have made 10 recommendations on future work and implementation. For further information see the Executive Summary online.

The report iteself has chapters covering the Introduction, Methodology, Benefits of Research Data Preservation, Describing the Cost Framework and its Use, Key Cost Variables and Units,the Activity Model and Resources Template, Overviews of the Case Studies, Issues Universities Need to Consider, Different Service Models and Structures, Conclusions and Recommendations. There are also four detailed case studies covering the Universities of Cambridge, King’s College London, Southampton, and the Archaeology Data Service (University of York).

Although focused on the UK and UK universities in particular, it should be of interest to anyone involved with research data or interested generally in the costs of digital preservation.

 

Comments and Feedback welcome!

OR2008 - Presentations available

 

The Open Repositories conference (OR2008) repository is available at http://pubs.or08.ecs.soton.ac.uk/ as a permanent record of the conference activities.

The repository contains papers, presentations and poster artwork for 144 different conference contributions from the main conference sessions (Interoperability, Legal, Models, Architectures & Frameworks, National Perspectives, Scientific Repositories, Social Networking, Sustainability, Usage, Web 2.0), the Poster session, User Group sessions (DSpace, EPrints, Fedora), Birds of a Feather sessions, the Repository Managers session and the ORE Information day.

My powerpoint presentation from the Plenary keynote for the Fedora International Users’ Meeting is also available there. Titled “Keeping alert: issues to know today for long-term digital preservation with repositories” it focussed on research data and sustainability. It drew heavily from the forthcoming JISC Research Data Preservation Costs study and the draft final report titled “Keeping Research Data Safe: A Cost Model and Guidance for UK Universities”. It concludes by outlining tentative findings and implications for repositories from that report.

Academic Libraries Unleashed

There is an excellent supplement on academic libraries today in the Guardian produced jointly with JISC. I would highly recommend it to international and UK colleagues who want a quick overview of latest developments in UK academic libraries.

You can also read the supplement online.

The supplement includes articles and overviews under the headings:

  • Colleges, universities and the digital challenge
  • Learning spaces
  • Library 2.0
  • New business models
  • Digitisation
  • The New User
  • and last but by no means least [new] Librarians.

UK Budget: Pricing Public Sector Information report

Buried deep in the small print of the UK Government budget statement today was the following interesting item:

“The Office of Fair Trading’s (OFT) market study into the commercial use of public information highlighted important issues around access to public sector information for commercial or other re-use. The Government commissioned Cambridge University to analyse the pricing of this information. This analysis is published alongside Budget 2008. The Government will look closely at public sector information held by trading funds to distinguish more clearly what is required by Government for public tasks and ensure that this information next Spending Review the Government will ensure that information collected for public
purposes is priced so that the need for access is balanced with ensuring that customers pay a fair contribution to the cost of collecting this information in the long term. These issues will be considered in conjunction with the assessment of trading funds.”

This report with the rather catchy title “Models of Public Sector Information Provision via Trading Funds” by  Prof David Newbery, Prof Lionel Bently, and Rufus Pollock from Cambridge University was published today and can be downloaded at http://www.berr.gov.uk/files/file45136.pdf.

For those interested in the context there has been a long-running debate over pricing of data from some government agencies. The Guardian hosts a “Free Our Data” campaign blog which has a commentary on the Cambridge report and associated issues.

Portico agreement with the National Library of the Netherlands

A recent announcement by Portico and the Koninklijke Bibliotheek, the National Library of the Netherlands (the KB), notes that they have reached an agreement for an off-line copy of the Portico e-journal archive,  to be held for safekeeping by the KB.

Placing a Portico-owned copy of the archive, in a secure access- and climate-controlled facility operated by the KB is one component of the replication strategy Portico is implementing to ensure the safety and security of its e-journal archive.

The announcement is of interest on several levels. It demonstrates in a very practical way the commitment by Portico and the KB to the concept of a “Safe Places Network”; and also implementation by Portico of the concept of replication of core electronic materials for international scholarship needing to have an international element - something one can also see in the archiving policies of Elsevier and the Internet Archive.

Google to host research datasets

The Wired Blog gives advanced notice that the domain, http://research.google.com, will soon provide a home for terabytes of open-source scientific datasets. The storage will be free to scientists and access to the data will be free for all. The project, known as Palimpsest, missed its original launch date this week, but will debut soon. It is suggested that Palimsest will fill a major need for scientists who want to openly share their data, and will allow public access to an unprecedented amount of data. For example, two planned datasets are all 120 terabytes of Hubble Space Telescope data and the images from the 10th century manuscript the Archimedes Palimpsest.

Those with long memories (hopefully prevalent amongst digital preservationists!) will also remember the Google/ Nasa memorandum of understanding signed in September 2005 that “outlines plans for cooperation on a variety of areas, including large-scale data management, massively distributed computing, bio-info-nano convergence, and encouragement of the entrepreneurial space industry” so perhaps we should expect more major announcements along these lines from Google and NASA in months to come.

NSF DataNet call

A major new development in the USA. The US National Science Foundation’s Office of Cyberinfrastructure (OCI) has released a new call for proposals for “Sustainable Digital Data Preservation and Access Network Partners (DataNet).” The DataNet seeks to foster the development of new types of organizations that “integrate library and archival sciences, cyberinfrastructure, computer and information sciences, and domain science expertise .” Up to $100,000,000 plus indirect costs is available in this program over a five year period, with the possibility of a five year renewal; it is anticipated that there will be around five grantees, with no single award exceeding $20,000,000. Funding is expected to ramp down for each project in successive years, encouraging the development of sustaining strategies. Although U.S. academic and not-for-profit organizations must be the lead submitters, commercial partners are encouraged. Preliminary proposals must be submitted by 7th January 2008 and full proposals by 21st March 2008. Full details of the call are available from here.