Science and Industry

Digital Preservation at the Science Museum Group

Science-Museum

Between April and August 2019, staff at Charles Beagrie Ltd had the privilege of working on a digital preservation consultancy for the Science Museums Group, one of the UK’s key groups of national museums.

With five museums and a collections centre around the UK, the Science Museum Group consists of the Science Museum in London, National Railway Museum in York, National Science and Media Museum in Bradford, Locomotion in County Durham, the Science and Industry Museum in Manchester, and the National Collections Centre in Wroughton.

Over several decades, the Group’s digitization programme has built up significant digital collections and the museums are increasingly having to manage born-digital content, both internally generated and acquired.

SMG recognised that digital preservation had become business critical and was committed to implementing best practice for managing its collection and other digital assets. It commissioned a digital preservation consultancy from us to:

  • Advise on SMG’s production of a Digital Asset Register.
  • Assess levels of digital preservation for archive collection and corporate information at the National  Railway Museum (NRM) prior to applying for archive service accreditation for NRM.
  • Produce a digital preservation gap analysis across SMG as a whole to highlight key risks, opportunities, and priorities for the organisation. This would be used by SMG to develop its digital preservation strategy and the business case for resourcing it.

Our work was completed in August 2019. It has been gratifying subsequently to see the progress that has been made by SMG:

It has been great to see SMG join the DPC and increase the membership representation from the museum sector. We are sure that like other members, it will benefit from the sharing of experience, skills, and expertise that this brings.

The new digital preservation manager post is also a key opportunity to make a difference within a major national organisation. Our experience of working with SMG staff was extremely positive and the work environment is inspiring. Do consider applying if the job is of interest. The closing date for applications is 12th October.

 

 

 

 

New “What To Keep” research data report published by Jisc

What to Keep

“What To Keep?” a new Jisc research data report by Charles Beagrie Ltd has just been published by Jisc. You can access the full report directly at: https://repository.jisc.ac.uk/7262/

What to keep in terms of research data has been a recognised issue for some time but research data management and in particular appraisal and selection (i.e. “what to keep and why”) has become a more significant focus in recent years as volumes and diversity of data have grown, and as the available infrastructure for ‘keeping’ has become more diverse.

The purpose of the What to Keep report is to provide new insights that will be useful to institutions, research funders, researchers, publishers, and Jisc on what research data to keep and why, the current position, and suggestions for improvement.

The analysis of emerging themes and mappings is available as a set of tables. Seven mini case studies illustrate in more detail the approaches and rationale for what to keep for different repositories, stakeholders and disciplinary areas.

The report provides insights on how what to keep decisions can be guided and supported, and the ten study recommendations and the potential implementations for them, provide practical suggestions for future development.

What to Keep Recommendations

Research Data: What to Keep?

Charles Beagrie Ltd has started a new research data study for Jisc and UK institutions.

Jisc is working to develop shared infrastructure, influence policy and provide guidance to support institutions with the growing need for robust research data management. There is a wide-range of needs and existing provision for creation, collection, storage and preservation, and reuse of data within UK Higher Education.

What research data should be kept?

Researchers, data curators and policy makers all need to answer the question, what research data should be kept? We can’t keep it all, because that would be too expensive and time-consuming. However, we have to keep data that is irreplaceable and unique in its value for future research; to enable it to be reused and validated: to enable peer review to be informed; and to enable there to be trust in research findings. Types of data needing to be retained vary and may include related materials such as software and documentation. But how much and what is enough? Obviously, there is no single answer to that: it depends on many factors, but what are those factors, and how should we weight them? These remain difficult and open questions, but this year Jisc is working with us to take a step toward answering them.

How can we identify what to keep?

We are setting out to explore, what actually is the optimal data to keep from research projects conducted at UK institutions? Over the course of the rest of 2018, our project will work with a small number of research areas to find out. What conditions, such as openness or timescales, might be ideal? We will consult the views of researchers (as data creators and data users), research funders, ethics professionals, archivists, research data managers, peer reviewers, other research users, and others on these questions. We will dig into the reasons for their views, and into whether research data is currently kept in line with those views, or not.

Why are we carrying out this investigation now?

This work comes at a critical time in the evolution of research data management and sharing. At the policy level, the recommendations from the UK Open Research Data Taskforce are expected shortly. These may take into account both the recommendations to Government of the 2017 report by Dame Wendy Hall and Jérôme Pesenti into the future of the UK artificial intelligence industry and the recent Government announcements around this, where research data can be a key input into AI tools. The availability of research data is also a matter of concern to those interested in research integrity and reproducibility. Relevant infrastructure investments include both the Jisc research data shared service and the increasing activity around the European Open Science Cloud.

Both policy and infrastructure investments need better information about the extent and nature of the research data that needs to be kept, under what conditions, and for how long. Our 2018 project will not provide all this information, but it will explore current practices and take the next step.

Presentation on the Value and Impact of Social Science Data Archives and the CESSDA SaW Toolkit

A set of 38 slides now on slideshare used for the Focus Group Cost-Benefit Funding Advocacy Program (Task 4.6) session at the CESSDA Saw Workshop in The Hague 16/17 June 2016.

This was an interactive focus group repeated over two parallel sessions.  It was aimed at European social science data archive staff with responsibility for bidding for funding or promotion and advocacy of the archive to key stakeholders.  The presentation covers some of the key ideas on how the CESSDA Saw funding advocacy toolkit will be structured, its components, and key facts and approaches it will include.

We expect the cost-benefit funding advocacy toolkit under development to support the negotiation with ministries and funding organisations across Europe.

The results of the toolkit user requirements survey with responses from 24 European social science archives were presented and discussed, together with suggested approaches and content for the toolkit. 22 people attended the two sessions overall, representing a mix of countries at different stages on the development path for social science archives (none, new/emerging, mature). There was strong interest and support for the emerging toolkit together with open discussion of how it can be applied in the specific political and administrative context of different European countries.

The slide set presented here is an extended version including a number of hidden background/ reference slides not used in the presentation. The focus group is one of a series guiding further development of the toolkit and its adoption being given to either: (a) social science data archive staff or (b) their key stakeholders (senior management in their universities, research councils and academies, funding ministries, national statistics offices, research users and depositors).

CESSDA is the Consortium of European Social Science Data Archives. The CESSDA SaW project “Strengthening and widening the European infrastructure for social science data archives” is funded by the European Commission as part of its Horizon2020 programme.

Digital Preservation Handbook Update February 2016

Originally published in 2001 as a paper edition, ‘Preservation and Management of Digital Materials: a Handbook’ was the first attempt in the UK to synthesise the diverse and burgeoning sources of advice on digital preservation.  Demand was so great that in 2002, a free online edition of the Handbook was published by the newly established Digital Preservation Coalition.

After more than a decade, in which digital preservation has been transformed, the Handbook remains among the most heavily used area of the DPC website.

Funders and organisations are collaborating on re-designing, expanding and updating the Handbook so it can continue to grow as a major open-access resource for digital preservation. The DPC and Charles Beagrie Ltd have been engaged on a major re-working of the Digital Preservation Handbook for release as a new edition over 2015/2016. The National Archives (our Gold Sponsor) working together with other stakeholders including Jisc, the British Library, and The Archives and Records Association (our Silver Sponsors), and the National Records of Scotland (our Bronze Sponsor) is supporting the Digital Preservation Coalition in updating and revamping the Handbook. Many individuals and organisations are also contributing to this work through book sprints, peer review, project and advisory boards.

The revision, guided by the user feedback and consultation (see Report on the Preparatory User Consultation on the 2nd Edition of the Digital Preservation Handbook), is modular and being undertaken over a two year period to March 2016.

We have provided updates at regular intervals to inform the community on progress with the project and with this final February update we are delighted to announce a number of key developments.

 

Publication Schedule

The 2nd edition of the Handbook had a partial “soft launch” in October 2015 and approximately 2/3rds is online and publicity accessible at http://www.dpconline.org/advice/preservationhandbook

This partial release will be further enhanced by additional functionality when a new platform for the website focused on ‘responsive design’ is brought on stream by the DPC in 2016. This will provide an updated design and improved user experience on mobile and tablet devices, compared to the current site templates that are optimised for viewing on a desktop screen. We will also add the facility to generate PDFs. In the interim some functionality and content will remain “works in progress” but the community have gained early access to a significant new resource.

The remaining 14 sections to complete the Handbook have now been written, edited and are in peer review (see Handbook contents page for coming soon sections). We are aiming to complete this work and revise content for publication by the end of March 2016. The Handbook is now live so we will need to close and update section by section for these 14 remaining updates, hopefully in the final week of March and/or early April 2016. Watch this space for future announcements!

NRS joins funding group

The Digital Preservation Coalition was delighted to announce this month that The National Records of Scotland (NRS) had come on board as a ‘Bronze Sponsor’ for the eagerly anticipated second edition of the ‘Digital Preservation Handbook’. As of February 2016, with the addition of the NRS we have raised 93% of estimated funding required for the Handbook revision. We have prioritised content creation, scaled back some events, and adjusted budgets to ensure completion within a very tight funding profile.

Slideshare from Handbook Workshop at DCDC15

A workshop on the Digital Preservation Handbook was run at the DCDC15 conference in early October. Powerpoint slides from the Handbook presentation are now available on Slideshare. They provide a detailed overview of the new edition Handbook and work in progress. To date, there have been over 2,000 views of the slides.

European Bioinformatics Institute economic impact slideshare

A short set of 4 powerpoint slides summarising the findings on the economic impact of the European Bioinformatics Institute with extensive accompanying slides notes, all CC-BY licensed, have been placed on Slideshare.

The European Bioinformatics Institute (EMBL- EBI), located on the Wellcome Genome Campus in Hinxton, UK, manages public life-science data on a very large scale, making a rich resource of information freely available to the global life science community. EMBL-EBI is one of a handful of organisations in the world involved in global efforts to exchange information, set standards, develop new methods, and curate complex genome information.

We published a full report this week with the results of a quantitative and qualitative study of the Institute, examining the value and impact of its work. Our focus is the economic impact and can be seen as complementary to traditional academic measures, such as citation counts.

The summary slides show the quantitative economic approaches used included: estimates of access and use value, contingent valuation using stated preference techniques, an activity-costing approach to estimating the efficiency impacts of EMBL-EBI data and services, and a macro-economic approach that seeks to explore the impacts of EMBL-EBI use on returns to investment in research. These approaches allowed us to develop a picture, beginning with estimates of minimum direct values for the EMBL-EBI’s user community and moving progressively toward approaches that measure wider social and economic value.

New report: The Value and Impact of the European Bioinformatics Institute

We are pleased to announce a new report: The Value and Impact of the European Bioinformatics Institute.

In 2015, Charles Beagrie Ltd  was commissioned by the European Bioinformatics Institute (EMBL-EBI), to study and analyse its economic and social impact.

The EMBL- EBI, located on the Wellcome Genome Campus in Hinxton, near Cambridge in the UK, manages public life science data on a very large scale, making a rich resource of genome information freely available to the global life science community.

The full report published today presents the results of the quantitative and qualitative study of the Institute, examining the value and impact of its work. The report highlights key findings, including that EMBL-EBI data and services made commercial and academic R&D significantly more efficient. This benefit to users and their funders is estimated, at a minimum, to be worth £1 billion per annum worldwide – equivalent to more than 20 times the direct operational cost of EMBL-EBI.

A press release with further information is available on the EMBL-EBI website at http://www.ebi.ac.uk/about/news/press-releases/value-and-impact-of-the-european-bioinformatics-institute

The Full Report is available online in printable format at http://www.beagrie.com/EBI-impact-report.pdf

A short Executive Summary version of the report is available online in printable format at http://www.beagrie.com/EBI-impact-summary.pdf

12 slideshares for Xmas: 20 years in digital preservation

I have just posted the final instalment of a personal selection of 12 presentations drawn from events and topics over the last 20 years in digital preservation, which I hope will be of interest.

They are taken from events on four different continents including the first iPres conference and cover themes such as personal archiving, research data management, e-journals, the digital preservation lifecycle model, national and institutional strategies and collaboration, costs/benefit/economic impacts of digital preservation, the establishment of the Digital Preservation Coalition, and the development of the online Digital Preservation Handbook. I hope there will be something in there for everyone.

There are accompanying blog narratives which set the presentations into context and the powerpoint presentations themselves on Slideshare. Details and web links to them are as follows:

2014 – The Value and Impact of Research Data Infrastructure (economic impact), presentation to the Preservation and Archiving Special Interest Group (PASIG), Karlsruhe Germany    slides     narrative

2013 – Maintaining a Vision: how mandates and strategies are changing with digital content (changes and responses), keynote presentation to Screening the Future conference, London UK slides     narrative

2010 – Keeping Research Data Safe (digital preservation costs and benefits), presentation to KB Experts Workshop on Digital Preservation Costs, The Hague Netherlands          slides     narrative

2007 – Digital Preservation: Setting the Course for a Decade of Change (evolution or revolution?), keynote presentation to the Belgian Association for Documentation (ABD-BVD), Brussels Belgium              slides     narrative

2005 – Digital Preservation and Curation Summing up + Next Steps (setting curation and research agenda for2005-2015), conclusions to Warwick II Workshop, Warwick UK             slides     narrative

2005 – Plenty of Room at the Bottom? Personal Digital Libraries and Collections, keynote presentation to European Conference on Research and Advanced Technology for Digital Libraries (ECDL), Vienna Austria   slides     narrative

2004 – eScience and Digital Preservation, presentation to Association for Information Science and Technology (ASIST) conference, Rhode Island USA                  slides     narrative

2004 –  The JISC Continuing Access and Digital Preservation Strategy 2002-5(covering UK Higher Education sector and partners), presentation to the JISC-CNI conference, Brighton UK slides  narrative

2004 –Digital Preservation, e-journals and e-prints, presentation at private workshop 1st iPres conference, Beijing China                 slides     narrative

2004  –  The Digital Preservation Coalition (DPC), Its History, Programme, Rationale ,and Structure, set of 4 linked presentations to DPC Forum, London UK              slides     narrative

2001 – Preservation Management of Digital Materials (the Digital Preservation Handbook) presentation to Digital Preservation Workshop/State Library, Melbourne Australia         slides     narrative

1998 – Preserving Digital Collections: current methods and research (digital preservation lifecycle model), presentation to the Society of Archivists annual conference, Sheffield UK             slides     narrative

This is a baker’s dozen as there is a also bonus presentation from 2015 on slideshare covering the latest work on The Digital Preservation Handbook (new edition for full release in March 2016).

The background and narrative blog for this personal selection of presentations is also available.

SlideShare: The Value and Impact of Research Data Infrastructure

This slideshare, The Value and Impact of Research Data Infrastructure, was given at the Preservation and Archiving Special Interest Group (PASIG) meeting in September 2014 held at Karlsruhe, Germany. It is the final instalment of 12 presentations I have selected to mark 20 years in Digital Preservation. It demonstrates the value of preservation and re-use of research data.

Between 2011 and 2014, Charles Beagrie Ltd and John Houghton completed three major studies on the economic value and impact of the Archaeology Data Service, the British Atmospheric Data Centre, and the Economic and Social Research Data Service, and a synthesis of the three studies. In these studies, we developed and refined qualitative and quantitative methodologies to measure the value and impact of research data and associated services and tools.

This combination of methods has broken new ground in approaches to assessing the value and impact of major research data services and provided a strong evidence base and compelling outcomes.  In a recent review of the international state of the art as regards the relationships between large-scale science facilities and innovation performance, our work was one of 3 studies highlighted to UK Department of Business, Innovation and Skills as being particularly good examples of ‘good practice’ in the measurement of economic impacts.

The presentation focuses on these studies, with the study of the Archaeology Data Service given as a detailed example. It has a UK Focus but the research and lessons are international. These studies are also three of the few quantitative studies of the value and impact of digital preservation currently available.

A fourth study on the value and impact of the EMBL European Bioinformatics Institute has since been completed by Charles Beagrie Ltd and John Houghton and should be available in 2016.

SlideShare: How institutional mandates and preservation strategies are changing with digital content

This slideshare, Maintaining a Vision: how mandates and strategies are changing with digital content, is one I like and is a keynote given to the 2013 Screening the Future conference in London.

It is the penultimate of 12 presentations I have selected to mark 20 years in Digital Preservation. The final one to come will be published in December 2015.

My brief  for this conference keynote was to focus on how  institutional responses to collection and preservation mandates are realized and stretched by the digital…do existing institutions just ‘go digital’ but otherwise claim ‘business as usual’ [or not]?

The Talk had an AV focus given the nature of the conference but I think the messages will be of broad interest. It was in three parts:

The Changes: covering how digital content (including AV content) has changed the nature of typical collections across sectors; how it has shifted the scale of available content; and how content has fragmented and the number of content creators proliferated.

The Responses: covering how we have seen in response the growth of cross-sectoral preservation exchange (different sectoral membership of the DPC; Technology Watch Reports; the national coalitions worldwide such as nestor, NCDD, NDSA, etc); the development of shared services and outsourcing (e.g. digital preservation services in the cloud); and in some cases a range of cross-sector mergers (particularly of national archives and national libraries).

Conclusions:

What is changing? We are seeing multi-media permeating sectoral boundaries; greater shared interests and convergence of interests across different sectors; and a massive shift in the scale and management of digital media.

The responses?  We are seeing new alliances and partnerships; digital preservation exchange across sectors; some mergers and partnerships across established boundaries; and more shared services and outsourcing.

Finally, if you want to know the answer to the question “When was the beginning of the Digital Age” posed in previous posts, the answer is here in slide 8:

 

Next »