Digital Curation

12 slideshares for Xmas: 20 years in digital preservation

I have just posted the final instalment of a personal selection of 12 presentations drawn from events and topics over the last 20 years in digital preservation, which I hope will be of interest.

They are taken from events on four different continents including the first iPres conference and cover themes such as personal archiving, research data management, e-journals, the digital preservation lifecycle model, national and institutional strategies and collaboration, costs/benefit/economic impacts of digital preservation, the establishment of the Digital Preservation Coalition, and the development of the online Digital Preservation Handbook. I hope there will be something in there for everyone.

There are accompanying blog narratives which set the presentations into context and the powerpoint presentations themselves on Slideshare. Details and web links to them are as follows:

2014 – The Value and Impact of Research Data Infrastructure (economic impact), presentation to the Preservation and Archiving Special Interest Group (PASIG), Karlsruhe Germany    slides     narrative

2013 – Maintaining a Vision: how mandates and strategies are changing with digital content (changes and responses), keynote presentation to Screening the Future conference, London UK slides     narrative

2010 – Keeping Research Data Safe (digital preservation costs and benefits), presentation to KB Experts Workshop on Digital Preservation Costs, The Hague Netherlands          slides     narrative

2007 – Digital Preservation: Setting the Course for a Decade of Change (evolution or revolution?), keynote presentation to the Belgian Association for Documentation (ABD-BVD), Brussels Belgium              slides     narrative

2005 – Digital Preservation and Curation Summing up + Next Steps (setting curation and research agenda for2005-2015), conclusions to Warwick II Workshop, Warwick UK             slides     narrative

2005 – Plenty of Room at the Bottom? Personal Digital Libraries and Collections, keynote presentation to European Conference on Research and Advanced Technology for Digital Libraries (ECDL), Vienna Austria   slides     narrative

2004 – eScience and Digital Preservation, presentation to Association for Information Science and Technology (ASIST) conference, Rhode Island USA                  slides     narrative

2004 –  The JISC Continuing Access and Digital Preservation Strategy 2002-5(covering UK Higher Education sector and partners), presentation to the JISC-CNI conference, Brighton UK slides  narrative

2004 –Digital Preservation, e-journals and e-prints, presentation at private workshop 1st iPres conference, Beijing China                 slides     narrative

2004  –  The Digital Preservation Coalition (DPC), Its History, Programme, Rationale ,and Structure, set of 4 linked presentations to DPC Forum, London UK              slides     narrative

2001 – Preservation Management of Digital Materials (the Digital Preservation Handbook) presentation to Digital Preservation Workshop/State Library, Melbourne Australia         slides     narrative

1998 – Preserving Digital Collections: current methods and research (digital preservation lifecycle model), presentation to the Society of Archivists annual conference, Sheffield UK             slides     narrative

This is a baker’s dozen as there is a also bonus presentation from 2015 on slideshare covering the latest work on The Digital Preservation Handbook (new edition for full release in March 2016).

The background and narrative blog for this personal selection of presentations is also available.

SlideShare: The Value and Impact of Research Data Infrastructure

This slideshare, The Value and Impact of Research Data Infrastructure, was given at the Preservation and Archiving Special Interest Group (PASIG) meeting in September 2014 held at Karlsruhe, Germany. It is the final instalment of 12 presentations I have selected to mark 20 years in Digital Preservation. It demonstrates the value of preservation and re-use of research data.

Between 2011 and 2014, Charles Beagrie Ltd and John Houghton completed three major studies on the economic value and impact of the Archaeology Data Service, the British Atmospheric Data Centre, and the Economic and Social Research Data Service, and a synthesis of the three studies. In these studies, we developed and refined qualitative and quantitative methodologies to measure the value and impact of research data and associated services and tools.

This combination of methods has broken new ground in approaches to assessing the value and impact of major research data services and provided a strong evidence base and compelling outcomes.  In a recent review of the international state of the art as regards the relationships between large-scale science facilities and innovation performance, our work was one of 3 studies highlighted to UK Department of Business, Innovation and Skills as being particularly good examples of ‘good practice’ in the measurement of economic impacts.

The presentation focuses on these studies, with the study of the Archaeology Data Service given as a detailed example. It has a UK Focus but the research and lessons are international. These studies are also three of the few quantitative studies of the value and impact of digital preservation currently available.

A fourth study on the value and impact of the EMBL European Bioinformatics Institute has since been completed by Charles Beagrie Ltd and John Houghton and should be available in 2016.

New Resources page on Charles Beagrie Website

We have produced a new resources pages on our website describing all the outputs we have produced which are publicly available and accessible on open access to students and practitioners interested in our work. Areas described include Cost/Benefit, Impact, Technology Watch, Digital Preservation Policies and Strategies. Conference presentations, and other digital preservation resources. These are linked either to outputs on our website or on the websites of clients and partners. An extract of the page is shown below.

Keeping Research Data Safe (KRDS)

Keeping Research Data Safe (KRDS), a workshop presentation from 2010 available now on Slideshare, is the ninth of 12 presentations I have selected to mark 20 years in Digital Preservation. The remaining two to come will be published at monthly intervals over November and December 2015.

This presentation was given as part of the KB Experts Workshop on Digital Preservation Costs, held at The Hague in the Netherlands in 2010.

Although very small in terms of budget, the KRDS projects were terrific examples of collaboration to achieve influential results and the pleasure and value of working with colleagues from many disparate fields and organisations. I’ve selected it as an example of doing great things on small budgets if you have the right people, and for its influence on subsequent work both by me (e.g. impact studies) and on the field generally. For me, in terms of personal follow-up and later projects, the costs element of KRDS has been less important than the benefits side which has led to a series of project on impact with John Houghton (more on this in the final Slideshare in December).

The KB requested a briefing document on each cost model presented at the workshop in the form of responses to their set questions. I have reproduced mine for the KRDS presentation below – it captures lots of interesting context for the slides. I have added links to the KRDS Factsheet and KRDS costs data survey to it.

THE KEEPING RESEARCH DATA SAFE MODEL

Outline:

1. General presentation of the cost model

What is the purpose of the cost model?  The KRDS model aims to support the costing of digital preservation of research datasets and assessment of the benefits of preservation. A significant proportion of its work is also focussed on identification of preservation cost data sources and methods which could support any model. It is currently primarily a set of tools and methods to construct a localised model rather than a pre-developed generic costing tool. Further information on findings from the KRDS projects is available in the KRDS Factsheet.

Who are the users? – The primary audience is research organisations in the UK but organisations in other countries and sectors can adopt parts of the model and its methodologies.

What preservation strategies does it handle? – It can accommodate any preservation strategy or service strategy (e.g. outsourcing or shared services as well as preservation in-house).

What is the target data? – Research data from the sciences, social sciences, or arts and humanities.

What time perspective does it cover? – Any time period.

2. What method is the cost model based on?

What reference is the model based on?  – The model uses OAIS with extensions and adaptations by the project team.

What financial principles is it based on? – It is modelled to adopt the Transparent Approach to Costing (TRAC) a full economic costs (FEC) model approved by UK research funders and universities.

Which costing approach have you adopted?– We use an activity based costing approach supported by a Benefits Taxonomy for assessing benefits.

What implementation have you chosen? – N/A

3. Which challenges do you currently see in relation to cost modelling?

Special issues – General cost model challenges? –

Primarily a lack of good quality preservation cost data from a range of different types of archive and data types (see our KRDS costs data survey) which can be used to underpin and develop models.

Secondly an excessive focus on costs (rather than cost/benefits) and also sometimes a too limited focus on costs of preservation strategies rather than preservation service costs as a whole.

Occasional over-reliance on research project or start-up cost data which will not be representative of operational preservation costs.

The degree of confidence that can be placed in results from cost models. How reliable is any cost prediction for a model and how does that change over time or other variables?

4. What are the opportunities for standardisation of cost models and collaboration between projects?

Possible standardisation and alignment of cost models? – I think cost models always need to be tailored to some degree to different audiences/sectors and prospects for standardisation and alignment may be variable. Some areas e.g. digital storage costs may be more promising than others.

Collaboration? – I can see beneficial opportunities for both formal and informal partnerships between projects and organisations. There may be opportunities for European and international collaboration.

5. What are your initial comments and feedback on the draft decision tree appended below?

A decision tree could start much earlier and involve different decisions on the cost model itself e.g. scope of activities, level of detail, and sources of data.

6. Please provide a short one paragraph biography for yourself

Neil Beagrie is director of consultancy at Charles Beagrie and principal investigator for the JISC Keeping Research Data Safe project which has investigated the costs and benefits of digital preservation for research data. He is an experienced senior consultant and an internationally recognised expert with extensive experience in information management, digital preservation, and developing access to digital collections.

Digital Curation and Preservation: Defining the Research Agenda for the Next Decade [2005-2015]. How did we do?

The Warwick3 Workshop: Digital Preservation and Curation Summing up + Next Steps available now on Slideshare is the eighth of 12 presentations I have selected to mark 20 years in Digital Preservation. The remainder will be published at monthly intervals over 2015.

I have chosen it as it briefly allows us to look back at aspirations and achievements in Digital Preservation over a 20 year period from the very first (and seminal) Warwick 1 workshop held in 1995 to today. The first Warwick workshop considered the Long Term Preservation of Electronic Materials and a UK response to the final report of the RLG/CPA Task Force on Digital Archiving. Two further Warwick workshops followed in 1999 and 2005 to review progress and set a forward agenda.

The two-day workshop that took place over 7 – 8 November 2005 at the University of Warwick aimed for the first time to address digital preservation issues for both scientific data and cultural heritage and to map out a future research agenda for them. Sponsored by JISC, the Digital Curation Centre (DCC), the British Library and the Council for the Central Laboratory of the Research Councils (CCLRC), the invitation-only event drew a wide range of national and international experts to explore the current state of play with a view to shaping future strategy. The slides are from my summing up and conclusions at the workshop close.

Part of my conclusions (slides 12-13), outlined the recommendations of the previous Warwick workshop held in 1999 and reviewed the progress that had been made in implementing them over the subsequent five years with a very subjective level of achievement (some) to √ √ √ (good) as follows:

Raise awareness

√ √ √ DPC advocacy, EU council, UNESCO, CODATA, ICSTI, NSF,RCUK

Encourage cross-sectoral communication

√ √ Established Digital Preservation Coalition 2001 – now 27 members

Develop guidelines

√ √ Preservation Management Handbook, Curation Manual, Cornell tutorial

Preservation Centre/Network of centres

√ √ Digital Curation Centre, British Library, The National Archives

Certification criteria

RLG/NARA checklist (TRAC)

Checklist to determine complexity and cost

JISC 04/04 funding programme (LIFE project, assessment tool project)

New research – emulation, dynamic data

Camileon project, JISC 04/04 programme, DCC research agenda

So how have we done 10 years further on?  Overall, OK I think with the caveat progress in digital preservation can take a long time. Perhaps I would raise the achievement levels if doing this exercise again in 2015 for “Encourage cross-sectoral communication”, “Checklist to determine complexity and cost”, and “New research”. However I would probably move Raise Awareness down one level. The others would probably be about the same. How about you?

20 years in DP: eScience and Digital Preservation 2004

eScience and Digital Preservation, presentation to Association for Information Science and Technology (ASIST) conference November 2004, Rhode Island USA, available now on Slideshare is the sixth of 12 presentations I’ve selected to mark 20 years in Digital Preservation. The remainder will be published at monthly intervals over 2015.

It is closely related to the previous slideshare for May on the Jisc continuing access and digital preservation strategy but focuses just on the science component.

This is one I wasn’t able to present in person but it was kindly delivered by Gail Hodge.

My brief for the presentation was “thoughts or citations you have for the impact of e-science, particularly the GRID, on information management, particularly archiving, preservation and long-term access.”

It is a short presentation of 15 slides covering collection-based science, the Grid, data publishing, and the background and rationale for the Digital Curation Centre (just launched two weeks before in the UK).

It is a snapshot in time and of key issues in 2004 – interesting to contrast with what one would write 10 years on and ponder on progress made.

20 years in DP: The JISC Continuing Access and Digital Preservation Strategy 2002-5

The JISC Continuing Access and Digital Preservation Strategy 2002-5, presentation to the 2004 JISC-CNI conference, Brighton UK available now on Slideshare is the fifth of 12 presentations I’ve selected to mark 20 years in Digital Preservation. The remainder will be published at monthly intervals over 2015 (however due to sheer volume of work over May this year including the EBI Impact Survey and the 2nd Handbook sprint, two monthly selections are appearing together this time!).

For those outside the UK, an important context is that Jisc’s role as a national body for digital infrastructure and content on behalf of UK universities and colleges, gave the Strategy considerable influence at the time not just within HE but in other sectors through partnership activities.

This presentation from 2004 is important largely for the legacy of the Strategy that helped establish bodies such as the Digital Preservation Coalition and the Digital Curation Centre, which still have a major influence today.

The presentation sets out the context and rationale for the Strategy including the predicted growth of electronic publications, scientific data, and data curation. The implications of that growth were seen as:

  • Core funding for institutions would not grow in line with information growth;
  • A need for more automation and tools;
  • A need for new shared services and information infrastructure;
  • A significant need for R&D and investment to prepare for this.

Therefore  the objectives of Strategy were:

  • As an advocacy document to secure additional funding of £6m over 3 years (2002-5) for new programmes in electronic records management and digital preservation;
  • Justify the accompanying implementation plan;
  • Provide a longer-term framework and rationale for activity extending beyond 2005.

Fortunately activity in these areas did continue beyond 2005 under a series of very able Jisc programme directors and managers.

Reflections on the Digital Preservation Handbook Book Sprint 28-29 October 2014

What a terrific couple of days! We completed a two day book sprint in London last week focussing on developing new content for the first release of the next edition of the Digital Preservation Handbook that is being funded by The National Archives, the British Library, and Jisc. Really pleased with the outputs and progress we made.

A group of 11 people Matthew Addis (Arkivum), Neil Beagrie (Charles Beagrie Ltd), Stephanie Davidson (West Yorkshire Archive Service), Michael Day (British Library), Matt Faber (Jisc), Chris Fryer (Parliamentary Archives), Anna Henry (the Tate Gallery), William Kilbride (DPC), Ed Pinsent (ULCC), Virginia Power (Jisc), Susan Thomas (Bodleian Library Oxford), met up over two days to progress sections of the content for the new “Technical Solutions and Tools” chapter of the Handbook (as identified in the Draft Outline of the 2nd Edition of the Digital Preservation Handbook). Accommodation for the sprint was kindly provided by the Jisc in their central London offices via the good offices of Neil Grindley.

We have completed draft sections for:

  • Tools (including guidance on Tool Registries)
  • Media and Storage
  • File Formats
  • Digital Forensics

In addition a content outline was agreed for the “Getting Started” sub-section of the Introduction.  Alongside this work, other sections including the Background, How to Use the Handbook, Definitions and Concepts, Acronyms and Initials, and References have been partially revised as we went.

The revision has been guided by the user feedback and consultation (see Report on the Preparatory User Consultation on the 2nd Edition of the Digital Preservation Handbook) in short to keep the Handbook text practical, concise, and accessible with more detail available in the case studies and further reading.

This was the first book sprint for all bar one of the participants. We learnt a lot about the strengths and weaknesses of “Booktype” the open source software we used that had been developed to help support this type of activity, eventually settling on using it in parallel with collaborative text tools such as Google Docs to get the best from each approach. A two-day book sprint was very intense but few could have spared more time away from the workplace, and as one participant said a tight-deadline helped everyone focus on the tasks in hand.

At the end of the sprint the challenge was set to aim to make the new content available within 3 months – we hope sufficient additional sections to create a ready critical mass, potentially the complete Tools and Solutions Chapter of the Handbook can be readied and transferred to the DPC website and reviewed for release in the New Year.

Survey results and the contents outline for new edition of the Digital Preservation Handbook just published

A big thank-you from Neil Beagrie and William Kilbride to everyone who contributed to the recent audience research survey or who  commented on the potential contents outline for the new edition of the Digital Preservation Handbook.

Following that work, the DPC and Charles Beagrie Ltd are delighted to announce the release two important documents which will form the foundations of the new edition of the DPC Digital Preservation Handbook: the results of a major survey into audience needs, an the first full outline of content.

‘We are very keen to make sure that the new edition of the handbook fits with people’s actual needs so we were very encouraged by the substantial response to the consultation document which we sent out before summer’ explained Neil Beagrie who is editor and lead author of the new edition of the handbook. ‘We estimate that the digital preservation community represented on the JiscMail list numbers around 1500 people in total: and there were 285 responses to the survey.’

‘It a very large sample of the community but it’s also re-assuringly diverse.  There’s a strong representation from higher education and public sector agencies but there’s also a sizeable group from industry, from charities as well as museums and community interest groups.  When asked if they would use the handbook, not a single respondent said no.’

‘The survey has directly informed the contents of the new handbook’, explained William Kilbride, Execuitve Director of the DPC.  ‘We started with an idea of the gaps and the many parts that had become outdated since the original handbook was published.  So we invited users to tell us what they wanted and how they wanted it – both in terms of content and presentation.  The project team has responded thoughtfully to these requests so I am confident that the resulting list of content is tailored to people’s needs. But we remain open to suggestions and comments’

‘This will help ensure that the handbook remains relevant for many years to come.’

The two documents are available as follows:

Trending: The Value and Impact of Data Sharing and Curation

A colleague has pointed out that our synthesis report for Jisc on the Value and Impact of Data Sharing and Curation has had over 3,900 downloads since April 2014. You can see the stats and access the report here on the Jisc Repository.

It is great to see that there is a very high level of interest in the topic and report. I’m not sure how that figure compares, but if you have done work for Jisc you should now be able to search or browse the Jisc repository and see the download stats for your own work. Potentially, access to the Jisc repository stats is going to be very useful for those involved in REF or needing to demonstrate their  impact to their institutions and other stakeholders.

« Prev - Next »