Digital Curation
Archived Posts from this Category
Archived Posts from this Category
The US Institute of Museum and Library Services (IMLS) has recently awarded 38 Laura Bush 21st Century Librarian Program grants totalling $22,623,984.
Amongst the awards list I was struck by the following:
University of Illinois at Urbana-Champaign – Champaign, IL : Project Title: “Data Curation Education in Research Centers (DCERC)”
Award Amount: $988,543; Matching: $179,822
The University of Illinois Urbana-Champaign Graduate School of Library and Information Science, the University of Tennessee School of Information Sciences, and the National Center for Atmospheric Research have partnered to establish Data Curation Education in Research Centers (DCERC). DCERC will develop a model, including a field experience in a data intensive scientific environment, for educating LIS master’s and doctoral students in data curation. It will implement a graduate research and education program to address the need for professionals with scientific expertise who can manage and curate large digital data collections. Six doctoral students will benefit from this project.
Purdue University – West Lafayette, IN: Project Title: “Understanding Curation through the use of Data Curation Profiles”
Award Amount: $187,242; Matching: $104,868
Purdue University will create a series of workshops to expand the expertise of academic librarians about data curation issues. The needs of researchers and data producers are changing radically because of the disruptive effects of technology on research and its dissemination. This continuing education program will teach an estimated 370 librarians to be more effective data curators.
University of North Carolina at Chapel Hill – Chapel Hill, NC: Project Title: “Workforce Issues in Library and Information Science 3 (WILIS 3): Sustaining the Career Tracking Model through Data sharing”
Award Amount: $298,385; Matching: $85,637
The School of Information and Library Science, the Institute on Aging, and the Howard Odum Institute for Research in Social Science at the University of North Carolina at Chapel Hill will collaborate to document the process of data archiving and sharing. The major aims of the WILIS 3 project are to create publicly accessible de-identified datasets; to develop an interactive program-specific data system to enable library and information science programs to explore their own data and benchmark with other programs; and to produce a data archiving toolkit for use by other researchers.
And in UK/European Library and Information schools we have…….???
0 comments Neil | Digital Curation, Libraries and Archives, Universities
Zoe Locke, Lead Technologist at the UK Technology Strategy Board has made an interesting post Impact of Data to their blog requesting any information on the economic impact of research data sharing. Extract as follows:
“I am currently in Manchester attending a JISC workshop on Managing Research Data…
Yesterday, there was an interesting keynote speech from the Director of the Digital Curation Centre (DCC). However, I noted that ‘Impact’ was the 3rd reason for why researchers should care about data curation. I asked about the meaning of impact. In the context of the talk, impact was about whether or not the research for which the data was used got published (and had an effect on the researcher’s career). The DCC focuses on transferring knowledge on curation into and around the higher education sector so this seems like an appropriate definition of impact. However, given the potential socio-economic impact of research and resultant data, not to mention the business opportunities it could create (though we don’t really know where or what these are, let alone how big they might be), I can’t help feeling that we need to widen the definition to stimulate greater sharing and exploitation of data. If businesses could generate wealth or increase the quality of life with this data then surely it would be easier for anyone to justify footing the bill for curation…
Does anyone out there have any specific case studies of money being made or saved through the exploitation of research data (specifically that data generated in a different organisation to the one exploiting it)?”
You will need to register with the Connect Network to post a reply to Zoe direct but I am happy to forward any examples readers may add as comments to this posting on the Charles Beagrie blog.
0 comments Neil | Digital Curation, Science and Industry, Universities, e-Research
I am pleased to announce that the final report for Keeping Research Data Safe 2 (KRDS2) is now available from the JISC website. This KRDS2 study report presents the results of a survey of available cost information, validation and further development of the KRDS activity cost model, and a new taxonomy to help assess benefits alongside costs.
KRDS2 has delivered the following:
• A survey of cost information for digital preservation, collating and making available 13 survey responses for different cost datasets;
• The KRDS activity model has been reviewed and its presentation and usability enhanced;
• Cost information for four organisations (the Archaeology Data Service; National Digital Archive of Datasets; UK Data Archive; and University of Oxford) has been analysed in depth and presented in case studies;
• A benefits framework has been produced and illustrated with two benefit case studies from the National Crystallography Service at Southampton University and the UK Data Archive at the University of Essex.
One of the key findings on the long-term costs of digital preservation for research data was that the cost of archiving activities (archival storage and preservation planning and actions) is consistently a very small proportion of the overall costs and significantly lower than the costs of acquisition/ingest or access activities for all the case studies in KRDS2. As an example the respective activity staff costs for the Archaeology Data Service are Access (c.31%), Outreach/Acquisition/Ingest (c.55%), Archiving (c.15%).This confirms and supports a preliminary finding in KRDS1.
A range of supplementary materials in support of this report have also been made available on the KRDS project website. This includes the ULCC Excel Cost Spreadsheet for the NDAD service together with a Guide to Interpreting and Using the NDAD Cost Spreadsheet. The NDAD Cost Spreadsheet has previously been used as an exercise in digital preservation training events and may be particularly useful in training covering digital preservation costs. The accompanying Guide provides guidance to those wishing to understand and experiment with the spreadsheet.
0 comments Neil | Charles Beagrie Ltd, Digital Curation, Digital Preservation, Libraries and Archives, Science and Industry, Universities, e-Research
During the May meeting of the National Science Board, National Science Foundation (NSF) officials announced a change in the implementation of the existing policy on sharing research data. In particular, on or around October, 2010, NSF is planning to require that all proposals include a data management plan in the form of a two-page supplementary document. The research community will be informed of the specifics of the anticipated changes and the agency’s expectations.
The changes are designed to address trends and needs in the modern era of data-driven science. Ed Seidel, acting assistant director for NSF’s Mathematical and Physical Sciences directorate acknowledged that each discipline has its own culture about data-sharing, and said that NSF wants to avoid a one-size-fits-all approach to the issue. But for all disciplines, the data management plans will be subject to peer review, and the new approach will allow flexibility at the directorate and division levels to tailor implementation as appropriate.
Full details can be found in the NSF press release.
Data Management Plans are also required by a growing number of research funders in the UK. The Digital Curation Centre provides a useful overview of current UK funder requirements for data management and sharing plans and a Data Management Plan Content Checklist.
0 comments Neil | Digital Curation, Science and Industry, Universities, e-Research
0 comments Neil | Digital Curation, Libraries and Archives, Scholarly Communication, Science and Industry, Universities, e-Research
I am pleased to announce that the findings from the Keeping Research Data Safe 2 (“KRDS2) survey of digital preservation cost information are now available on the KRDS2 project webpage.
One of the core aims of the KRDS2 project was to identify potential sources of cost information for preservation of digital research data and to conduct a survey of them. Between September and November 2009 we made an open invitation via email lists and the project blog and project webpage for others to contact us and contribute to the data survey if they had research datasets and associated cost information that they believe may be of interest to the study.
13 survey responses were received: 11 of these were from UK-based collections, and 2 were from mainland Europe. Two further potential contributions from the USA were unfortunately not available in time to be included.
The responses covered a broad area of research including the arts and humanities, social sciences, and physical and biological sciences and research data archives or cultural heritage collections. Each survey response is approximately 6-8 pages in length.
A summary analysis plus individual completed responses to the data survey that provide more detail, are available.
We have also made the revised versions of the KRDS2 activity model available to download.
We aim to release the KRDS2 report via JISC in March following peer review and final editing. Further supplementary materials from KRDS2 will also be placed on the project webpage in March.
You will also notice that we have recently undertaken a major website re-design and made additions, should you wish to browse other information on the web site.
3 comments Neil | Digital Curation, Digital Preservation, Libraries and Archives, Science and Industry, Uncategorized, Universities, e-Research
An important editorial has just appeared online in the February issue of The American Naturalist.
To promote the preservation and fuller use of data, The American Naturalist, Evolution, the Journal of Evolutionary Biology, Molecular Ecology, Heredity, and other key journals in evolution and ecology will soon introduce a new data archiving policy. The policy has been enacted by the Executive Councils of the societies owning or sponsoring the journals. For example, the policy of The American Naturalist will state:
This journal requires, as a condition for publication, that data supporting the results in the paper should be archived in an appropriate public archive, such as GenBank, TreeBASE, Dryad, or the Knowledge Network for Biocomplexity. Data are important products of the scientific enterprise, and they should be preserved and usable for decades in the future. Authors may elect to have the data publicly available at time of publication, or, if the technology of the archive allows, may opt to embargo access to the data for a period up to a year after publication. Exceptions may be granted at the discretion of the editor, especially for sensitive information such as human subject data or the location of endangered species.
This policy will be introduced approximately a year from now, after a period when authors are encouraged to voluntarily place their data in a public archive. Data that have an established standard repository, such as DNA sequences, should continue to be archived in the appropriate repository, such as GenBank. For more idiosyncratic data, the data can be placed in a more flexible digital data library such as the National Science Foundation–sponsored Dryad Archive.
Authors of the editorial, Michael C. Whitlock, Mark A. McPeek, Mark D. Rausher, Loren Rieseberg, and Allen J. Moore present the case for the importance of data archiving in science. This is the first of several coordinated editorials soon to appear in major journals.
1 comment Neil | Digital Curation, Digital Preservation, Libraries and Archives, Scholarly Communication, Universities
We are starting up and partnering in a number of new and interesting consultancy projects which run into 2010 as follows:
Dryad is an emerging digital repository for supplementary data underlying published works in ecology, evolution, and related fields being developed by a consortium of the National Evolutionary Synthesis Center (NESCent) in the US and relevant scientific societies and academic journals. Its goals are to:
The strategic priorities for Dryad emerged from a May 2007 workshop on “Data Preservation, Sharing, and Discovery: Challenges for Small Science in the Digital Era“, at which a variety of stakeholder journals and societies were represented.
I am pleased to announce that Charles Beagrie Limited will be working with the Dryad project team to develop a business plan and sustainability for the Dryad repository. Neil Beagrie and Julia Chruszcz will lead the consultancy with research support from Peter Williams. Further information on Dryad, the partners and the latest developments can be found on the Dryad website.
I2S2 - The Infrastructure for Integration in Structural Sciences (I2S2) Project is funded under the Research Data Management Infrastructure strand of the JISC’s Managing Research Data Programme, with a duration of 18 months (Oct 2009 to March 2011). It will identify requirements for a data-driven research infrastructure in “Structural Science”, focussing on the domain of Chemistry, but with a view towards inter-disciplinary application.
Two research data management pilots will examine the business processes of research, and highlight the benefits of an integrated approach. Both pilots will address traversing administrative boundaries between institutions to national facilities in addition to issues of scale (local laboratory to national facilities, DIAMOND synchrotron and ISIS respectively).
A key component of the infrastructure will be a harmonised Integrated Information Model to include all stages of the Data Life Cycle. A “before and after” cost-benefit analysis will be performed using the Keeping Research Data Safe (KRDS2) model, which will be extended to address specific requirements in I2S2. We are looking forward to working with UKOLN (University of Bath and DCC), The Universities of Southampton and Cambridge, and the Science and Technology Facilities Council (STFC) in the project.
0 comments Neil | Charles Beagrie Ltd, Digital Curation, Scholarly Communication, Science and Industry, Universities, e-Research
The completed response from the eCrystals repository at the University of Southampton to the KRDS2 Survey has been added to the project webpage.
So far around 12 organisations from the UK and internationally have responded to the Survey. The eCystals response has been added to the project webpage as an exemplar for those still considering a response and for anyone interested in the information the Survey will contain.
Further information on the KRDS2 Survey are contained in an earlier blog posting on the Survey and on the project webpage. KRDS2 invite you to contribute to the Survey if you have research datasets and associated cost information that you feel may be of interest to the study.
We anticipate that no organisation will have complete information on costs but most will have cost information in some areas. The aim of the survey is to compile an overview of what preservation cost information is collected.
The Survey proforma is available to download as an Acrobat form (requires Adobe Reader 8+ installed) or a Word form (requires Microsoft Word installed). The Survey proforma is available as a single main questionnaire or alternatively if you have multiple cost datasets you can complete a separate organisational cover sheet and multiple collection details as required. It should take less than 30 minutes to complete and KRDS2 is seeking responses (to info@beagrie.com) by the end of October 2009.
0 comments Neil | Charles Beagrie Ltd, Digital Curation, Universities
The Keeping Research Data Safe2 project (KRDS2) commenced on 31 March 2009 and will complete in December 2009. The project is identifying long-lived datasets for the purpose of cost analysis (including social sciences and humanities research) and is building on the work of the first “Keeping Research Data Safe” study completed in 2008.
We are currently undertaking detailed analysis of available cost information from 3 of our project partners and aim to develop guidance for how cost metrics can be captured and applied in future from this.
In addition we have now added a survey proforma to the project website to help us identify other research data collections with information on preservation costs and issues. We invite you to contribute to the data survey if you have research datasets and associated cost information that you feel may be of interest to the study.
We anticipate that no organisation will have complete information on costs but most will have cost information in some areas. The aim of the survey is to compile an overview of what preservation cost information is collected.
The Survey proforma is available to download as an Acrobat form (requires Adobe Reader 8+ installed) or a Word form (requires Microsoft Word installed). It should take less than 30 minutes to complete and we are seeking responses (to info@beagrie.com) by the end of October 2009. The Survey proforma is available as a single main questionnaire or alternatively if you have multiple cost datasets you can complete a separate organisational cover sheet and multiple collection details as required. Please do not hesitate to contact us at info@beagrie.com if you have any difficulty or questions.
1 comment Neil | Charles Beagrie Ltd, Digital Curation, Digital Preservation, Libraries and Archives, Universities, e-Research