[open-science] Conference on Preservation of Digital Objects + Data Preservation Tutorial
Townsend J.H.
J.Townsend at soton.ac.uk
Wed Jun 12 18:38:33 UTC 2013
Forwarded below 1) Data Preservation Tutorial 2) Conference on Preservation of Digital Objects
Jack
Web & Sustainability University of Southampton
@JackTownsend_ <https://twitter.com/JackTownsend_> jack at jacktownsend.net<mailto:jack at jacktownsend.net>
+44 7934 508 661
Begin forwarded message:
From: Angela Dappert <angela at dpconline.org<mailto:angela at dpconline.org>>
Subject: [codata_international] From Preserving Data to Preserving Research. Registration open for tutorial at TPDL 2013 22 Sep 2013
Date: 12 June 2013 15:24:40 BST
To: CODATA_International <codata_international at kbx7.de<mailto:codata_international at kbx7.de>>
Reply-To: CODATA_International <codata_international at kbx7.de<mailto:codata_international at kbx7.de>>
======== Apologies for cross-posting =======
Tutorial to be held in connection with TPDL 2013, 22 September 2013, Valletta, Malta http://www.tpdl2013.info/
From Preserving Data to Preserving Research:
Curation of Process and Context
ABSTRACT
In the domain of eScience, investigations are increasingly collaborative. Most scientific and engineering domains benefit from building on the outputs of other research: by sharing information to reason over and data to incorporate in the modeling task at hand. This raises the need for preserving and sharing entire eScience workflows and processes for later reuse. We need to define which information is to be collected, create means to preserve it and approaches to enable and validate the re-execution of a preserved process. This includes and goes beyond preserving the data used in the experiments, as the process underlying its creation and use is essential.
The TIMBUS project and Wf4Ever project team up for this half-day tutorial to provide an introduction to the problem domain and discuss solutions for the curation of eScience processes.
1. TUTORIAL LEVEL
Introductory level
2. DURATION
Half-day
3. OUTLINE OF THE CONTENT
The tutorial will cover the following topics:
Introduction to Process and Context Preservation: The introduction will motivate the need for process and context preservation, illustrate how this task is difficult in an evolving domain, and introduce a use case for the rest of the tutorial to illustrate approaches and tools.
Data Citation: Data forms the basis of the results of many research publications, and thus needs to be referenced with the same accuracy as bibliographic data. Only if data can be identified with high precision can it be reused, validated, verified and reproduced. Citing a specific data set is however not trivial - it exists in a vast plurality of specifications and instances, can potentially be huge in size, and its location might change. We will provide an overview over existing approaches to overcoming these challenges. Further, we will present the issue of creating data citations of data held in databases, especially of dynamic data sets where data is added or updated on a regular basis.
Re-usability and traceability of workflows and processes: The processes creating and interpreting data are complex objects. Curating and preserving them requires special effort, as they are dynamic, and highly dependent on software, configuration, hardware, and other aspects. We will discuss these issues in detail, and provide an introduction to two complementary approaches.
The first approach is based on the concept of Research Objects, which adopts a workflow-centric approach and thereby aims at facilitating the reuse and reproducibility. It allows packaging the data and the methods as one Research Object to share and cite it, and thus enable publishers to grant access to the actual data and methods that contribute to the findings reported in scholarly articles.
A second approach focuses on describing and preserving a process and the context it is embedded in. The artifacts that may need to be captured range from data, software and accompanying documentation, to legal and human resource aspects. Some of this information can be automatically extracted from an existing process, and tools for this will be presented. Ways to archive the process and to perform preservation actions on the process environment, such as recreating a controlled execution environment or migration of software components, are presented. Finally, the challenge of evaluating the re-execution of a preserved process is discussed, addressing means of establishing its authenticity.
4. INTENDED AUDIENCE
The tutorial is targeted at researchers, publishers and curators in eScience disciplines who want to learn about methods of ensuring the long-term availability of experiments forming the basis of scientific research.
5. EXPECTED LEARNING OUTCOMES
The tutorial participants will become understand
· Motivations and challenges of process preservation
· Motivations, stakeholders and challenges of making data citable
· How Data is Cited Today: OECD [1] report on data citability, Google search of data sets, requirements, guidelines, metadata, locators and identifiers, approaches to naming schemes and properties.
· Available technologies for identifiers: Archival Resource Key (ARK), Digital Object Identifiers (DOI), Extensible Resource Identifier (XRI), HANDLE, Life Science ID (LSID), Object Identifiers (OID), Persistent Uniform Resource Locators (PURL), URI/URN/URL, Universally Unique Identifier (UUID)
· Approaches and Initiatives for citing data: CODATA, Data Cite, OpenAire, challenges and opportunities: granularity, scalability, complexity and evolving data sets current research questions
· Ontologies needed to capture research objects: Core Ontology of the RO family of vocabularies, workflow centric ROs, provenance traces, life cycle of research objects.
· Wf4Ever Toolkit / technological infrastructure for the preservation and efficient retrieval and reuse of scientific workflows: software architecture, functionalities, software interfaces to functionalities, reference implementation as services and clients:
~ Collect, manage and preserve aggregations of scientific workflows and related objects and annotations
~ Workflow sharing through a social website
~ Execution of workflows
~ Testing completeness, execution, repeatability and other desired quality features
~ Testing the ability of a Research Object to achieve its original purpose after changes to its resources.
~ Recommendations of relevant users, Research Objects and their aggregated resources
~ Converting workflows into Research Objects
~ Search for workflows by input parameters or frequency of use
~ Collaborative environment
~ Access and use of research objects and aggregated resources.
~ Synchronization with remote repositories
~ Visualization of correlation between similar objects
· TIMBUS context model and tools to semi-automatically capture the relevant context of a business process for preservation
~ The scope of context regarding business process preservation - technology, application and business context, aligned with enterprise architecture
~ The context meta-model, with domain independent and domain specific aspects
~ Demonstration of a context model instance of example processes (in the eScience domain)
~ Tools to automatically capture some parts of the context (software dependencies, data formats, licenses, ...)
~ Outlook on reasoning and preservation planning, based on the context model
________________________________
Dr Angela Dappert
Senior Project Officer
DPC c/o The British Library,
Floor 5, Room 14, 96 Euston Road,
London NW1 2DB
Digital Preservation Coalition<http://www.dpconline.org/>,
Innovation Centre,
York University Science Park,
Heslington, YORK YO10 5DG
Phone: +44 (0) 20 7412 7028
angela at dpconline.org<mailto:angela at dpconline.org>
[Description: dpc_250logo]
________________________________
The information contained in this e-mail is confidential and may be privileged. If you have received this message in error, please notify us and remove it from your system. The contents of this e-mail must not be disclosed or copied without the sender's consent and does not constitute legal advice. We cannot accept any responsibility for viruses, so please scan all attachments. The statements and opinions expressed in this message are those of the author and do not necessarily reflect those of the DPC. Registered Office, Innovation Centre, University Way, York Science Park, Heslington, YORK YO10 5DG Registered in England No: 4492292
Begin forwarded message:
From: Angela Dappert <angela at dpconline.org<mailto:angela at dpconline.org>>
Subject: [codata_international] Registration open for iPRES 2013 / DC-2013
Date: 12 June 2013 14:34:03 BST
To: CODATA_International <codata_international at kbx7.de<mailto:codata_international at kbx7.de>>
Reply-To: CODATA_International <codata_international at kbx7.de<mailto:codata_international at kbx7.de>>
*Apologies for cross-posting*
=== iPRES 2013 / DC-2013 INVITATION TO REGISTER ===
Registration for iPRES-2013 is now open.
The conference will take place 2-6 September 2013 in Lisbon, Portugal at the Instituto Superior Técnico.
Conference website: http://ipres2013.ist.utl.pt/index.html
Draft program: http://ipres2013.ist.utl.pt/prg_overview.html
The scientific program comprises 37 full and short papers, as well as 25 posters and demonstrations. There will be 8 tutorials, 12 workshops and a doctoral symposium.
Topics of interest include:
• Innovation in Digital Preservation: Novel Challenges and Scenarios; Innovative Approaches; Preservation at Scale; Domain-specific Challenges (Cultural Heritage, Technical and Scientific Processes and Data, Engineering Models and Simulation, Medical Records, Corporate Processes and Recordkeeping, Web Archiving, Personal Archiving, e-Procurement, etc.)
• Systems Life-cycle: Specific Digital Preservation Requirements and Implications in Modeling, Design, Development, Deployment and Maintenance
• Governance: Risk Analysis; Audit, Trust and Certification, Trusted Repositories; Information/Data Quality
• Business Models and Added-value of Digital Preservation: Benefits Analysis, Emerging Exploitation Scenarios, Long-Tail of Digital Preservation
• Theory of Digital Preservation: Interdisciplinary Modeling, Representation Concepts, Incentive Structures
• Case Studies and Best Practices: Processes, Metadata, Systems, Services, Infrastructures
• Training and Education
iPRES-2013 will be collocated with DC-2013. Both conferences will take place in the same venue and run in parallel. During the collocated events, delegates are welcome to choose sessions that best fit their interests from either conference. Keynotes are held in common plenaries; and, social events are shared, providing an excellent opportunity for iPRES and DCMI delegates to socialize, share common interests and network. Delegates of the two conferences may separately register for a mix of pre- and post-conference events organized by the conference committees of both iPRES and DCMI.
Important Dates:
• 08 July 2013: Deadline for early registration
• 02 September 2013: Tutorials sessions and Doctoral Symposium
• 03 September 2013: Conference starting…
• 05 September 2013: Conference closing (noon time)
• 05 September 2013: Workshops starting (afternoon)
€350 early regular, €250 early student (to 8 July)
€375 regular & student (after 8 July)
--Separate rates apply for pre-/post-conference sessions on Monday and Friday
--Day rates are available
Registration questions? Contact for DC/iPRES 2013: ipres2013 at ist.utl.pt<mailto:ipres2013 at ist.utl.pt>
We look forward to seeing you in Lisbon in September.
Tutorials:
T1.1 - Introduction to Linked Open Data (LOD)
T1.2 - Metadata Provenance
T1.3 - IGM: Maturidade da Governação da Informação (Tutorial in Portuguese)
T1.4 - Build it, Share it, Keep it safe
T1.5 - Personal Digital Archiving
T1.6 - Islandora Institutional Repository Tutorial
T2.1 - Introduction to Ontology Concepts and Terminology
T2.2 - PROV - the W3C Provenance Ontology
T2.3 - IGM: Information Governance Maturity (Tutorial in English)
T2.4 - Legal challenges in the preservation lifecycle – How to address and how to solve them!
T2.5 - From Preserving Data to Preserving Research – Curation of Process and Context
T2.6 - Tools for uncovering preservation risks in your large repositories
T3.1 - Datasets, Open Data and Digital Preservation
T3.2 - Getting Started in Web Archiving and Web Archives Preservation
Workshops
W1 - Digital Preservation Capabilities - How to assess and improve capabilities in digital preservation?
W2 - PREMIS Implementation Fair Workshop
W3 - Archiving Community Memories
W4 - Cost of Curation
W5 - Preservation at Scale
W6 - Interoperability of Persistent Identifiers Systems – services across PI domains
W7 – Open Research Challenges in Digital Preservation
W8 - CAMP-4-DATA
W9 -Vocabulary Day
iPREShack: SPRUCE, CURATEcamp and OPF Hackathon (from September 2nd to 5th)
Please forward this email to anybody who might be interested!
________________________________
Dr Angela Dappert
Senior Project Officer
DPC c/o The British Library,
Floor 5, Room 14, 96 Euston Road,
London NW1 2DB
Digital Preservation Coalition<http://www.dpconline.org/>,
Innovation Centre,
York University Science Park,
Heslington, YORK YO10 5DG
Phone: +44 (0) 20 7412 7028
angela at dpconline.org<mailto:angela at dpconline.org>
[Description: dpc_250logo]
________________________________
The information contained in this e-mail is confidential and may be privileged. If you have received this message in error, please notify us and remove it from your system. The contents of this e-mail must not be disclosed or copied without the sender's consent and does not constitute legal advice. We cannot accept any responsibility for viruses, so please scan all attachments. The statements and opinions expressed in this message are those of the author and do not necessarily reflect those of the DPC. Registered Office, Innovation Centre, University Way, York Science Park, Heslington, YORK YO10 5DG Registered in England No: 4492292
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image001.png
Type: image/png
Size: 5191 bytes
Desc: image001.png
URL: <http://lists.okfn.org/pipermail/open-science/attachments/20130612/f92b82a1/attachment-0004.png>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image001.png
Type: image/png
Size: 5191 bytes
Desc: image001.png
URL: <http://lists.okfn.org/pipermail/open-science/attachments/20130612/f92b82a1/attachment-0005.png>
More information about the open-science
mailing list