Utilizing Provenance in Reusable Research Objects

Journal Title: Informatics - Year 2018, Vol 5, Issue 1

Abstract

Science is conducted collaboratively, often requiring the sharing of knowledge about computational experiments. When experiments include only datasets, they can be shared using Uniform Resource Identifiers (URIs) or Digital Object Identifiers (DOIs). An experiment, however, seldom includes only datasets, but more often includes software, its past execution, provenance, and associated documentation. The Research Object has recently emerged as a comprehensive and systematic method for aggregation and identification of diverse elements of computational experiments. While a necessary method, mere aggregation is not sufficient for the sharing of computational experiments. Other users must be able to easily recompute on these shared research objects. Computational provenance is often the key to enable such reuse. In this paper, we show how reusable research objects can utilize provenance to correctly repeat a previous reference execution, to construct a subset of a research object for partial reuse, and to reuse existing contents of a research object for modified reuse. We describe two methods to summarize provenance that aid in understanding the contents and past executions of a research object. The first method obtains a process-view by collapsing low-level system information, and the second method obtains a summary graph by grouping related nodes and edges with the goal to obtain a graph view similar to application workflow. Through detailed experiments, we show the efficacy and efficiency of our algorithms.

Authors and Affiliations

Zhihao Yuan, Dai Hai Ton That, Siddhant Kothari, Gabriel Fils and Tanu Malik

Keywords

Related Articles

How Thumbelina Knows

In this paper, I take the book by Michel Serres, “Thumbelina”, as an occasion for reflection on the conceptual basis of knowledge management, as was built by Nonaka and co-workers. The direct access to knowledge that T...

Conceptualization and Non-Relational Implementation of Ontological and Epistemic Vagueness of Information in Digital Humanities

Research in the digital humanities often involves vague information, either because our objects of study lack clearly defined boundaries, or because our knowledge about them is incomplete or hypothetical, which is especi...

Detecting Transitions in Manual Tasks from Wearables: An Unsupervised Labeling Approach†

Authoring protocols for manual tasks such as following recipes, manufacturing processes or laboratory experiments requires significant effort. This paper presents a system that estimates individual procedure transition...

A Smart Sensor Data Transmission Technique for Logistics and Intelligent Transportation Systems

When it comes to Internet of Things systems that include both a logistics system and an intelligent transportation system, a smart sensor is one of the key elements to collect useful information whenever and wherever n...

Designing a Situational Awareness Information Display: Adopting an Affordance-Based Framework to Amplify User Experience in Environmental Interaction Design

User experience remains a crucial consideration when assessing the successfulness of information visualization systems. The theory of affordances provides a robust framework for user experience design. In this article,...

Download PDF file
  • EP ID EP44117
  • DOI https://doi.org/10.3390/informatics5010014
  • Views 253
  • Downloads 0

How To Cite

Zhihao Yuan, Dai Hai Ton That, Siddhant Kothari, Gabriel Fils and Tanu Malik (2018). Utilizing Provenance in Reusable Research Objects. Informatics, 5(1), -. https://europub.co.uk/articles/-A-44117