Utilizing Provenance in Reusable Research Objects

Journal Title: Informatics - Year 2018, Vol 5, Issue 1

Abstract

Science is conducted collaboratively, often requiring the sharing of knowledge about computational experiments. When experiments include only datasets, they can be shared using Uniform Resource Identifiers (URIs) or Digital Object Identifiers (DOIs). An experiment, however, seldom includes only datasets, but more often includes software, its past execution, provenance, and associated documentation. The Research Object has recently emerged as a comprehensive and systematic method for aggregation and identification of diverse elements of computational experiments. While a necessary method, mere aggregation is not sufficient for the sharing of computational experiments. Other users must be able to easily recompute on these shared research objects. Computational provenance is often the key to enable such reuse. In this paper, we show how reusable research objects can utilize provenance to correctly repeat a previous reference execution, to construct a subset of a research object for partial reuse, and to reuse existing contents of a research object for modified reuse. We describe two methods to summarize provenance that aid in understanding the contents and past executions of a research object. The first method obtains a process-view by collapsing low-level system information, and the second method obtains a summary graph by grouping related nodes and edges with the goal to obtain a graph view similar to application workflow. Through detailed experiments, we show the efficacy and efficiency of our algorithms.

Authors and Affiliations

Zhihao Yuan, Dai Hai Ton That, Siddhant Kothari, Gabriel Fils and Tanu Malik

Keywords

Related Articles

Social Media Providing an International Virtual Elective Experience for Student Nurses

The advances in social media offer many opportunities for developing understanding of different countries and cultures without any implications of travel. Nursing has a global presence and yet it appears as though stud...

A Novel Three-Stage Filter-Wrapper Framework for miRNA Subset Selection in Cancer Classification

Micro-Ribonucleic Acids (miRNAs) are small non-coding Ribonucleic Acid (RNA) molecules that play an important role in the cancer growth. There are a lot of miRNAs in the human body and not all of them are responsible f...

Web-Based Scientific Exploration and Analysis of 3D Scanned Cuneiform Datasets for Collaborative Research

The three-dimensional cuneiform script is one of the oldest known writing systems and a central object of research in Ancient Near Eastern Studies and Hittitology. An important step towards the understanding of the cun...

Analyzing Spatiotemporal Anomalies through Interactive Visualization

As we move into the big data era, data grows not just in size, but also in complexity, containing a rich set of attributes, including location and time information, such as data from mobile devices (e.g., smart phones),...

Data Governance in the Sustainable Smart City

The wisdom of ‘smart’ development increasingly shapes urban sustainability in Europe and beyond. Yet, the ‘smart city’ paradigm has been critiqued for favouring technological solutions and business interests over socia...

Download PDF file
  • EP ID EP44117
  • DOI https://doi.org/10.3390/informatics5010014
  • Views 272
  • Downloads 0

How To Cite

Zhihao Yuan, Dai Hai Ton That, Siddhant Kothari, Gabriel Fils and Tanu Malik (2018). Utilizing Provenance in Reusable Research Objects. Informatics, 5(1), -. https://europub.co.uk/articles/-A-44117