provenance systems
Recently Published Documents


TOTAL DOCUMENTS

11
(FIVE YEARS 2)

H-INDEX

3
(FIVE YEARS 0)

2021 ◽  
Vol 14 (12) ◽  
Author(s):  
Xiong Ding ◽  
Huachuan Jiang ◽  
Yuefeng Sun ◽  
Yuanhao Li ◽  
Min Li ◽  
...  

2018 ◽  
Vol 57 (3) ◽  
pp. 495-543 ◽  
Author(s):  
Beatriz Pérez ◽  
Julio Rubio ◽  
Carlos Sáenz-Adán

2015 ◽  
Vol 26 (2) ◽  
pp. 32-47 ◽  
Author(s):  
Salmin Sultana ◽  
Elisa Bertino

Existing provenance systems operate at a single layer of abstraction (workflow/process/OS) at which they record and store provenance. However, the provenance captured from different layers provides the highest benefit when integrated through a unified provenance framework. To build such a framework, a comprehensive provenance model able to represent the provenance of data objects with various semantics and granularity is the first step. In this paper, the authors propose a provenance model able to represent the provenance of any data object captured at any abstraction layer and present an abstract schema of the model. The expressive nature of the model enables a wide range of provenance queries. The authors also illustrate the utility of their model in real world data processing systems. In the paper, they also introduce a data provenance distributed middleware system composed of several different components and services that capture provenance according to their model and securely stores it in a central repository. As part of our middleware, the authors present a thin stackable file system, called FiPS, for capturing local provenance in a portable manner. FiPS is able to capture provenance at various degrees of granularity, transform provenance records into secure information, and direct the resulting provenance data to various persistent storage systems.


2013 ◽  
Vol 6 (3) ◽  
pp. 867-872 ◽  
Author(s):  
Anu Mary Chacko ◽  
Anu Mary Chacko ◽  
Dr. Madhukumar S D

The metadata that captures information about the origin of data is referred to as data provenance or data lineage. The provenance of a data item captures information about the processes and source data items that lead to its creation and its current representation. A provenance-aware application captures and stores adequate documentation about process executions to answer queries regarding provenance. Provenance information is very useful when we need to know the inter dependency of data to find error propagation or information flow. Provenance collected also helps to understand what went different in two identical workflows with same inputs but producing different outputs. Currently, most of the provenance systems designed is domain specific. Through this paper, we propose a general methodology for making an application provenance-aware from the basic UML design diagrams. As a starting point we have analyzed UML Class diagrams to generate information to make application provenance aware.  


Author(s):  
Fakhri Alam Khan ◽  
Sardar Hussain ◽  
Ivan Janciak ◽  
Peter Brezany

e-Science helps scientists to automate scientific discovery processes and experiments, and promote collaboration across organizational boundaries and disciplines. These experiments involve data discovery, knowledge discovery, integration, linking, and analysis through different software tools and activities. Scientific workflow is one technique through which such activities and processes can be interlinked, automated, and ultimately shared amongst the collaborating scientists. Workflows are realized by the workflow enactment engine, which interprets the process definition and interacts with the workflow participants. Since workflows are typically executed on a shared and distributed infrastructure, the information on the workflow activities, data processed, and results generated (also known as provenance), needs to be recorded in order to be reproduced and reused. A range of solutions and techniques have been suggested for the provenance of data collection and analysis; however, these are predominantly workflow enactment engine and domain dependent. This paper includes taxonomy of existing provenance techniques and a novel solution named VePS (The Vienna e-Science Provenance System) for e-Science provenance collection.


Author(s):  
Fakhri Alam Khan ◽  
Sardar Hussain ◽  
Ivan Janciak ◽  
Peter Brezany

e-Science helps scientists to automate scientific discovery processes and experiments, and promote collaboration across organizational boundaries and disciplines. These experiments involve data discovery, knowledge discovery, integration, linking, and analysis through different software tools and activities. Scientific workflow is one technique through which such activities and processes can be interlinked, automated, and ultimately shared amongst the collaborating scientists. Workflows are realized by the workflow enactment engine, which interprets the process definition and interacts with the workflow participants. Since workflows are typically executed on a shared and distributed infrastructure, the information on the workflow activities, data processed, and results generated (also known as provenance), needs to be recorded in order to be reproduced and reused. A range of solutions and techniques have been suggested for the provenance of data collection and analysis; however, these are predominantly workflow enactment engine and domain dependent. This paper includes taxonomy of existing provenance techniques and a novel solution named VePS (The Vienna e-Science Provenance System) for e-Science provenance collection.


Sign in / Sign up

Export Citation Format

Share Document