TetherlessPC3

From Tetherless World Wiki

Jump to: navigation, search

Tetherless World for the Third Provenance Challenge
type Research Project
status active
homepage http://tw.rpi.edu/wiki/TetherlessPC3/
Management
team Tetherless World Constellation
investigator Deborah L. McGuinness
participant James Michaelis,Li Ding,Rui Huang,Zhenning Shangguan
Tags
tag OWL,Provenance
relation Inference Web
Internal

Contents

Overview

We are investigating a solution to the 3rd provenance challenge on annotating and integrating provenance metadata for workflow.

facts

Challenges

with special focus on scientific workflow, we see the following challenges

  • provenance ontology - what provenance metadata can be captured, e.g. data flow and control flow
  • provenance metadata generation - how to capture provenance using the general purposed ontology
  • provenance integration - how to share, link and normalize distributed provenance data
  • provenance query - how to answer queries on the provenance graph

Our Solution

provenance generation

  • we build a common OWL ontology based on PML for fully capturing OPM concepts
  • we review the workflow/control flow of the challenge and establish strategies for capturing what provenance data
    • capture data flow
    • capture control flow
    • capture version of data
  • we generate provenance metadata with minimal changes on the original workflow source code
    • internal injection: inject provenance logging code into source code of workflow logics
    • external observation: add code to observe input/output/known-operation-semantics/system-state of workflow processes

provenance integration

  • we map OPM models generated by other teams to RDF data using our common ontology
  • we do additional computations (OWL inference, domain heuristics for model alignment) to enrich the mapped RDF data

provenance query

  • we clarify the semantics of provenance queries
  • we build SPARQL based solution to answer provenance queries

Results

Resources

Personal tools