TetherlessPC3
From Tetherless World Wiki
| Tetherless World for the Third Provenance Challenge | |
| type | Research Project |
| status | active |
| homepage | http://tw.rpi.edu/wiki/TetherlessPC3/ |
| Management | |
| team | Tetherless World Constellation |
| investigator | Deborah L. McGuinness |
| participant | James Michaelis,Li Ding,Rui Huang,Zhenning Shangguan |
| Tags | |
| tag | OWL,Provenance |
| relation | Inference Web |
| Internal | |
Contents |
Overview
We are investigating a solution to the 3rd provenance challenge on annotating and integrating provenance metadata for workflow.
facts
- The 3rd Provenance Challenge (PC3 Workshop June 10 - 11 held in Amsterdam)
- the challenge homepage - http://twiki.ipaw.info/bin/view/Challenge/ThirdProvenanceChallenge
- the queries - http://twiki.ipaw.info/bin/view/Challenge/ProvenanceQuestionsPc3
- OPM 1.01 Specification, http://openprovenance.org/
Challenges
with special focus on scientific workflow, we see the following challenges
- provenance ontology - what provenance metadata can be captured, e.g. data flow and control flow
- provenance metadata generation - how to capture provenance using the general purposed ontology
- provenance integration - how to share, link and normalize distributed provenance data
- provenance query - how to answer queries on the provenance graph
Our Solution
provenance generation
- we build a common OWL ontology based on PML for fully capturing OPM concepts
- we review the workflow/control flow of the challenge and establish strategies for capturing what provenance data
- capture data flow
- capture control flow
- capture version of data
- we generate provenance metadata with minimal changes on the original workflow source code
- internal injection: inject provenance logging code into source code of workflow logics
- external observation: add code to observe input/output/known-operation-semantics/system-state of workflow processes
provenance integration
- we map OPM models generated by other teams to RDF data using our common ontology
- we do additional computations (OWL inference, domain heuristics for model alignment) to enrich the mapped RDF data
provenance query
- we clarify the semantics of provenance queries
- we build SPARQL based solution to answer provenance queries
Results
- the query results - http://twiki.ipaw.info/bin/view/Challenge/TetherlessPC3
- the source code - http://code.google.com/p/tetherlesspc3/
- Ontology
Resources
- PC3 Presentation Slides: Image:PC3Slides.ppt
- PC3 Query Semantics: Image:PC3QuerySemantics.pdf
- OPM-PML2 Mapping Strategy: Image:OPM-PML2Mapping.pdf
- Provenance Mapping System: Image:ProtoProv.pdf
Facts about TetherlessPC3RDF feed
