Ontology engineering in provenance enablement for the National Climate Assessment

The National Climate Assessment of the U.S. Global Change Research Program (USGCRP) analyzes and presents the impacts of climate change on the United States. The provenance information in the assessment is important because the assessment findings are of great public and academic concern and are used in policy and decision-making. By applying a use case-driven iterative methodology, we developed information models and ontology to represent the content structure of the recent National Climate Assessment draft report and its associated provenance information. We tested the ontology by using it in pilot systems serving information about instances of chapters, scientific findings, figures, tables, images, datasets, references, people, and organizations, etc. in the draft report, as well as interrelationships among those instances. The results successfully help users trace provenance in the draft report, such as finding all the journal articles from which a figure in the report was derived. The provenance information in our work was maintained in the context of the “Web of Data”. In addition to the pilot systems we developed, other tools and services are also able to retrieve and utilize the provenance information. Our work is part of a Global Change Information System coordinated by the USGCRP that will eventually cover provenance information for the entire scope of global change research. Such a system will greatly increase understanding, credibility and trust in the global change research and foster reproducibility of scientific results and conclusions.

View Publication

Associated Projects

The Tetherless World Constellation (TWC) at Rensselaer Polytechnic Institute (RPI) proposes to facilitate the vocabulary and ontology development within the context of the overall development of semantic prototypes for the National Climate Assessment (NCA) portals using a combination of environmental inter-agency collaborations in a use-case focused workshop setting, information modeling, and software developments and deployments.