Modelling Data Set Versioning Operations

Printer-friendly version

Authors:Benno Lee


Data sets do not remain stagnant after collection. They must often be corrected and grown to ensure data quality and coverage. The model in this poster demonstrates a method to encode versioning operations into semantically aware links so further metrics can be determined such as magnitude of change. This would allow for better automated versioning of data sets and data set consumption.


DateCreated ByLink
January 19, 2017
Benno LeeDownload
December 29, 2016
Benno LeeDownload

Related Projects:

MBVL Project LogoMarine Biodiversity Virtual Laboratory (MBVL)
Principal Investigator: Peter Fox, Stace Beaulieu, David Mark Welch, and Heidi Sosik
Description: This research effort brings together computational and information scientists, oceanographers and microbiologists to develop a Marine Biodiversity Virtual Laboratory (MBVL). In addition to research investigations of marine ecosystems, the Virtual Laboratory provides a platform for education via student diversity programs at the three institutions. The important learning opportunities will be two-fold for students: (1) to learn about, model, and make predictions for biodiversity in natural systems, and (2) to be exposed to working in an interdisciplinary team that includes both natural scientists and computer scientists.

Related Research Areas:

Semantic eScience
Lead Professor: Peter Fox
Science has fully entered a new mode of operation. E-science, defined as a combination of science, informatics, computer science, cyberinfrastructure and information technology is changing the way all of these disciplines do both their individual and collaborative work.
As semantic technologies have been gaining momentum in various e-Science areas (for example, W3C's new interest group for semantic web health care and life science), it is important to offer semantic-based methodologies, tools, middleware to facilitate scientific knowledge modeling, logical-based hypothesis checking, semantic data integration and application composition, integrated knowledge discovery and data analyzing for different e-Science applications.
Partially influenced by the Artificial Intelligence community, the Semantic Web researchers have largely focused on formal aspects of semantic representation languages or general-purpose semantic application development, with inadequate consideration of requirements from specific science areas. On the other hand, general science researchers are growing ever more dependent on the web, but they have no coherent agenda for exploring the emerging trends on the semantic web technologies. It urgently requires the development of a multi-disciplinary field to foster the growth and development of e-Science applications based on the semantic technologies and related knowledge-based approaches.