Parallel Identities for Managing Open Government Data

The widespread availability of Open Government Data is exposing significant challenges to trust in its unplanned applications. As data are accumulated, transformed, and presented through a chain of independent third parties, there is a growing need for sophisticated models of provenance. Significant progress has been made in describing data derivation, but has been limited by its ability to distinguish between transformations that change content and transformations that simply change representation. We have found that Functional Requirements for Bibliographic Resources (FRBR) can, when paired with a derivational provenance model like the World Wide Web Consortiumtextquoteright{}s emerging PROV standard, successfully represent web resource accession, distinguish between transformations of content and format, and facilitate veracity using cryptographic digests. We show how cryptographic digest algorithms can be used to provide an automated method and tools for the coordination of multiscale identity of information resources using FRBR concepts and cryptographic digests.


February 7, 2012
James McCusker

