The Importance of Authoritative URI Design Schemes for Open Government Data

Printer-friendly version

Abstract:

A major challenge when working with open government data is managing, connecting, and understanding the links between references to entities found across multiple datasets when these datasets use different vocabularies to refer to identical entities (i.e.: one dataset may refer to Microsoft as "Microsoft", another may refer to the company by its SEC filing number as "0000789019", and a third may use its stock ticker "MSFT".) In this paper we propose a naming scheme based on Web URLs that enables unambiguous naming and linking of datasets and, more importantly, data elements, across the Web. We further describe our ongoing work to demonstrate the implementation and authoritative management of such schemes through a class of web service we refer to as the "instance hub".

Related Projects:

DCO-DS LogoLinking Open Government Data (LOGD)
Principal Investigator: Jim Hendler and Deborah L. McGuinness
Description: The LOGD project investigates the role of Semantic Web technologies, especially Linked Data, in producing, enhancing and utilizing government data published on Data.gov and other websites.

Related Research Areas:

Semantic Foundations
Lead Professor: Deborah L. McGuinness
Description: Semantic Foundations
Concepts: Semantic Web