The Importance of Authoritative URI Design Schemes for Open Government Data

A major challenge when working with open government data is managing, connecting, and understanding the links between references to entities found across multiple datasets when these datasets use different vocabularies to refer to identical entities (i.e.: one dataset may refer to Microsoft as "Microsoft", another may refer to the company by its SEC filing number as "0000789019", and a third may use its stock ticker "MSFT".) In this paper we propose a naming scheme based on Web URLs that enables unambiguous naming and linking of datasets and, more importantly, data elements, across the Web. We further describe our ongoing work to demonstrate the implementation and authoritative management of such schemes through a class of web service we refer to as the "instance hub".

View Publication

Associated Projects

The LOGD project investigates the role of Semantic Web technologies, especially Linked Data, in producing, enhancing and utilizing government data published on and other websites. Large portion of government data published on the Web are not necessarily ready for mashups. The Tetherless World Constellation (TWC) is now publishing over 8 billions RDF triples converted from hundreds of government-related datasets from and other sources (e.g.