Converting governmental datasets into linked data

Printer-friendly version






Abstract:

Linked Data provide many benefits to data consumers, but many publicly available datasets are still released in the Comma Separated Values (CSV) format, a ubiquitous common denominator. We introduce a methodology to transform such datasets into Linked Data. Our design is based on requirements identified while surveying existing governmental datasets released by data.gov. We present an implementation-independent RDF vocabulary to describe how a CSV dataset should be promoted into Linked Data, and use a Java-based converter to produce 5.3 billion RDF triples from 312 data.gov datasets.

History

DateCreated ByLink
July 12, 2011
21:01:48
Tim LeboDownload
July 12, 2011
20:59:54
Tim LeboDownload

Related Projects:

DCO-DS LogoLinking Open Government Data (LOGD)
Principal Investigator: Deborah L. McGuinness and Jim Hendler
Description: The LOGD project investigates the role of Semantic Web technologies, especially Linked Data, in producing, enhancing and utilizing government data published on Data.gov and other websites.

Related Research Areas:

Future Web
Lead Professor: Jim Hendler
Description: Since its inception the World Wide Web has changed the ways people work, play, communicate, collaborate, and educate. There is, however, a growing realization among researchers across a number of disciplines that without new research aimed at understanding the current, evolving and potential Web, we may be missing or delaying opportunities for new and revolutionary capabilities. To model the Web, it is necessary to understand the architectural principles that have provided for its growth. Looking into the future, to be sure that it supports the basic social values of trustworthiness, personal control over information, and respect for social boundaries, a research agenda must be pursued that targets the Web and its use as a primary focus of attention. This research requires powerful scientific and mathematical techniques from many disciplines to explore the modeling of the Web from network- and information- centric views.
Concepts: Semantic Web
Semantic Foundations
Lead Professor: Deborah L. McGuinness
Description: Semantic Foundations
Concepts: Semantic Web