Home > linked data, Semantic Web > Data.gov Datasets Translated in RDF!

Data.gov Datasets Translated in RDF!

July 22nd, 2009

We have created 16 RDF datasets covering 187 of the datasets published at data.gov (171 EPA datasets are subsets of three larger EPA datasets). The original datasets were published by EPA, US Census Bureau, USGS and Office of Management and Budget in CSV compatible format, and they contributed 13,532,250 table entries. The translated RDF datasets includes a total of 2,927,398,352 triples involving 2,526 properties.

We publish the RDF data in two alternative ways: (i) a collection of linked partition files in RDF/XML for users to browse the dataset and dereference the URIs using semantic web browsers, and (ii) one big N-TRIPLE file (data.nt) concatenating the partition files for machines, especially triple stores, to download and import. The largest dataset is Dataset_91, which contributed 2.11 billion triples.

To access the RDF datasets, users may go to Data.gov_Catalog with the following options:

  • follow links in the “rdf(index file)” column to access the index file in RDF/XML which contains the property list, statistics, and links of the RDF dataset. e.g. http://data-gov.tw.rpi.edu/raw/401/index.rdf
  • follow links in the “rdf(partition files)” column to start an RDF browser (e.g. tabulator) to surf the RDF/XML partition files. e.g. http://data-gov.tw.rpi.edu/raw/401/link00001.rdf
  • follow links in “the rdf(complete file)” column to download the complete RDF dataset in N-TRIPLE format (gzipped). e.g. http://data-gov.tw.rpi.edu/raw/401/data-401.nt.gz
  • follow links in the “url(data.gov)” column to see the original metadata at data.gov
  • follow links in the “wiki page” column to see enhanced metadata about data.gov datasets

More datasets are coming, so please stay tuned and come back to http://data-gov.tw.rpi.edu/.

Further reading:

Li Ding, Dominic DiFranzo, Sarah Magidson, and Jim Hendler

VN:F [1.9.22_1171]
Rating: 9.6/10 (7 votes cast)
VN:F [1.9.22_1171]
Rating: 0 (from 0 votes)
Data.gov Datasets Translated in RDF!, 9.6 out of 10 based on 7 ratings
Author: Categories: linked data, Semantic Web Tags: , ,
  1. Melvin Carvalho
    July 23rd, 2009 at 05:37 | #1

    Great job!

    VA:F [1.9.22_1171]
    Rating: 0.0/5 (0 votes cast)
    VA:F [1.9.22_1171]
    Rating: 0 (from 0 votes)
  2. Mukkai Krishnamoorthy
    July 23rd, 2009 at 07:59 | #2

    Great Job. Lots of useful information may be gleaned from your great work. Are you also producing graphs of these data (linking information)? If so will that be in RGML format?

    VA:F [1.9.22_1171]
    Rating: 0.0/5 (0 votes cast)
    VA:F [1.9.22_1171]
    Rating: 0 (from 0 votes)
  3. July 25th, 2009 at 09:37 | #3

    Doh! Great job, great new!
    I keep tracks of news concerning putting government data in the LinkedData, I tought the UK government will be faster (thanks to TBL mainly). Happy to see that the US community is translating non-standard formats into RDF.
    My sources: http://www.pearltrees.com/nicolas/map/1_53584/

    Thanks.
    Nicolas

    VA:F [1.9.22_1171]
    Rating: 0.0/5 (0 votes cast)
    VA:F [1.9.22_1171]
    Rating: 0 (from 0 votes)
  1. July 23rd, 2009 at 02:52 | #1
  2. August 23rd, 2009 at 03:38 | #2