Beth Noveck, NYU law professor and previously Deputy Chief Technology Officer of the US, and I have written a blog posting that discusses the DATA act, a proposed US law to improve the transparency of US spending. I realized this may also be of interest to folks working on Linked Data, and especially linked open government data, so I thought I’d mention this here as well.
VN:F [1.9.13_1145]
Rating: 10.0/10 (1 vote cast)
VN:F [1.9.13_1145]
We have created 16 RDF datasets covering 187 of the datasets published at data.gov (171 EPA datasets are subsets of three larger EPA datasets). The original datasets were published by EPA, US Census Bureau, USGS and Office of Management and Budget in CSV compatible format, and they contributed 13,532,250 table entries. The translated RDF datasets includes a total of 2,927,398,352 triples involving 2,526 properties.
We publish the RDF data in two alternative ways: (i) a collection of linked partition files in RDF/XML for users to browse the dataset and dereference the URIs using semantic web browsers, and (ii) one big N-TRIPLE file (data.nt) concatenating the partition files for machines, especially triple stores, to download and import. The largest dataset is Dataset_91, which contributed 2.11 billion triples.
To access the RDF datasets, users may go to Data.gov_Catalog with the following options:
More datasets are coming, so please stay tuned and come back to http://data-gov.tw.rpi.edu/.
Further reading:
Li Ding, Dominic DiFranzo, Sarah Magidson, and Jim Hendler
VN:F [1.9.13_1145]
Rating: 9.6/10 (7 votes cast)
VN:F [1.9.13_1145]