Health Data Challenge

Printer-friendly version
Research Areas: Health Informatics, Web Science, Knowledge Provenance
Principal Investigator: Jim Hendler and Deborah L. McGuinness
Co Investigator: Kristine Gloria, Alvaro Graves, Tim Lebo, and James McCusker
Concepts: Web Science, Software Framework, Provenance, Health informatics, eScience, Data Science, Linked Data, Faceted Search, Xinformatics
Description:
A set of in-house developed tools which enable the discovery of, access to, and integration of the Health and Human Services' datasets. Over the past several years, TWC has built many of the tools, has set the standards, and has engineered the architecture of the Web that many of us use today. We mirrored the hub.healthdata.gov CKAN instance using its API to our own instance at healthdata.tw.rpi.edu/hub. This allowed us to both improve the CKAN-based metadata, including adding Data Dictionaries and Technical Documentation as Resources, and to improve the RDF generated by CKAN.


The Tetherless World Constellation (TWC) at Rensselaer Polytechnic Institute (RPI) is proud to announce that a team of RPI students has won first place in the HealthIT.org Health Data Platform (HDP) Metadata Challenge. The Office of the National Coordinator for Health Information Technology's HDP challenges were designed to create new functionalities for the U.S. Department of Health and Human Services' repository for open health data, HealthData.gov.

The Metadata Challenge was launched on June 5, 2012 in order to facilitate the application of common metadata standards to all open government data. Additionally, the challenge sought to improve designs for health specific metadata. The challenge entries were judged on the number of metadata and data sets the app was designed to accommodate, the use of open source software and the incorporation of best practices.

For the Metadata challenge, RPI's team of Jim McCusker, Timothy Lebo, Alvaro Graves and Kristine Gloria won the $20,000 first place award with an application that leveraged the healthdata.gov CKAN API (Application Programming Interface) and the complete catalog of datasets on healthdata.gov to create multiple resources for organizing data and automating many of the data processes. The Tetherless World Team presented a set of in-house developed tools enabling the discovery of, access to, and integration of the Health and Human Services’ datasets as Linked Government Data.