Towards Next Generation Health Data Exploration: A Data Cube-based Investigation into Population Statistics for Tobacco

Printer-friendly version

Abstract:

Increasingly, experts and interested laypeople are turning to the explosion of online data to form and explore hypotheses about relationships between public health intervention strategies and their possible impacts. We have engaged in a multi-year collaboration to use and design semantic techniques and tools to support the current and next generation of these explorations. We introduce a tool, qb.js, to enable access to multidimensional statistical data in ways that allow non-specialists to explore and create specific visualizations of that data. We focus on explorations of health data - in particular aimed at helping to support the formation and analysis of hypotheses about public health intervention strategies and their correlation with health-related behavior changes. We used qb.js to formulate and explore the hypothesis that youth tobacco access laws have consistent, measurable impacts on the rate of change in cigarette smoking among high school students over time. While focused in this instance on one particular intervention strategy (i.e., limiting youth access to tobacco), this analytics platform may be used for a wide range of correlationalcorrelational analyses. To address this hypothesis, we converted population science data on tobacco-related policy and behavior from ImpacTeen to an Resource Description framework (RDF) representation that was annotated with the RDF Data Cube vocabulary. A Semantic Data Dictionary enabled mapping between the original datasets and the RDF representation. This allowed for the creation and publication of data visualizations using qb.js. The RDF Data Cube representation made it possible to discover a significant downward effect from the introduction of nine youth tobacco access laws on the rate of change in smoking prevalence among high school-aged youth.

History

DateCreated ByLink
September 23, 2012
15:23:02
James McCuskerDownload
September 23, 2012
15:05:38
James McCuskerDownload
September 23, 2012
15:00:07
James McCuskerDownload

Related Projects:

PopSciGrid LogoPopulation Science Grid (PopSciGrid)
Principal Investigator: Deborah L. McGuinness
Description: The National Cancer Institute’s (NCI) PopSciGrid Community Health Portal is an evolving platform demonstrating how health behavior, policy, and demographic data can be integrated, visualized, and communicated to empower communities and support new avenues of research and policy for cancer prevention and control. As a proof of concept for cyber-enabled population health research, the PopSciGrid Portal is designed to encourage trans-disciplinary collaboration, data harmonization, and development of new computational methods for disparate health related data.

Related Research Areas:

Data Frameworks
Lead Professor: Peter Fox
Description: None.
Concepts: eScience
Health Informatics
Lead Professor: Deborah L. McGuinness
Description:

Health informatics is "the interdisciplinary study of the design, development, adoption and application of IT-based innovations in healthcare services delivery, management and planning." Procter, R. Dr. (Editor, Health Informatics Journal, Edinburgh, United Kingdom). (From the U.S. National Library of Medicine)


Concepts: None.
Inference And Trust
Lead Professor: Deborah L. McGuinness
Description: Inference And Trust
Concepts:
Knowledge Provenance
Lead Professor: Deborah L. McGuinness
Description: Knowledge Provenance
Concepts: Provenance,