Data Science Concept

Printer-friendly version

Description:

Data science is advancing inductive conduct of science driven by the greater volumes, complexity and heterogeneity of data being made available over the Internet. Data science combines of aspects of data management, library science, computer science, and physical science using supporting cyberinfrastructure and information technology. As such it is changing the way all of these disciplines do both their individual and collaborative work.

Data science is helping scienists face new global problems of a magnitude, complexity and interdisciplinary nature whose progress is presently limited by lack of available tools and a fully trained and agile workforce.

Projects:
DCO-DS LogoDeep Carbon Observatory Data Science (DCO-DS)
Principal Investigator: Peter Fox
Co Investigator: John S. Erickson and Jim Hendler
Description: Given this increasing data deluge, it is clear that each of the Directorates in the Deep Carbon Observatory face diverse data science and data management needs to fulfill both their decadal strategic objectives and their day-to-day tasks. This project will assess in detail the data science and data management needs for each DCO directorate and for the DCO as a whole, using a combination of informatics methods; use case development, requirements analysis, inventories and interviews.
EAGER Project LogoEAGER: Semantic Search (EAGER)
Principal Investigator: Jim Hendler
Description: NSF EAGER project to explore advanced semantic technology for data search.
Health Data Challenge (HealthData)
Principal Investigator: Deborah L. McGuinness and Jim Hendler
Co Investigator: Kristine Gloria, Alvaro Graves, Tim Lebo, and James McCusker
Description: An infrastructure for large-scale collaboration around aggregation, generation, and publication of health-related Linked Data.
Repurposing Drugs with Semantics (ReDrugS)
Principal Investigator: Deborah L. McGuinness and Jonathan Dordick
Description: We aim to find new effective treatments for disease using existing drugs. Our approach is to gather and integrate existing data using semantic technologies to help discover promising drug repurposing.
SEMMDD LogoSemantically Enabled Modeling of Major Depressive Disorder (SEMMDD)
Principal Investigator: Joanne S. Luciano
Description: In this project, we study the effects of how different antidepressant treatments, including non-pharmacological treatments, affect the underlying brain regions, clinical symptoms, and behaviors. We use mathematical modeling and computer simulation to combine clinical research with neuroscience research.
TW LogoTWC Web Observatory (WebObservatory)
Principal Investigator: Deborah L. McGuinness
Co Investigator: Jim Hendler
Description: The Web Science Research Center at TWC RPI is working with other members of the Web Science Trust to create a global "Web Observatory". The global movement toward Open Data and transparency have successfully motivated the release of very large institutional and commercial data sets describing social phenomena, economic indicators and geographic trends. This proliferation of data represents great opportunity for researchers and industry but this data abundance also threatens to make it ever more difficult to locate, analyse, compare and interpret useful information in a consistent and reliable way; a situation which can only get worse unless we can help stakeholders perform useful analysis rather than drowning in a sea of data. A global Web Observatory will offer an institutional framework to promote the use of W3C and other standards in the development of Semantic Catalogues to globally locate existing data sets, Collection Systems to gather new global data sets, and Analytics Tools and methodologies to analyse these data sets.
HADATAC LogoThe Human-Aware Data Acquisition Framework (HADatAc)
Principal Investigator: Paulo Pinheiro
Co Investigator: Deborah L. McGuinness
Description:
People:
Stephan Zednik

Stephan Zednik is a Senior Software Engineer with the Tetherless World Constellation at Rensselaer Polytechnic Institute. His research interests include researcher collaboration networks, quality representation and semantics, and provenance representation from data science tools. Stephan partici [...]