Projects

TWC Faculty: Jim Hendler, Deborah L. McGuinness
Funding Agency/Sponsor: Defense Advanced Research Projects Agency

This is a program led by the Defense Advanced Research Projects Agency (DARPA), which is developing a schema-based AI system that can identify complex events and bring them to the attention of users. KAIROS aims to understand complex events described in multimedia inputs by developing a semi-automated system that identifies, links, and temporally sequences their subsidiary elements, the participants involved, as well as the complex event type.

TWC Faculty: Jim Hendler, Deborah L. McGuinness
Research Staff: John S. Erickson, Daniel M. Gruen, Jamie McCusker, Paulo Pinheiro, Henrique Santos, Oshani Seneviratne, Samuel Stouffer
Funding Agency/Sponsor: IBM AI Horizons

The Center for Health Empowerment by Analytics, Learning, and Semantics (HEALS) is a five-year collaboration between Rensselaer and IBM aimed at researching how the application of advanced cognitive computing capabilities can help people to understand and improve their own health conditions.

TWC Faculty: Deborah L. McGuinness
Research Staff: Jamie McCusker, Paulo Pinheiro, Henrique Santos
Funding Agency/Sponsor: National Institute of Environmental Health Sciences (NIEHS)

In 2019 the Human Health Exposure Analysis Resource (HHEAR) Data Center was established by NIEHS as a continuation of the CHEAR Data Center expanding to include health outcomes at all ages. The goal is to provide approved HHEAR investigators their laboratory analysis results and incorporate them in statistical analyses of their study data. We then make that data publicly available as a means to improve our knowledge of the comprehensive effects of environmental exposures on human health throughout the life course.

TWC Faculty: Deborah L. McGuinness
Research Staff: Rebecca Cowan, Alice M. Mulvehill, Henrique Santos
Funding Agency/Sponsor: Defense Advanced Research Projects Agency
DARPA is the research and development office for the U.S. Department of Defense. It maintains the technological superiority of the U.S. military and prevents technological surprise from harming our national security. TWC has been funded by DARPA for numerous projects, including this one.
TWC Faculty: Deborah L. McGuinness
Research Staff: Jamie McCusker

Whyis is a nano-scale knowledge graph publishing, management, and analysis framework. Whyis aims to support domain-aware management and curation of knowledge from many different sources. Its primary goal is to enable creation of useful domain- and data-driven knowledge graphs. Knowledge can be contributed and managed through direct user interaction, statistical analysis, or data ingestion from many different kinds of data sources.

TWC Faculty: Deborah L. McGuinness
Research Staff: Rebecca Cowan, John S. Erickson, Jamie McCusker, Paulo Pinheiro, Oshani Seneviratne
The aim of the Semantic Data Dictionary (SDD) approach is to annotate datasets such that it is machine readable, uses best practice ontologies, and follows FAIR Guiding Principles. It is a project that was developed to address machines’ difficulty in understanding data dictionaries, a standard method used to describe datasets through the use of tables that identify information about data variables’ content, description, and format. With SDD, there is an extension and integration of data from multiple domains using a common metadata standard.
TWC Faculty: Deborah L. McGuinness
Research Staff: Jamie McCusker, Paulo Pinheiro
Funding Agency/Sponsor: National Institute of Environmental Health Sciences (NIEHS)
The United States’ National Institute of Environmental Health Sciences has established an infrastructure, the Children’s Health Exposure Analysis Resource (CHEAR), to provide the extramural research community access to laboratory and statistical analyses aimed at adding or expanding the inclusion of environmental exposures in their research.
TWC Faculty: Deborah L. McGuinness
Research Staff: Paulo Pinheiro
Funding Agency/Sponsor: McGill

Circular economy (CE) envisions a sustainable future where waste is eliminated in the built environment and materials and buildings are kept in use for as long as possible. ‘Housing passports’ (HP) are standardized digital descriptions of residential building characteristics. Data Homebase, provides digital HPs which identify the CE characteristics of current housing materials, products, components, and composition, showing their value for present use, reuse and recovery.

Research Staff: Jamie McCusker

Semantic Extract, Transform, and Load-er (SETLr) is a flexible, scalable tool for providing semantic interpretations to tabular, XML, and JSON-based data from local or web files. It has been used by diverse projects and has shown to be scalable and flexible, allowing for the simplified creation of arbitrary RDF, including ontologies and nanopublications, from many different data formats. Semantic ETL scripts use best practice standards for provenance (PROV-O) and support streaming conversion for RDF transformation using the JSON-LD based templating language, JSLDT.

TWC Faculty: Deborah L. McGuinness
Research Staff: John S. Erickson, Jamie McCusker, Paulo Pinheiro, Henrique Santos
HADatAc (Human-Aware Data Acquisition framework) is an open-source infrastructure that enables combined acquisitions of data and metadata in a way that metadata is properly and logically connected to data.
TWC Faculty: Deborah L. McGuinness

SemNExT combines numeric analysis of data with semantic understanding and explanation technologies to provide a holistic means of exploring robust datasets.

TWC Faculty: Deborah L. McGuinness
Research Staff: Paulo Pinheiro, Henrique Santos
The Jefferson Project at Lake George is building one of the world’s most sophisticated environmental monitoring and prediction systems, which will provide scientists and the community with a real-time picture of the health of the lake. Launched in June 2013, the project aims to understand and manage multiple complex factors—including road salt incursion, storm water runoff, and invasive species—all threatening one of the world’s most pristine natural ecosystems and an economic cornerstone of the New York tourism industry.
TWC Faculty: Deborah L. McGuinness
Research Staff: Rebecca Cowan, Jamie McCusker, Samuel Stouffer
Funding Agency/Sponsor: National Science Foundation

The MaterialsMine Team brings together expertise across five research institutions in the fields of mechanics, materials, design, manufacturing, data science, and computer science to build and develop an open-source, user-friendly materials data resource guided by FAIR principles, with current modules geared toward research communities in the domains of polymer nanocomposites (NanoMine) and mechanical metamaterials (MetaMine).

TWC Faculty: Jonathan Dordick, Deborah L. McGuinness
Research Staff: Jamie McCusker

We aim to find new effective treatments for disease using existing drugs. Our approach is to gather and integrate existing data using semantic technologies to help discover promising drug repurposing.

Many diseases are based on genetic or epigenetic changes that can be targeted indirectly via upstream regulatory pathways. Targets need to have a high likelihood of affecting all possible changes, and so need to have upstream interactions that cover multiple genotypes/epigenotypes that might drive the same phenotype.

TWC Faculty: Jim Hendler, Deborah L. McGuinness
Research Staff: John S. Erickson

The Web Science Research Center at TWC RPI is working with other members of the Web Science Trust to create a global "Web Observatory". The global movement toward Open Data and transparency have successfully motivated the release of very large institutional and commercial data sets describing social phenomena, economic indicators and geographic trends.

TWC Faculty: Peter Fox
Research Staff: John S. Erickson, Kathy Fontaine

Recent advances in data generation techniques, whether by experiments, measurements or computer simulation, quickly provide complex data characterized by source heterogeneity, multiple modalities, often high volume, high dimensionality, and multiple scales (temporal, spatial, and function).