Linked Data

Printer-friendly version Description:
Linked Data is about using the Web to connect related data that wasn't previously linked, or using the Web to lower the barriers to linking data currently linked using other methods. More specifically, Wikipedia defines Linked Data as "a term used to describe a recommended best practice for exposing, sharing, and connecting pieces of data, information, and knowledge on the Semantic Web using URIs and RDF."

The four rules of linked data are:

1. Use URIs as names for things (human readable)
2. Use HTTP URIs so that people can look up those names
3. When someone looks up a URI, provide useful information using standards (RDF*, SPARQL)
4. Includes links to other URIs, so they can discover more things.

5 Star Data

Projects:
CHEAR Project LogoChild Health Exposure Analysis Repository (CHEAR)
Principal Investigator: Deborah L. McGuinness
Co Investigator: Kristin Bennett
Description: Child Health Exposure Analysis Repository Data Science Semantics
EAGER Project LogoEAGER: Semantic Search (EAGER)
Principal Investigator: Jim Hendler
Description: NSF EAGER project to explore advanced semantic technology for data search.
Health Data Challenge (HealthData)
Principal Investigator: Deborah L. McGuinness and Jim Hendler
Co Investigator: Kristine Gloria, Alvaro Graves, Tim Lebo, and James McCusker
Description: An infrastructure for large-scale collaboration around aggregation, generation, and publication of health-related Linked Data.
Mobile Health Project LogoMobile Health
Principal Investigator: Deborah L. McGuinness
Description: The Mobile Health project aims to bring semantic representations of medical data collected from a variety of consumer and medical grade devices and integrate those data on an individual's mobile smartphone. Combined with the reasoning capabilities of semantic web and technologies such as IBM Watson, this project plans to enable personalized health care through the instrumented self.
NOCV Project LogoNational Ocean Council Vocabulary (NOCV)
Description: The objective of the NOCV project is to demonstrate technical capabilities that are available and can be deployed to implement solutions to key needs identified in the National Ocean Policy in regard to data and the decision-support requirements that arise from data-oriented questions.
SemantEco Annotator Project LogoSemantEco Annotator
Principal Investigator: Deborah L. McGuinness
Co Investigator: Patrice Seyed
Description: Generating useful RDF linked data is not a straightforward process for scientists using today's tools. In this project we introduce the SemantEco Annotator, a semantic web application that leverages community-based vocabularies and ontologies during the translation process itself to ease the process of drawing out implicit relationships in tabular data so that they may be immediately available for use within the LOD cloud. Our goal for the SemantEco Annotator is to make advanced RDF translation techniques available to the layperson.
SSIII LogoSemantic Sea Ice Interoperability Initiative (SSIII)
Principal Investigator: Siri Jodha Singh Khalsa, Mark Parsons, and Ruth Duerr
Co Investigator: Peter Fox and Deborah L. McGuinness
Description: SSIII is a National Science Foundation (NSF) funded effort to enhance the interoperability of sea ice data to establish a network of practitioners working to enhance semantic interoperability of all Arctic data. SSIII is a collaborative project between NSIDC and the Rensselaer Polytechnic Institute (RPI) Tetherless World Constellation project. We seek to build on the work initiated under the International Polar Year (IPY) and create a community of practice working to improve interoperability within the Polar Information Commons (PIC), the Sustained Arctic Observing Network (SAON), and broader global systems.
S2S Project LogoSemantically Enabled Facetd Browser (S2S)
Principal Investigator: Peter Fox
Co Investigator: Stephan Zednik
Description: S2S is a user interface framework that leverages the machine-readable semantics of data, services, and user interface components, or "widgets". S2S automates various tasks in UI development for search interfaces.
SPP Project LogoSocial Practices (SPP)
Principal Investigator: Jim Hendler
Description: The overall goal of this project is to explore and establish a better understanding of privacy in this highly-networked world. This page features the tools and workflow needed to accomplish such a task. We argue that while much has been written and discussed about privacy in various domains (e.g., law, psychology, economic behavior, security, etc.), it remains unclear what exactly is the privacy problem? Our aim is to reframe our own understanding of privacy by moving away from these traditional disjointed compartments of knowledge. Moreover, given the complexity, we advocate this research question as an exemplar for the value of combining efforts between human and machine. This project features tools, workflow(s) and best practices we've developed and implemented to accomplish such a task. This is and will be a work in progress. Any comments and or feedback are welcomed. Please email Kristine Gloria at glorim@rpi.edu for more information.
TW Website Project
Description: A semantically-powered Tetherless World Website running in the Drupal CMS. This combines many web standard technologies, including RDF, SPARQL, XSLT, and XHTML.
TWC schema.org Project LogoTWC Schema.org Vocabulary Development (TWC_Schemas)
Principal Investigator: Jim Hendler
Co Investigator: Joshua Shinavier
Description: schema.org provides a collection of schemas — html tags — that webmasters can use to markup their pages in ways recognized by major search providers. Search engines including Bing, Google, Yahoo! and Yandex rely on this markup to improve the display of search results, making it easier for people to find the right web pages. Since early 2012 researchers at TWC RPI have been working with government and research data providers to define vocabularies for expressing the structured data that powers their web sites, using on-page markup based on schema.org vocabularies. In particular, we developed the schema.org/Dataset extension, a concise vocabulary that extends schema.org for describing datasets and data catalogs. Current work includes applying Dataset to scientific datasets and developing new extensions for use by Web Observatories
TW LogoTWC Web Observatory (WebObservatory)
Principal Investigator: Deborah L. McGuinness
Co Investigator: Jim Hendler
Description: The Web Science Research Center at TWC RPI is working with other members of the Web Science Trust to create a global "Web Observatory". The global movement toward Open Data and transparency have successfully motivated the release of very large institutional and commercial data sets describing social phenomena, economic indicators and geographic trends. This proliferation of data represents great opportunity for researchers and industry but this data abundance also threatens to make it ever more difficult to locate, analyse, compare and interpret useful information in a consistent and reliable way; a situation which can only get worse unless we can help stakeholders perform useful analysis rather than drowning in a sea of data. A global Web Observatory will offer an institutional framework to promote the use of W3C and other standards in the development of Semantic Catalogues to globally locate existing data sets, Collection Systems to gather new global data sets, and Analytics Tools and methodologies to analyse these data sets.
HADATAC LogoThe Human-Aware Data Acquisition Framework (HADatAc)
Principal Investigator: Paulo Pinheiro
Co Investigator: Deborah L. McGuinness
Description:
ToolMatch LogoToolMatch (ToolMatch)
Description: or a given dataset, it is difficult to find the tools that can be used to work with the dataset. In many cases, the information that Tool A works with Dataset B is somewhere on the Web, but not in a readily identifiable or discoverable form. In other cases, particularly more generalized tools, the information does not exist at all, until somebody tries to use the tool on a given dataset. Thus, the simplest, most prevalent use case is for a user to search for the tools that can be used with a given dataset. A further refinement would be to specify what the tool can do with the dataset, e.g., read, visualize, map, analyze, reformat.
Wineagent LogoWine Agent
Principal Investigator: Deborah L. McGuinness
Description: The Wine Agent represents knowledge of wines and foods and is a demonstration platform for a large variety of Semantic Web technologies in a rich domain and is derived from previous work in the field of reasoning systems.
People:
Patrick West

Patrick West is a senior software engineer at Entangled Media, Inc. He is currently responsible for the design and development of full-stack web applications that help users manage their profile and subscription for the younity app. The younity app acts as a relay between media content that resi [...]