TWC Schema.org Vocabulary Development

Printer-friendly version

Research Areas: Web Science, Future Web, Data Science
Principal Investigator: Jim Hendler
Co Investigator: Joshua Shinavier
Concepts: Linked Data, Data Science, Web Science, Cyberinfrastructure, Data Management, Controlled Vocabulary, Semantic Web, Web Observatory
Description:

schema.org provides a collection of schemas — html tags — that webmasters can use to markup their pages in ways recognized by major search providers. Search engines including Bing, Google, Yahoo! and Yandex rely on this markup to improve the display of search results, making it easier for people to find the right web pages.

Since early 2012 researchers at TWC RPI have been working with government and research data providers to define vocabularies for expressing the structured data that powers their web sites, using on-page markup based on schema.org vocabularies. In particular, we developed the schema.org/Dataset extension, a concise vocabulary that extends schema.org for describing datasets and data catalogs. Current work includes applying Dataset to scientific datasets and developing new extensions for use by Web Observatories