Semantic eScience meeting 2013-11-01

Printer-friendly version

General Meeting Information

[Semantic eScience meeting 2013-11-01 This Pad]
[Semantic eScience meeting 2013-10-18 Previous Meeting]
Call-in information
Goto Meeting
Please join my meeting.
Use your microphone and speakers (VoIP) - a headset is recommended. Or, call in using your telephone.
Dial +1 (805) 309-0012
Access Code: 776-009-689
Audio PIN: Shown after joining the meeting
Meeting ID: 776-009-689
Meeting Page


Fiona Murphy's presentation
Han Wang's presentation - rief Introduction to DCO-DS
Marshall X Ma's presentation:


Peter Fox
Patrick West
Marshall X Ma
Linyun Fu
Katie Dunn
massimo di stefano (remote)

Past Action Items

Action Items


Keep this list from week to week so we know who's presented and who will present. Please sign up if you have a good topic to share with others.
Sept. 20, 2013 - Linyun Fu - ORCID
Oct. 4, 2013 - Jin - From Ontology Matching To Information Extraction From Big Data (slides by Isabel F. Cruz from University of Illinois at Chicago)
Oct. 18, 2013 - Katie Dunn - A review of "Building semantic bridges between museums, libraries and archives: The CIDOC Conceptual Reference Model" by Tony Gill (
Oct 18, 2013 - Jun Xu - Endnote XML to RDF
Nov. 1, 2013 - Marshall X Ma - Connected China (
Nov. 1, 2013 - Han Wang - Introduction to DCO
Nov. 15, 2013 - Mengyu - topic TBD
Nov. 29, 2013 - Massimo Di Stefano - topic TBD
Dec. 13, 2013


Fiona's presentation: Publishing Research Data: Issues and Actions
Publishing data
not same as publishing research
Persistent id
Peer review
Permanent id: to access the data
Landing page w/ metadata
Checklist for reviewers
data description doc
the dataset
Repository accreditation
access to the dataset
dataset persistence
dataset stability
People are interested in wether the repository is safe, is a good/trusted source of data, archives their data, authorship is good, repository support is good
Right now there isn't any organization that would be able to do this, to authoritatively accredit a repository. There's no plan, no schedule, etc...
between Geoscience Data Journal (GDJ) and a data repository (BACD/NCAR)
A registry in the middle to certify data centers are "trustworthy"

Question/Comment: Peer reviewed datasets ... provenance could/should play a significant role in this, being able to trace back to the authors, publishers, algorithms, etc...

Question: Where do we discover any citations of a given dataset? Seems that many times publications, presentations are written, citing the dataset, without the dataset publisher, curator, author being notified?

Question: Data correction. The paper was written and published given a certain dataset. If the dataset is "corrected" then it could certainly effect the results that are published in the paper. So is the data citation more of a comment to that?

AGU session for winter meeting 2013

Han presents: Brief introduction to DCO-Data Science

DCO is "We are dedicated to achieving transformational understanding of carbon's chemcial and biological roles in Earth."
Helping scientists get the most out of heterogeneous data sources so they can achieve their research goals.
Aiding discovery by integrating data sets with other associated products.
DCO Data Science platform diagram
Object registration & Deposit workflow
Data may be hosted in DCO, or externally.
DCO Objects
Person (and subclasses)
Organizations (and subclasses)
creating a new Fieldwork object
DCO-ID is assigned
Can associate other existing objects (people, funding sources, etc.) with the Fieldwork object.
Research Activities report for a certain team - populated by queries against VIVO triplestore.
Index page for drupal platform -
dealing with differences in two metadata schemes - mapping.
Data and data integration are both important.
Also need to integrate non-data: information and knowledge.

Q (Fiona): people have DCO-IDs, but they could also have ORCIDs.
A: Marshall: DCOID is like doi, but not just for papers - for anything. but DCOID and DOI both are based on Handle.
A: Patrick: DCO supports multiple identifiers - fields for OrCID id, Scopus id, etc.

Marshall's presentation
One of the main authors:
Cool visualizations on this portal with tons of data.