Semantic eScience meeting 2013-11-01

Printer-friendly version

General Meeting Information

[Semantic eScience meeting 2013-11-01 This Pad]
[Semantic eScience meeting 2013-10-18 Previous Meeting]
Call-in information
Goto Meeting
Please join my meeting. https://global.gotomeeting.com/join/776009689
Use your microphone and speakers (VoIP) - a headset is recommended. Or, call in using your telephone.
Dial +1 (805) 309-0012
Access Code: 776-009-689
Audio PIN: Shown after joining the meeting
Meeting ID: 776-009-689
Meeting Page

Agenda

Fiona Murphy's presentation
Han Wang's presentation - rief Introduction to DCO-DS
Marshall X Ma's presentation: http://orion.tw.rpi.edu/~xgmatwc/eScience20131101/20131101-eScience_XMa.pptx

Attendance

Peter Fox
Patrick West
Marshall X Ma
Fiona
Han
Linyun Fu
Chengcong
Jin
Katie Dunn
massimo di stefano (remote)

Past Action Items

Action Items

Presentations

Keep this list from week to week so we know who's presented and who will present. Please sign up if you have a good topic to share with others.
Sept. 20, 2013 - Linyun Fu - ORCID
Oct. 4, 2013 - Jin - From Ontology Matching To Information Extraction From Big Data (slides by Isabel F. Cruz from University of Illinois at Chicago)
Oct. 18, 2013 - Katie Dunn - A review of "Building semantic bridges between museums, libraries and archives: The CIDOC Conceptual Reference Model" by Tony Gill (http://firstmonday.org/ojs/index.php/fm/article/view/1145/1065)
Oct 18, 2013 - Jun Xu - Endnote XML to RDF
Nov. 1, 2013 - Marshall X Ma - Connected China (http://connectedchina.reuters.com)
Nov. 1, 2013 - Han Wang - Introduction to DCO
Nov. 15, 2013 - Mengyu - topic TBD
Nov. 29, 2013 - Massimo Di Stefano - topic TBD
Dec. 13, 2013

Notes

Fiona's presentation: Publishing Research Data: Issues and Actions
PREPARDE: http://proj.badc.rl.ac.uk/preparde
Publishing data
not same as publishing research
permanently
Persistent id
Metadata
Workflow
varied
Peer review
Permanent id: to access the data
Landing page w/ metadata
Checklist for reviewers
data description doc
metadata
the dataset
Repository accreditation
access to the dataset
dataset persistence
dataset stability
People are interested in wether the repository is safe, is a good/trusted source of data, archives their data, authorship is good, repository support is good
Right now there isn't any organization that would be able to do this, to authoritatively accredit a repository. There's no plan, no schedule, etc...
Cross-linking
between Geoscience Data Journal (GDJ) and a data repository (BACD/NCAR)
A registry in the middle to certify data centers are "trustworthy"

Question/Comment: Peer reviewed datasets ... provenance could/should play a significant role in this, being able to trace back to the authors, publishers, algorithms, etc...

Question: Where do we discover any citations of a given dataset? Seems that many times publications, presentations are written, citing the dataset, without the dataset publisher, curator, author being notified?

Question: Data correction. The paper was written and published given a certain dataset. If the dataset is "corrected" then it could certainly effect the results that are published in the paper. So is the data citation more of a comment to that?

AGU session for winter meeting 2013

Han presents: Brief introduction to DCO-Data Science

DCO is "We are dedicated to achieving transformational understanding of carbon's chemcial and biological roles in Earth."
Helping scientists get the most out of heterogeneous data sources so they can achieve their research goals.
Aiding discovery by integrating data sets with other associated products.
DCO Data Science platform diagram
Object registration & Deposit workflow
Data may be hosted in DCO, or externally.
DCO-ID
DCO Objects
Person (and subclasses)
Organizations (and subclasses)
Awards
Publications
etc.
Demo
creating a new Fieldwork object
DCO-ID is assigned
Can associate other existing objects (people, funding sources, etc.) with the Fieldwork object.
Research Activities report for a certain team - populated by queries against VIVO triplestore.
Index page for drupal platform - udco.tw.rpi.edu/vivo/browse
dealing with differences in two metadata schemes - mapping.
Data and data integration are both important.
Also need to integrate non-data: information and knowledge.

Q (Fiona): people have DCO-IDs, but they could also have ORCIDs.
A: Marshall: DCOID is like doi, but not just for papers - for anything. but DCOID and DOI both are based on Handle.
A: Patrick: DCO supports multiple identifiers - fields for OrCID id, Scopus id, etc.

Marshall's presentation
http://connectedchina.reuters.com/
One of the main authors: http://benfry.com/
Cool visualizations on this portal with tons of data.