eScience Meeting February 20, 2013

Printer-friendly version

Welcome to TitanPad!

General Meeting Information

Agenda

  • Presentation: Jin Guang Zheng, February 20, 2013 - Semantic Similarity based Entity Mapping: GCIS (Global Change Information System) Case: GCMD (Global Change Master Directory) to CLEAN (Climate Literacy and Energy Awareness Network pathway) Mapping
    • Goals are data integration and better search
    • Task is to recognize the same individual described differently at different places
    • Approach
      • triple-wise similarity
      • weight learning - weight learning as binary classification
      • speed up
        • reduce pair-wise computation
        • reduce triple-wise computation
    • Experiments
      • use OAEI 2012 Ontology Matching Conference Track Datasets
        • precision, recall, f-measure are all the best, with acceptable speed
      • use OAEI Instance Matching Conference Track Datasets
        • precision, recall, f-measure are all the best among competing systems
    • GCMD-CLEAN mapping status
      • work on skos:broaderMatch in addition to skos:exactMatch
      • using a voting method
  • Quick catch up
    • Everyone gives a brief for 2013 Spring
      • Linyun:
        • vocabulary service with ocean.data.gov and energy.data.gov projects
        • now have a demo (on my laptop, working on deploying it on aquarius)
        • research: data portal generation, need to read papers obtained from AGU authors, and prepare the literature review
        • thinking Submission to gradute symposium of SoS
      • Yu:
        • two courses this semester
        • co-authored a paper with Josh: semantic sensor network
        • ISWC2013: may further work with Josh
        • project: DCO-DS, website for registering and depositing research data, to do: semantic search
      • Eric
        • S2S Documentation
          • http://tw.rpi.edu/web/project/sesf/workinggroups/s2s
          • http://tw.rpi.edu/web/project/sesf/workinggroups/s2s/tutorials
      • Jin
        • Focus on proposal: perhaps next month
        • research: ontology mapping and schema mapping, Entity ranking, entity annotation
        • project: GCIS-IMSAP, entity mapping GCMD - CLEAN Vocabulary mapping
      • Marshall
        • research: DCO-DS, GCIS-IMSAP
        • teach: GIS in the Sciences
        • Publication: a few journal papers
      • Han:
        • two courses this semester
        • DQE literature review
        • project: DCO-DS, now preparing poster(s) for the March meeting at DC. also brain storm some research ideas with this project
        • Deborah suggests a brain storm meeting together
        • Semantic Vernacular System project slowly going on
      • Patrick:
        • DCO Portal work, performance issues
        • DCO vserver on aquarius, port 5000 is CKAN
        • ACTION: Get rid of dco1 vserver
        • VSTO portal problems with CEDAR database access
          • RDESC might allow us to not have to run scripts
        • dataservices (data.rpi.edu) authentication with shibeloth. Currently a problem even seeing the site (tomcat issue?)
        • RPI logo issues on our site (top priority)
        • TW web page issues, instance creation, document visualization
        • Would love to work with S2S for document search
      • Massimo:
        • outcome from Workshop on Data Visualization (NOAA)
          • good feedback from the audience, they showed interest in using the software developed for the ecoop
          • proposal to make an "ipython workshop" (NOAA will look for resources)
          • Invitation to Present at UNH CCOM/JHC (refernce point : Larry Mayer) - 12 April
        • set-up linux server (for each ECO-OP large group users) as test bed
          • multiuser geodatabase to allow geoprocessing trough the web
          • developed code to access timeseries of data from necdf / grib dataset using the ipython parallel computing features
        • Area of improvement :
          • metadata development for the notebook file (json) in order to allow ingestion of the notebook in triple store (visit to RPI to discuss this topic is needed)
        • Project ready for the next "Evaluation" step
  • Submissions to ISWC 2013
    • full paper due: May 01, 2013
  • eScience meeting presentation schdule for 2013
    • Linyun next
    • then begin the next round

Attendance

  • Deborah
  • Marshall
  • Han
  • Jin
  • Yu
  • Eric
  • Linyun
  • Massimo (remote)
  • Patrick (remote)

Past Action Items

Action Items

Presentations

  • Keep this list from week to week so we know who's presented and who will present
  • Stephan September 21, 2012 - Ontology documentation discussion
  • Yu Chen, October 5, 2012 - Continuous Flow Forcast in Southesk River: Where we are and how we proceed, semantically
  • Marshall X Ma, October 19, 2012 - Exploratory visualization of earth science data in a semantic web context
  • Eric Rozell, November 2, 2012 - Resource Discovery for Extreme Scale Collaboration
  • Massimo Di Stefano, November 16, 2012 - IPython Notebook (applied to the ECOOP use case)
  • Han Wang, November 30, 2012 - http://www.odata.org/ - Open Data Protocol: the current state, libraries, functionality
  • Jin Guang Zheng, February 20, 2013 - Semantic Similarity based Entity Mapping: GCIS Case: GCMD-CLEAN Mapping
  • Linyun Fu, March 6, 2013 - CMSPV and ELDA

Notes