Data Library by Lisa Raymond (MBLWHOI Library)

From Semantic Portal Wiki

Jump to: navigation, search

Slide - Data Library & Archives

What kind of weight do we give to combined data sets

Collection includes data, logs instruments, maps and much more

Slide - Data

Have compact shelving and climate control

Cruise data -Atlantis, Knorr, Oceanus

Vehicles - Alvin, Jason

Once got 600 DVD from Jason cruise

Slide 3 - Make digital files accessible

To Do

 * Organize the data
 * Migration of legacy formats
 * Data provenance/citation and long term preservation
 * Migration

Slide 4 - What the Library is doing Migration

From floppies, jaz disks, Alvin Reel to reel audio

Migration is done 'on demand' when a research has a question or when there is money for a pilot project

Slide 5 - Making data accessible

Data in a mysql database

Data has a 2 year embargo, everything that comes off the ships

They used audacity for the audio migration http://audacity.sourceforge.net/

Slide 6 - Bright Future

Problems include lack of metadata for material in the data library

Have been getting a warmer reception about data migration

Institutional repository

Relationship with crossref to get DOIs

Working on a project to make marine mammal audio available through the institutional repository

Slide 7 - A mess is "a complex issue that is not well formulated or defined ...."

We are somewhere between a mess and a problem, we are still defining the issues

Data library started in 1930 so we have the possibility to do long term pedagogical studies

Migration is the problem

Roger Goldsmith wanted to connect information in the data library, but he has left

Discussion:

Andrew M. : We really need references about who can help WHOI with video translation

Tom Moritz: Getty is doing digital conversion, might be worth contacting them

Andrew M. Getty Research institute has been doing digital conversion and has it setup and are using it

Lisa Raymond: we have equipment to read almost all of the data

Cyndy Chandler: isn't the limiting factor money

Lisa Raymond: yes, they are looking at stimulus money

Risk factor for converting video? Sometimes you can only play them once

Vicki Ferrini: Contacted a company about digitizing video

Guy at Getty doing video transfers has link to conservation as well

Lisa Raymond:Problem with DVDs too Jason alone has 12,000 DVDs

Peter Fox: What I have heard a lot from librarians is that they get handed a mess. get old data without metadata Librarians have gone out on the ship go to the instrument makers and 'get ahead of the mess'

Deborah M. so.... maybe that means our new mantra is "get out in front of the mess"

Cyndy Chandler: WHOI has actively brought the data library into the data collection (R to R ?)

Peter Fox: Data library scientists curate the data/metadata as it is being created and MLIS programs are beginning to teach

Andrew M. Ilinois, Cornell, and Purdue are beginning to look at how to get "out in front". Librarians at Purdue visit scientists in their offices

Cathy Norton: anyone hired in the WHOI data library gets out to sea to see how the data is collected

Vicki Ferrini: The NSF is forcing people to publish data and so they need the metadata

Peter Fox: but not the metadata, the context for metadata is extremely important. MD for data reuse is different than MD for preservation Newer projects are being recommended to assign someone to assign someone to do MD But getting push back due to expense etc

Semantic Web Community
Tetherless World constellation
maintenance