Data Library by Lisa Raymond (MBLWHOI Library)
From Semantic Portal Wiki
Slide - Data Library & Archives
What kind of weight do we give to combined data sets
Collection includes data, logs instruments, maps and much more
Slide - Data
Have compact shelving and climate control
Cruise data -Atlantis, Knorr, Oceanus
Vehicles - Alvin, Jason
Once got 600 DVD from Jason cruise
Slide 3 - Make digital files accessible
To Do
* Organize the data * Migration of legacy formats * Data provenance/citation and long term preservation * Migration
Slide 4 - What the Library is doing Migration
From floppies, jaz disks, Alvin Reel to reel audio
Migration is done 'on demand' when a research has a question or when there is money for a pilot project
Slide 5 - Making data accessible
Data in a mysql database
Data has a 2 year embargo, everything that comes off the ships
They used audacity for the audio migration http://audacity.sourceforge.net/
Slide 6 - Bright Future
Problems include lack of metadata for material in the data library
Have been getting a warmer reception about data migration
Institutional repository
Relationship with crossref to get DOIs
Working on a project to make marine mammal audio available through the institutional repository
Slide 7 - A mess is "a complex issue that is not well formulated or defined ...."
We are somewhere between a mess and a problem, we are still defining the issues
Data library started in 1930 so we have the possibility to do long term pedagogical studies
Migration is the problem
Roger Goldsmith wanted to connect information in the data library, but he has left
Discussion:
Andrew M. : We really need references about who can help WHOI with video translation
Tom Moritz: Getty is doing digital conversion, might be worth contacting them
Andrew M. Getty Research institute has been doing digital conversion and has it setup and are using it
Lisa Raymond: we have equipment to read almost all of the data
Cyndy Chandler: isn't the limiting factor money
Lisa Raymond: yes, they are looking at stimulus money
Risk factor for converting video? Sometimes you can only play them once
Vicki Ferrini: Contacted a company about digitizing video
Guy at Getty doing video transfers has link to conservation as well
Lisa Raymond:Problem with DVDs too Jason alone has 12,000 DVDs
Peter Fox: What I have heard a lot from librarians is that they get handed a mess. get old data without metadata Librarians have gone out on the ship go to the instrument makers and 'get ahead of the mess'
Deborah M. so.... maybe that means our new mantra is "get out in front of the mess"
Cyndy Chandler: WHOI has actively brought the data library into the data collection (R to R ?)
Peter Fox: Data library scientists curate the data/metadata as it is being created and MLIS programs are beginning to teach
Andrew M. Ilinois, Cornell, and Purdue are beginning to look at how to get "out in front". Librarians at Purdue visit scientists in their offices
Cathy Norton: anyone hired in the WHOI data library gets out to sea to see how the data is collected
Vicki Ferrini: The NSF is forcing people to publish data and so they need the metadata
Peter Fox: but not the metadata, the context for metadata is extremely important. MD for data reuse is different than MD for preservation Newer projects are being recommended to assign someone to assign someone to do MD But getting push back due to expense etc

