Peter Murray-Rust's Pecha Kucha presentation "Repositories for Scientific Data: An #animalgarden show" which was delivered on Friday 2nd August 2013 at the Repository Fringe 2013.
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Repositories for Scientific Data: An #animalgarden show (Pecha Kucha) - Peter Murray-Rust
1. REPOSITORIES FOR SCIENTIFIC DATA
An #animalgarden show
Peter Murray-Rust,
OKFN and University of Cambridge
Chuff OWL
Moomin
AMI
Gulliver
Sleepless
cleanTux
UncleSam
6. Australia have a
national data
service (ANDS)
We could use
their TARDIS*
Let’s ask the
crystallographers.
They save their
data
7. I want to
publish this
paper
You MUST send ALL
the data. The IUCr will
check if it’s correct
8. It takes
years to
create
vocabularies
Core dictionary (coreCIF) version 2.4.3
_diffrn_ambient_temperature
Definition: The mean temperature in kelvins at
which the intensities were measured.
Range: 0.0 -> infinity Type: numb
ID
For
humans
For machines:
Constraint + type
We need domain vocabularies through
inter/national efforts
9. PMRgroup also
built a crystal
structure repo
(Crystaleye)
It’s got
200,000
entries
But none from
Elsevier, Wiley,
Springer
10. And NONE of
the results are
archived
Computational Materials
scientists costs 1,000
Million USD / year
PMR wrote software to
turn FORTRAN into
XML
11. PMR and others have
started a global effort
to create
vocabularies
It’s hard and slow
work
PMR group built
compchem repository
Chempound XML RDF
NoSQL SPARQL
19. Chuff
REPOSITORIES FOR SCIENTIFIC DATA
An #animalgarden show
Peter Murray-Rust,
OKFN and University of Cambridge
WE NEED
DOMAIN
REPOSITORIES
FOR SCIENCE