Digital archiving 3.0

Data Archiving and Networked Services

Digital Archiving 3.0

“My data open on the Web, ok but how ?”

Christophe Guéret (@cgueret)

Open Data on the Web, 23 - 24 April 2013

DANS is een instituut van KNAW en NWO

A bit of context

http://cedar-project.nl

http://easy.dans.knaw.nl

Put your data open on the Web!

“E-Data & Research”, October 2011

“Sharing knowledge: EC-funded projects on scientific information in the digital age”

Where is your research data ?
Just get it from the web site
of the research project

I think I have have it somewhere
on a stick, let me check...

It is available as an RDF/XML
dump on my test server

All bad answers, really.
●
We need research data to be
– Accessible/readable/usable by anyone
– Available in many (>1) years from now
– With traceable provenance and usages

●
Dumping the data on a web site
somewhere is not enough

Solution: use a repository

“Sharing knowledge: EC-funded projects on scientific information in the digital age”

●
Data repositories will take over serving
the data and have a page for it!
●
Repository hold two type of data
– The data stored
– The meta-data about this data

Which format for meta-data ?
●
LOD is a perfect fit for describing data
– Use to refer to and link data items
– Facilitates discovery, easy to crawl/index
– One description per data item stored
– Redirects to actual location of the data

●
Remaining question: how much meta-data
is needed?

Which format for the data?
●
Many formats around : PDF, SDF, DSPL,
XLS, RDF, CSV, SHP, JSON-LD, ...
●
Translation will imply some extra work for
the data owner and not please everyone

Which format for the data?
●
Many formats around : PDF, SDF, DSPL,
XLS, RDF, CSV, SHP, JSON-LD, ...
●
Translation will imply some extra work for
the data owner and not please everyone
Express your data as Buy a DN, decide on a Select vocabularies to
described resources URI scheme for your data describe your resources

Just get the
●

data in the
Solution: use a repository
repository

●
Repositories
will take care
●
Data repositories will take over everything
of serving
your data

●
PS: forget
about HTTP
URIs for data

Format evolution
●
Use Content-negotiation to translate and
serve different data formats
●
Ensure everyone gets the format he wants

Format evolution
●
Use Content-negotiation to translate and
serve different data formats
●
Ensure everyone gets the format he wants

?
?

Next generation archives
●
Provide long term access to data in
several formats
●
Publish Linked Open Meta-Data about the
data stored (DCAT, ...)
●
Facilitate moving data around archives
(LDP, ...)

Digital archiving 3.0

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

En vedette

En vedette (13)

Similaire à Digital archiving 3.0

Similaire à Digital archiving 3.0 (20)

Plus de Christophe Guéret

Plus de Christophe Guéret (20)

Dernier

Dernier (20)

Digital archiving 3.0