SlideShare une entreprise Scribd logo
1  sur  13
Télécharger pour lire hors ligne
Data Archiving and Networked Services




                     Digital Archiving 3.0

   “My data open on the Web, ok but how ?”


           Christophe Guéret (@cgueret)

      Open Data on the Web, 23 - 24 April 2013



DANS is een instituut van KNAW en NWO
A bit of context


http://cedar-project.nl




                          http://easy.dans.knaw.nl
Put your data open on the Web!



             “E-Data & Research”, October 2011




 “Sharing knowledge: EC-funded projects on scientific information in the digital age”
Where is your research data ?
  Just get it from the web site
    of the research project


                                      I think I have have it somewhere
                                           on a stick, let me check...



           It is available as an RDF/XML
                dump on my test server
All bad answers, really.
●
    We need research data to be
    –   Accessible/readable/usable by anyone
    –   Available in many (>1) years from now
    –   With traceable provenance and usages

●
    Dumping the data on a web site
    somewhere is not enough
Solution: use a repository


        “Sharing knowledge: EC-funded projects on scientific information in the digital age”


●
    Data repositories will take over serving
    the data and have a page for it!
●
    Repository hold two type of data
    –    The data stored
    –    The meta-data about this data
Which format for meta-data ?
●
    LOD is a perfect fit for describing data
    –   Use to refer to and link data items
    –   Facilitates discovery, easy to crawl/index
    –   One description per data item stored
    –   Redirects to actual location of the data

●
    Remaining question: how much meta-data
    is needed?
Which format for the data?
●
    Many formats around : PDF, SDF, DSPL,
    XLS, RDF, CSV, SHP, JSON-LD, ...
●
    Translation will imply some extra work for
    the data owner and not please everyone
Which format for the data?
 ●
     Many formats around : PDF, SDF, DSPL,
     XLS, RDF, CSV, SHP, JSON-LD, ...
 ●
     Translation will imply some extra work for
     the data owner and not please everyone
Express your data as    Buy a DN, decide on a      Select vocabularies to
described resources    URI scheme for your data   describe your resources
Just get the
                                ●


                               data in the
Solution: use a         repository
                               repository


                                ●
                                  Repositories
                                  will take care
●
    Data repositories will take over everything
                                  of serving
    your data

                                ●
                                    PS: forget
                                    about HTTP
                                    URIs for data
Format evolution
●
    Use Content-negotiation to translate and
    serve different data formats
●
    Ensure everyone gets the format he wants
Format evolution
●
    Use Content-negotiation to translate and
    serve different data formats
●
    Ensure everyone gets the format he wants




                  ?
                             ?
Next generation archives
●
    Provide long term access to data in
    several formats
●
    Publish Linked Open Meta-Data about the
    data stored (DCAT, ...)
●
    Facilitate moving data around archives
    (LDP, ...)

Contenu connexe

Tendances

Coherance in dissemination- Msis 2007
Coherance in dissemination- Msis 2007Coherance in dissemination- Msis 2007
Coherance in dissemination- Msis 2007
annegrete
 
Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014
Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014
Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014
KDZ - Zentrum für Verwaltungsforschung
 

Tendances (20)

Big Data e tecnologie semantiche - Utilizzare i Linked data come driver d'int...
Big Data e tecnologie semantiche - Utilizzare i Linked data come driver d'int...Big Data e tecnologie semantiche - Utilizzare i Linked data come driver d'int...
Big Data e tecnologie semantiche - Utilizzare i Linked data come driver d'int...
 
TAO: Facebook's Distributed Data Store for the Social Graph
TAO: Facebook's Distributed Data Store for the Social GraphTAO: Facebook's Distributed Data Store for the Social Graph
TAO: Facebook's Distributed Data Store for the Social Graph
 
Python for data science
Python for data sciencePython for data science
Python for data science
 
Think like a Digital Curator
Think like a Digital CuratorThink like a Digital Curator
Think like a Digital Curator
 
Wikidata & dbpedia
Wikidata & dbpediaWikidata & dbpedia
Wikidata & dbpedia
 
Linked Open Data and American Art
Linked Open Data and American ArtLinked Open Data and American Art
Linked Open Data and American Art
 
Introduction to the FP7 CODE project @ BDBC
Introduction to the FP7 CODE project @ BDBCIntroduction to the FP7 CODE project @ BDBC
Introduction to the FP7 CODE project @ BDBC
 
Collaborative Creation of a Wikidata handbook
Collaborative Creation of a Wikidata handbookCollaborative Creation of a Wikidata handbook
Collaborative Creation of a Wikidata handbook
 
Smart Data Applications powered by the Wikidata Knowledge Graph
Smart Data Applications powered by the Wikidata Knowledge GraphSmart Data Applications powered by the Wikidata Knowledge Graph
Smart Data Applications powered by the Wikidata Knowledge Graph
 
Mantas Zimnickas - How Open is Lithuanian Government data? atviriduomenys.lt
Mantas Zimnickas - How Open is Lithuanian Government data? atviriduomenys.lt Mantas Zimnickas - How Open is Lithuanian Government data? atviriduomenys.lt
Mantas Zimnickas - How Open is Lithuanian Government data? atviriduomenys.lt
 
Thinking Outside the Table
Thinking Outside the TableThinking Outside the Table
Thinking Outside the Table
 
The AGINFRA+ Virtual Research Environment (VRE)
The AGINFRA+ Virtual Research Environment (VRE)The AGINFRA+ Virtual Research Environment (VRE)
The AGINFRA+ Virtual Research Environment (VRE)
 
Open content opens up new avenues of research
Open content opens up new avenues of researchOpen content opens up new avenues of research
Open content opens up new avenues of research
 
Bar camp bigdata
Bar camp bigdataBar camp bigdata
Bar camp bigdata
 
Coherance in dissemination- Msis 2007
Coherance in dissemination- Msis 2007Coherance in dissemination- Msis 2007
Coherance in dissemination- Msis 2007
 
Introducing SURF
Introducing SURF Introducing SURF
Introducing SURF
 
It summit2015
It summit2015It summit2015
It summit2015
 
Stardog Linked Data Catalog
Stardog Linked Data CatalogStardog Linked Data Catalog
Stardog Linked Data Catalog
 
Dataverse in the European Open Science Cloud
Dataverse in the European Open Science CloudDataverse in the European Open Science Cloud
Dataverse in the European Open Science Cloud
 
Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014
Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014
Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014
 

En vedette

Google Solution Validation Process Certificate
Google Solution Validation Process CertificateGoogle Solution Validation Process Certificate
Google Solution Validation Process Certificate
Ehab Al Badry
 
Data Journalism
Data JournalismData Journalism
Data Journalism
pilhofer
 
Solution Validation & Assessments - A practical Approach
Solution Validation & Assessments - A practical ApproachSolution Validation & Assessments - A practical Approach
Solution Validation & Assessments - A practical Approach
Julen Mohanty
 

En vedette (13)

Solution validation best practices
Solution validation best practicesSolution validation best practices
Solution validation best practices
 
Novartis and ValiMation Present a SharePoint Solution for Risk Based Cleaning...
Novartis and ValiMation Present a SharePoint Solution for Risk Based Cleaning...Novartis and ValiMation Present a SharePoint Solution for Risk Based Cleaning...
Novartis and ValiMation Present a SharePoint Solution for Risk Based Cleaning...
 
Google Solution Validation Process Certificate
Google Solution Validation Process CertificateGoogle Solution Validation Process Certificate
Google Solution Validation Process Certificate
 
Digital Library Home Access: User Validation, E- Resources Proxying and Feder...
Digital Library Home Access: User Validation, E- Resources Proxying and Feder...Digital Library Home Access: User Validation, E- Resources Proxying and Feder...
Digital Library Home Access: User Validation, E- Resources Proxying and Feder...
 
Verification and Validation of Findings
Verification and Validation of FindingsVerification and Validation of Findings
Verification and Validation of Findings
 
TestNG Data Binding
TestNG Data BindingTestNG Data Binding
TestNG Data Binding
 
Data Journalism
Data JournalismData Journalism
Data Journalism
 
Data validation in the Digital Age
Data validation in the Digital AgeData validation in the Digital Age
Data validation in the Digital Age
 
Solution Validation & Assessments - A practical Approach
Solution Validation & Assessments - A practical ApproachSolution Validation & Assessments - A practical Approach
Solution Validation & Assessments - A practical Approach
 
Best Practice Solution Validation - Lean Startup Machine - Naples 2015
Best Practice Solution Validation - Lean Startup Machine - Naples 2015Best Practice Solution Validation - Lean Startup Machine - Naples 2015
Best Practice Solution Validation - Lean Startup Machine - Naples 2015
 
Calibration and validation of analytical instruments
Calibration and validation of analytical instrumentsCalibration and validation of analytical instruments
Calibration and validation of analytical instruments
 
Developing a Roadmap for Digital Transformation
Developing a Roadmap for Digital TransformationDeveloping a Roadmap for Digital Transformation
Developing a Roadmap for Digital Transformation
 
Digital Transformation: What it is and how to get there
Digital Transformation: What it is and how to get thereDigital Transformation: What it is and how to get there
Digital Transformation: What it is and how to get there
 

Similaire à Digital archiving 3.0

Wed batsakis tut_challenges of preservations
Wed batsakis tut_challenges of preservationsWed batsakis tut_challenges of preservations
Wed batsakis tut_challenges of preservations
eswcsummerschool
 
Wed batsakis tut_chalasdlenges of preservations
Wed batsakis tut_chalasdlenges of preservationsWed batsakis tut_chalasdlenges of preservations
Wed batsakis tut_chalasdlenges of preservations
eswcsummerschool
 
Intro to-technologies-Green-City-Hackathon-Athens
Intro to-technologies-Green-City-Hackathon-AthensIntro to-technologies-Green-City-Hackathon-Athens
Intro to-technologies-Green-City-Hackathon-Athens
Stoitsis Giannis
 

Similaire à Digital archiving 3.0 (20)

Illuminating DSpace's Linked Data Support
Illuminating DSpace's Linked Data SupportIlluminating DSpace's Linked Data Support
Illuminating DSpace's Linked Data Support
 
The Web of Data: The W3C Semantic Web Initiative
The Web of Data: The W3C Semantic Web InitiativeThe Web of Data: The W3C Semantic Web Initiative
The Web of Data: The W3C Semantic Web Initiative
 
Linked Data
Linked DataLinked Data
Linked Data
 
Publishing and Using Linked Open Data - Day 4
Publishing and Using Linked Open Data - Day 4Publishing and Using Linked Open Data - Day 4
Publishing and Using Linked Open Data - Day 4
 
Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...
 
DataverseNL as structured data hub
DataverseNL as structured data hubDataverseNL as structured data hub
DataverseNL as structured data hub
 
Wed batsakis tut_challenges of preservations
Wed batsakis tut_challenges of preservationsWed batsakis tut_challenges of preservations
Wed batsakis tut_challenges of preservations
 
Wed batsakis tut_chalasdlenges of preservations
Wed batsakis tut_chalasdlenges of preservationsWed batsakis tut_chalasdlenges of preservations
Wed batsakis tut_chalasdlenges of preservations
 
2007 iPres Beijing - MIXED: Preservation by migration to XML
2007 iPres Beijing - MIXED: Preservation by migration to XML2007 iPres Beijing - MIXED: Preservation by migration to XML
2007 iPres Beijing - MIXED: Preservation by migration to XML
 
SWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic WebSWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic Web
 
Intro to-technologies-Green-City-Hackathon-Athens
Intro to-technologies-Green-City-Hackathon-AthensIntro to-technologies-Green-City-Hackathon-Athens
Intro to-technologies-Green-City-Hackathon-Athens
 
Big Data Open Source Technologies
Big Data Open Source TechnologiesBig Data Open Source Technologies
Big Data Open Source Technologies
 
Scaling the (evolving) web data –at low cost-
Scaling the (evolving) web data –at low cost-Scaling the (evolving) web data –at low cost-
Scaling the (evolving) web data –at low cost-
 
Decentralised identifiers and knowledge graphs
Decentralised identifiers and knowledge graphs Decentralised identifiers and knowledge graphs
Decentralised identifiers and knowledge graphs
 
Implementing Linked Data in Low-Resource Conditions
Implementing Linked Data in Low-Resource ConditionsImplementing Linked Data in Low-Resource Conditions
Implementing Linked Data in Low-Resource Conditions
 
CLARIAH Toogdag 2018: A distributed network of digital heritage information
CLARIAH Toogdag 2018: A distributed network of digital heritage informationCLARIAH Toogdag 2018: A distributed network of digital heritage information
CLARIAH Toogdag 2018: A distributed network of digital heritage information
 
Overview of Big Data by Sunny
Overview of Big Data by SunnyOverview of Big Data by Sunny
Overview of Big Data by Sunny
 
ERA CoBioTech Data Management Webinar
ERA CoBioTech Data Management WebinarERA CoBioTech Data Management Webinar
ERA CoBioTech Data Management Webinar
 
Wednesday 6 May: Hand me the data! What you should know as a humanities resea...
Wednesday 6 May: Hand me the data! What you should know as a humanities resea...Wednesday 6 May: Hand me the data! What you should know as a humanities resea...
Wednesday 6 May: Hand me the data! What you should know as a humanities resea...
 
Integrating an electronic lab notebook with a data repository; American Chemi...
Integrating an electronic lab notebook with a data repository; American Chemi...Integrating an electronic lab notebook with a data repository; American Chemi...
Integrating an electronic lab notebook with a data repository; American Chemi...
 

Plus de Christophe Guéret

ICT4D course 2013 - Low resources infrastructure
ICT4D course 2013 - Low resources infrastructureICT4D course 2013 - Low resources infrastructure
ICT4D course 2013 - Low resources infrastructure
Christophe Guéret
 
ICT4D course 2013 - OLPC deployments
ICT4D course 2013 - OLPC deploymentsICT4D course 2013 - OLPC deployments
ICT4D course 2013 - OLPC deployments
Christophe Guéret
 

Plus de Christophe Guéret (20)

HHAI June 2022 - KGs and Hybrid Intelligence
HHAI June 2022 - KGs and Hybrid IntelligenceHHAI June 2022 - KGs and Hybrid Intelligence
HHAI June 2022 - KGs and Hybrid Intelligence
 
Informal presentation about RES
Informal presentation about RESInformal presentation about RES
Informal presentation about RES
 
Introduction about WorldWideSemanticWeb.org for the workshop "Making it Matter"
Introduction about WorldWideSemanticWeb.org for the workshop "Making it Matter"Introduction about WorldWideSemanticWeb.org for the workshop "Making it Matter"
Introduction about WorldWideSemanticWeb.org for the workshop "Making it Matter"
 
Let's downscale the semantic web !
Let's downscale the semantic web !Let's downscale the semantic web !
Let's downscale the semantic web !
 
Your next data viz gear should be a Wii-U
Your next data viz gear should be a Wii-UYour next data viz gear should be a Wii-U
Your next data viz gear should be a Wii-U
 
Linking knowledge spaces
Linking knowledge spacesLinking knowledge spaces
Linking knowledge spaces
 
The data behind the HuisKluis
The data behind the HuisKluisThe data behind the HuisKluis
The data behind the HuisKluis
 
The road towards a Web-based data ecosystem
The road towards a Web-based data ecosystemThe road towards a Web-based data ecosystem
The road towards a Web-based data ecosystem
 
Linked Open Data for Digital Humanities
Linked Open Data for Digital HumanitiesLinked Open Data for Digital Humanities
Linked Open Data for Digital Humanities
 
ICT4D course 2013 - Low resources infrastructure
ICT4D course 2013 - Low resources infrastructureICT4D course 2013 - Low resources infrastructure
ICT4D course 2013 - Low resources infrastructure
 
ICT4D course 2013 - OLPC deployments
ICT4D course 2013 - OLPC deploymentsICT4D course 2013 - OLPC deployments
ICT4D course 2013 - OLPC deployments
 
ICT4D course 2013 - Sugar
ICT4D course 2013 - SugarICT4D course 2013 - Sugar
ICT4D course 2013 - Sugar
 
Clarifier le sens de vos données publiques avec le Web de données
Clarifier le sens de vos données publiques avec le Web de donnéesClarifier le sens de vos données publiques avec le Web de données
Clarifier le sens de vos données publiques avec le Web de données
 
Embedding young learners into the information society
Embedding young learners into the information societyEmbedding young learners into the information society
Embedding young learners into the information society
 
Is linked data something for me?
Is linked data something for me?Is linked data something for me?
Is linked data something for me?
 
Decentralised entity registry “WikiReg”
Decentralised entity registry “WikiReg”Decentralised entity registry “WikiReg”
Decentralised entity registry “WikiReg”
 
Evolutionary and Swarm Computing for scaling up the Semantic Web
Evolutionary and Swarm Computing for scaling up the Semantic WebEvolutionary and Swarm Computing for scaling up the Semantic Web
Evolutionary and Swarm Computing for scaling up the Semantic Web
 
Decentralised Open Data for World Citizens
Decentralised Open Data  for World CitizensDecentralised Open Data  for World Citizens
Decentralised Open Data for World Citizens
 
Assessing Linked Data Mappings using Network Measures
Assessing Linked Data Mappings using Network MeasuresAssessing Linked Data Mappings using Network Measures
Assessing Linked Data Mappings using Network Measures
 
Finding and consuming (Linked) Open Data
Finding and consuming (Linked) Open DataFinding and consuming (Linked) Open Data
Finding and consuming (Linked) Open Data
 

Dernier

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
?#DUbAI#??##{{(☎️+971_581248768%)**%*]'#abortion pills for sale in dubai@
 

Dernier (20)

Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
CNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In PakistanCNIC Information System with Pakdata Cf In Pakistan
CNIC Information System with Pakdata Cf In Pakistan
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Platformless Horizons for Digital Adaptability
Platformless Horizons for Digital AdaptabilityPlatformless Horizons for Digital Adaptability
Platformless Horizons for Digital Adaptability
 

Digital archiving 3.0

  • 1. Data Archiving and Networked Services Digital Archiving 3.0 “My data open on the Web, ok but how ?” Christophe Guéret (@cgueret) Open Data on the Web, 23 - 24 April 2013 DANS is een instituut van KNAW en NWO
  • 2. A bit of context http://cedar-project.nl http://easy.dans.knaw.nl
  • 3. Put your data open on the Web! “E-Data & Research”, October 2011 “Sharing knowledge: EC-funded projects on scientific information in the digital age”
  • 4. Where is your research data ? Just get it from the web site of the research project I think I have have it somewhere on a stick, let me check... It is available as an RDF/XML dump on my test server
  • 5. All bad answers, really. ● We need research data to be – Accessible/readable/usable by anyone – Available in many (>1) years from now – With traceable provenance and usages ● Dumping the data on a web site somewhere is not enough
  • 6. Solution: use a repository “Sharing knowledge: EC-funded projects on scientific information in the digital age” ● Data repositories will take over serving the data and have a page for it! ● Repository hold two type of data – The data stored – The meta-data about this data
  • 7. Which format for meta-data ? ● LOD is a perfect fit for describing data – Use to refer to and link data items – Facilitates discovery, easy to crawl/index – One description per data item stored – Redirects to actual location of the data ● Remaining question: how much meta-data is needed?
  • 8. Which format for the data? ● Many formats around : PDF, SDF, DSPL, XLS, RDF, CSV, SHP, JSON-LD, ... ● Translation will imply some extra work for the data owner and not please everyone
  • 9. Which format for the data? ● Many formats around : PDF, SDF, DSPL, XLS, RDF, CSV, SHP, JSON-LD, ... ● Translation will imply some extra work for the data owner and not please everyone Express your data as Buy a DN, decide on a Select vocabularies to described resources URI scheme for your data describe your resources
  • 10. Just get the ● data in the Solution: use a repository repository ● Repositories will take care ● Data repositories will take over everything of serving your data ● PS: forget about HTTP URIs for data
  • 11. Format evolution ● Use Content-negotiation to translate and serve different data formats ● Ensure everyone gets the format he wants
  • 12. Format evolution ● Use Content-negotiation to translate and serve different data formats ● Ensure everyone gets the format he wants ? ?
  • 13. Next generation archives ● Provide long term access to data in several formats ● Publish Linked Open Meta-Data about the data stored (DCAT, ...) ● Facilitate moving data around archives (LDP, ...)