SlideShare une entreprise Scribd logo
1  sur  23
datos.bne.es:
          Publishing and
            consuming
                      Daniel Vila Suero
                      dvila@fi.upm.es

Ontology Engineering Group, Universidad Politécnica de Madrid
   Acknowledgements: OEG Members, BNE staff (Elena
    Escolano, Marina Jimenez Piano, Ana Manchado, Mar
        HernándezAgustí, Ricardo Santos and others)
datos.bne.es

               2
Background
datos.bne.es
  • Initiative from BibliotecaNacional de
    Españatogether with OEG-UPM Madrid.

  • Multidisciplinary effort: Librarians, Computer
    scientists, linguists..

  • Close collaboration between library experts and
    computer scientists.

  • Initiated as a small scale proof-of-concept: the
    "Cervantes dataset" using IFLA vocabularies
    (FRBR, ISBD) and others (MADS, DC, RDA..)
                                                       3
Main goals
datos.bne.es
  • Perform the transformation incrementally and
    iteratively
  • Develop a system where library experts can define
    and assess the mappings to RDF independently
    from the IT people
  • Be vocabulary agnostic (BNE uses FRBR as core
    model, but the system would allow them to use RDA
    for example)
  • Have a clear picture of the source data before you
    start to transform (help to detect possible deficiencies
    in the source data)

                                                               4
Source MARC records
datos.bne.es


         AUTHORITY                    BIBLIOGRAPHIC




              Persons
                                      76576   Maps
              Corporate bodies        320727 Sound recordings
              Conferences             166017 Gravings, drawings, pictures
              Titles                  35770   Manuscripts
              Subject                 143959 Ancient books
                                      2696560 Modern books
                                      178473 Scores
                                      3021    Electronic resources
                                      156634 Serials
                                      96672   Videos




                                                                            5
Some figures
datos.bne.es
 •   Total number of authority records: 4.100.000
 •   Total number of bibliographical records: 2.390.140
 •   Total number of RDF triples: 58.053.215
 •   Number of links: (15% authorities): 587.520
 •   Linked sources:
     •   VIAF
     •   SUDOC (French Collective University Catalogue) FR
     •   GND (German National Library Authorities) GER
     •   LIBRIS Sweden
     •   DBPedia
     •   Soon BNF, BNB, German Bibliographie



                                                              6
Some statistics
datos.bne.es
                        282,879

              497,644

                                                Manifestation
                                  2,390,103
                                                Work
        1,114,719


                                                Person

                                                Expression
        1,163,764
                                                Thema

                            1,969,526
                                                Corporate Body




                                                                 7
Some statistics
datos.bne.es

 2,500,000        2,129,222
                              2,129,222
 2,000,000                                    1,246,773
                                                               1,054,736
  1,500,000                       1,246,773
  1,000,000                                        1,054,736

    500,000
              0
                                                                     85,347
                                                                              85,347
                                                                                       78,561
                                                                                                16,462
                                                                                                         16,462
                                                                                                                  755
                                                                                                                        755




                                                                                                                              8
Publishing

             9
Our data model
Publishing




                         10
Transformation process
Publishing

 • How to facilitate the mapping process to library
   experts?
      1. Use a familiar and intuitive interface: Spreadsheets
      2. Work only on what's in the database: Pre-process
         records to build the spreadsheets


  •   3 step-process 3 different spreadsheets

      1. Classification: is it a Person? a Work? a Manifestation?
      2. Annotation: name, birth date, title, language of expression
      3. Relation: find relationships between entities (Person is
         creator of a certain work)

                                                                       11
Publishing




             12
Mapping process
Publishing
Open mappings at: http://bne.linkeddata.es/mapping-marc21




                                                        13
Mapping process
Publishing




                          14
Mapping process
Publishing




                          15
Still a lot of work to do
Publishing
 • We cover only core relations of FRBR

 • There are a significant amount of
   manifestationsnot linked to their expressions 
   currently looking at more sophisticated clustering
   techniques

 • Manifestations are not linked to their corresponding
   digitalized materials at the digital library (Biblioteca
   Digital Hispánica)  Next version (to be published
   this year) will contain these links

 • Classification step can be further automatized             16
Consuming

            17
Perspectives
Consuming
 • 2 different perspectives:
    - Systems and applications:
       • SPARQL endpoint,
       • Linked Data API
    - End-user interfaces
 • + an interesting side-effect:
    - By applying FRBR and RDF mappings we can (and did)
      improve the catalogue


 • Using standard web technologies and more intuitive
   models we open the door to:

    - Data analytics and cleansing, catalogue enrichment, reuse
      by smaller institutions…                                    18
Graph analysis example
 Consuming


http://bne.linkeddata.es/graphvis




Using Open-source tools:
    Gephi for example
                                                   19
Enabling access to systems and apps
Consuming
Linked Data API: http://datos.bne.es/frontend/persons




                                                        20
Flexible access to data
Consuming    Out of the box:
                •Search by every field
                •Access cluster of resources
                •Filtering
                •Paging
                •Serve multiple formats: XML,
                Turtle, JSON




                                                21
Different views on the data
Consuming
                               XML
                           HTML




                                     22
END-user interfaces
Consuming


       Current linked data opens the door to:
       •Re-rank OPAC results
       •Better clustering of results
       •Recommendation
       •Enhance data from other sources




                                                23

Contenu connexe

Tendances

The Web of Data is Our Opportunity
The Web of Data is Our OpportunityThe Web of Data is Our Opportunity
The Web of Data is Our OpportunityRichard Wallis
 
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...What is #LODLAM?! Understanding linked open data in libraries, archives [and ...
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...Alison Hitchens
 
Identifying The Benefit of Linked Data
Identifying The Benefit of Linked DataIdentifying The Benefit of Linked Data
Identifying The Benefit of Linked DataRichard Wallis
 
Microdata for Dummies
Microdata for DummiesMicrodata for Dummies
Microdata for Dummiesgiurca
 
Let's Get Visible! with Karla Smith, Winnefox Library System
Let's Get Visible! with Karla Smith, Winnefox Library SystemLet's Get Visible! with Karla Smith, Winnefox Library System
Let's Get Visible! with Karla Smith, Winnefox Library SystemWiLS
 
Linked Data, Library Users, and the Discovery Tools of the Future
Linked Data, Library Users, and the Discovery Tools of the FutureLinked Data, Library Users, and the Discovery Tools of the Future
Linked Data, Library Users, and the Discovery Tools of the FutureEmily Nimsakont
 
IFLA 2012 - OCLC Linked Data round table
IFLA 2012 - OCLC Linked Data round tableIFLA 2012 - OCLC Linked Data round table
IFLA 2012 - OCLC Linked Data round tableFigoblog
 
Why SKOS should be a Focal Point of your Linked Data Strategy
Why SKOS should be a Focal Point of your Linked Data StrategyWhy SKOS should be a Focal Point of your Linked Data Strategy
Why SKOS should be a Focal Point of your Linked Data StrategySemantic Web Company
 
Small pieces loosely joined: getting louse research online.
Small pieces loosely joined: getting louse research online.Small pieces loosely joined: getting louse research online.
Small pieces loosely joined: getting louse research online.Vince Smith
 
Linked Data and Discovery with Steve Meyer
Linked Data and Discovery with Steve MeyerLinked Data and Discovery with Steve Meyer
Linked Data and Discovery with Steve MeyerWiLS
 
Generous Interfaces - rich websites for digital collections
Generous Interfaces - rich websites for digital collections Generous Interfaces - rich websites for digital collections
Generous Interfaces - rich websites for digital collections Mitchell Whitelaw
 
Web Driven Revolution For Library Data
Web Driven Revolution For Library DataWeb Driven Revolution For Library Data
Web Driven Revolution For Library DataRichard Wallis
 
Schema.org: What It Means For You and Your Library
Schema.org: What It Means For You and Your LibrarySchema.org: What It Means For You and Your Library
Schema.org: What It Means For You and Your LibraryRichard Wallis
 
Wikidata, a target for Europeana’s semantic strategy (Glam-Wiki 2015)
Wikidata, a target for Europeana’s semantic strategy (Glam-Wiki 2015)Wikidata, a target for Europeana’s semantic strategy (Glam-Wiki 2015)
Wikidata, a target for Europeana’s semantic strategy (Glam-Wiki 2015)Vladimir Alexiev, PhD, PMP
 
Entification: The Route to 'Useful' Library Data
Entification: The Route to 'Useful' Library DataEntification: The Route to 'Useful' Library Data
Entification: The Route to 'Useful' Library DataRichard Wallis
 
Vila LOD-innovacion- bib-semweb-redux
Vila LOD-innovacion- bib-semweb-reduxVila LOD-innovacion- bib-semweb-redux
Vila LOD-innovacion- bib-semweb-reduxLIS EPI Meeting
 
Schema.org - An Extending Influence
Schema.org - An Extending InfluenceSchema.org - An Extending Influence
Schema.org - An Extending InfluenceRichard Wallis
 
Schema.org - Extending Benefits
Schema.org - Extending BenefitsSchema.org - Extending Benefits
Schema.org - Extending BenefitsRichard Wallis
 

Tendances (20)

The Web of Data is Our Opportunity
The Web of Data is Our OpportunityThe Web of Data is Our Opportunity
The Web of Data is Our Opportunity
 
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...What is #LODLAM?! Understanding linked open data in libraries, archives [and ...
What is #LODLAM?! Understanding linked open data in libraries, archives [and ...
 
Identifying The Benefit of Linked Data
Identifying The Benefit of Linked DataIdentifying The Benefit of Linked Data
Identifying The Benefit of Linked Data
 
Microdata for Dummies
Microdata for DummiesMicrodata for Dummies
Microdata for Dummies
 
Let's Get Visible! with Karla Smith, Winnefox Library System
Let's Get Visible! with Karla Smith, Winnefox Library SystemLet's Get Visible! with Karla Smith, Winnefox Library System
Let's Get Visible! with Karla Smith, Winnefox Library System
 
Linked Data, Library Users, and the Discovery Tools of the Future
Linked Data, Library Users, and the Discovery Tools of the FutureLinked Data, Library Users, and the Discovery Tools of the Future
Linked Data, Library Users, and the Discovery Tools of the Future
 
IFLA 2012 - OCLC Linked Data round table
IFLA 2012 - OCLC Linked Data round tableIFLA 2012 - OCLC Linked Data round table
IFLA 2012 - OCLC Linked Data round table
 
Why SKOS should be a Focal Point of your Linked Data Strategy
Why SKOS should be a Focal Point of your Linked Data StrategyWhy SKOS should be a Focal Point of your Linked Data Strategy
Why SKOS should be a Focal Point of your Linked Data Strategy
 
Small pieces loosely joined: getting louse research online.
Small pieces loosely joined: getting louse research online.Small pieces loosely joined: getting louse research online.
Small pieces loosely joined: getting louse research online.
 
Linked Data and Discovery with Steve Meyer
Linked Data and Discovery with Steve MeyerLinked Data and Discovery with Steve Meyer
Linked Data and Discovery with Steve Meyer
 
Generous Interfaces - rich websites for digital collections
Generous Interfaces - rich websites for digital collections Generous Interfaces - rich websites for digital collections
Generous Interfaces - rich websites for digital collections
 
Web Driven Revolution For Library Data
Web Driven Revolution For Library DataWeb Driven Revolution For Library Data
Web Driven Revolution For Library Data
 
Schema.org: What It Means For You and Your Library
Schema.org: What It Means For You and Your LibrarySchema.org: What It Means For You and Your Library
Schema.org: What It Means For You and Your Library
 
ITS Projects and Services Showcase - June 2013
ITS Projects and Services Showcase - June 2013ITS Projects and Services Showcase - June 2013
ITS Projects and Services Showcase - June 2013
 
Wikidata, a target for Europeana’s semantic strategy (Glam-Wiki 2015)
Wikidata, a target for Europeana’s semantic strategy (Glam-Wiki 2015)Wikidata, a target for Europeana’s semantic strategy (Glam-Wiki 2015)
Wikidata, a target for Europeana’s semantic strategy (Glam-Wiki 2015)
 
Entification: The Route to 'Useful' Library Data
Entification: The Route to 'Useful' Library DataEntification: The Route to 'Useful' Library Data
Entification: The Route to 'Useful' Library Data
 
Vila LOD-innovacion- bib-semweb-redux
Vila LOD-innovacion- bib-semweb-reduxVila LOD-innovacion- bib-semweb-redux
Vila LOD-innovacion- bib-semweb-redux
 
Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...
Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...
Embedding Linked Data Invisibly into Web Pages: Strategies and Workflows for ...
 
Schema.org - An Extending Influence
Schema.org - An Extending InfluenceSchema.org - An Extending Influence
Schema.org - An Extending Influence
 
Schema.org - Extending Benefits
Schema.org - Extending BenefitsSchema.org - Extending Benefits
Schema.org - Extending Benefits
 

Similaire à datos.bne.es: Publishing and consuming

datos.bne.es: Publishing and Consuming
datos.bne.es: Publishing and Consumingdatos.bne.es: Publishing and Consuming
datos.bne.es: Publishing and ConsumingDaniel Vila Suero
 
What is New in W3C land?
What is New in W3C land?What is New in W3C land?
What is New in W3C land?Ivan Herman
 
Linked Open data: CNR
Linked Open data: CNRLinked Open data: CNR
Linked Open data: CNRDatiGovIT
 
An Introduction to NOSQL, Graph Databases and Neo4j
An Introduction to NOSQL, Graph Databases and Neo4jAn Introduction to NOSQL, Graph Databases and Neo4j
An Introduction to NOSQL, Graph Databases and Neo4jDebanjan Mahata
 
An Approach to Publish Spatial Data on the Web: The GeoLinked Data Use Case
An Approach to Publish Spatial Data on the Web: The GeoLinked Data Use CaseAn Approach to Publish Spatial Data on the Web: The GeoLinked Data Use Case
An Approach to Publish Spatial Data on the Web: The GeoLinked Data Use CaseBoris Villazón-Terrazas
 
OeRC Seminar
OeRC SeminarOeRC Seminar
OeRC Seminarseanb
 
A Workflow-Driven Discovery and Training Ecosystem for Distributed Analysis o...
A Workflow-Driven Discovery and Training Ecosystem for Distributed Analysis o...A Workflow-Driven Discovery and Training Ecosystem for Distributed Analysis o...
A Workflow-Driven Discovery and Training Ecosystem for Distributed Analysis o...Ilkay Altintas, Ph.D.
 
Using Architectures for Semantic Interoperability to Create Journal Clubs for...
Using Architectures for Semantic Interoperability to Create Journal Clubs for...Using Architectures for Semantic Interoperability to Create Journal Clubs for...
Using Architectures for Semantic Interoperability to Create Journal Clubs for...James Powell
 
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Vince Smith
 
Scratchpads past,present,future
Scratchpads past,present,futureScratchpads past,present,future
Scratchpads past,present,futureEdward Baker
 
Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero
Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila SueroLinked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero
Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila SueroBiblioteca Nacional de España
 
Building a Digital Library
Building a Digital LibraryBuilding a Digital Library
Building a Digital LibraryEd Fay
 
Minimizing the Complexities of Machine Learning with Data Virtualization
Minimizing the Complexities of Machine Learning with Data VirtualizationMinimizing the Complexities of Machine Learning with Data Virtualization
Minimizing the Complexities of Machine Learning with Data VirtualizationDenodo
 
Visualization library and tools
Visualization library and toolsVisualization library and tools
Visualization library and toolsseung hyun Seo
 
Piloting an E-Journals Preservation Registry Service (PEPRS)
Piloting an E-Journals Preservation Registry Service (PEPRS)Piloting an E-Journals Preservation Registry Service (PEPRS)
Piloting an E-Journals Preservation Registry Service (PEPRS)EDINA, University of Edinburgh
 
Scratchpads training course introduction
Scratchpads training course introductionScratchpads training course introduction
Scratchpads training course introductionDimitrios Koureas
 
The Inside Out Library.
The Inside Out Library. The Inside Out Library.
The Inside Out Library. lisld
 

Similaire à datos.bne.es: Publishing and consuming (20)

datos.bne.es: Publishing and Consuming
datos.bne.es: Publishing and Consumingdatos.bne.es: Publishing and Consuming
datos.bne.es: Publishing and Consuming
 
What is New in W3C land?
What is New in W3C land?What is New in W3C land?
What is New in W3C land?
 
Linked Open data: CNR
Linked Open data: CNRLinked Open data: CNR
Linked Open data: CNR
 
An Introduction to NOSQL, Graph Databases and Neo4j
An Introduction to NOSQL, Graph Databases and Neo4jAn Introduction to NOSQL, Graph Databases and Neo4j
An Introduction to NOSQL, Graph Databases and Neo4j
 
An Approach to Publish Spatial Data on the Web: The GeoLinked Data Use Case
An Approach to Publish Spatial Data on the Web: The GeoLinked Data Use CaseAn Approach to Publish Spatial Data on the Web: The GeoLinked Data Use Case
An Approach to Publish Spatial Data on the Web: The GeoLinked Data Use Case
 
Geo linked data lstd10(v2-boris)
Geo linked data lstd10(v2-boris)Geo linked data lstd10(v2-boris)
Geo linked data lstd10(v2-boris)
 
OeRC Seminar
OeRC SeminarOeRC Seminar
OeRC Seminar
 
A Workflow-Driven Discovery and Training Ecosystem for Distributed Analysis o...
A Workflow-Driven Discovery and Training Ecosystem for Distributed Analysis o...A Workflow-Driven Discovery and Training Ecosystem for Distributed Analysis o...
A Workflow-Driven Discovery and Training Ecosystem for Distributed Analysis o...
 
Using Architectures for Semantic Interoperability to Create Journal Clubs for...
Using Architectures for Semantic Interoperability to Create Journal Clubs for...Using Architectures for Semantic Interoperability to Create Journal Clubs for...
Using Architectures for Semantic Interoperability to Create Journal Clubs for...
 
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...Making your data work for you: Scratchpads, publishing & the biodiversity dat...
Making your data work for you: Scratchpads, publishing & the biodiversity dat...
 
Scratchpads past,present,future
Scratchpads past,present,futureScratchpads past,present,future
Scratchpads past,present,future
 
Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero
Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila SueroLinked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero
Linked Data at the BNE. Elena Escolano Rodríguez, Daniel Vila Suero
 
Building a Digital Library
Building a Digital LibraryBuilding a Digital Library
Building a Digital Library
 
Minimizing the Complexities of Machine Learning with Data Virtualization
Minimizing the Complexities of Machine Learning with Data VirtualizationMinimizing the Complexities of Machine Learning with Data Virtualization
Minimizing the Complexities of Machine Learning with Data Virtualization
 
Visualization library and tools
Visualization library and toolsVisualization library and tools
Visualization library and tools
 
Piloting an E-Journals Preservation Registry Service (PEPRS)
Piloting an E-Journals Preservation Registry Service (PEPRS)Piloting an E-Journals Preservation Registry Service (PEPRS)
Piloting an E-Journals Preservation Registry Service (PEPRS)
 
Scratchpads training course introduction
Scratchpads training course introductionScratchpads training course introduction
Scratchpads training course introduction
 
Krnarich "Assessing Contribution & Value"
Krnarich "Assessing Contribution & Value"Krnarich "Assessing Contribution & Value"
Krnarich "Assessing Contribution & Value"
 
NISO BISG Forum: Bibliographic Roadmap
NISO BISG Forum: Bibliographic RoadmapNISO BISG Forum: Bibliographic Roadmap
NISO BISG Forum: Bibliographic Roadmap
 
The Inside Out Library.
The Inside Out Library. The Inside Out Library.
The Inside Out Library.
 

Plus de Scottish Library & Information Council (SLIC), CILIP in Scotland (CILIPS)

Plus de Scottish Library & Information Council (SLIC), CILIP in Scotland (CILIPS) (20)

Why link?
Why link?Why link?
Why link?
 
Will's World: Walking Through Shakespeare
Will's World: Walking Through ShakespeareWill's World: Walking Through Shakespeare
Will's World: Walking Through Shakespeare
 
Linked Open Data stuff
Linked Open Data stuffLinked Open Data stuff
Linked Open Data stuff
 
The University of Edinburgh's Mobile App
The University of Edinburgh's Mobile App The University of Edinburgh's Mobile App
The University of Edinburgh's Mobile App
 
Social Media and National Libraries
Social Media and National LibrariesSocial Media and National Libraries
Social Media and National Libraries
 
Growing Knowledge : Supporting the Digital Researcher
Growing Knowledge : Supporting the Digital Researcher Growing Knowledge : Supporting the Digital Researcher
Growing Knowledge : Supporting the Digital Researcher
 
Libguides in Academic Libraries
Libguides in Academic Libraries Libguides in Academic Libraries
Libguides in Academic Libraries
 
SLIC FE 2011 Karen Stevenson
SLIC FE 2011 Karen StevensonSLIC FE 2011 Karen Stevenson
SLIC FE 2011 Karen Stevenson
 
SLIC FE 2011Tom MacMaster
SLIC FE 2011Tom MacMasterSLIC FE 2011Tom MacMaster
SLIC FE 2011Tom MacMaster
 
SLICFE2011 Elaine Fulton
SLICFE2011 Elaine FultonSLICFE2011 Elaine Fulton
SLICFE2011 Elaine Fulton
 
Introducing Reader Development
Introducing Reader DevelopmentIntroducing Reader Development
Introducing Reader Development
 
SCURL Walk in Access Project
SCURL Walk in Access ProjectSCURL Walk in Access Project
SCURL Walk in Access Project
 
Innovation with reducing budgets British Library
Innovation with reducing budgets British LibraryInnovation with reducing budgets British Library
Innovation with reducing budgets British Library
 
mlibrary project Napier University
mlibrary project Napier Universitymlibrary project Napier University
mlibrary project Napier University
 
Wendy Walker - Ebooks Unbound at University of Glasgow – Power to the Users?
Wendy Walker - Ebooks Unbound at University of Glasgow – Power to the Users?Wendy Walker - Ebooks Unbound at University of Glasgow – Power to the Users?
Wendy Walker - Ebooks Unbound at University of Glasgow – Power to the Users?
 
Nora Dale - Growing Knowledge: The evolution of research
Nora Dale - Growing Knowledge: The evolution of researchNora Dale - Growing Knowledge: The evolution of research
Nora Dale - Growing Knowledge: The evolution of research
 
Ken Chad - ebooks: metadata & patron (demand) driven acquisitions
Ken Chad - ebooks: metadata & patron (demand) driven acquisitionsKen Chad - ebooks: metadata & patron (demand) driven acquisitions
Ken Chad - ebooks: metadata & patron (demand) driven acquisitions
 
Jean Inness - Browse, Checkout, Download: How South Ayrshire Libraries embrac...
Jean Inness - Browse, Checkout, Download: How South Ayrshire Libraries embrac...Jean Inness - Browse, Checkout, Download: How South Ayrshire Libraries embrac...
Jean Inness - Browse, Checkout, Download: How South Ayrshire Libraries embrac...
 
Who needs social media cilips
Who needs social media cilipsWho needs social media cilips
Who needs social media cilips
 
Sca
ScaSca
Sca
 

datos.bne.es: Publishing and consuming

  • 1. datos.bne.es: Publishing and consuming Daniel Vila Suero dvila@fi.upm.es Ontology Engineering Group, Universidad Politécnica de Madrid Acknowledgements: OEG Members, BNE staff (Elena Escolano, Marina Jimenez Piano, Ana Manchado, Mar HernándezAgustí, Ricardo Santos and others)
  • 3. Background datos.bne.es • Initiative from BibliotecaNacional de Españatogether with OEG-UPM Madrid. • Multidisciplinary effort: Librarians, Computer scientists, linguists.. • Close collaboration between library experts and computer scientists. • Initiated as a small scale proof-of-concept: the "Cervantes dataset" using IFLA vocabularies (FRBR, ISBD) and others (MADS, DC, RDA..) 3
  • 4. Main goals datos.bne.es • Perform the transformation incrementally and iteratively • Develop a system where library experts can define and assess the mappings to RDF independently from the IT people • Be vocabulary agnostic (BNE uses FRBR as core model, but the system would allow them to use RDA for example) • Have a clear picture of the source data before you start to transform (help to detect possible deficiencies in the source data) 4
  • 5. Source MARC records datos.bne.es AUTHORITY BIBLIOGRAPHIC Persons 76576 Maps Corporate bodies 320727 Sound recordings Conferences 166017 Gravings, drawings, pictures Titles 35770 Manuscripts Subject 143959 Ancient books 2696560 Modern books 178473 Scores 3021 Electronic resources 156634 Serials 96672 Videos 5
  • 6. Some figures datos.bne.es • Total number of authority records: 4.100.000 • Total number of bibliographical records: 2.390.140 • Total number of RDF triples: 58.053.215 • Number of links: (15% authorities): 587.520 • Linked sources: • VIAF • SUDOC (French Collective University Catalogue) FR • GND (German National Library Authorities) GER • LIBRIS Sweden • DBPedia • Soon BNF, BNB, German Bibliographie 6
  • 7. Some statistics datos.bne.es 282,879 497,644 Manifestation 2,390,103 Work 1,114,719 Person Expression 1,163,764 Thema 1,969,526 Corporate Body 7
  • 8. Some statistics datos.bne.es 2,500,000 2,129,222 2,129,222 2,000,000 1,246,773 1,054,736 1,500,000 1,246,773 1,000,000 1,054,736 500,000 0 85,347 85,347 78,561 16,462 16,462 755 755 8
  • 11. Transformation process Publishing • How to facilitate the mapping process to library experts? 1. Use a familiar and intuitive interface: Spreadsheets 2. Work only on what's in the database: Pre-process records to build the spreadsheets • 3 step-process 3 different spreadsheets 1. Classification: is it a Person? a Work? a Manifestation? 2. Annotation: name, birth date, title, language of expression 3. Relation: find relationships between entities (Person is creator of a certain work) 11
  • 13. Mapping process Publishing Open mappings at: http://bne.linkeddata.es/mapping-marc21 13
  • 16. Still a lot of work to do Publishing • We cover only core relations of FRBR • There are a significant amount of manifestationsnot linked to their expressions  currently looking at more sophisticated clustering techniques • Manifestations are not linked to their corresponding digitalized materials at the digital library (Biblioteca Digital Hispánica)  Next version (to be published this year) will contain these links • Classification step can be further automatized 16
  • 17. Consuming 17
  • 18. Perspectives Consuming • 2 different perspectives: - Systems and applications: • SPARQL endpoint, • Linked Data API - End-user interfaces • + an interesting side-effect: - By applying FRBR and RDF mappings we can (and did) improve the catalogue • Using standard web technologies and more intuitive models we open the door to: - Data analytics and cleansing, catalogue enrichment, reuse by smaller institutions… 18
  • 19. Graph analysis example Consuming http://bne.linkeddata.es/graphvis Using Open-source tools: Gephi for example 19
  • 20. Enabling access to systems and apps Consuming Linked Data API: http://datos.bne.es/frontend/persons 20
  • 21. Flexible access to data Consuming Out of the box: •Search by every field •Access cluster of resources •Filtering •Paging •Serve multiple formats: XML, Turtle, JSON 21
  • 22. Different views on the data Consuming XML HTML 22
  • 23. END-user interfaces Consuming Current linked data opens the door to: •Re-rank OPAC results •Better clustering of results •Recommendation •Enhance data from other sources 23