SlideShare une entreprise Scribd logo
1  sur  53
IATUL • 20 June 2017
Data Designed for Discovery
Roy Tennant
Senior Program Officer, OCLC Research
The world’s largest and most
consulted bibliographic database
• 2.5 Billion holdings
• 400 Million bibliographic
records
• 10 Million Italian records
• 57% non-English
Where librarians and library
patrons search
• This is the Research view of linked data
• We (OCLC) have experiments and prototypes,
but no products or production services (yet)
• We (OCLC Research) have been working with
linked data for as long as anyone in the library
world
• Our (OCLC Research) playground is the entirety
of WorldCat ( million records) and a parallel
computing cluster
• Stay tuned for more information on production
services
A few introductory remarks
WHY LINKED DATA?
What we have to work with
• A collection of text strings…
• Taken from the piece itself…
• Sometimes “enhanced” with inferred
parentheticals (e.g., [1975] )…
• Or additional statements not on the piece (e.g.,
subject headings)
• Punctuation, which may or may not be present,
is used (inconsistently) for structure
• Mostly uncontrolled and only loosely connected
to anything else
• Designed for description rather than discovery
What we have to work with
THE PROBLEM
• Identification Problems (two illustrated next):
– The Title Problem
– The Names Problem
• Quality Problems (one illustrated next):
– The Legacy Problem (strings are not controlled
terms; often, they cannot be turned into them)
• Linkage Problems (just two examples):
– The Web Problem (records aren’t enough, you need
links)
– The Language Problem (showing the right translation
for a given user)
Actually, A Number of Problems
Data Quality Problems
THE SOLUTION
First,
define ALL
THE
THINGS
Quick Definitions
entity
/ˈɛntɪti/
noun
a thing with distinct and independent
existence.
relationship
/rɪˈleɪʃ(ə)nʃɪp/
noun
the way in which two or more people or
things are connected
Albert Einstein
Person
Relativity: The Special and General Theory
Work
Physics
Concept
author
about
…then establish relationships with other entities
https://www.wikidata.org/wiki/Q937 and
http://viaf.org/viaf/75121530
Wikidata and VIAF
http://experiment.worldcat.org/entity/work/data/369081611
WorldCat Works
http://id.loc.gov/authorities/subjects/sh85101653.html
Library of Congress Subject Headings
author
about
…with actionable links from authoritative data hubs
A REAL WORLD EXAMPLE
From Records to Entities: Works
OCLC Production Services
External OCLC Research Systems
Internal OCLC Research
Resources
enhanced
WorldCat
WORKS
Kindred Works
Classify
Identities
FictionFinder
Cookbook
Finder
LCSH
FAST
VIAF
GMGPC
GSAFD
GTT
DDC
LCTGM
MeSH
Linked Data Entities
OCLC’s linked data resources
WorldCat Catalog:
15 billion triples
WorldCat Works:
5 billion RDF triples
FAST:
23 million
triples
VIAF: 2 billion triples
ISNI: 10-50 million triples
VIAF aggregates identifiers
Wikidata disseminates identifiers
OCLC’S 2015 INTERNATIONAL
LINKED DATA SURVEY
SOURCE: KAREN SMITH-YOSHIMURA
Academic library
National library
Network
Government
Scholarly
Public Library
Museum
Other
31%
20%14%
10%
8%
7%
4% 6%
2015 responding institutions by type
71 institutions total
What is published as linked data
0 10 20 30 40 50 60
Authority files
Bibliographic data
Data about musuem objects
Datasets
Descriptive metadata
Digital collections
Encoded archival descriptions
Geographic data
Ontologies/vocabularies
Other
2015 linked data sources most consumed 2015
VIAF (Virtual International Authority File) 41
DBpedia 36
GeoNames 35
id.loc.gov 35
Resources we convert to linked data
ourselves 17
Getty's AAT 16
FAST (Faceted Application of Subject
Terminology) 15
WorldCat.org 15
data.bnf.fr 12
Deutsche National Bib Linked Data Service 12
SOLVING PROBLEMS & MOVING
TOWARD A LINKED DATA FUTURE
Improving the Discovery Experience
Exploring Ways to Use Linked Data
Title: Journey to the West
Language: English
Translator: Anthony C. Yu
Date: 1977
IsTranslationOf:
Title: Journey to the West
Language: English
Translator: W. J. F. Jenner
Date: 1982-1984
IsTranslationOf:
Title: 西遊記
Language: Chinese
Author: 吳承恩
Created: 1592
HasTranslation:
Title: Tây du ký bình khảo
Language: Vietnamese
Translator: Phan Quân
Date: 1980
IsTranslationOf:
Title: 西遊記
Language: Japanese
Translator: 中野美代子
Date: 1986
IsTranslationOf:
Title: Pilgerfahrt
Language: German
Translator: Georgette Boner
Date: 1983
IsTranslationOf:
Offering the right translation
Title: Journey to the West
Language: English
Translator: Anthony C. Yu
Date: 1977
IsTranslationOf:
Title: Journey to the West
Language: English
Translator: W. J. F. Jenner
Date: 1982-1984
IsTranslationOf:
Title: 西遊記
Language: Chinese
Author: 吳承恩
Created: 1592
HasTranslation:
Title: Tây du ký bình khảo
Language: Vietnamese
Translator: Phan Quân
Date: 1980
IsTranslationOf:
Title: 西遊記
Language: Japanese
Translator: 中野美代子
Date: 1986
IsTranslationOf:
Title: Pilgerfahrt
Language: German
Translator: Georgette Boner
Date: 1983
IsTranslationOf:
Offering the right translation
Bringing Authority Control to the Web
• Person Lookup Service – An experimental service for
looking up OCLC Person Entities
• Scenario:
– A library wants to disambiguate a name
– It sends the name text string to our API
– We check all of our aggregated authority files and
send back the best match(es)
– Each response comes with one or more URIs (e.g., to
LCNAF, Wikidata, ISNI, etc.)
– The library inserts this data into their record, turning a
text string into an actionable link on the web
Prototyping New Services
Replicate existing library
functions more cheaply and
efficiently
Improve data integration
A better user
experience
Greater Web
visibility
Develop better models of
resources not well served by
current standards
Improve internal data
management
In Summary: Why Linked Data?
EASING THE TRANSITION
• Working with the Library of Congress and others to
finalize the BIBFRAME standard
• Beginning to explore what working with it at scale will
mean
Collaborating on BIBFRAME
• Modeling bibliographic data using Schema.org
• Collaborating on expanding the Schema.org with
additional bibliographic elements at bib.schema.org
• Syndicating WorldCat data to search engines using
Schema.org markup
Working With the Web
Learning About Changing Workflows
Photo by https://www.flickr.com/photos/sanjoselibrary/ - CC BY-SA 2.0
• Use uniform titles
• Use added entries with role codes (7xx and $4)
• Use 041 for translations, including intermediate translations
• Use indicators to refine the meaning
• Use the most specific fields appropriate for a
descriptive task
• Minimize the use of 500 fields
• Obey field semantics
• Avoid redundancy
If you must use free text:
• Use established conventions
• Use standardized terms
Least machine-processable
Most machine-processable
Algorithmically recoverable
Making MARC “Linked Data Ready”
‘Work’ Task Force
‘URI’ Task Force
Analyze the ‘Work’ definitions referenced in library linked data.
• How are they similar or different?
• How do they relate to the classic FRBR definition?
• What are the use cases for ‘Work?’
How should Work URIs be represented in MARC records?
• What are the best practices for adding URIs to MARC records to ease the conversion to linked data?
• How will cataloging or resource description workflows be affected?
Working With the PCC To Make MARC LD Ready
• We are in a major transition that will take
YEARS to navigate
• We don’t know yet exactly what the future
holds…
• ...but we know that it will be more linked
and machine actionable (not just
readable) than ever before
• And that’s a Good Thing
Summary Remarks
For More Information
SM
Together we make breakthroughs possible.
Thank you!
Roy Tennant
@rtennant
tennantr@oclc.org
facebook.com/roytennant
IATUL • 20 June 2017
©2017 OCLC. This work is licensed under a Creative Commons Attribution 4.0 International License. Suggested attribution:
“This work uses content from “Data Designed for Discovery” © OCLC, used under a Creative Commons Attribution 4.0
International License: http://creativecommons.org/licenses/by/4.0/.”

Contenu connexe

Tendances

The facilitated collection: collections and collecting in a network environment
The facilitated collection: collections and collecting in a network environmentThe facilitated collection: collections and collecting in a network environment
The facilitated collection: collections and collecting in a network environmentlisld
 
OCLC Research Update at ALA Chicago. June 26, 2017.
OCLC Research Update at ALA Chicago. June 26, 2017.OCLC Research Update at ALA Chicago. June 26, 2017.
OCLC Research Update at ALA Chicago. June 26, 2017.OCLC
 
The network reshapes the research library collection
The network reshapes the research library collectionThe network reshapes the research library collection
The network reshapes the research library collectionlisld
 
Libraries, collections, technology: presented at Pennylvania State University...
Libraries, collections, technology: presented at Pennylvania State University...Libraries, collections, technology: presented at Pennylvania State University...
Libraries, collections, technology: presented at Pennylvania State University...lisld
 
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI PresentationOpen Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentationekansa
 
Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...
Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...
Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...Charleston Conference
 
The library in the life of the user
The library in the life of the userThe library in the life of the user
The library in the life of the userlisld
 
Irish Studies - making library data work harder
Irish Studies - making library data work harderIrish Studies - making library data work harder
Irish Studies - making library data work harderlisld
 
Exploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadataExploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadataShenghui Wang
 
OCLC and the Social Web: Building tools, providing platforms, engaging the co...
OCLC and the Social Web:Building tools, providing platforms, engaging the co...OCLC and the Social Web:Building tools, providing platforms, engaging the co...
OCLC and the Social Web: Building tools, providing platforms, engaging the co...Andy Havens
 
Environmental trends and OCLC Research, a presentation at the University of N...
Environmental trends and OCLC Research, a presentation at the University of N...Environmental trends and OCLC Research, a presentation at the University of N...
Environmental trends and OCLC Research, a presentation at the University of N...lisld
 
Library futures: converging and diverging directions for public and academic ...
Library futures: converging and diverging directions for public and academic ...Library futures: converging and diverging directions for public and academic ...
Library futures: converging and diverging directions for public and academic ...lisld
 
OA in the Library Collection: The Challenge of Identifying and Managing Open ...
OA in the Library Collection: The Challenge of Identifying and Managing Open ...OA in the Library Collection: The Challenge of Identifying and Managing Open ...
OA in the Library Collection: The Challenge of Identifying and Managing Open ...NASIG
 
Collection Directions: Some Reflections on Libraries and Stewardship of the ...
 Collection Directions: Some Reflections on Libraries and Stewardship of the ... Collection Directions: Some Reflections on Libraries and Stewardship of the ...
Collection Directions: Some Reflections on Libraries and Stewardship of the ...OCLC
 
Library collections and the emerging scholarly record
Library collections and the emerging scholarly recordLibrary collections and the emerging scholarly record
Library collections and the emerging scholarly recordlisld
 
Understanding the Collective Collection: Concepts, Implications, and Futures
Understanding the Collective Collection: Concepts, Implications, and FuturesUnderstanding the Collective Collection: Concepts, Implications, and Futures
Understanding the Collective Collection: Concepts, Implications, and FuturesOCLC
 
Towards collaboration at scale: Libraries, the social and the technical
Towards collaboration at scale:  Libraries, the social and the technicalTowards collaboration at scale:  Libraries, the social and the technical
Towards collaboration at scale: Libraries, the social and the technicallisld
 
From local infrastructure to engagement - thinking about the library in the l...
From local infrastructure to engagement - thinking about the library in the l...From local infrastructure to engagement - thinking about the library in the l...
From local infrastructure to engagement - thinking about the library in the l...lisld
 
Cloud Library: Precipitating change in library infrastructure
Cloud Library: Precipitating change in library infrastructureCloud Library: Precipitating change in library infrastructure
Cloud Library: Precipitating change in library infrastructureOCLC Research
 

Tendances (20)

The facilitated collection: collections and collecting in a network environment
The facilitated collection: collections and collecting in a network environmentThe facilitated collection: collections and collecting in a network environment
The facilitated collection: collections and collecting in a network environment
 
OCLC Research Update at ALA Chicago. June 26, 2017.
OCLC Research Update at ALA Chicago. June 26, 2017.OCLC Research Update at ALA Chicago. June 26, 2017.
OCLC Research Update at ALA Chicago. June 26, 2017.
 
The network reshapes the research library collection
The network reshapes the research library collectionThe network reshapes the research library collection
The network reshapes the research library collection
 
Libraries, collections, technology: presented at Pennylvania State University...
Libraries, collections, technology: presented at Pennylvania State University...Libraries, collections, technology: presented at Pennylvania State University...
Libraries, collections, technology: presented at Pennylvania State University...
 
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI PresentationOpen Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
Open Context and Publishing to the Web of Data: Eric Kansa's LAWDI Presentation
 
Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...
Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...
Full Spectrum Stewardship of the Scholarly Record by Brian E. C. Schottlaende...
 
The library in the life of the user
The library in the life of the userThe library in the life of the user
The library in the life of the user
 
Irish Studies - making library data work harder
Irish Studies - making library data work harderIrish Studies - making library data work harder
Irish Studies - making library data work harder
 
Exploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadataExploring a world of networked information built from free-text metadata
Exploring a world of networked information built from free-text metadata
 
OCLC and the Social Web: Building tools, providing platforms, engaging the co...
OCLC and the Social Web:Building tools, providing platforms, engaging the co...OCLC and the Social Web:Building tools, providing platforms, engaging the co...
OCLC and the Social Web: Building tools, providing platforms, engaging the co...
 
NISO Webinar: The Future of Integrated Library Systems PART 2: User Interaction
NISO Webinar: The Future of Integrated Library Systems PART 2: User InteractionNISO Webinar: The Future of Integrated Library Systems PART 2: User Interaction
NISO Webinar: The Future of Integrated Library Systems PART 2: User Interaction
 
Environmental trends and OCLC Research, a presentation at the University of N...
Environmental trends and OCLC Research, a presentation at the University of N...Environmental trends and OCLC Research, a presentation at the University of N...
Environmental trends and OCLC Research, a presentation at the University of N...
 
Library futures: converging and diverging directions for public and academic ...
Library futures: converging and diverging directions for public and academic ...Library futures: converging and diverging directions for public and academic ...
Library futures: converging and diverging directions for public and academic ...
 
OA in the Library Collection: The Challenge of Identifying and Managing Open ...
OA in the Library Collection: The Challenge of Identifying and Managing Open ...OA in the Library Collection: The Challenge of Identifying and Managing Open ...
OA in the Library Collection: The Challenge of Identifying and Managing Open ...
 
Collection Directions: Some Reflections on Libraries and Stewardship of the ...
 Collection Directions: Some Reflections on Libraries and Stewardship of the ... Collection Directions: Some Reflections on Libraries and Stewardship of the ...
Collection Directions: Some Reflections on Libraries and Stewardship of the ...
 
Library collections and the emerging scholarly record
Library collections and the emerging scholarly recordLibrary collections and the emerging scholarly record
Library collections and the emerging scholarly record
 
Understanding the Collective Collection: Concepts, Implications, and Futures
Understanding the Collective Collection: Concepts, Implications, and FuturesUnderstanding the Collective Collection: Concepts, Implications, and Futures
Understanding the Collective Collection: Concepts, Implications, and Futures
 
Towards collaboration at scale: Libraries, the social and the technical
Towards collaboration at scale:  Libraries, the social and the technicalTowards collaboration at scale:  Libraries, the social and the technical
Towards collaboration at scale: Libraries, the social and the technical
 
From local infrastructure to engagement - thinking about the library in the l...
From local infrastructure to engagement - thinking about the library in the l...From local infrastructure to engagement - thinking about the library in the l...
From local infrastructure to engagement - thinking about the library in the l...
 
Cloud Library: Precipitating change in library infrastructure
Cloud Library: Precipitating change in library infrastructureCloud Library: Precipitating change in library infrastructure
Cloud Library: Precipitating change in library infrastructure
 

Similaire à Data Designed for Discovery

It's not rocket surgery - Linked In: ALA 2011
It's not rocket surgery - Linked In: ALA 2011It's not rocket surgery - Linked In: ALA 2011
It's not rocket surgery - Linked In: ALA 2011Ross Singer
 
The Semantic Web meets the Code of Federal Regulations
The Semantic Web meets the Code of Federal RegulationsThe Semantic Web meets the Code of Federal Regulations
The Semantic Web meets the Code of Federal Regulationstbruce
 
Importing life science at a into Neo4j
Importing life science at a into Neo4jImporting life science at a into Neo4j
Importing life science at a into Neo4jSimon Jupp
 
Beyond the catalogue : BibFrame, Linked Data and Ending the Invisible Library
Beyond the catalogue : BibFrame, Linked Data and Ending the 	Invisible LibraryBeyond the catalogue : BibFrame, Linked Data and Ending the 	Invisible Library
Beyond the catalogue : BibFrame, Linked Data and Ending the Invisible LibraryKsenija Mincic Obradovic
 
Building the new open linked library: Theory and Practice
Building the new open linked library: Theory and PracticeBuilding the new open linked library: Theory and Practice
Building the new open linked library: Theory and PracticeTrish Rose-Sandler
 
Annotations Supporting Scholarly Editing
Annotations Supporting Scholarly EditingAnnotations Supporting Scholarly Editing
Annotations Supporting Scholarly EditingAnna Gerber
 
Cornell20080516
Cornell20080516Cornell20080516
Cornell20080516charper
 
INSPIRE Hackathon Webinar Intro to Linked Data and Semantics
INSPIRE Hackathon Webinar   Intro to Linked Data and SemanticsINSPIRE Hackathon Webinar   Intro to Linked Data and Semantics
INSPIRE Hackathon Webinar Intro to Linked Data and Semanticsplan4all
 
Porting Library Vocabularies to the Semantic Web - IFLA 2010
Porting Library Vocabularies to the Semantic Web - IFLA 2010Porting Library Vocabularies to the Semantic Web - IFLA 2010
Porting Library Vocabularies to the Semantic Web - IFLA 2010Bernard Vatant
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsJon Voss
 
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014eswcsummerschool
 
How Libraries Use Publisher Metadata Redux (Steven Shadle)
How Libraries Use Publisher Metadata Redux (Steven Shadle)How Libraries Use Publisher Metadata Redux (Steven Shadle)
How Libraries Use Publisher Metadata Redux (Steven Shadle)Charleston Conference
 
Building the New Open Linked Library
Building the New Open Linked LibraryBuilding the New Open Linked Library
Building the New Open Linked LibraryJoel Richard
 
Publishing and Using Linked Open Data - Day 1
Publishing and Using Linked Open Data - Day 1 Publishing and Using Linked Open Data - Day 1
Publishing and Using Linked Open Data - Day 1 Richard Urban
 
Library OKRA: A Matter of Semantics? Intelligence, Open Data and the Future o...
Library OKRA: A Matter of Semantics? Intelligence, Open Data and the Future o...Library OKRA: A Matter of Semantics? Intelligence, Open Data and the Future o...
Library OKRA: A Matter of Semantics? Intelligence, Open Data and the Future o...Jonathan Blackburn
 
How Libraries Use Publisher Metadata - Crossref Community Webinar
How Libraries Use Publisher Metadata - Crossref Community WebinarHow Libraries Use Publisher Metadata - Crossref Community Webinar
How Libraries Use Publisher Metadata - Crossref Community WebinarCrossref
 
Putting the Pieces Together: Creating a National Educational Television Catalog
Putting the Pieces Together: Creating a National Educational Television CatalogPutting the Pieces Together: Creating a National Educational Television Catalog
Putting the Pieces Together: Creating a National Educational Television CatalogWGBH Media Library and Archives
 
Studying archives of online behavior
Studying archives of online behaviorStudying archives of online behavior
Studying archives of online behaviorJames Howison
 

Similaire à Data Designed for Discovery (20)

Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Web
 
It's not rocket surgery - Linked In: ALA 2011
It's not rocket surgery - Linked In: ALA 2011It's not rocket surgery - Linked In: ALA 2011
It's not rocket surgery - Linked In: ALA 2011
 
The Semantic Web meets the Code of Federal Regulations
The Semantic Web meets the Code of Federal RegulationsThe Semantic Web meets the Code of Federal Regulations
The Semantic Web meets the Code of Federal Regulations
 
Importing life science at a into Neo4j
Importing life science at a into Neo4jImporting life science at a into Neo4j
Importing life science at a into Neo4j
 
Beyond the catalogue : BibFrame, Linked Data and Ending the Invisible Library
Beyond the catalogue : BibFrame, Linked Data and Ending the 	Invisible LibraryBeyond the catalogue : BibFrame, Linked Data and Ending the 	Invisible Library
Beyond the catalogue : BibFrame, Linked Data and Ending the Invisible Library
 
Building the new open linked library: Theory and Practice
Building the new open linked library: Theory and PracticeBuilding the new open linked library: Theory and Practice
Building the new open linked library: Theory and Practice
 
2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery
 
Annotations Supporting Scholarly Editing
Annotations Supporting Scholarly EditingAnnotations Supporting Scholarly Editing
Annotations Supporting Scholarly Editing
 
Cornell20080516
Cornell20080516Cornell20080516
Cornell20080516
 
INSPIRE Hackathon Webinar Intro to Linked Data and Semantics
INSPIRE Hackathon Webinar   Intro to Linked Data and SemanticsINSPIRE Hackathon Webinar   Intro to Linked Data and Semantics
INSPIRE Hackathon Webinar Intro to Linked Data and Semantics
 
Porting Library Vocabularies to the Semantic Web - IFLA 2010
Porting Library Vocabularies to the Semantic Web - IFLA 2010Porting Library Vocabularies to the Semantic Web - IFLA 2010
Porting Library Vocabularies to the Semantic Web - IFLA 2010
 
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & MuseumsALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
ALIAOnline Practical Linked (Open) Data for Libraries, Archives & Museums
 
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014
Keynote: Global Media Monitoring - M. Grobelnik - ESWC SS 2014
 
How Libraries Use Publisher Metadata Redux (Steven Shadle)
How Libraries Use Publisher Metadata Redux (Steven Shadle)How Libraries Use Publisher Metadata Redux (Steven Shadle)
How Libraries Use Publisher Metadata Redux (Steven Shadle)
 
Building the New Open Linked Library
Building the New Open Linked LibraryBuilding the New Open Linked Library
Building the New Open Linked Library
 
Publishing and Using Linked Open Data - Day 1
Publishing and Using Linked Open Data - Day 1 Publishing and Using Linked Open Data - Day 1
Publishing and Using Linked Open Data - Day 1
 
Library OKRA: A Matter of Semantics? Intelligence, Open Data and the Future o...
Library OKRA: A Matter of Semantics? Intelligence, Open Data and the Future o...Library OKRA: A Matter of Semantics? Intelligence, Open Data and the Future o...
Library OKRA: A Matter of Semantics? Intelligence, Open Data and the Future o...
 
How Libraries Use Publisher Metadata - Crossref Community Webinar
How Libraries Use Publisher Metadata - Crossref Community WebinarHow Libraries Use Publisher Metadata - Crossref Community Webinar
How Libraries Use Publisher Metadata - Crossref Community Webinar
 
Putting the Pieces Together: Creating a National Educational Television Catalog
Putting the Pieces Together: Creating a National Educational Television CatalogPutting the Pieces Together: Creating a National Educational Television Catalog
Putting the Pieces Together: Creating a National Educational Television Catalog
 
Studying archives of online behavior
Studying archives of online behaviorStudying archives of online behavior
Studying archives of online behavior
 

Plus de OCLC

Communicating library impact beyond library walls: Findings from an action-or...
Communicating library impact beyond library walls: Findings from an action-or...Communicating library impact beyond library walls: Findings from an action-or...
Communicating library impact beyond library walls: Findings from an action-or...OCLC
 
"You can just tell whether a website looks reliable or not." People's modes o...
"You can just tell whether a website looks reliable or not." People's modes o..."You can just tell whether a website looks reliable or not." People's modes o...
"You can just tell whether a website looks reliable or not." People's modes o...OCLC
 
Factors influencing research data management programs.
Factors influencing research data management programs.Factors influencing research data management programs.
Factors influencing research data management programs.OCLC
 
Teaching research methods in LIS programs: Approaches, formats, and innovativ...
Teaching research methods in LIS programs: Approaches, formats, and innovativ...Teaching research methods in LIS programs: Approaches, formats, and innovativ...
Teaching research methods in LIS programs: Approaches, formats, and innovativ...OCLC
 
OCLC ALISE Library & Information Science Research Grant Program
OCLC ALISE Library & Information Science Research Grant ProgramOCLC ALISE Library & Information Science Research Grant Program
OCLC ALISE Library & Information Science Research Grant ProgramOCLC
 
Investing in library users and potential users: The Many Faces of Digital Vi...
 Investing in library users and potential users: The Many Faces of Digital Vi... Investing in library users and potential users: The Many Faces of Digital Vi...
Investing in library users and potential users: The Many Faces of Digital Vi...OCLC
 
Academic library impact: Improving practice and essential areas to research
Academic library impact: Improving practice and essential areas to researchAcademic library impact: Improving practice and essential areas to research
Academic library impact: Improving practice and essential areas to researchOCLC
 
Studying information behavior: The Many Faces of Digital Visitors and Residents
Studying information behavior: The Many Faces of Digital Visitors and ResidentsStudying information behavior: The Many Faces of Digital Visitors and Residents
Studying information behavior: The Many Faces of Digital Visitors and ResidentsOCLC
 
Online engagement and information literacy: The Many Face of Digital Visitors...
Online engagement and information literacy: The Many Face of Digital Visitors...Online engagement and information literacy: The Many Face of Digital Visitors...
Online engagement and information literacy: The Many Face of Digital Visitors...OCLC
 
People's mode of online engagement: The Many Faces of Digital Visitors and R...
 People's mode of online engagement: The Many Faces of Digital Visitors and R... People's mode of online engagement: The Many Faces of Digital Visitors and R...
People's mode of online engagement: The Many Faces of Digital Visitors and R...OCLC
 
Applying research methods: Investigating the Many Faces of Digital Visitors &...
Applying research methods: Investigating the Many Faces of Digital Visitors &...Applying research methods: Investigating the Many Faces of Digital Visitors &...
Applying research methods: Investigating the Many Faces of Digital Visitors &...OCLC
 
OCLC RLP @ RLUK
OCLC RLP @ RLUKOCLC RLP @ RLUK
OCLC RLP @ RLUKOCLC
 
Using Qualitative Methods for Library Evaluation: An Interactive Workshop
Using Qualitative Methods for Library Evaluation: An Interactive WorkshopUsing Qualitative Methods for Library Evaluation: An Interactive Workshop
Using Qualitative Methods for Library Evaluation: An Interactive WorkshopOCLC
 
Visitors and Residents: The Hows and Whys of Engagement with Technology
Visitors and Residents: The Hows and Whys of Engagement with TechnologyVisitors and Residents: The Hows and Whys of Engagement with Technology
Visitors and Residents: The Hows and Whys of Engagement with TechnologyOCLC
 
Action-Oriented Research Agenda on Library Contributions to Student Learning ...
Action-Oriented Research Agenda on Library Contributions to Student Learning ...Action-Oriented Research Agenda on Library Contributions to Student Learning ...
Action-Oriented Research Agenda on Library Contributions to Student Learning ...OCLC
 
Visitors and Residents: Interactive Mapping Exercise Workshop
Visitors and Residents: Interactive Mapping Exercise WorkshopVisitors and Residents: Interactive Mapping Exercise Workshop
Visitors and Residents: Interactive Mapping Exercise WorkshopOCLC
 
The Library in the Life of the User
The Library in the Life of the UserThe Library in the Life of the User
The Library in the Life of the UserOCLC
 
Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...
Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...
Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...OCLC
 
Changing Tack: A Future-Focused ACRL Research Agenda
Changing Tack: A Future-Focused ACRL Research AgendaChanging Tack: A Future-Focused ACRL Research Agenda
Changing Tack: A Future-Focused ACRL Research AgendaOCLC
 
Qualitative Research Methods in LIS
Qualitative Research Methods in LISQualitative Research Methods in LIS
Qualitative Research Methods in LISOCLC
 

Plus de OCLC (20)

Communicating library impact beyond library walls: Findings from an action-or...
Communicating library impact beyond library walls: Findings from an action-or...Communicating library impact beyond library walls: Findings from an action-or...
Communicating library impact beyond library walls: Findings from an action-or...
 
"You can just tell whether a website looks reliable or not." People's modes o...
"You can just tell whether a website looks reliable or not." People's modes o..."You can just tell whether a website looks reliable or not." People's modes o...
"You can just tell whether a website looks reliable or not." People's modes o...
 
Factors influencing research data management programs.
Factors influencing research data management programs.Factors influencing research data management programs.
Factors influencing research data management programs.
 
Teaching research methods in LIS programs: Approaches, formats, and innovativ...
Teaching research methods in LIS programs: Approaches, formats, and innovativ...Teaching research methods in LIS programs: Approaches, formats, and innovativ...
Teaching research methods in LIS programs: Approaches, formats, and innovativ...
 
OCLC ALISE Library & Information Science Research Grant Program
OCLC ALISE Library & Information Science Research Grant ProgramOCLC ALISE Library & Information Science Research Grant Program
OCLC ALISE Library & Information Science Research Grant Program
 
Investing in library users and potential users: The Many Faces of Digital Vi...
 Investing in library users and potential users: The Many Faces of Digital Vi... Investing in library users and potential users: The Many Faces of Digital Vi...
Investing in library users and potential users: The Many Faces of Digital Vi...
 
Academic library impact: Improving practice and essential areas to research
Academic library impact: Improving practice and essential areas to researchAcademic library impact: Improving practice and essential areas to research
Academic library impact: Improving practice and essential areas to research
 
Studying information behavior: The Many Faces of Digital Visitors and Residents
Studying information behavior: The Many Faces of Digital Visitors and ResidentsStudying information behavior: The Many Faces of Digital Visitors and Residents
Studying information behavior: The Many Faces of Digital Visitors and Residents
 
Online engagement and information literacy: The Many Face of Digital Visitors...
Online engagement and information literacy: The Many Face of Digital Visitors...Online engagement and information literacy: The Many Face of Digital Visitors...
Online engagement and information literacy: The Many Face of Digital Visitors...
 
People's mode of online engagement: The Many Faces of Digital Visitors and R...
 People's mode of online engagement: The Many Faces of Digital Visitors and R... People's mode of online engagement: The Many Faces of Digital Visitors and R...
People's mode of online engagement: The Many Faces of Digital Visitors and R...
 
Applying research methods: Investigating the Many Faces of Digital Visitors &...
Applying research methods: Investigating the Many Faces of Digital Visitors &...Applying research methods: Investigating the Many Faces of Digital Visitors &...
Applying research methods: Investigating the Many Faces of Digital Visitors &...
 
OCLC RLP @ RLUK
OCLC RLP @ RLUKOCLC RLP @ RLUK
OCLC RLP @ RLUK
 
Using Qualitative Methods for Library Evaluation: An Interactive Workshop
Using Qualitative Methods for Library Evaluation: An Interactive WorkshopUsing Qualitative Methods for Library Evaluation: An Interactive Workshop
Using Qualitative Methods for Library Evaluation: An Interactive Workshop
 
Visitors and Residents: The Hows and Whys of Engagement with Technology
Visitors and Residents: The Hows and Whys of Engagement with TechnologyVisitors and Residents: The Hows and Whys of Engagement with Technology
Visitors and Residents: The Hows and Whys of Engagement with Technology
 
Action-Oriented Research Agenda on Library Contributions to Student Learning ...
Action-Oriented Research Agenda on Library Contributions to Student Learning ...Action-Oriented Research Agenda on Library Contributions to Student Learning ...
Action-Oriented Research Agenda on Library Contributions to Student Learning ...
 
Visitors and Residents: Interactive Mapping Exercise Workshop
Visitors and Residents: Interactive Mapping Exercise WorkshopVisitors and Residents: Interactive Mapping Exercise Workshop
Visitors and Residents: Interactive Mapping Exercise Workshop
 
The Library in the Life of the User
The Library in the Life of the UserThe Library in the Life of the User
The Library in the Life of the User
 
Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...
Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...
Where are We Going and What Do We Do Next? Demonstrating the Value of Academi...
 
Changing Tack: A Future-Focused ACRL Research Agenda
Changing Tack: A Future-Focused ACRL Research AgendaChanging Tack: A Future-Focused ACRL Research Agenda
Changing Tack: A Future-Focused ACRL Research Agenda
 
Qualitative Research Methods in LIS
Qualitative Research Methods in LISQualitative Research Methods in LIS
Qualitative Research Methods in LIS
 

Dernier

UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfNirmal Dwivedi
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701bronxfugly43
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxAreebaZafar22
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxAmanpreet Kaur
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfagholdier
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxVishalSingh1417
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...Nguyen Thanh Tu Collection
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsMebane Rash
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxJisc
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptxMaritesTamaniVerdade
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Jisc
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfPoh-Sun Goh
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...Poonam Aher Patil
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxheathfieldcps1
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the ClassroomPooky Knightsmith
 
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdfVishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdfssuserdda66b
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxDenish Jangid
 

Dernier (20)

UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701ComPTIA Overview | Comptia Security+ Book SY0-701
ComPTIA Overview | Comptia Security+ Book SY0-701
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptxSKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
SKILL OF INTRODUCING THE LESSON MICRO SKILLS.pptx
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Spatium Project Simulation student brief
Spatium Project Simulation student briefSpatium Project Simulation student brief
Spatium Project Simulation student brief
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
2024-NATIONAL-LEARNING-CAMP-AND-OTHER.pptx
 
Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)Accessible Digital Futures project (20/03/2024)
Accessible Digital Futures project (20/03/2024)
 
Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Fostering Friendships - Enhancing Social Bonds in the Classroom
Fostering Friendships - Enhancing Social Bonds  in the ClassroomFostering Friendships - Enhancing Social Bonds  in the Classroom
Fostering Friendships - Enhancing Social Bonds in the Classroom
 
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdfVishram Singh - Textbook of Anatomy  Upper Limb and Thorax.. Volume 1 (1).pdf
Vishram Singh - Textbook of Anatomy Upper Limb and Thorax.. Volume 1 (1).pdf
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptxBasic Civil Engineering first year Notes- Chapter 4 Building.pptx
Basic Civil Engineering first year Notes- Chapter 4 Building.pptx
 

Data Designed for Discovery

  • 1. IATUL • 20 June 2017 Data Designed for Discovery Roy Tennant Senior Program Officer, OCLC Research
  • 2.
  • 3. The world’s largest and most consulted bibliographic database • 2.5 Billion holdings • 400 Million bibliographic records • 10 Million Italian records • 57% non-English Where librarians and library patrons search
  • 4. • This is the Research view of linked data • We (OCLC) have experiments and prototypes, but no products or production services (yet) • We (OCLC Research) have been working with linked data for as long as anyone in the library world • Our (OCLC Research) playground is the entirety of WorldCat ( million records) and a parallel computing cluster • Stay tuned for more information on production services A few introductory remarks
  • 6. What we have to work with
  • 7. • A collection of text strings… • Taken from the piece itself… • Sometimes “enhanced” with inferred parentheticals (e.g., [1975] )… • Or additional statements not on the piece (e.g., subject headings) • Punctuation, which may or may not be present, is used (inconsistently) for structure • Mostly uncontrolled and only loosely connected to anything else • Designed for description rather than discovery What we have to work with
  • 9. • Identification Problems (two illustrated next): – The Title Problem – The Names Problem • Quality Problems (one illustrated next): – The Legacy Problem (strings are not controlled terms; often, they cannot be turned into them) • Linkage Problems (just two examples): – The Web Problem (records aren’t enough, you need links) – The Language Problem (showing the right translation for a given user) Actually, A Number of Problems
  • 10.
  • 11.
  • 15. Quick Definitions entity /ˈɛntɪti/ noun a thing with distinct and independent existence. relationship /rɪˈleɪʃ(ə)nʃɪp/ noun the way in which two or more people or things are connected
  • 16. Albert Einstein Person Relativity: The Special and General Theory Work Physics Concept author about …then establish relationships with other entities
  • 17. https://www.wikidata.org/wiki/Q937 and http://viaf.org/viaf/75121530 Wikidata and VIAF http://experiment.worldcat.org/entity/work/data/369081611 WorldCat Works http://id.loc.gov/authorities/subjects/sh85101653.html Library of Congress Subject Headings author about …with actionable links from authoritative data hubs
  • 18. A REAL WORLD EXAMPLE
  • 19. From Records to Entities: Works
  • 20.
  • 21.
  • 22.
  • 23.
  • 24. OCLC Production Services External OCLC Research Systems Internal OCLC Research Resources enhanced WorldCat WORKS Kindred Works Classify Identities FictionFinder Cookbook Finder LCSH FAST VIAF GMGPC GSAFD GTT DDC LCTGM MeSH Linked Data Entities
  • 25. OCLC’s linked data resources WorldCat Catalog: 15 billion triples WorldCat Works: 5 billion RDF triples FAST: 23 million triples VIAF: 2 billion triples ISNI: 10-50 million triples
  • 28. OCLC’S 2015 INTERNATIONAL LINKED DATA SURVEY SOURCE: KAREN SMITH-YOSHIMURA
  • 29. Academic library National library Network Government Scholarly Public Library Museum Other 31% 20%14% 10% 8% 7% 4% 6% 2015 responding institutions by type 71 institutions total
  • 30. What is published as linked data 0 10 20 30 40 50 60 Authority files Bibliographic data Data about musuem objects Datasets Descriptive metadata Digital collections Encoded archival descriptions Geographic data Ontologies/vocabularies Other
  • 31. 2015 linked data sources most consumed 2015 VIAF (Virtual International Authority File) 41 DBpedia 36 GeoNames 35 id.loc.gov 35 Resources we convert to linked data ourselves 17 Getty's AAT 16 FAST (Faceted Application of Subject Terminology) 15 WorldCat.org 15 data.bnf.fr 12 Deutsche National Bib Linked Data Service 12
  • 32. SOLVING PROBLEMS & MOVING TOWARD A LINKED DATA FUTURE
  • 34.
  • 35.
  • 36. Exploring Ways to Use Linked Data
  • 37.
  • 38.
  • 39. Title: Journey to the West Language: English Translator: Anthony C. Yu Date: 1977 IsTranslationOf: Title: Journey to the West Language: English Translator: W. J. F. Jenner Date: 1982-1984 IsTranslationOf: Title: 西遊記 Language: Chinese Author: 吳承恩 Created: 1592 HasTranslation: Title: Tây du ký bình khảo Language: Vietnamese Translator: Phan Quân Date: 1980 IsTranslationOf: Title: 西遊記 Language: Japanese Translator: 中野美代子 Date: 1986 IsTranslationOf: Title: Pilgerfahrt Language: German Translator: Georgette Boner Date: 1983 IsTranslationOf: Offering the right translation
  • 40. Title: Journey to the West Language: English Translator: Anthony C. Yu Date: 1977 IsTranslationOf: Title: Journey to the West Language: English Translator: W. J. F. Jenner Date: 1982-1984 IsTranslationOf: Title: 西遊記 Language: Chinese Author: 吳承恩 Created: 1592 HasTranslation: Title: Tây du ký bình khảo Language: Vietnamese Translator: Phan Quân Date: 1980 IsTranslationOf: Title: 西遊記 Language: Japanese Translator: 中野美代子 Date: 1986 IsTranslationOf: Title: Pilgerfahrt Language: German Translator: Georgette Boner Date: 1983 IsTranslationOf: Offering the right translation
  • 42. • Person Lookup Service – An experimental service for looking up OCLC Person Entities • Scenario: – A library wants to disambiguate a name – It sends the name text string to our API – We check all of our aggregated authority files and send back the best match(es) – Each response comes with one or more URIs (e.g., to LCNAF, Wikidata, ISNI, etc.) – The library inserts this data into their record, turning a text string into an actionable link on the web Prototyping New Services
  • 43. Replicate existing library functions more cheaply and efficiently Improve data integration A better user experience Greater Web visibility Develop better models of resources not well served by current standards Improve internal data management In Summary: Why Linked Data?
  • 45. • Working with the Library of Congress and others to finalize the BIBFRAME standard • Beginning to explore what working with it at scale will mean Collaborating on BIBFRAME
  • 46. • Modeling bibliographic data using Schema.org • Collaborating on expanding the Schema.org with additional bibliographic elements at bib.schema.org • Syndicating WorldCat data to search engines using Schema.org markup Working With the Web
  • 47. Learning About Changing Workflows Photo by https://www.flickr.com/photos/sanjoselibrary/ - CC BY-SA 2.0
  • 48.
  • 49. • Use uniform titles • Use added entries with role codes (7xx and $4) • Use 041 for translations, including intermediate translations • Use indicators to refine the meaning • Use the most specific fields appropriate for a descriptive task • Minimize the use of 500 fields • Obey field semantics • Avoid redundancy If you must use free text: • Use established conventions • Use standardized terms Least machine-processable Most machine-processable Algorithmically recoverable Making MARC “Linked Data Ready”
  • 50. ‘Work’ Task Force ‘URI’ Task Force Analyze the ‘Work’ definitions referenced in library linked data. • How are they similar or different? • How do they relate to the classic FRBR definition? • What are the use cases for ‘Work?’ How should Work URIs be represented in MARC records? • What are the best practices for adding URIs to MARC records to ease the conversion to linked data? • How will cataloging or resource description workflows be affected? Working With the PCC To Make MARC LD Ready
  • 51. • We are in a major transition that will take YEARS to navigate • We don’t know yet exactly what the future holds… • ...but we know that it will be more linked and machine actionable (not just readable) than ever before • And that’s a Good Thing Summary Remarks
  • 53. SM Together we make breakthroughs possible. Thank you! Roy Tennant @rtennant tennantr@oclc.org facebook.com/roytennant IATUL • 20 June 2017 ©2017 OCLC. This work is licensed under a Creative Commons Attribution 4.0 International License. Suggested attribution: “This work uses content from “Data Designed for Discovery” © OCLC, used under a Creative Commons Attribution 4.0 International License: http://creativecommons.org/licenses/by/4.0/.”

Notes de l'éditeur

  1. Having an entry for every specific manifestation of a work presents particular problems for users. Imagine you are a student with a paper due tomorrow (as it always is) and you must choose which entry to click on to find a copy of the book. This kind of screen display is no better than a “hunting license”.
  2. When two different people have the same name, how can you be sure you have the right person? Unambiguous identifiers are needed.
  3. So the goal of linked data is to produce machine-understandable knowledge about things we are interested in. As librarians, we can jumpstart this process by upgrading the descriptions of things that librarians have always collected information about…authors, works, subjects.
  4. What’s new and different about linked data. URIs – a web location that is unique in the world and persistent. When referenced, they provide information about things. They may include links to other sources of information (VIAF & Wikidata both provide information about Albert Einstein…reinforcing and complementary. Make machine-understandable statements that link the sources of information. “Triples” – <Albert Einstein> <is the author of> <The General Theory of Relativity>
  5. As of August 2014, we can say that OCLC has published over 20 billion RDF triples extracted from MARC records and library authority files.
  6. Each of the sources contributing to VIAF has its own identifier, so VIAF can be viewed as an “ID aggregator”. This is the VIAF cluster for Noam Chomsky. VIAF publishes this information as linked data. <Click> This RDF states this is for a person. <Click> This RDF shows the different languages representing this person – further annotated with a geographic location, in this case Arabic in Egypt, Lebanon and Israel. This can be useful when multiple countries use the same language and writing system but with variations. Think of the differences in British or Canadian English and American English. <click> And the RDF gives the “same as” property for the identifier in each of the VIAF contributing sources .
  7. Wikidata not only aggregates identifiers but also disseminates them. In this case, the VIAF identifier in Wikidata is also included in <click> the English Wikipedia and the <click> Korean Wikipedia page for Jerry Brown, our California governor.
  8. 80 respondents; not a scientific sample; repeat of survey conducted in 2014. Karen will talk more about this at the CNI meeting in April. She will give a view of the tabulated responses. I’m going to do something different and complementary. Look at the corpus of linked data sites mentioned to try to understand why linked data is interesting to the library community and how mature the efforts are.
  9. This is how I categorized the responding institutions, but others may do it differently. National Libraries which responded (14): Biblioteca. Real Academia Nacional de Medicina, Bibliotheque nationale de France, British Library, German National Library, Koninklijke Bibliotheek, Library of Congress, National Diet Library, National Library of Malaysia, National Library of Medicine, National Library of Portugal, National Library of Spain, National Library of Sweden, National Library of Wales, National Széchényi Library [Hungary] Categorized as “network” (10): ABES, BIBSYS, Consorci de Serveis Universitaris de Catalunya, Digital Public Library of America, Europeana Foundation, Haute école de gestion de Genève (SwissBib), North Rhine-Westphalian Library Service Center, OCLC, RERO - Library Network of Western Switzerland, and The European Library. Government (7): Agencia Española de Cooperación Internacional para el Desarrollo (AECID). Biblioteca della Camera dei deputati (Italy), Biblioteca Valenciana Nicolau Primitiu, Biblioteca Virtual de Derecho Aragonés, Consejería de Educación, Cultura y Deportes Gobierno de Castilla-La Mancha, España, Diputación de Málaga. Cultura y Deportes. Biblioteca Cánovas del Castillo, Ministry of Defense (Spain) Scholarly (based at one institution but multi-institutional on a theme/discipline) (6): Big Data Institute [Muninn Project, Canadian Writing Research Collaboratory]; Colorado State [datasets from the NSF-funded Shortgrass Steppe-Long-Term Ecological Research station in northern Colorado, for researchers in natural sciences]; Fundacción Ignacio Larramendi (Spain); Pratt Institute [Linked jazz]; University of Alberta Libraries [Canadiana, partners with Pan-Canadian Documentary Heritage Network]; University of Applied Sciences St. Poelten [encyclopedic music data for music magazines, legal information for publishers and semantic tagging/indexing for video files at community TV network.] Public library/libraries (5): Anythink Libraries, Arapahoe Library District, Evansville Vanderburgh Public Library, New York Public Library, Oslo Public Library Museum (3): British Museum, J. Paul Getty Trust, Smithsonian Other: 1 publisher (Springer) and 3 societies (American Numismatic Society, Chemical Heritage Foundation, Minnesota Historical Society)
  10. Given the relatively large representation of libraries among respondents, no surprise that bibliographic and authority data are the most common types of data published, with descriptive metadata a close third. Other: 5 of the 11 “other” were about organizational data; 2 were data about people (researchers, library staff). 1 about performance works (e.g., shows).
  11. These are the sources 12 or more of the 2015 survey respondents reported that they consumed. I’ve starred the ones which also responded to the survey. Note that “resources we convert to linked data ourselves” is one of the top linked data sources consumed. One advice from linked data implementers is to first consume the linked data you publish. These could be considered successful publishers of linked data by the degree to which others consume the data provided. Three of the twelve are OCLC linked data sources. VIAF is the #1 linked data resource consumed by the respondents, partially because so many more national libraries responded to the 2015 survey.
  12. By using the concept of a “work” it is possible to aggregate all of the various manifestations of a title under one work depiction. This conceivably will allow users to use filters to locate the particular item they want, such as “show me only the books that are in English”, or “show me only the books that are on the shelf”.
  13. The second is the Person Lookup Service. This was a prototype service, used in a pilot study, that provided a means for users to lookup People and pull back string labels and descriptions (across a wide range of languages) as well as sameAs links to outside resources that described the Person. A good example of this would be finding the Person Abraham Lincoln. The service could provide you names and descriptions for him in 15+ languages as well as links to URIs in other datasets for him (such as LAC, WikiData, DNB, BNF, etc.)
  14. The second is the Person Lookup Service. This was a prototype service, used in a pilot study, that provided a means for users to lookup People and pull back string labels and descriptions (across a wide range of languages) as well as sameAs links to outside resources that described the Person. A good example of this would be finding the Person Abraham Lincoln. The service could provide you names and descriptions for him in 15+ languages as well as links to URIs in other datasets for him (such as LAC, WikiData, DNB, BNF, etc.)
  15. From the survey participants Triangle represents: level of effort; visibility of user-apparent benefit. Looks like an iceberg. Lots of invisible effort. But it accumulates. Bottom tier: Essentially a technology assessment exercise. Using URIs, not strings. Understanding and using data produced by third parties. Most of the datasets were from within the library community. Respondents reported that third-party datasets were too small and too unstable; semantics too hard to understand. BNF – connecting data resources that were in siloes before. Monographs + archives and digital descriptions. Oslo Public Library – reports a success Middle tier: Europeana; Digital Public Library of America Many smaller projects around digitization and archives—National Diet library Top tier: Scattered comments. Needs were met, but didn’t say how. SEO improvements. Best example is BNF. Montana State University report at CNI in April. Small-scale experiments with the user experience. Best example is Linked Jazz. Popular on the conference circuit in the U.S.
  16. Working with partners such as the UC Davis BIBFLOW project and the Linked Data for Libraries (LD4L) project to understand how linked data changes our work
  17. The list of recommendations can be organized into a sort of metadata “food pyramid.” Those at the top may be necessary, but should be used sparingly. Those further down should form the foundation of practice, if the goal is improved machine understanding of MARC metadata.
  18. Both committees are due to deliver recommendations later in 2017.