SlideShare une entreprise Scribd logo
1  sur  31
The world’s libraries. Connected.
Multilingual WorldCatpresented by Janifer Gatenby
IFLA, Singapore, 2013-08-19
Karen Smith Yoshimura
Eric Childress
Janifer Gatenby
Jean Godby
Richard Greene
Jenny Toves
Diane Vizine Goetz
Robert Bremer
JD Shipengrover
Gail Thornburg
Jay Weitz
The world’s libraries. Connected.
WorldCat Today
• Resources in nearly
all languages
• Contributed by more
than 20,000 libraries
worldwide
• More than half the
database is for works
not in English
The world’s libraries. Connected.
WorldCat Today
• Bibliographic Records
• Hybrid records
• Parallel records
• Clustered at Work
level (FRBR)
The world’s libraries. Connected.
Existing Architecture
Author
sAuthor
sAuthors
Subj
Classif
Subj
ClassifSubj
Classif
Holdin
gHoldin
gHoldings
Bibliographic
recordWork
cluster
Content
cluster
Manifes
tation
cluster
The world’s libraries. Connected.
Complementary Initiatives
Work Level
Record
GLIMIR
Manifestation
& Content
Clusters
Multi-lingual
Bibliographic
Structure
The world’s libraries. Connected.
Work Level Record
http://www.oclc.org/research/activities/workrecs.html
The world’s libraries. Connected.
Create a landing
page summarizing
content for a work
Work Level Record: Objective
The world’s libraries. Connected.
• The Content Cluster
• Enables better work record displays by reducing the number of lines that display
for large works
• Enables a choice of format and presents the formats that could be acceptable
substitutes
• Consolidates holdings for identical content
• The Manifestation Cluster is important
• Consolidates holdings at manifestation level
• In the short term allows the record catalogued in the language of the interface to
be chosen for display
• Reduces apparent duplication
• Allows a more accurate count of the number of manifestations in WorldCat (as
opposed to the number of records)
GLIMIR
The world’s libraries. Connected.
Creates true multi-lingual displays
• At work and manifestation levels
• Using all available data instead of “most appropriate
record”
• Generates data
Corrects many of the 28 million records coded “und”
Better control and linking of translations
Input to refinement of work clusters
Smarter data storage
Multilingual Bibliographic Structure Project
The world’s libraries. Connected.
• Worldcat.org selects the most appropriate record
to show to a user as representative of the work in
the short result list and beyond
• The end result will not be very satisfactory from a
multi-lingual viewpoint… here’s why
“Most appropriate” questioned
The world’s libraries. Connected.
Which record is better to present to a German speaker?
The world’s libraries. Connected.
Incomplete Swedish Record
The world’s libraries. Connected.
Hybrid record
The world’s libraries. Connected.
Most appropriate display
The world’s libraries. Connected.
• Work level data, mined from all associated
bibliographic records will be displayed
supplemented with expression / manifestation
level data as the user drills through the short to
fuller versions of the metadata.
Multilingual Bibliographic Structure Project
End user interface will show works and manifestations not bibliographic
records; the cataloguing client will also show bibliographic records
The world’s libraries. Connected.
Proposed new architecture
Work
eng
fre
ger
jpn
Manif
eng
Manif
eng
Manif
eng
Manif
eng
Manif
eng Manif
eng
o freNotes
Contents
++
Holdin
gHoldin
gHolding
Holdin
g
Subj
sif
Subj
Classif
eng
fre
ger
jpn
Author
sAuthor
sAuthorseng
fre
ger
jpn
eng
fre
ger
jpn
eng
fre
ger
jpn
Translations
(Language of work)
Manif
fre
Holding
The world’s libraries. Connected.
• Language tagging of elements, particularly
• Summaries (M21 520)
• Subject headings
• Display in script preferred by the user if data is
available
• Improve translated interfaces
• Show consolidated holdings as appropriate
Important principles
The world’s libraries. Connected.
The world’s libraries. Connected.
The world’s libraries. Connected.
The world’s libraries. Connected.
The world’s libraries. Connected.
Translations
The world’s libraries. Connected.
• The cream of the world’s cultural and knowledge
heritage is shared by being translated
• WorldCat contains many rich cataloguing records
for these translations
Great works are translated
GOAL: Data mine the really good records to
improve clustering, presentation, authority
records and linked data
The world’s libraries. Connected.
• Inconsistencies causing work clusters to be
incomplete & less than optimal search results
• Titles without subtitles
• Different forms of uniform title or missing uniform title
• Inverted title
• Different coding of original and translated information
Translations
Generated uniform title authority records will overcome most of these
differences without needing to edit individual records
The world’s libraries. Connected.
• Improve FRBR work groups
• Made by data mining
• Contribute to VIAF
• Diffuse via VIAF as linked data
• Possibility to create web page / web service
Generate uniform title authority records
The world’s libraries. Connected.
The world’s libraries. Connected.
Translation records in VIAF
• Will enrich VIAF significantly
• New elements - translated title and translator
Author Title Expressions in VIAF Translation count in
WorldCat
Atwood Blind assassin 8 31
Guevara Notas de viaje 0 11
Hawking Grand design 0 18
Lenard Grosse naturforscher 1 3
Loti Pêcheur d’Islande 1 31
The world’s libraries. Connected.
• Records are freely available to the world from
VIAF in
• MARC-21
• XML
• RDF (linked data)
• Just links in JSON
• And other formats as introduced
Diffusion of Translation records
The world’s libraries. Connected.
• # of manifestations as
opposed to # of records
• # of works that have
translations
• Top translated authors
and works
• And more 
We don’t know now, but soon will

Contenu connexe

Tendances

OCLC Research Update at ALA Chicago. June 26, 2017.
OCLC Research Update at ALA Chicago. June 26, 2017.OCLC Research Update at ALA Chicago. June 26, 2017.
OCLC Research Update at ALA Chicago. June 26, 2017.OCLC
 
OCLC and the Social Web: Building tools, providing platforms, engaging the co...
OCLC and the Social Web:Building tools, providing platforms, engaging the co...OCLC and the Social Web:Building tools, providing platforms, engaging the co...
OCLC and the Social Web: Building tools, providing platforms, engaging the co...Andy Havens
 
OA in the Library Collection: The Challenge of Identifying and Managing Open ...
OA in the Library Collection: The Challenge of Identifying and Managing Open ...OA in the Library Collection: The Challenge of Identifying and Managing Open ...
OA in the Library Collection: The Challenge of Identifying and Managing Open ...NASIG
 
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosConnecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosOCLC
 
Collection Directions: Some Reflections on Libraries and Stewardship of the ...
 Collection Directions: Some Reflections on Libraries and Stewardship of the ... Collection Directions: Some Reflections on Libraries and Stewardship of the ...
Collection Directions: Some Reflections on Libraries and Stewardship of the ...OCLC
 
Best Practices for Descriptive Metadata for Web Archiving
Best Practices for Descriptive Metadata for Web ArchivingBest Practices for Descriptive Metadata for Web Archiving
Best Practices for Descriptive Metadata for Web ArchivingOCLC
 
Social metadata for libraries, archives and museums: Research findings from t...
Social metadata for libraries, archives and museums: Research findings from t...Social metadata for libraries, archives and museums: Research findings from t...
Social metadata for libraries, archives and museums: Research findings from t...Rose Holley
 
Cloud Library: Precipitating change in library infrastructure
Cloud Library: Precipitating change in library infrastructureCloud Library: Precipitating change in library infrastructure
Cloud Library: Precipitating change in library infrastructureOCLC Research
 
Let's Get Visible! with Karla Smith, Winnefox Library System
Let's Get Visible! with Karla Smith, Winnefox Library SystemLet's Get Visible! with Karla Smith, Winnefox Library System
Let's Get Visible! with Karla Smith, Winnefox Library SystemWiLS
 
IASSIT Kansa Presentation
IASSIT Kansa PresentationIASSIT Kansa Presentation
IASSIT Kansa Presentationekansa
 
The OCLC Research Library Partnership
The OCLC Research Library PartnershipThe OCLC Research Library Partnership
The OCLC Research Library PartnershipOCLC
 
Library Assessment Toolkit & Dashboard Scoping Research Final Report and Path...
Library Assessment Toolkit & Dashboard Scoping Research Final Report and Path...Library Assessment Toolkit & Dashboard Scoping Research Final Report and Path...
Library Assessment Toolkit & Dashboard Scoping Research Final Report and Path...Megan Hurst
 
The network reshapes the research library collection
The network reshapes the research library collectionThe network reshapes the research library collection
The network reshapes the research library collectionlisld
 

Tendances (20)

OCLC Research Update at ALA Chicago. June 26, 2017.
OCLC Research Update at ALA Chicago. June 26, 2017.OCLC Research Update at ALA Chicago. June 26, 2017.
OCLC Research Update at ALA Chicago. June 26, 2017.
 
OCLC and the Social Web: Building tools, providing platforms, engaging the co...
OCLC and the Social Web:Building tools, providing platforms, engaging the co...OCLC and the Social Web:Building tools, providing platforms, engaging the co...
OCLC and the Social Web: Building tools, providing platforms, engaging the co...
 
OA in the Library Collection: The Challenge of Identifying and Managing Open ...
OA in the Library Collection: The Challenge of Identifying and Managing Open ...OA in the Library Collection: The Challenge of Identifying and Managing Open ...
OA in the Library Collection: The Challenge of Identifying and Managing Open ...
 
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata SilosConnecting the Dots: Linking Digitized Collections Across Metadata Silos
Connecting the Dots: Linking Digitized Collections Across Metadata Silos
 
Collection Directions: Some Reflections on Libraries and Stewardship of the ...
 Collection Directions: Some Reflections on Libraries and Stewardship of the ... Collection Directions: Some Reflections on Libraries and Stewardship of the ...
Collection Directions: Some Reflections on Libraries and Stewardship of the ...
 
NISO Webinar: The Future of Integrated Library Systems PART 2: User Interaction
NISO Webinar: The Future of Integrated Library Systems PART 2: User InteractionNISO Webinar: The Future of Integrated Library Systems PART 2: User Interaction
NISO Webinar: The Future of Integrated Library Systems PART 2: User Interaction
 
Thompson 6-jun15-final
Thompson 6-jun15-finalThompson 6-jun15-final
Thompson 6-jun15-final
 
Best Practices for Descriptive Metadata for Web Archiving
Best Practices for Descriptive Metadata for Web ArchivingBest Practices for Descriptive Metadata for Web Archiving
Best Practices for Descriptive Metadata for Web Archiving
 
Gonzalez-8-jun15
Gonzalez-8-jun15Gonzalez-8-jun15
Gonzalez-8-jun15
 
Social metadata for libraries, archives and museums: Research findings from t...
Social metadata for libraries, archives and museums: Research findings from t...Social metadata for libraries, archives and museums: Research findings from t...
Social metadata for libraries, archives and museums: Research findings from t...
 
2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery
 
Cloud Library: Precipitating change in library infrastructure
Cloud Library: Precipitating change in library infrastructureCloud Library: Precipitating change in library infrastructure
Cloud Library: Precipitating change in library infrastructure
 
Let's Get Visible! with Karla Smith, Winnefox Library System
Let's Get Visible! with Karla Smith, Winnefox Library SystemLet's Get Visible! with Karla Smith, Winnefox Library System
Let's Get Visible! with Karla Smith, Winnefox Library System
 
IASSIT Kansa Presentation
IASSIT Kansa PresentationIASSIT Kansa Presentation
IASSIT Kansa Presentation
 
Lauruhn-5-jun15
Lauruhn-5-jun15Lauruhn-5-jun15
Lauruhn-5-jun15
 
The OCLC Research Library Partnership
The OCLC Research Library PartnershipThe OCLC Research Library Partnership
The OCLC Research Library Partnership
 
2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery
 
Library Assessment Toolkit & Dashboard Scoping Research Final Report and Path...
Library Assessment Toolkit & Dashboard Scoping Research Final Report and Path...Library Assessment Toolkit & Dashboard Scoping Research Final Report and Path...
Library Assessment Toolkit & Dashboard Scoping Research Final Report and Path...
 
2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery2015 NISO Forum: The Future of Library Resource Discovery
2015 NISO Forum: The Future of Library Resource Discovery
 
The network reshapes the research library collection
The network reshapes the research library collectionThe network reshapes the research library collection
The network reshapes the research library collection
 

En vedette

Lupus (Manifestaciones cutáneas)
Lupus (Manifestaciones cutáneas)Lupus (Manifestaciones cutáneas)
Lupus (Manifestaciones cutáneas)Ricardo Zavala
 
Multilingualism ifla 2014 08
Multilingualism ifla 2014 08Multilingualism ifla 2014 08
Multilingualism ifla 2014 08Janifer Gatenby
 
Isni behind the scenes gatenby nadav manes harvard 201411
Isni behind the scenes gatenby nadav manes harvard 201411Isni behind the scenes gatenby nadav manes harvard 201411
Isni behind the scenes gatenby nadav manes harvard 201411Janifer Gatenby
 
Mapping Ballet Méchanique
Mapping Ballet MéchaniqueMapping Ballet Méchanique
Mapping Ballet MéchaniqueJohn Howrey
 
Viaf and isni ifla 2013 08-16
Viaf and isni  ifla 2013 08-16Viaf and isni  ifla 2013 08-16
Viaf and isni ifla 2013 08-16Janifer Gatenby
 
SparxITSolutions-Company-Profile
SparxITSolutions-Company-ProfileSparxITSolutions-Company-Profile
SparxITSolutions-Company-ProfileVikash Sharma
 
Art discovery group catalogue: Usage, content and new horizons
Art discovery group catalogue:  Usage, content and new horizonsArt discovery group catalogue:  Usage, content and new horizons
Art discovery group catalogue: Usage, content and new horizonsJanifer Gatenby
 
It19 20140721 linked data personal perspective
It19 20140721 linked data personal perspectiveIt19 20140721 linked data personal perspective
It19 20140721 linked data personal perspectiveJanifer Gatenby
 
Art discovery view to future content and interface ifla lyon 20140820
Art discovery view to future content and interface   ifla lyon 20140820Art discovery view to future content and interface   ifla lyon 20140820
Art discovery view to future content and interface ifla lyon 20140820Janifer Gatenby
 
Hipertensión pulmonar
Hipertensión pulmonarHipertensión pulmonar
Hipertensión pulmonarRicardo Zavala
 
Isni where are we now gatenby harvard 2014 11
Isni where are we now gatenby harvard 2014 11Isni where are we now gatenby harvard 2014 11
Isni where are we now gatenby harvard 2014 11Janifer Gatenby
 
Viaf and isni ifla 2014 08-15
Viaf and isni ifla 2014 08-15Viaf and isni ifla 2014 08-15
Viaf and isni ifla 2014 08-15Janifer Gatenby
 
Diagnóstico y tratamiento de aterosclerosis.
Diagnóstico y tratamiento de aterosclerosis.Diagnóstico y tratamiento de aterosclerosis.
Diagnóstico y tratamiento de aterosclerosis.Ricardo Zavala
 

En vedette (16)

Lupus (Manifestaciones cutáneas)
Lupus (Manifestaciones cutáneas)Lupus (Manifestaciones cutáneas)
Lupus (Manifestaciones cutáneas)
 
Multilingualism ifla 2014 08
Multilingualism ifla 2014 08Multilingualism ifla 2014 08
Multilingualism ifla 2014 08
 
Tecnologia Educativa
Tecnologia EducativaTecnologia Educativa
Tecnologia Educativa
 
Grammar book
Grammar bookGrammar book
Grammar book
 
Isni behind the scenes gatenby nadav manes harvard 201411
Isni behind the scenes gatenby nadav manes harvard 201411Isni behind the scenes gatenby nadav manes harvard 201411
Isni behind the scenes gatenby nadav manes harvard 201411
 
Juan presentacion 2
Juan presentacion 2Juan presentacion 2
Juan presentacion 2
 
Mapping Ballet Méchanique
Mapping Ballet MéchaniqueMapping Ballet Méchanique
Mapping Ballet Méchanique
 
Viaf and isni ifla 2013 08-16
Viaf and isni  ifla 2013 08-16Viaf and isni  ifla 2013 08-16
Viaf and isni ifla 2013 08-16
 
SparxITSolutions-Company-Profile
SparxITSolutions-Company-ProfileSparxITSolutions-Company-Profile
SparxITSolutions-Company-Profile
 
Art discovery group catalogue: Usage, content and new horizons
Art discovery group catalogue:  Usage, content and new horizonsArt discovery group catalogue:  Usage, content and new horizons
Art discovery group catalogue: Usage, content and new horizons
 
It19 20140721 linked data personal perspective
It19 20140721 linked data personal perspectiveIt19 20140721 linked data personal perspective
It19 20140721 linked data personal perspective
 
Art discovery view to future content and interface ifla lyon 20140820
Art discovery view to future content and interface   ifla lyon 20140820Art discovery view to future content and interface   ifla lyon 20140820
Art discovery view to future content and interface ifla lyon 20140820
 
Hipertensión pulmonar
Hipertensión pulmonarHipertensión pulmonar
Hipertensión pulmonar
 
Isni where are we now gatenby harvard 2014 11
Isni where are we now gatenby harvard 2014 11Isni where are we now gatenby harvard 2014 11
Isni where are we now gatenby harvard 2014 11
 
Viaf and isni ifla 2014 08-15
Viaf and isni ifla 2014 08-15Viaf and isni ifla 2014 08-15
Viaf and isni ifla 2014 08-15
 
Diagnóstico y tratamiento de aterosclerosis.
Diagnóstico y tratamiento de aterosclerosis.Diagnóstico y tratamiento de aterosclerosis.
Diagnóstico y tratamiento de aterosclerosis.
 

Similaire à Multilingual presentation ifla 2013 08-19

EDUC 5N99 ISP
EDUC 5N99 ISPEDUC 5N99 ISP
EDUC 5N99 ISPjthiessen
 
Everything you always wanted to know about WorldCat (but were afraid to ask) ...
Everything you always wanted to know about WorldCat (but were afraid to ask) ...Everything you always wanted to know about WorldCat (but were afraid to ask) ...
Everything you always wanted to know about WorldCat (but were afraid to ask) ...CILIP MDG
 
The Power of Sharing Linked Data: Giving the Web What It Wants
The Power of Sharing Linked Data: Giving the Web What It WantsThe Power of Sharing Linked Data: Giving the Web What It Wants
The Power of Sharing Linked Data: Giving the Web What It WantsNASIG
 
The Power of Sharing Linked Data (NASIG)
The Power of Sharing Linked Data (NASIG)The Power of Sharing Linked Data (NASIG)
The Power of Sharing Linked Data (NASIG)Richard Wallis
 
Webscale Discovery EDS / WorldCat Local "quick start" Charleston 2012 - Expanded
Webscale Discovery EDS / WorldCat Local "quick start" Charleston 2012 - ExpandedWebscale Discovery EDS / WorldCat Local "quick start" Charleston 2012 - Expanded
Webscale Discovery EDS / WorldCat Local "quick start" Charleston 2012 - ExpandedRafal Kasprowski
 
Richard Wallis Linked Data
Richard Wallis Linked DataRichard Wallis Linked Data
Richard Wallis Linked DataIncisive_Events
 
Multilingual issues in the representation of international bibliographic stan...
Multilingual issues in the representation of international bibliographic stan...Multilingual issues in the representation of international bibliographic stan...
Multilingual issues in the representation of international bibliographic stan...Gordon Dunsire
 
Kampmeier ecn 2012
Kampmeier ecn 2012Kampmeier ecn 2012
Kampmeier ecn 2012ECNOfficer
 
The web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedThe web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedSören Auer
 
When the Web of Linked Data Arrives
When the Web of Linked Data ArrivesWhen the Web of Linked Data Arrives
When the Web of Linked Data ArrivesRichard Wallis
 
NLW Linked Open Data Sets
NLW Linked Open Data SetsNLW Linked Open Data Sets
NLW Linked Open Data SetsGlen Robson
 
Building the New Open Linked Library
Building the New Open Linked LibraryBuilding the New Open Linked Library
Building the New Open Linked LibraryJoel Richard
 
Linked Data and cultural heritage data: an overview of the approaches from Eu...
Linked Data and cultural heritage data: an overview of the approaches from Eu...Linked Data and cultural heritage data: an overview of the approaches from Eu...
Linked Data and cultural heritage data: an overview of the approaches from Eu...The European Library
 
Archives' User Studies & Archival WorldCat Records
Archives' User Studies & Archival WorldCat RecordsArchives' User Studies & Archival WorldCat Records
Archives' User Studies & Archival WorldCat RecordsOCLC Research
 
Jabes 2010 - Réseaux étrangers "Libris, réseau suédois"
Jabes 2010 - Réseaux étrangers "Libris, réseau suédois"Jabes 2010 - Réseaux étrangers "Libris, réseau suédois"
Jabes 2010 - Réseaux étrangers "Libris, réseau suédois"ABES
 
Radicalize Your Library Catalog with Ebooks Your Patrons Can Keep Forever
Radicalize Your Library Catalog with Ebooks Your Patrons Can Keep ForeverRadicalize Your Library Catalog with Ebooks Your Patrons Can Keep Forever
Radicalize Your Library Catalog with Ebooks Your Patrons Can Keep Foreverloriayre
 
Which Data Quality is Needed and Affordable?
Which Data Quality is Needed and Affordable?Which Data Quality is Needed and Affordable?
Which Data Quality is Needed and Affordable?Charleston Conference
 
The Power of Sharing Linked Data: Bibliothekartag 2014
The Power of Sharing Linked Data: Bibliothekartag 2014The Power of Sharing Linked Data: Bibliothekartag 2014
The Power of Sharing Linked Data: Bibliothekartag 2014Richard Wallis
 
Publishing the British National Bibliography as Linked Open Data / Corine Del...
Publishing the British National Bibliography as Linked Open Data / Corine Del...Publishing the British National Bibliography as Linked Open Data / Corine Del...
Publishing the British National Bibliography as Linked Open Data / Corine Del...CIGScotland
 
New Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data CitationsNew Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data CitationsJohn Kunze
 

Similaire à Multilingual presentation ifla 2013 08-19 (20)

EDUC 5N99 ISP
EDUC 5N99 ISPEDUC 5N99 ISP
EDUC 5N99 ISP
 
Everything you always wanted to know about WorldCat (but were afraid to ask) ...
Everything you always wanted to know about WorldCat (but were afraid to ask) ...Everything you always wanted to know about WorldCat (but were afraid to ask) ...
Everything you always wanted to know about WorldCat (but were afraid to ask) ...
 
The Power of Sharing Linked Data: Giving the Web What It Wants
The Power of Sharing Linked Data: Giving the Web What It WantsThe Power of Sharing Linked Data: Giving the Web What It Wants
The Power of Sharing Linked Data: Giving the Web What It Wants
 
The Power of Sharing Linked Data (NASIG)
The Power of Sharing Linked Data (NASIG)The Power of Sharing Linked Data (NASIG)
The Power of Sharing Linked Data (NASIG)
 
Webscale Discovery EDS / WorldCat Local "quick start" Charleston 2012 - Expanded
Webscale Discovery EDS / WorldCat Local "quick start" Charleston 2012 - ExpandedWebscale Discovery EDS / WorldCat Local "quick start" Charleston 2012 - Expanded
Webscale Discovery EDS / WorldCat Local "quick start" Charleston 2012 - Expanded
 
Richard Wallis Linked Data
Richard Wallis Linked DataRichard Wallis Linked Data
Richard Wallis Linked Data
 
Multilingual issues in the representation of international bibliographic stan...
Multilingual issues in the representation of international bibliographic stan...Multilingual issues in the representation of international bibliographic stan...
Multilingual issues in the representation of international bibliographic stan...
 
Kampmeier ecn 2012
Kampmeier ecn 2012Kampmeier ecn 2012
Kampmeier ecn 2012
 
The web of interlinked data and knowledge stripped
The web of interlinked data and knowledge strippedThe web of interlinked data and knowledge stripped
The web of interlinked data and knowledge stripped
 
When the Web of Linked Data Arrives
When the Web of Linked Data ArrivesWhen the Web of Linked Data Arrives
When the Web of Linked Data Arrives
 
NLW Linked Open Data Sets
NLW Linked Open Data SetsNLW Linked Open Data Sets
NLW Linked Open Data Sets
 
Building the New Open Linked Library
Building the New Open Linked LibraryBuilding the New Open Linked Library
Building the New Open Linked Library
 
Linked Data and cultural heritage data: an overview of the approaches from Eu...
Linked Data and cultural heritage data: an overview of the approaches from Eu...Linked Data and cultural heritage data: an overview of the approaches from Eu...
Linked Data and cultural heritage data: an overview of the approaches from Eu...
 
Archives' User Studies & Archival WorldCat Records
Archives' User Studies & Archival WorldCat RecordsArchives' User Studies & Archival WorldCat Records
Archives' User Studies & Archival WorldCat Records
 
Jabes 2010 - Réseaux étrangers "Libris, réseau suédois"
Jabes 2010 - Réseaux étrangers "Libris, réseau suédois"Jabes 2010 - Réseaux étrangers "Libris, réseau suédois"
Jabes 2010 - Réseaux étrangers "Libris, réseau suédois"
 
Radicalize Your Library Catalog with Ebooks Your Patrons Can Keep Forever
Radicalize Your Library Catalog with Ebooks Your Patrons Can Keep ForeverRadicalize Your Library Catalog with Ebooks Your Patrons Can Keep Forever
Radicalize Your Library Catalog with Ebooks Your Patrons Can Keep Forever
 
Which Data Quality is Needed and Affordable?
Which Data Quality is Needed and Affordable?Which Data Quality is Needed and Affordable?
Which Data Quality is Needed and Affordable?
 
The Power of Sharing Linked Data: Bibliothekartag 2014
The Power of Sharing Linked Data: Bibliothekartag 2014The Power of Sharing Linked Data: Bibliothekartag 2014
The Power of Sharing Linked Data: Bibliothekartag 2014
 
Publishing the British National Bibliography as Linked Open Data / Corine Del...
Publishing the British National Bibliography as Linked Open Data / Corine Del...Publishing the British National Bibliography as Linked Open Data / Corine Del...
Publishing the British National Bibliography as Linked Open Data / Corine Del...
 
New Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data CitationsNew Metaphors: Data Papers and Data Citations
New Metaphors: Data Papers and Data Citations
 

Dernier

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking MenDelhi Call girls
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024Results
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Servicegiselly40
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Miguel Araújo
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024The Digital Insurer
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationMichael W. Hawkins
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...gurkirankumar98700
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationSafe Software
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 

Dernier (20)

How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024A Call to Action for Generative AI in 2024
A Call to Action for Generative AI in 2024
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
CNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of ServiceCNv6 Instructor Chapter 6 Quality of Service
CNv6 Instructor Chapter 6 Quality of Service
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
Kalyanpur ) Call Girls in Lucknow Finest Escorts Service 🍸 8923113531 🎰 Avail...
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 

Multilingual presentation ifla 2013 08-19

  • 1. The world’s libraries. Connected. Multilingual WorldCatpresented by Janifer Gatenby IFLA, Singapore, 2013-08-19 Karen Smith Yoshimura Eric Childress Janifer Gatenby Jean Godby Richard Greene Jenny Toves Diane Vizine Goetz Robert Bremer JD Shipengrover Gail Thornburg Jay Weitz
  • 2. The world’s libraries. Connected. WorldCat Today • Resources in nearly all languages • Contributed by more than 20,000 libraries worldwide • More than half the database is for works not in English
  • 3. The world’s libraries. Connected. WorldCat Today • Bibliographic Records • Hybrid records • Parallel records • Clustered at Work level (FRBR)
  • 4. The world’s libraries. Connected. Existing Architecture Author sAuthor sAuthors Subj Classif Subj ClassifSubj Classif Holdin gHoldin gHoldings Bibliographic recordWork cluster Content cluster Manifes tation cluster
  • 5. The world’s libraries. Connected. Complementary Initiatives Work Level Record GLIMIR Manifestation & Content Clusters Multi-lingual Bibliographic Structure
  • 6. The world’s libraries. Connected. Work Level Record http://www.oclc.org/research/activities/workrecs.html
  • 7. The world’s libraries. Connected. Create a landing page summarizing content for a work Work Level Record: Objective
  • 8. The world’s libraries. Connected. • The Content Cluster • Enables better work record displays by reducing the number of lines that display for large works • Enables a choice of format and presents the formats that could be acceptable substitutes • Consolidates holdings for identical content • The Manifestation Cluster is important • Consolidates holdings at manifestation level • In the short term allows the record catalogued in the language of the interface to be chosen for display • Reduces apparent duplication • Allows a more accurate count of the number of manifestations in WorldCat (as opposed to the number of records) GLIMIR
  • 9. The world’s libraries. Connected. Creates true multi-lingual displays • At work and manifestation levels • Using all available data instead of “most appropriate record” • Generates data Corrects many of the 28 million records coded “und” Better control and linking of translations Input to refinement of work clusters Smarter data storage Multilingual Bibliographic Structure Project
  • 10. The world’s libraries. Connected. • Worldcat.org selects the most appropriate record to show to a user as representative of the work in the short result list and beyond • The end result will not be very satisfactory from a multi-lingual viewpoint… here’s why “Most appropriate” questioned
  • 11. The world’s libraries. Connected. Which record is better to present to a German speaker?
  • 12. The world’s libraries. Connected. Incomplete Swedish Record
  • 13. The world’s libraries. Connected. Hybrid record
  • 14. The world’s libraries. Connected. Most appropriate display
  • 15. The world’s libraries. Connected. • Work level data, mined from all associated bibliographic records will be displayed supplemented with expression / manifestation level data as the user drills through the short to fuller versions of the metadata. Multilingual Bibliographic Structure Project End user interface will show works and manifestations not bibliographic records; the cataloguing client will also show bibliographic records
  • 16. The world’s libraries. Connected. Proposed new architecture Work eng fre ger jpn Manif eng Manif eng Manif eng Manif eng Manif eng Manif eng o freNotes Contents ++ Holdin gHoldin gHolding Holdin g Subj sif Subj Classif eng fre ger jpn Author sAuthor sAuthorseng fre ger jpn eng fre ger jpn eng fre ger jpn Translations (Language of work) Manif fre Holding
  • 17. The world’s libraries. Connected. • Language tagging of elements, particularly • Summaries (M21 520) • Subject headings • Display in script preferred by the user if data is available • Improve translated interfaces • Show consolidated holdings as appropriate Important principles
  • 22. The world’s libraries. Connected. Translations
  • 23. The world’s libraries. Connected. • The cream of the world’s cultural and knowledge heritage is shared by being translated • WorldCat contains many rich cataloguing records for these translations Great works are translated GOAL: Data mine the really good records to improve clustering, presentation, authority records and linked data
  • 24. The world’s libraries. Connected. • Inconsistencies causing work clusters to be incomplete & less than optimal search results • Titles without subtitles • Different forms of uniform title or missing uniform title • Inverted title • Different coding of original and translated information Translations Generated uniform title authority records will overcome most of these differences without needing to edit individual records
  • 25. The world’s libraries. Connected. • Improve FRBR work groups • Made by data mining • Contribute to VIAF • Diffuse via VIAF as linked data • Possibility to create web page / web service Generate uniform title authority records
  • 26.
  • 27.
  • 29. The world’s libraries. Connected. Translation records in VIAF • Will enrich VIAF significantly • New elements - translated title and translator Author Title Expressions in VIAF Translation count in WorldCat Atwood Blind assassin 8 31 Guevara Notas de viaje 0 11 Hawking Grand design 0 18 Lenard Grosse naturforscher 1 3 Loti Pêcheur d’Islande 1 31
  • 30. The world’s libraries. Connected. • Records are freely available to the world from VIAF in • MARC-21 • XML • RDF (linked data) • Just links in JSON • And other formats as introduced Diffusion of Translation records
  • 31. The world’s libraries. Connected. • # of manifestations as opposed to # of records • # of works that have translations • Top translated authors and works • And more  We don’t know now, but soon will

Notes de l'éditeur

  1. There are more than 300 million records in WorldCat today representing holdings of the world’s libraries. There are also another 200 million articles. Of the 300 million, more than half are in languages other than English.
  2. The 300 million records are clustered into works. The records may be in just one language of cataloguing or they may have a mixture of language of cataloguing, e.g. subject heading in more than one language. Parallel records may exist, i.e. records in different languages of cataloguing describing the same resource.
  3. The existing architecture is bibliographicrecord centric. There may be links to author and subject authority records. The work, manifestation and content clusters only contain identifiers, no metadata at present.
  4. Three complementary initiatives are in progress concerning multi-lingualism and WorldCat.
  5. The first project – generation of metadata at work level to make better presentations.
  6. The GLIMIR project creates clusters of parallel records for the same manifestation (manifestation cluster) and also clusters for the same content, though the form may be different (print, microform, digital). We are in the process of trawling thorugh the database to make these clusters.Only the part in blue will be changed by the multi-lingual bibliographic structure approach
  7. Concerning correcting records coded as “und” = “undetermined”, we expect that we can correct about 7 million by searching for the same title string in other records.
  8. Currently when the short list and full displays are created, the system selects the most appropriate record for display. The most appropriate record is determined by its size and the number of associated holdings and it was envisaged to extend this to include the language of cataloguing. But we think we can do much better by questioning the “most appropriate record” concept.
  9. Here the first record is catalogued in German but has no significant German content – no subject headings and no notes. There are other richer records that could inform the user better.
  10. Hyrid records also mean that significant linguistic information may be buried in other records.
  11. This record has subject headings in 3 languages.
  12. Build the information to display to users from all available records and show them all relevant holdings. Do not display just one selected record from the work set. Cataloguers too will benefit from this, being able to drill down to actual records where appropriate.
  13. This is the theoretical bibliographic structure that will no longer be bibliographic record centric but work centric. All information – authors, notes, summaries, subject headings will be flagged with the language of cataloguing.
  14. The work level metadata will be tagged with language for each data element.Instead of always showing the data from the main title in the record, the alternative script fields may be chosen for display, depending on what the system can determine about the user, e.g. from IP address range or expressed preference.Worldcat.org already includes the ability to change language of display, but the numbers of fields that change will be enhanced and the tabulation of several displays will be improved.Consolidate holdings from all records that re applicable to a display (work, content, manifestation level)
  15. An Iceland Fisherman, original in French displayed in English UI with Translations table.
  16. Same record with Chinese interface
  17. The Grand Design; English in Italian UI with Translations table.
  18. Same in Japanese interface
  19. Instead of working directly with lesser quality records to improve the quality in WorldCat and instead of working with the long tail, we are turning our attention to the most important works and working ways to use the good records to improve the quality.
  20. Translated titles not always consistent, causing work grouping failure.  Sometimes:caused by titles without sub titles, caused by different forms of uniform title, i.e. in Gujarati and in English (several forms)caused by inverting the titles, by placing the name Gandhi before “Autobiography”.  Some figures:  French – 15 records, 6 work sets; German -  9 records, 9 work sets; Italian 7 records, 5 work sets; Spanish 8 records, 4 work sets
  21. VIAFConsolidated display – shows work with expressions summary.
  22. This is the full expressions summary for the work Pêcheurd’Islande
  23. VIAFFull display – whos all the expressions, e.g. different translations with the title and the translator and the earliest determined publication date
  24. These records will enrich VIAF. There is no risk of generating bad records. The translation records will include both the translated title and the translator (both are not usually included in existing records in VIAF)Most of the expression records will be new to VIAF.
  25. The newly generated records will be available in the formats as distributed by VIAF.
  26. Just think. How many of the 150+ million records that are for non English language works are actually translations of English language records. And how many of the English language records are also translations? It could be as high as 25%? Once we have these records generated, it will open many new possibilities.Also, we know we have 300 million records, but how many real resources do we have? GLIMIR will produce these figures. We are just starting…