SlideShare une entreprise Scribd logo
1  sur  21
Télécharger pour lire hors ligne
Linking Library of
                 Congress Subject Headings
                                 Owen Stephens 14th July 2011
                                        @ostephens
                         http://www.meanboyfriend.com/overdue_ideas




Thursday, 14 July 2011
LNKNLCSH
                                      @ostephens




Thursday, 14 July 2011

This is the lightning version

I should precursor this talk by saying I’m really pleased that the LoC have invested in
experimenting with Linked Data representations of aspects of their data. Anything in this talk
isn’t a criticism of this, but about the issues we encountered using aspects of the data. It’s
possible that some or all of these problems may have been down to my lack of understanding
of LCSH and Linked Data :)
Library chops



Thursday, 14 July 2011

I’m a librarian - by nature and qualification :) - see http://www.meanboyfriend.com/
overdue_ideas/2010/11/library-routes/

Been working on the cusp between libraries and IT since 1995. Spending early part of my
career in small libraries means I have worked in just about every area of library front of house
and back office. However although I’ve catalogued books, and have more than a passing
familiarly with MARC, I’m not a cataloguer, and not an expert on LCSH
Linked Data chops



Thursday, 14 July 2011

I’ve been trying to understand the Semantic Web/Linked Data for several years :) My
understanding has been accelerated over the last couple of years by involvement in several
projects in the Linked Data space. Specifically the Lucero and CORE projects at the Open
University
Thursday, 14 July 2011

Expressing similarity between published papers in UK research repositories
Harvest metadata and full-text (50k papers from 143 UK repos so far)
Text mine for relationships
Expose ‘similarity’ measure as RDF triples using MuSIM Ontology (originally developed for
Music, but equally applicable)
For more information http://core-project.kmi.open.ac.uk
Exposing RDF




Thursday, 14 July 2011
Three ʻproductsʼ
CORE Portal - search or SPARQL metadata for harvested papers
CORE Mobile – Android application to search & navigate across related papers & downloading articles
CORE Plugin - Designed to integrate into existing repository interface to link to ʻrelated papersʼ in other repos, based on CORE ʻsimilarityʼ

For more information http://core-project.kmi.open.ac.uk
SPARQL Endpoint at http://core.kmi.open.ac.uk:8081/COREWeb/squery
How we express data in RDF: http://core-project.kmi.open.ac.uk/node/13
Lucero




Thursday, 14 July 2011

For more information see http://lucero-project.info
Data and SPARQL Endpoint available via http://data.open.ac.uk

Lucero published variety of data from the Open University as linked open data - admin data
(buildings), course data (course catalogue, OERs), research data and data about bibliographic
resources - including materials in the library (focussed on materials related to course
materials - around 30k catalogue records)
LCSH



Thursday, 14 July 2011
Lots been written about LCSH, it’s structure, whether it should be replaced. I don’t want to spend too much time on this today but it may come up in places

However it is probably worth recapping my understanding (if only to let those more knowledgeable correct it)

Key aspect in the context of this talk is that LCSH is primarily a pre-coordinated system - that is facets of subject headings are pre-combined into a single,
multi-faceted heading.
Although....
“LCSH itself requires some degree of post-coordination of the pre-coordinated strings to bring out specific topics of works.” (http://www.loc.gov/catdir/
cpso/pre_vs_post.pdf)

In fact the way that LCSH is structured in MARC records, and the way that indexes can be built on this in library management systems means that

I’m going to focus on ‘Topical’ subject headings (confusingly to me, LCSH can also cover Name, Title and Geographic headings)

Topical Terms can represent “a concrete object, animal, etc.; a category of people, animals, or objects; a more abstract concept, belief, process, or
phenomenon; an institution, etc.” (http://www.tulane.edu/~techserv/lcsh%20introd.html)

Topical LC Subject Headings are built by combining ‘Topical Terms’ with qualifiers (‘subdivisions’) which allow you to contextualise the term.
The types of subdivision available are:

General (a high level general qualifier - e.g. ‘History’)
Chronological (period of time - e.g. ‘20th Century’)
Geographic (place - e.g. ‘Great Britain’)
Form (the type/genre of material - e.g. ‘Dictionary’)

There are large number of rules that express how these subdivisions can be used in conjunction with Topical Terms, and the order in which they should be
expressed. Not all combinations are valid - for example only certain General subdivisions may be further subdivided Geographically. The rules are not always
black and white - they have ‘examples’ lists which you can use to inform you if it might be valid in a given situation.

Perhaps suffice to say that a document called ‘BASIC SUBJECT CATALOGING USING LCSH: Trainee’s Manual’ is 382 pages long.

Subject heading strings can be valid (i.e. constructed according to rules/patterns) while not being ‘Authorized’ - in this context and Authorized Heading is “A
preferred subject term as decided and established by the Library of Congress by means of an authority record.” (Thanks to Tom Meehan for this definition)
Thursday, 14 July 2011

Thanks to work of Ed Summers and others, the Library of Congress have a Linked Data
representation of LCSH in SKOS. However, this only covers ‘Authorized’ LCSH - presumably
because only those LCSH with an Authority record have an identifier within LoC systems? (I’m
speculating)
Thursday, 14 July 2011

This is a catalogue record from the OU - the two strings listed as ‘Subjects’ are LCSH (for
cataloguers amongst you MARC 650s)

Can see the linked data representation at http://data.open.ac.uk/page/library/289148
General
                                   Subdivision


 Science--Study and Teaching--Research

                Topical                                             General
                 Term                                              Subdivision




Thursday, 14 July 2011

This is made up of a Topical Term - Science and two general subdivisions ‘Study and
Teaching’ and ‘Research’
Science--Study and Teaching--Research
                                        id.loc.gov ?




Thursday, 14 July 2011

This is (afaik - I trust the cataloguers) a valid LCSH ... however it is not authorized ... and so
does not have a URI on id.loc.gov
Science--Study and Teaching--Research
   http://id.loc.gov/authorities/sh85118587#concept




Thursday, 14 July 2011

“Science--Study and Teaching”, however, is an authorized heading
Science--Study and Teaching--Research
   http://id.loc.gov/authorities/sh85118553#concept

   N.B. This is URI for Science as Topical Term not
  http://id.loc.gov/authorities/sh00007934#concept
  which is URI for Science as a General Subdivision
Thursday, 14 July 2011

As is “Science”
Science--Study and Teaching--Research
http://id.loc.gov/authorities/sh2001008697#concept




Thursday, 14 July 2011

Also “Study and Teaching” (as a topical subdivision) is an authorized heading
Science--Study and Teaching--Research
http://id.loc.gov/authorities/sh2002006576#concept
    N.B. This is URI for Research as General
   Subdivision not http://id.loc.gov/authorities/
sh85113021#concept which is URI for Research as
                  a Topical Term
Thursday, 14 July 2011

Also “Research” (as a topical subdivision) is an authorized heading
More links please


Thursday, 14 July 2011

If we only used id.loc.gov URIs where we had an authorised LCSH, we would end up with only
a small number of links. Some URIs in id.loc.gov would never be used in this way as they only
represent subdivisions - never valid by themselves.

Therefor decided to check a variety of combinations against id.loc.gov
Science--Study and Teaching--Research
    Science--Study and teaching     http://id.loc.gov/authorities/
                                       sh85118587#concept

                                    http://id.loc.gov/authorities/
    Science                            sh85118553#concept

                                    http://id.loc.gov/authorities/
    Study and Teaching                sh2001008697#concept

                                    http://id.loc.gov/authorities/
    Research                          sh2002006576#concept

Science--Study and Teaching--     http://data.open.ac.uk/page/topic/
                                           library/science--
          Research                 study_and_teaching--research
Thursday, 14 July 2011
MADS?


                         http://www.loc.gov/standards/mads/rdf/




Thursday, 14 July 2011

As far as I can see MADS (apart from looking complex) models the Authority - not the
heading - this doesn’t solve the problem we saw here!

That is MADS would solve the problem only for Authorized headings (which it does represent
as component parts - which I think addresses the issues raised by Karen Coyle at http://
kcoyle.blogspot.com/2009/05/lcsh-as-linked-data-beyond-dash-dash.html)

Happy to be corrected...
A different approach?
       bibo:authorList ( <http://examples.net/contributors/2>
               <http://examples.net/contributors/1>)


          lcsh:headingList ( <http://id.loc.gov/authorities/
       sh85118553#concept> <http://id.loc.gov/authorities/
      sh2001008697#concept> <http://id.loc.gov/authorities/
                    sh2002006576#concept>)




Thursday, 14 July 2011

If we could use rdfs:list to represent the pre-coordinated string of headings - then wouldn’t
care about whether ‘authorized’ or not, and would have all the individual headings there as
well (bibo lists authors individual and as a list)

Again copying BIBO which has each author as a dc:author as well, could represent each part
of the subject string as a separate dc:subject.

In a MADS world there would be advantage to expressing full authorized heading as well (for
relationships derived in MADS) although there is still the question of expressing ‘authorized
fragments’ which seems to me would also be useful with MADS for the same reasons

This feels like a simple approach that would at least allow us to capture the component parts
of subject string (and personally I’m not sure we ought to go further than this? do we need
to? why?). My feeling is lots of the work goes into representing the ‘Authority file’ as opposed
to how subject headings are used in the real world ... is this fair?
Details: http://discovery.ac.uk/developers/
   competition/
   Datasets: http://ckan.net/group/
   ukdiscovery
   Ask Questions: http://getthedata.org
   or #discodev
Thursday, 14 July 2011

Finally just an advert - if you are interested in open data in the library/archive/museum
space please consider entering this competition :) - really show the value of this stuff!

Contenu connexe

Tendances

A tour of the library of the future
A tour of the library of the futureA tour of the library of the future
A tour of the library of the futureBethan Ruddock
 
iPod eVil
iPod eViliPod eVil
iPod eVilannbee
 
Defying Domains Draft Presentation
Defying Domains Draft PresentationDefying Domains Draft Presentation
Defying Domains Draft PresentationChanteus
 
Honours Year Library Tutorial 2011
Honours Year Library Tutorial 2011Honours Year Library Tutorial 2011
Honours Year Library Tutorial 2011Suenn Ng
 
High and Lows of Library Linked Data
High and Lows of Library Linked DataHigh and Lows of Library Linked Data
High and Lows of Library Linked DataAdrian Stevenson
 
Faculty of Education - Undergradudate
Faculty of Education - UndergradudateFaculty of Education - Undergradudate
Faculty of Education - UndergradudateCC Library
 
Hard won: the challenges of obtaining scholarly communication knowledge & skills
Hard won: the challenges of obtaining scholarly communication knowledge & skillsHard won: the challenges of obtaining scholarly communication knowledge & skills
Hard won: the challenges of obtaining scholarly communication knowledge & skillsDanny Kingsley
 
Literature searching for your dissertation - Translation 2017
Literature searching for your dissertation - Translation 2017 Literature searching for your dissertation - Translation 2017
Literature searching for your dissertation - Translation 2017 Alex Asman
 
NYLA: De-mystifying 2.0 at the Metropolitan Museum of Art
NYLA: De-mystifying 2.0 at the Metropolitan Museum of ArtNYLA: De-mystifying 2.0 at the Metropolitan Museum of Art
NYLA: De-mystifying 2.0 at the Metropolitan Museum of Artguest7dbf306
 
New York Library Association: Web 2.0 at the Metropolitan Museum of Art
New York Library Association: Web 2.0 at the Metropolitan Museum of ArtNew York Library Association: Web 2.0 at the Metropolitan Museum of Art
New York Library Association: Web 2.0 at the Metropolitan Museum of ArtJennie Pu
 
Data Designed for Discovery
Data Designed for DiscoveryData Designed for Discovery
Data Designed for DiscoveryOCLC
 
We need to talk about cataloguing - a report from a beginner's workshop / Amy...
We need to talk about cataloguing - a report from a beginner's workshop / Amy...We need to talk about cataloguing - a report from a beginner's workshop / Amy...
We need to talk about cataloguing - a report from a beginner's workshop / Amy...CILIP MDG
 
New Life to Old Serials:
New Life to Old Serials: New Life to Old Serials:
New Life to Old Serials: NASIG
 
Mendeley%20 presentation%20iaald
Mendeley%20 presentation%20iaaldMendeley%20 presentation%20iaald
Mendeley%20 presentation%20iaaldBarbara Hutchinson
 
Engl317 gateways intro
Engl317 gateways introEngl317 gateways intro
Engl317 gateways introsgass
 

Tendances (20)

A tour of the library of the future
A tour of the library of the futureA tour of the library of the future
A tour of the library of the future
 
Advanced information and research skills for music
Advanced information and research skills for musicAdvanced information and research skills for music
Advanced information and research skills for music
 
iPod eVil
iPod eViliPod eVil
iPod eVil
 
Defying Domains Draft Presentation
Defying Domains Draft PresentationDefying Domains Draft Presentation
Defying Domains Draft Presentation
 
Honours Year Library Tutorial 2011
Honours Year Library Tutorial 2011Honours Year Library Tutorial 2011
Honours Year Library Tutorial 2011
 
High and Lows of Library Linked Data
High and Lows of Library Linked DataHigh and Lows of Library Linked Data
High and Lows of Library Linked Data
 
The opac and the web
The opac and the webThe opac and the web
The opac and the web
 
Faculty of Education - Undergradudate
Faculty of Education - UndergradudateFaculty of Education - Undergradudate
Faculty of Education - Undergradudate
 
Hard won: the challenges of obtaining scholarly communication knowledge & skills
Hard won: the challenges of obtaining scholarly communication knowledge & skillsHard won: the challenges of obtaining scholarly communication knowledge & skills
Hard won: the challenges of obtaining scholarly communication knowledge & skills
 
Snyder_A_LIS457
Snyder_A_LIS457Snyder_A_LIS457
Snyder_A_LIS457
 
Literature searching for your dissertation - Translation 2017
Literature searching for your dissertation - Translation 2017 Literature searching for your dissertation - Translation 2017
Literature searching for your dissertation - Translation 2017
 
NYLA: De-mystifying 2.0 at the Metropolitan Museum of Art
NYLA: De-mystifying 2.0 at the Metropolitan Museum of ArtNYLA: De-mystifying 2.0 at the Metropolitan Museum of Art
NYLA: De-mystifying 2.0 at the Metropolitan Museum of Art
 
New York Library Association: Web 2.0 at the Metropolitan Museum of Art
New York Library Association: Web 2.0 at the Metropolitan Museum of ArtNew York Library Association: Web 2.0 at the Metropolitan Museum of Art
New York Library Association: Web 2.0 at the Metropolitan Museum of Art
 
Information and research skills for Undergraduates
Information and research skills for UndergraduatesInformation and research skills for Undergraduates
Information and research skills for Undergraduates
 
SMLLC - Dissertations: information and research skills
SMLLC - Dissertations: information and research skillsSMLLC - Dissertations: information and research skills
SMLLC - Dissertations: information and research skills
 
Data Designed for Discovery
Data Designed for DiscoveryData Designed for Discovery
Data Designed for Discovery
 
We need to talk about cataloguing - a report from a beginner's workshop / Amy...
We need to talk about cataloguing - a report from a beginner's workshop / Amy...We need to talk about cataloguing - a report from a beginner's workshop / Amy...
We need to talk about cataloguing - a report from a beginner's workshop / Amy...
 
New Life to Old Serials:
New Life to Old Serials: New Life to Old Serials:
New Life to Old Serials:
 
Mendeley%20 presentation%20iaald
Mendeley%20 presentation%20iaaldMendeley%20 presentation%20iaald
Mendeley%20 presentation%20iaald
 
Engl317 gateways intro
Engl317 gateways introEngl317 gateways intro
Engl317 gateways intro
 

En vedette

References on the web
References on the webReferences on the web
References on the webostephens
 
Open, Linked, Hacked
Open, Linked, HackedOpen, Linked, Hacked
Open, Linked, Hackedostephens
 
Mashing libraries to build communities - CILIPS 2011
Mashing libraries to build communities - CILIPS 2011Mashing libraries to build communities - CILIPS 2011
Mashing libraries to build communities - CILIPS 2011ostephens
 
A Chrismash Carol
A Chrismash CarolA Chrismash Carol
A Chrismash Carolostephens
 
Where are you from? and other stupid questions
Where are you from? and other stupid questionsWhere are you from? and other stupid questions
Where are you from? and other stupid questionsostephens
 
Knowledge net pres 22 sept 2
Knowledge net pres 22 sept 2Knowledge net pres 22 sept 2
Knowledge net pres 22 sept 2Natasha Low
 
RefWorks for DEPARTMENT OF FAMILY MEDICINE - Faculty Development
RefWorks for DEPARTMENT OF FAMILY MEDICINE - Faculty Development RefWorks for DEPARTMENT OF FAMILY MEDICINE - Faculty Development
RefWorks for DEPARTMENT OF FAMILY MEDICINE - Faculty Development Naz Torabi
 

En vedette (9)

References on the web
References on the webReferences on the web
References on the web
 
Open, Linked, Hacked
Open, Linked, HackedOpen, Linked, Hacked
Open, Linked, Hacked
 
Mashing libraries to build communities - CILIPS 2011
Mashing libraries to build communities - CILIPS 2011Mashing libraries to build communities - CILIPS 2011
Mashing libraries to build communities - CILIPS 2011
 
TELSTAR
TELSTARTELSTAR
TELSTAR
 
A Chrismash Carol
A Chrismash CarolA Chrismash Carol
A Chrismash Carol
 
Where are you from? and other stupid questions
Where are you from? and other stupid questionsWhere are you from? and other stupid questions
Where are you from? and other stupid questions
 
Knowledge net pres 22 sept 2
Knowledge net pres 22 sept 2Knowledge net pres 22 sept 2
Knowledge net pres 22 sept 2
 
Refworks Website
Refworks WebsiteRefworks Website
Refworks Website
 
RefWorks for DEPARTMENT OF FAMILY MEDICINE - Faculty Development
RefWorks for DEPARTMENT OF FAMILY MEDICINE - Faculty Development RefWorks for DEPARTMENT OF FAMILY MEDICINE - Faculty Development
RefWorks for DEPARTMENT OF FAMILY MEDICINE - Faculty Development
 

Similaire à Linking lcsh and other stuff

PaLA JC common core presentation
PaLA JC common core presentationPaLA JC common core presentation
PaLA JC common core presentationEllysa
 
Teaching with WorldCat Local: What's Different? (Slide captions)
Teaching with WorldCat Local: What's Different? (Slide captions)Teaching with WorldCat Local: What's Different? (Slide captions)
Teaching with WorldCat Local: What's Different? (Slide captions)kslovesbooks
 
Guest Lecture for GSLIS 522 (Science Reference) Feb 20, 2012
Guest Lecture for GSLIS 522 (Science Reference) Feb 20, 2012Guest Lecture for GSLIS 522 (Science Reference) Feb 20, 2012
Guest Lecture for GSLIS 522 (Science Reference) Feb 20, 2012Laksamee Putnam
 
Copac: Reengineering the UK national academic union catalogue to serve the 21...
Copac: Reengineering the UK national academic union catalogue to serve the 21...Copac: Reengineering the UK national academic union catalogue to serve the 21...
Copac: Reengineering the UK national academic union catalogue to serve the 21...Joy Palmer
 
Isr 2531
Isr 2531Isr 2531
Isr 2531Traciwm
 
Library skills for EAP slides
Library skills for EAP slidesLibrary skills for EAP slides
Library skills for EAP slidesLynne Meehan
 
Writing Seminar
Writing Seminar Writing Seminar
Writing Seminar Traciwm
 
20110929 tpdl2011 dl-research-humboldt
20110929 tpdl2011 dl-research-humboldt20110929 tpdl2011 dl-research-humboldt
20110929 tpdl2011 dl-research-humboldtStefan Gradmann
 
Knowledge Organisation Systems in Digital Libraries: A Comparative Study
Knowledge Organisation Systems in Digital Libraries: A Comparative StudyKnowledge Organisation Systems in Digital Libraries: A Comparative Study
Knowledge Organisation Systems in Digital Libraries: A Comparative StudyBhojaraju Gunjal
 
Do the LOCAH-Motion: How to Make Bibliographic and Archival Linked Data
Do the LOCAH-Motion: How to Make Bibliographic and Archival Linked DataDo the LOCAH-Motion: How to Make Bibliographic and Archival Linked Data
Do the LOCAH-Motion: How to Make Bibliographic and Archival Linked DataAdrian Stevenson
 
Should libraries discontinue using and maintaining controlled subject vocabul...
Should libraries discontinue using and maintaining controlled subject vocabul...Should libraries discontinue using and maintaining controlled subject vocabul...
Should libraries discontinue using and maintaining controlled subject vocabul...Ryan Scicluna
 
The Culture of Content Sharing and Learning Objects
The Culture of Content Sharing and Learning ObjectsThe Culture of Content Sharing and Learning Objects
The Culture of Content Sharing and Learning ObjectsLisa Johnson, PhD
 
Engl 1221 Putt 2011
Engl 1221 Putt 2011Engl 1221 Putt 2011
Engl 1221 Putt 2011Traciwm
 
Exploring Open Educational Resources
Exploring Open Educational ResourcesExploring Open Educational Resources
Exploring Open Educational ResourcesCSAPSubjectCentre
 
Corpus Report
Corpus ReportCorpus Report
Corpus ReportCharlesKo
 
Open Annotation Collaboration Introduction
Open Annotation Collaboration IntroductionOpen Annotation Collaboration Introduction
Open Annotation Collaboration IntroductionTimothy Cole
 
Wollumbin Guide to RTRL eresources - introduction to rtrl 24.7
Wollumbin Guide to RTRL eresources - introduction to rtrl 24.7Wollumbin Guide to RTRL eresources - introduction to rtrl 24.7
Wollumbin Guide to RTRL eresources - introduction to rtrl 24.7jokunev
 
Intertwingularity, Semantic Web and linked Geo data
Intertwingularity, Semantic Web and linked Geo dataIntertwingularity, Semantic Web and linked Geo data
Intertwingularity, Semantic Web and linked Geo dataDan Brickley
 
A Theoretical Framework for Physics Education Research Modeling Student Thin...
A Theoretical Framework for Physics Education Research  Modeling Student Thin...A Theoretical Framework for Physics Education Research  Modeling Student Thin...
A Theoretical Framework for Physics Education Research Modeling Student Thin...Sarah Marie
 
Social Science Research_ Principles Methods and Practices.pdf
Social Science Research_ Principles Methods and Practices.pdfSocial Science Research_ Principles Methods and Practices.pdf
Social Science Research_ Principles Methods and Practices.pdfValriaFerreira59
 

Similaire à Linking lcsh and other stuff (20)

PaLA JC common core presentation
PaLA JC common core presentationPaLA JC common core presentation
PaLA JC common core presentation
 
Teaching with WorldCat Local: What's Different? (Slide captions)
Teaching with WorldCat Local: What's Different? (Slide captions)Teaching with WorldCat Local: What's Different? (Slide captions)
Teaching with WorldCat Local: What's Different? (Slide captions)
 
Guest Lecture for GSLIS 522 (Science Reference) Feb 20, 2012
Guest Lecture for GSLIS 522 (Science Reference) Feb 20, 2012Guest Lecture for GSLIS 522 (Science Reference) Feb 20, 2012
Guest Lecture for GSLIS 522 (Science Reference) Feb 20, 2012
 
Copac: Reengineering the UK national academic union catalogue to serve the 21...
Copac: Reengineering the UK national academic union catalogue to serve the 21...Copac: Reengineering the UK national academic union catalogue to serve the 21...
Copac: Reengineering the UK national academic union catalogue to serve the 21...
 
Isr 2531
Isr 2531Isr 2531
Isr 2531
 
Library skills for EAP slides
Library skills for EAP slidesLibrary skills for EAP slides
Library skills for EAP slides
 
Writing Seminar
Writing Seminar Writing Seminar
Writing Seminar
 
20110929 tpdl2011 dl-research-humboldt
20110929 tpdl2011 dl-research-humboldt20110929 tpdl2011 dl-research-humboldt
20110929 tpdl2011 dl-research-humboldt
 
Knowledge Organisation Systems in Digital Libraries: A Comparative Study
Knowledge Organisation Systems in Digital Libraries: A Comparative StudyKnowledge Organisation Systems in Digital Libraries: A Comparative Study
Knowledge Organisation Systems in Digital Libraries: A Comparative Study
 
Do the LOCAH-Motion: How to Make Bibliographic and Archival Linked Data
Do the LOCAH-Motion: How to Make Bibliographic and Archival Linked DataDo the LOCAH-Motion: How to Make Bibliographic and Archival Linked Data
Do the LOCAH-Motion: How to Make Bibliographic and Archival Linked Data
 
Should libraries discontinue using and maintaining controlled subject vocabul...
Should libraries discontinue using and maintaining controlled subject vocabul...Should libraries discontinue using and maintaining controlled subject vocabul...
Should libraries discontinue using and maintaining controlled subject vocabul...
 
The Culture of Content Sharing and Learning Objects
The Culture of Content Sharing and Learning ObjectsThe Culture of Content Sharing and Learning Objects
The Culture of Content Sharing and Learning Objects
 
Engl 1221 Putt 2011
Engl 1221 Putt 2011Engl 1221 Putt 2011
Engl 1221 Putt 2011
 
Exploring Open Educational Resources
Exploring Open Educational ResourcesExploring Open Educational Resources
Exploring Open Educational Resources
 
Corpus Report
Corpus ReportCorpus Report
Corpus Report
 
Open Annotation Collaboration Introduction
Open Annotation Collaboration IntroductionOpen Annotation Collaboration Introduction
Open Annotation Collaboration Introduction
 
Wollumbin Guide to RTRL eresources - introduction to rtrl 24.7
Wollumbin Guide to RTRL eresources - introduction to rtrl 24.7Wollumbin Guide to RTRL eresources - introduction to rtrl 24.7
Wollumbin Guide to RTRL eresources - introduction to rtrl 24.7
 
Intertwingularity, Semantic Web and linked Geo data
Intertwingularity, Semantic Web and linked Geo dataIntertwingularity, Semantic Web and linked Geo data
Intertwingularity, Semantic Web and linked Geo data
 
A Theoretical Framework for Physics Education Research Modeling Student Thin...
A Theoretical Framework for Physics Education Research  Modeling Student Thin...A Theoretical Framework for Physics Education Research  Modeling Student Thin...
A Theoretical Framework for Physics Education Research Modeling Student Thin...
 
Social Science Research_ Principles Methods and Practices.pdf
Social Science Research_ Principles Methods and Practices.pdfSocial Science Research_ Principles Methods and Practices.pdf
Social Science Research_ Principles Methods and Practices.pdf
 

Plus de ostephens

Selecting with SPARQL
Selecting with SPARQLSelecting with SPARQL
Selecting with SPARQLostephens
 
Publishing and Using Linked Data
Publishing and Using Linked DataPublishing and Using Linked Data
Publishing and Using Linked Dataostephens
 
Open for Reuse: Library data and mashups
Open for Reuse: Library data and mashupsOpen for Reuse: Library data and mashups
Open for Reuse: Library data and mashupsostephens
 
Lucero Library Update 03/11/10
Lucero Library Update 03/11/10Lucero Library Update 03/11/10
Lucero Library Update 03/11/10ostephens
 
Mashing libraries to build communities
Mashing libraries to build communitiesMashing libraries to build communities
Mashing libraries to build communitiesostephens
 
Project Management Tools
Project Management ToolsProject Management Tools
Project Management Toolsostephens
 
The Semantic Web
The Semantic WebThe Semantic Web
The Semantic Webostephens
 
Resource Discovery Infrastructure - what if we were starting from scratch?
Resource Discovery Infrastructure - what if we were starting from scratch?Resource Discovery Infrastructure - what if we were starting from scratch?
Resource Discovery Infrastructure - what if we were starting from scratch?ostephens
 
Digital Future
Digital FutureDigital Future
Digital Futureostephens
 

Plus de ostephens (9)

Selecting with SPARQL
Selecting with SPARQLSelecting with SPARQL
Selecting with SPARQL
 
Publishing and Using Linked Data
Publishing and Using Linked DataPublishing and Using Linked Data
Publishing and Using Linked Data
 
Open for Reuse: Library data and mashups
Open for Reuse: Library data and mashupsOpen for Reuse: Library data and mashups
Open for Reuse: Library data and mashups
 
Lucero Library Update 03/11/10
Lucero Library Update 03/11/10Lucero Library Update 03/11/10
Lucero Library Update 03/11/10
 
Mashing libraries to build communities
Mashing libraries to build communitiesMashing libraries to build communities
Mashing libraries to build communities
 
Project Management Tools
Project Management ToolsProject Management Tools
Project Management Tools
 
The Semantic Web
The Semantic WebThe Semantic Web
The Semantic Web
 
Resource Discovery Infrastructure - what if we were starting from scratch?
Resource Discovery Infrastructure - what if we were starting from scratch?Resource Discovery Infrastructure - what if we were starting from scratch?
Resource Discovery Infrastructure - what if we were starting from scratch?
 
Digital Future
Digital FutureDigital Future
Digital Future
 

Dernier

Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDThiyagu K
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfsanyamsingh5019
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxGaneshChakor2
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Krashi Coaching
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsTechSoup
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphThiyagu K
 
The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxShobhayan Kirtania
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room servicediscovermytutordmt
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 

Dernier (20)

Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Measures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SDMeasures of Dispersion and Variability: Range, QD, AD and SD
Measures of Dispersion and Variability: Range, QD, AD and SD
 
Sanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdfSanyam Choudhary Chemistry practical.pdf
Sanyam Choudhary Chemistry practical.pdf
 
CARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptxCARE OF CHILD IN INCUBATOR..........pptx
CARE OF CHILD IN INCUBATOR..........pptx
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
Kisan Call Centre - To harness potential of ICT in Agriculture by answer farm...
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Introduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The BasicsIntroduction to Nonprofit Accounting: The Basics
Introduction to Nonprofit Accounting: The Basics
 
Z Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot GraphZ Score,T Score, Percential Rank and Box Plot Graph
Z Score,T Score, Percential Rank and Box Plot Graph
 
The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptx
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
9548086042 for call girls in Indira Nagar with room service
9548086042  for call girls in Indira Nagar  with room service9548086042  for call girls in Indira Nagar  with room service
9548086042 for call girls in Indira Nagar with room service
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 

Linking lcsh and other stuff

  • 1. Linking Library of Congress Subject Headings Owen Stephens 14th July 2011 @ostephens http://www.meanboyfriend.com/overdue_ideas Thursday, 14 July 2011
  • 2. LNKNLCSH @ostephens Thursday, 14 July 2011 This is the lightning version I should precursor this talk by saying I’m really pleased that the LoC have invested in experimenting with Linked Data representations of aspects of their data. Anything in this talk isn’t a criticism of this, but about the issues we encountered using aspects of the data. It’s possible that some or all of these problems may have been down to my lack of understanding of LCSH and Linked Data :)
  • 3. Library chops Thursday, 14 July 2011 I’m a librarian - by nature and qualification :) - see http://www.meanboyfriend.com/ overdue_ideas/2010/11/library-routes/ Been working on the cusp between libraries and IT since 1995. Spending early part of my career in small libraries means I have worked in just about every area of library front of house and back office. However although I’ve catalogued books, and have more than a passing familiarly with MARC, I’m not a cataloguer, and not an expert on LCSH
  • 4. Linked Data chops Thursday, 14 July 2011 I’ve been trying to understand the Semantic Web/Linked Data for several years :) My understanding has been accelerated over the last couple of years by involvement in several projects in the Linked Data space. Specifically the Lucero and CORE projects at the Open University
  • 5. Thursday, 14 July 2011 Expressing similarity between published papers in UK research repositories Harvest metadata and full-text (50k papers from 143 UK repos so far) Text mine for relationships Expose ‘similarity’ measure as RDF triples using MuSIM Ontology (originally developed for Music, but equally applicable) For more information http://core-project.kmi.open.ac.uk
  • 6. Exposing RDF Thursday, 14 July 2011 Three ʻproductsʼ CORE Portal - search or SPARQL metadata for harvested papers CORE Mobile – Android application to search & navigate across related papers & downloading articles CORE Plugin - Designed to integrate into existing repository interface to link to ʻrelated papersʼ in other repos, based on CORE ʻsimilarityʼ For more information http://core-project.kmi.open.ac.uk SPARQL Endpoint at http://core.kmi.open.ac.uk:8081/COREWeb/squery How we express data in RDF: http://core-project.kmi.open.ac.uk/node/13
  • 7. Lucero Thursday, 14 July 2011 For more information see http://lucero-project.info Data and SPARQL Endpoint available via http://data.open.ac.uk Lucero published variety of data from the Open University as linked open data - admin data (buildings), course data (course catalogue, OERs), research data and data about bibliographic resources - including materials in the library (focussed on materials related to course materials - around 30k catalogue records)
  • 8. LCSH Thursday, 14 July 2011 Lots been written about LCSH, it’s structure, whether it should be replaced. I don’t want to spend too much time on this today but it may come up in places However it is probably worth recapping my understanding (if only to let those more knowledgeable correct it) Key aspect in the context of this talk is that LCSH is primarily a pre-coordinated system - that is facets of subject headings are pre-combined into a single, multi-faceted heading. Although.... “LCSH itself requires some degree of post-coordination of the pre-coordinated strings to bring out specific topics of works.” (http://www.loc.gov/catdir/ cpso/pre_vs_post.pdf) In fact the way that LCSH is structured in MARC records, and the way that indexes can be built on this in library management systems means that I’m going to focus on ‘Topical’ subject headings (confusingly to me, LCSH can also cover Name, Title and Geographic headings) Topical Terms can represent “a concrete object, animal, etc.; a category of people, animals, or objects; a more abstract concept, belief, process, or phenomenon; an institution, etc.” (http://www.tulane.edu/~techserv/lcsh%20introd.html) Topical LC Subject Headings are built by combining ‘Topical Terms’ with qualifiers (‘subdivisions’) which allow you to contextualise the term. The types of subdivision available are: General (a high level general qualifier - e.g. ‘History’) Chronological (period of time - e.g. ‘20th Century’) Geographic (place - e.g. ‘Great Britain’) Form (the type/genre of material - e.g. ‘Dictionary’) There are large number of rules that express how these subdivisions can be used in conjunction with Topical Terms, and the order in which they should be expressed. Not all combinations are valid - for example only certain General subdivisions may be further subdivided Geographically. The rules are not always black and white - they have ‘examples’ lists which you can use to inform you if it might be valid in a given situation. Perhaps suffice to say that a document called ‘BASIC SUBJECT CATALOGING USING LCSH: Trainee’s Manual’ is 382 pages long. Subject heading strings can be valid (i.e. constructed according to rules/patterns) while not being ‘Authorized’ - in this context and Authorized Heading is “A preferred subject term as decided and established by the Library of Congress by means of an authority record.” (Thanks to Tom Meehan for this definition)
  • 9. Thursday, 14 July 2011 Thanks to work of Ed Summers and others, the Library of Congress have a Linked Data representation of LCSH in SKOS. However, this only covers ‘Authorized’ LCSH - presumably because only those LCSH with an Authority record have an identifier within LoC systems? (I’m speculating)
  • 10. Thursday, 14 July 2011 This is a catalogue record from the OU - the two strings listed as ‘Subjects’ are LCSH (for cataloguers amongst you MARC 650s) Can see the linked data representation at http://data.open.ac.uk/page/library/289148
  • 11. General Subdivision Science--Study and Teaching--Research Topical General Term Subdivision Thursday, 14 July 2011 This is made up of a Topical Term - Science and two general subdivisions ‘Study and Teaching’ and ‘Research’
  • 12. Science--Study and Teaching--Research id.loc.gov ? Thursday, 14 July 2011 This is (afaik - I trust the cataloguers) a valid LCSH ... however it is not authorized ... and so does not have a URI on id.loc.gov
  • 13. Science--Study and Teaching--Research http://id.loc.gov/authorities/sh85118587#concept Thursday, 14 July 2011 “Science--Study and Teaching”, however, is an authorized heading
  • 14. Science--Study and Teaching--Research http://id.loc.gov/authorities/sh85118553#concept N.B. This is URI for Science as Topical Term not http://id.loc.gov/authorities/sh00007934#concept which is URI for Science as a General Subdivision Thursday, 14 July 2011 As is “Science”
  • 15. Science--Study and Teaching--Research http://id.loc.gov/authorities/sh2001008697#concept Thursday, 14 July 2011 Also “Study and Teaching” (as a topical subdivision) is an authorized heading
  • 16. Science--Study and Teaching--Research http://id.loc.gov/authorities/sh2002006576#concept N.B. This is URI for Research as General Subdivision not http://id.loc.gov/authorities/ sh85113021#concept which is URI for Research as a Topical Term Thursday, 14 July 2011 Also “Research” (as a topical subdivision) is an authorized heading
  • 17. More links please Thursday, 14 July 2011 If we only used id.loc.gov URIs where we had an authorised LCSH, we would end up with only a small number of links. Some URIs in id.loc.gov would never be used in this way as they only represent subdivisions - never valid by themselves. Therefor decided to check a variety of combinations against id.loc.gov
  • 18. Science--Study and Teaching--Research Science--Study and teaching http://id.loc.gov/authorities/ sh85118587#concept http://id.loc.gov/authorities/ Science sh85118553#concept http://id.loc.gov/authorities/ Study and Teaching sh2001008697#concept http://id.loc.gov/authorities/ Research sh2002006576#concept Science--Study and Teaching-- http://data.open.ac.uk/page/topic/ library/science-- Research study_and_teaching--research Thursday, 14 July 2011
  • 19. MADS? http://www.loc.gov/standards/mads/rdf/ Thursday, 14 July 2011 As far as I can see MADS (apart from looking complex) models the Authority - not the heading - this doesn’t solve the problem we saw here! That is MADS would solve the problem only for Authorized headings (which it does represent as component parts - which I think addresses the issues raised by Karen Coyle at http:// kcoyle.blogspot.com/2009/05/lcsh-as-linked-data-beyond-dash-dash.html) Happy to be corrected...
  • 20. A different approach? bibo:authorList ( <http://examples.net/contributors/2> <http://examples.net/contributors/1>) lcsh:headingList ( <http://id.loc.gov/authorities/ sh85118553#concept> <http://id.loc.gov/authorities/ sh2001008697#concept> <http://id.loc.gov/authorities/ sh2002006576#concept>) Thursday, 14 July 2011 If we could use rdfs:list to represent the pre-coordinated string of headings - then wouldn’t care about whether ‘authorized’ or not, and would have all the individual headings there as well (bibo lists authors individual and as a list) Again copying BIBO which has each author as a dc:author as well, could represent each part of the subject string as a separate dc:subject. In a MADS world there would be advantage to expressing full authorized heading as well (for relationships derived in MADS) although there is still the question of expressing ‘authorized fragments’ which seems to me would also be useful with MADS for the same reasons This feels like a simple approach that would at least allow us to capture the component parts of subject string (and personally I’m not sure we ought to go further than this? do we need to? why?). My feeling is lots of the work goes into representing the ‘Authority file’ as opposed to how subject headings are used in the real world ... is this fair?
  • 21. Details: http://discovery.ac.uk/developers/ competition/ Datasets: http://ckan.net/group/ ukdiscovery Ask Questions: http://getthedata.org or #discodev Thursday, 14 July 2011 Finally just an advert - if you are interested in open data in the library/archive/museum space please consider entering this competition :) - really show the value of this stuff!