Bridging Between CAD & GIS: 6 Ways to Automate Your Data Integration
Linked data - NCompass presentation
1. LINKED DATA AND LIBRARIES: AN
OVERVIEW Robin Hastings
NEKLS
rhastings@nekls.org
2. TODAY’S TOPICS
What is linked data?
How is linked data being used right now?
BIBFRAME
Open Library
What are some potential uses for linked data?
Extending library data’s reach
Reduce, Reuse, Recycle
More info: Previous NCompass Live episode at
https://nlc.nebraska.gov/scripts/calendar/eventshow.asp?ProgID=14798 (Life
after MARC - Cataloging Tools of the Future
This presentation:
3. WHAT IS LINKED DATA?
Photo Credit: https://www.flickr.com/photos/12033805@N00/2268295018/ - gerlos via Compfight - https://creativecommons.org/licenses/by-
sa/2.0/
4. ESSENTIAL PRINCIPLES:
Use URIs (Universal Resource Identifiers) as names for objects
Use HTTP URIs so people can look up those names
Provide information (at the URI) in standard form (RDF, JSON, Turtle, etc.)
Include links to other URI objects so that other objects can be discovered via
links
Tim-Burners Lee, back in 2006
5. SEMANTICS & LINKED DATA
Photo Credit: https://openclipart.org/detail/140701/brain
6. RDA VS. RDF
Resource Description and Access (RDA)
Library-land created
Describes and provides access points
for bibliographic information
means of encoding metadata for
library resources
Structured on FRBR
Can be expressed in multiple
“languages” such as MARC21,
BIBFRAME, etc.
Content descriptor, not a specific
format
Resource Description Framework (RDF)
WWW created
Describes and provides access points
for semantically valid information on
the web
means of encoding metadata for web-
based resources
Structured on semantic language (OWL
- Working Ontology Language)
Can be expressed in multiple
“languages” such as XML, Turtle, etc.
13. 4 Main Classes
WORK - a resource reflecting a conceptual
essence of the cataloging resource. (Audio,
Text, title, creator, place)
INSTANCE - a resource reflecting an
individual, material embodiment of the Work.
(extent, format, frequency)
AUTHORITY - a resource reflecting key
authority concepts that have defined
relationships reflected in the Work
and Instance. (agent, place, temporal, topic)
ANNOTATION - a resource that enhances our
knowledge about another resource when
knowing, minimally, 'who' is doing the
annotating is important. (reviews, etc.)
The BIBFRAME Vocabulary is comprised of the RDF
properties, classes, and relationships between and
among them. (bibframe.org)
14. OPEN LIBRARY
One page per book for every book published ever
Record available in RDF or
JSON web formats
15. By Max Schmachtenberg, Christian Bizer,
Anja Jentzsch and Richard Cyganiak -
http://lod-cloud.net/, CC BY-SA 3.0,
https://commons.wikimedia.org/w/index.php
?curid=36956792
17. CURRENT USES OF LINKED DATA
Linked Data = Knowledge Graph
Google, Facebook, etc. all use Linked
Data to provide knowledge (see hours,
phone number, map based on address,
Wikipedia entry, etc.)
Schema.org - the linked data standard
used by Google -->
18. US GOV’T LINKED DATA
Local severe weather warning systems in Missouri
Product recall data from fed government
Higher Ed datasets, including information on every institution of higher
education that participates in the federal student financial aid programs
19.
20. USES FOR LINKED DATA OUTSIDE THE US
England: British National Bibliography (bnb.data.bl.uk) - working on
creating a repository of linked data objects based on national library
holdings
Germany: Culturegraph1 - linked open data service that intends to
create
an individual information object for each kind of material held by
libraries
in Germany
(http://www.dnb.de/EN/Service/DigitaleDienste/LinkedData/linkeddata_nod
England: https://openclipart.org/detail/221971/big-ben-tower-illustration
Germany: https://openclipart.org/detail/240797/germany-map-flag-2
21. REDUCE, REUSE, RECYCLE
Reduce - the amount of work going into reinventing the record at each library (at
least somewhat)
Reuse - the library’s data in new and exciting ways, taking advantage of linked
open data on the web
Recycle - Shared MARC records - even more sharing with common URIs that all
libraries can use
Join Robin Hastings, Director of Technology Services for Northeast Kansas Library System (NEKLS),
as she goes over the basics of what linked data is, what the potential of linked data can be and how libraries (and other organizations) are using it right now to make information more easily accessible on the web. Learn what the Library of Congress is doing with BIBFRAME and how projects like the Open Library are making use of linked data to extend their reach and make their information reusable.
Linked data is data that is on the web in a format that computers can understand. It should be human readable, too, but it needs mostly to be specially coded so that computers can read it and, crucially, understand it.
Linked Data vs. Linked Open Data
Our brain understands non-standard English (or Turkish, or Russian, or whatever human language you speak), but computers understand NOTHING that is not perfectly formed and precisely standard.
Things not strings
Discuss differences
On click, Arrow drops down - most of the rest of the presentation will be focused on RDF, more info on RDA and library-specific cataloging in the presentation mentioned earlier.
URI’s are names for things (things rather than strings!)
Subject == noun
Predicate == verb (or property)
Object == noun
MARC standard - AARC2 encoding
Data is locked up and never appears in a search result because it doesn’t conform to linked data standards (like BIBFRAME); makes our data silo’d and impossible to find where our patrons are looking
BIBFRAME with RDF coding - Output from Zepheira's python code (work)
Uses the linked vocabularies (Subject, Predicate, Object) to add richness to our records -> BIBFRAME is a specifically created vocabulary for bib information (unlike Dublin Core, FOAF, etc.)
(Instance - manifestation level, mostly)
Output from LC's xquery code into RDF
These Information Resources can then be re-assembled into a coherent architecture that allows for cooperative cataloging at a far more granular level than before. Then, as we leverage the Web as an architecture for data, whenever updates to these Resources are performed (e.g. someone adds new information about a Person, new mappings related to a Subject, etc.) notification events can occur to automatically update systems that reference these Resources. Further, these information assets can now be more effectively utilized at a granular level and provide a richer substrate in which local collections, special collections and third party data can easily annotate and contextualize cooperative library content.
Linked Open Datasets published up through April of 2014 (wikimedia)
Pie in the sky sort of uses:
Today is Steve Job’s birthday (in 1955). Because Steve has an entry in the VIAF (Virtual Authority File) which gives his birthday (among lots of other information), the web page knows it’s his birthday. It pulls a picture from the VIAF file, then starts searching in Wikipedia for media he might be connected to. There are biographies (book and movie) listed there, so it pulls the information about those and uses the info stored in Amazon to give rich and detailed descriptions of that media. It also knows, from Wikipedia, that he founded the Apple company, so it goes to Yahoo! Finance and pulls the stock information for Apple and provides that information as well. If you happen to have a library-lover coding this, it will also check Worldcat for links to any media referencing Steve Jobs or Apple in a library near you and give you links into the catalog, availability information and offer to put any of it on hold for you - all you have to do is supply your library card number.
Searching: Who is that actor from Pulp Fiction (IMDB for cast list, Wikipedia for info/pictures)
Internationalisation and multilingualism in RDA - the ability with linked data to choose the language on the fly (or term that applies to your particular field/user group, etc.)
records from multiple sources (Dublin Core, etc.) all pulled into a single ILS and understood and displayed as easily as native info
www.linkedjazz.org/network
For libraries? Using similar linked data could create a book box with links to author information, publisher, etc. plus information about its availability in your local library at the moment of searching
WP plugin for Schema.org; could use it here to provide hours/phone info
Useful in websites or apps; freely available
One example of using data.gov data - search for Lawrence, KS, get data from:
Open Government Data Used: American Community Survey, Bureau of Labor Statistics, National Weather Service, US Census TIGER Database, Federal Housing Finance Agency Mortgage Data
https://www.imls.gov/research-evaluation/data-collection/public-libraries-united-states-survey - uses Public Library Survey 2012 and Archives Library Information Center