DAPI Diem: Using Linked Data and the WorldCat Discovery API to surface timely holdings

•Télécharger en tant que PPTX, PDF•

0 j'aime•383 vues

Presented as part of "Linked Data-Driven Discovery: Applications and APIs from a User-Centered Perspective" session at LITA Forum, Nov 13, 2015

Technologie

DAPI Diem
Or, Using Linked Data & the WorldCat Discovery
API to surface timely holdings
Scott Hanrath | shanrath@ku.edu | @rshanrath

Approach: Outside -> In
1. Use external sources to find interesting entities related to
a given date
2. Feed those entities into a query to the Discovery API
3. Present the entities and a set of related holdings from
WorldCat

[subject] [predicate] [object]
[subject] [predicate] day
SPARQL: http://dbpedia.org/sparql

?entity a dbpedia-owl:Writer .
?entity ont:birthDate ?date .
?entity a ont:Book .
?entity ont:publicationDate ?date .
?entity a dbpedia-owl:Country.
?entity dbpedia-owl:foundingDate ?date .

<http://dbpedia.org/resource/David_Foster_Wallace>
<http://www.w3.org/1999/02/22-rdf-syntax-ns#type>
<http://dbpedia.org/ontology/Writer> .
<http://dbpedia.org/resource/David_Foster_Wallace>
<http://dbpedia.org/property/dateOfBirth>
"1962-02-21"^^<http://www.w3.org/2001/XMLSchema#date> .
<http://dbpedia.org/resource/David_Foster_Wallace>
<http://dbpedia.org/ontology/viafId>
"68975157"^^<http://www.w3.org/2001/XMLSchema#string> .

WikiPedia API: https://www.mediawiki.org/wiki/API:Main_page
rank = revisions_count +
(10 * article_length) +
number_external_links

WorldCat Discovery API
Things -> Strings -> Things
creator:[author name]
name:[book title]
subject:[author name | country name | book title]
Add number of results to ranking
rank += 10 * number_dapi_results

Credits
Francis Kayiwa
Emily Flynn
Shawn Denny
Scott Hanrath
Bilal Khalid
Rachel Maderik
w/ OCLC’s Jeff Young, SPARQL coach
github.com/oclc-developer-house/thirdpartyapi

Recommandé

Commodity Semantic Search: A Case Study of DiscoverEdNathan Yergler

A hint of_mintPeter Sefton

Finding sci tech grey literature informationMatthew Von Hendy

Bridging Batch and Real-time Systems for Anomaly DetectionDataWorks Summit

The agINFRA Linked Data layer by Valeria Pesce, Giovanni l'Abate, Luca Mattei...CIARD Movement

Bioschemas overviewBioschemas

Texas sla presentation finding sci tech grey literature informationMatthew Von Hendy

2015 09 rda-pre-meeting_jkJohannes Keizer

Recommandé

Commodity Semantic Search: A Case Study of DiscoverEdNathan Yergler

A hint of_mintPeter Sefton

Finding sci tech grey literature informationMatthew Von Hendy

Bridging Batch and Real-time Systems for Anomaly DetectionDataWorks Summit

The agINFRA Linked Data layer by Valeria Pesce, Giovanni l'Abate, Luca Mattei...CIARD Movement

Bioschemas overviewBioschemas

Texas sla presentation finding sci tech grey literature informationMatthew Von Hendy

2015 09 rda-pre-meeting_jkJohannes Keizer

Linked data - NCompass presentationRobin Hastings

How open is open? An evaluation rubric for public knowledgebasesmhaendel

Finding grey literatureKosjanka

2009 11 icudlJohannes Keizer

Describing Scientific Datasets: The HCLS Community ProfileAlasdair Gray

Accidental Discovery, Intentional Inquiry: Leveraging Linked Data to Uncover ...Cristina Pattuelli

Science in the open, what does it take?mhaendel

Semantic HTMLhchen1

The Open Access Community, and OAIsterJessica Hedgecock and John Shannon

Make your data great again - Ver 2Daniel JACOB

Drupal Calendaring, A Technological SolutionMatthew Farina

The ENCODE Portal REST API ENCODE-DCC

Reusable data for biomedicine: A data licensing odysseymhaendel

05 SPARQL queries over Open Land Use, Open Transport Net and Smart Points Of ...plan4all

Data Vault vs Data Lake: What's the difference?Fru Louis

Bio2RDF presentation at Combine 2012François Belleau

FundRef October 2013Crossref

Sherborn: Lyal - Digitising legacy taxonomic literature: processes, products ...ICZN

Linked data experiments at the National Library of Scotland / Alexandra De Pr...CIGScotland

New Tools for an Old Art: Rhetorical Analysis Through Visualization and PlayShannan Butler

Test map 1startmetjan

Bomenpark groep 3a 28 februari 2011startmetjan

Contenu connexe

Tendances

Linked data - NCompass presentationRobin Hastings

How open is open? An evaluation rubric for public knowledgebasesmhaendel

Finding grey literatureKosjanka

2009 11 icudlJohannes Keizer

Describing Scientific Datasets: The HCLS Community ProfileAlasdair Gray

Accidental Discovery, Intentional Inquiry: Leveraging Linked Data to Uncover ...Cristina Pattuelli

Science in the open, what does it take?mhaendel

Semantic HTMLhchen1

The Open Access Community, and OAIsterJessica Hedgecock and John Shannon

Make your data great again - Ver 2Daniel JACOB

Drupal Calendaring, A Technological SolutionMatthew Farina

The ENCODE Portal REST API ENCODE-DCC

Reusable data for biomedicine: A data licensing odysseymhaendel

05 SPARQL queries over Open Land Use, Open Transport Net and Smart Points Of ...plan4all

Data Vault vs Data Lake: What's the difference?Fru Louis

Bio2RDF presentation at Combine 2012François Belleau

FundRef October 2013Crossref

Sherborn: Lyal - Digitising legacy taxonomic literature: processes, products ...ICZN

Linked data experiments at the National Library of Scotland / Alexandra De Pr...CIGScotland

New Tools for an Old Art: Rhetorical Analysis Through Visualization and PlayShannan Butler

Tendances (20)

Linked data - NCompass presentation

How open is open? An evaluation rubric for public knowledgebases

Finding grey literature

2009 11 icudl

Describing Scientific Datasets: The HCLS Community Profile

Accidental Discovery, Intentional Inquiry: Leveraging Linked Data to Uncover ...

Science in the open, what does it take?

Semantic HTML

The Open Access Community, and OAIster

Make your data great again - Ver 2

Drupal Calendaring, A Technological Solution

The ENCODE Portal REST API

Reusable data for biomedicine: A data licensing odyssey

05 SPARQL queries over Open Land Use, Open Transport Net and Smart Points Of ...

Data Vault vs Data Lake: What's the difference?

Bio2RDF presentation at Combine 2012

FundRef October 2013

Sherborn: Lyal - Digitising legacy taxonomic literature: processes, products ...

Linked data experiments at the National Library of Scotland / Alexandra De Pr...

New Tools for an Old Art: Rhetorical Analysis Through Visualization and Play

En vedette

Test map 1startmetjan

Bomenpark groep 3a 28 februari 2011startmetjan

Using Event Tracking to Enhance Library Web Interfacesrshanrath

Streetwise groep 3a 2010startmetjan

Virtual Environments at the University of Kansas Librariesrshanrath

Verjaardagfeest van meneer janstartmetjan

Nationale voorleesdagen 2011startmetjan

Schoolreis 2010 september groep 3astartmetjan

En vedette (8)

Test map 1

Bomenpark groep 3a 28 februari 2011

Using Event Tracking to Enhance Library Web Interfaces

Streetwise groep 3a 2010

Virtual Environments at the University of Kansas Libraries

Verjaardagfeest van meneer jan

Nationale voorleesdagen 2011

Schoolreis 2010 september groep 3a

Similaire à DAPI Diem: Using Linked Data and the WorldCat Discovery API to surface timely holdings

GDG Meets U event - Big data & Wikidata - no lies codelabCAMELIA BOBAN

Linked Open Data Fundamentals for Libraries, Archives and Museumstrevorthornton

Exploring and using the Semantic Web - SSSW09 tutorialMathieu d'Aquin

Knowledge Technologies: Opportunities and ChallengesFariz Darari

20160818 Semantics and Linkage of Archived Catalogsandrea huang

SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)net2-project

Lifting the Lid on Linked DataJane Stevenson

IBC FAIR Data Prototype Implementation slideshowMark Wilkinson

RO-Crate: packaging metadata love notes into FAIR Digital ObjectsCarole Goble

Qpat 2007Sharon Brown-Peters

Metadata as Linked Data for Research Data Repositoriesandrea huang

FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...Mark Wilkinson

Identifying The Benefit of Linked DataRichard Wallis

Linked Data and Locah, UKSG2011 Jane Stevenson

The nature.com ontologies portal: nature.com/ontologiesTony Hammond

Informal presentation about RESChristophe Guéret

The Nature.com ontologies portal - Linked Science 2015Michele Pasin

Linked dataworkshopintro14aug2014Jane Stevenson

Linked Data and Discovery with Steve MeyerWiLS

December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...DeVonne Parks, CEM

Similaire à DAPI Diem: Using Linked Data and the WorldCat Discovery API to surface timely holdings (20)

GDG Meets U event - Big data & Wikidata - no lies codelab

Linked Open Data Fundamentals for Libraries, Archives and Museums

Exploring and using the Semantic Web - SSSW09 tutorial

Knowledge Technologies: Opportunities and Challenges

20160818 Semantics and Linkage of Archived Catalogs

SPARQL1.1 Tutorial, given in UChile by Axel Polleres (DERI)

Lifting the Lid on Linked Data

IBC FAIR Data Prototype Implementation slideshow

RO-Crate: packaging metadata love notes into FAIR Digital Objects

Qpat 2007

Metadata as Linked Data for Research Data Repositories

FAIR Data Prototype - Interoperability and FAIRness through a novel combinati...

Identifying The Benefit of Linked Data

Linked Data and Locah, UKSG2011

The nature.com ontologies portal: nature.com/ontologies

Informal presentation about RES

The Nature.com ontologies portal - Linked Science 2015

Linked dataworkshopintro14aug2014

Linked Data and Discovery with Steve Meyer

December 2, 2015: NISO/NFAIS Virtual Conference: Semantic Web: What's New and...

Dernier

Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko

Unblocking The Main Thread Solving ANRs and Frozen FramesSinan KOZAK

Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard

GenCyber Cyber Security Day PresentationMichael W. Hawkins

Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik

IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge

CNv6 Instructor Chapter 6 Quality of Servicegiselly40

A Call to Action for Generative AI in 2024Results

Finology Group – Insurtech Innovation Award 2024The Digital Insurer

A Domino Admins Adventures (Engage 2024)Gabriella Davis

Swan(sea) Song – personal research during my six years at Swansea ... and bey...Alan Dix

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...Neo4j

Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure servicePooja Nehwal

Scaling API-first – The story of a global engineering organizationRadu Cotescu

The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...HostedbyConfluent

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Drew Madelung

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024BookNet Canada

Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC

Dernier (20)

Handwritten Text Recognition for manuscripts and early printed texts

Unblocking The Main Thread Solving ANRs and Frozen Frames

Maximizing Board Effectiveness 2024 Webinar.pptx

GenCyber Cyber Security Day Presentation

Injustice - Developers Among Us (SciFiDevCon 2024)

IAC 2024 - IA Fast Track to Search Focused AI Solutions

CNv6 Instructor Chapter 6 Quality of Service

A Call to Action for Generative AI in 2024

Finology Group – Insurtech Innovation Award 2024

A Domino Admins Adventures (Engage 2024)

Swan(sea) Song – personal research during my six years at Swansea ... and bey...

Neo4j - How KGs are shaping the future of Generative AI at AWS Summit London ...

Transcript: #StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024

WhatsApp 9892124323 ✓Call Girls In Kalyan ( Mumbai ) secure service

Scaling API-first – The story of a global engineering organization

The 7 Things I Know About Cyber Security After 25 Years | April 2024

Transforming Data Streams with Kafka Connect: An Introduction to Single Messa...

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

#StandardsGoals for 2024: What’s new for BISAC - Tech Forum 2024

Breaking the Kubernetes Kill Chain: Host Path Mount

DAPI Diem: Using Linked Data and the WorldCat Discovery API to surface timely holdings

1. DAPI Diem Or, Using Linked Data & the WorldCat Discovery API to surface timely holdings Scott Hanrath | shanrath@ku.edu | @rshanrath

2. Today in History...

3. Approach: Outside -> In 1. Use external sources to find interesting entities related to a given date 2. Feed those entities into a query to the Discovery API 3. Present the entities and a set of related holdings from WorldCat

4. [subject] [predicate] [object] [subject] [predicate] day SPARQL: http://dbpedia.org/sparql

5. ?entity a dbpedia-owl:Writer . ?entity ont:birthDate ?date . ?entity a ont:Book . ?entity ont:publicationDate ?date . ?entity a dbpedia-owl:Country. ?entity dbpedia-owl:foundingDate ?date .

6. <http://dbpedia.org/resource/David_Foster_Wallace> <http://www.w3.org/1999/02/22-rdf-syntax-ns#type> <http://dbpedia.org/ontology/Writer> . <http://dbpedia.org/resource/David_Foster_Wallace> <http://dbpedia.org/property/dateOfBirth> "1962-02-21"^^<http://www.w3.org/2001/XMLSchema#date> . <http://dbpedia.org/resource/David_Foster_Wallace> <http://dbpedia.org/ontology/viafId> "68975157"^^<http://www.w3.org/2001/XMLSchema#string> .

7. WikiPedia API: https://www.mediawiki.org/wiki/API:Main_page rank = revisions_count + (10 * article_length) + number_external_links

8. WorldCat Discovery API Things -> Strings -> Things creator:[author name] name:[book title] subject:[author name | country name | book title] Add number of results to ranking rank += 10 * number_dapi_results

10. DBPedia WorldCat

11. Good!

12. Not so good.

13. Judgement call?

14. Credits Francis Kayiwa Emily Flynn Shawn Denny Scott Hanrath Bilal Khalid Rachel Maderik w/ OCLC’s Jeff Young, SPARQL coach github.com/oclc-developer-house/thirdpartyapi

15. Thank you

Notes de l'éditeur

Important to consider the context: * OCLC's Developer House event, December 2015 * Focus on Linked Data and the WorldCat Discovery API - OCLC-provided venue and staff to help get up to speed on the Discovery API as well as Linked Data concepts * Work with colleagues to prototype a tool or service PROTOTYPE: I'll present the prototype my team -- and it was a team, who I am attempting to represent -- worked on, what it was intended to demonstrate, and how it works. I think it's cool, but don't confuse this with anything production-ready. Which won't be hard, because I want to highlight some "opportunities" for future along these lines ... by which I mean: not all of this worked very well, and I want to talk about the shortcomings and hard problems we encountered as well as the things that worked.
The pitch for this tool would be something like "This day is history" * could we leverage Linked Data and the discovery API to highlight holdings for a given day? * applications could be: - use to drive a recommendation tile on a search results page or a website - use this to drive content for digital signage or other building displays * Something fully automated would be great, but even a weekly email of "interesting things for the next week" could be useful.
Approach: Outside -> In * use external sources to find *interesting* *entities* related to a given date * feed those entities into a query to the Discovery API * present the entities and a set of interesting holdings from WorldCat Two things here: * entities: what should we be searching for? how is that thing related to the date? * interesting: how do we know whether an entity is interesting?
DBpedia So the entities...Find entities via DBpedia * "DBpedia is a crowd-sourced community effort to extract structured information from Wikipedia and make this information available on the Web" * WikiPedia as Linked Data * Things not strings * RDF (Resource Description Framework) data model makes statements using triples; subject -> predicate -> object * we were looking for subject -> predicate -> date DBpedia has a SPARQL interface * (SPARQL = SPAQRL Protocol and RDF Query Language) * So we can create queries against DBpedia to find entities related to our data
Tried out a bunch of different examples to see what worked. * Authors who were born on a date * Books that were published on a date * Countries that were founded on a date (Not going to get too into the nitty-gritty here, code is on github)
Statements about the dbPedia resource David Foster Wallace * fits our criteria * also, sorta-kinda gives us interesting information like a VIAF id
Now about "interesting" * I don't know if you've noticed, but there's a lot of stuff in WikiPedia -- how do we rank the date-related entities? * each entity has a WikiPedia page and each WikiPedia will tell you things about itself through the WikiPedia API * using that relationship we can pull some data to use to compute a score for each entity that we can sort on Rank = revision_count + (10 * article_length) + number_external_links (all of the WikiPedia variables are normalized) Political ELement to this that I don't want to gloss over entirely: we're highlighting things that are well-represented in WikiPedia. Worth considering what limitations that puts on what we're highlighting -- and more importantly what we're not seeing
Here's where the "outside-in" approach gets tricky * A lot of what me might like to do in a linked data way, we can't do with the Discovery API at this point * We need to go back from our "thing" to some "strings" -- we map some string values from our entities to the query-able indices in the Discovery API * And we add the number of results back to the ranking to indicate more interest * Big opportunity for improvement, even for string, by, say, reconciling against FAST and plugging those strings (if not the URIs) into DAPI Diem * Could also further mine the Statements about the entity to find futher information for use with DAPI Diem (e.g., look up works and query for them more specifically), look up related topcis/authors
Example of some things DAPI Diem brings up for November 11 - image where available - statement about the relationship to the day - Description - 5 items from WorldCat
Bigger image * sources * thumbnail images never made it out of the backlog... * This works pretty well, some duplication but useful enough...
Not everything works that well the good...
….the bad...
...the ugly
Credits and code My fantastic teammates A nod to OCLC’s help along the way Code is on GitHub