SlideShare une entreprise Scribd logo
1  sur  16
Télécharger pour lire hors ligne
LINKED DATA EXPERIENCE AT MACMILLAN 
Building discovery services for scientific and 
scholarly content on top of a semantic data model 
22 October 2014 
Tony Hammond 
Michele Pasin
Background 
About Macmillan and what we are doing 
Linked Data at Macmillan | 22 October 2014 1
Macmillan Science and Education 
Group brands and businesses 
Linked Data at Macmillan | 22 October 2014
MS&E Current trends 
Developing a richer graph of objects 
Change Drivers 
● Digital first workflow 
– print becomes secondary 
– support for multiple workflows 
● User-centric design 
– things, not data 
– focus on user experience 
● Deeply integrated datasets 
– standard naming convention 
– common metadata model 
– flexible schema management 
– rich dataset descriptions 
Linked Data at Macmillan | 22 October 2014
NPG Linked Data Platform (2012) 
data.nature.com 
Deliverables (2012–2014) 
● Prototype for external use 
● Two RDF dataset releases in 2012 
– April 2012 (22m triples) 
– July 2012 (270m triples) 
● Live updates to query endpoint 
● SPARQL query service (now terminated) 
Current Work (2014–) 
● Focus on internal use-cases 
● Publish ontology pages 
● Periodic data snapshots (no endpoint) 
Linked Data at Macmillan | 22 October 2014
NPG Core Ontology (2014) 
Things: assets, documents, events, types 
Features 
● Classes: ~65 
● Properties: ~200 
● Named graphs (per class) 
Namespaces 
● npg: => http://ns.nature.com/terms/ 
● npgg: => http://ns.nature.com/graphs/ 
Approach 
● Minimal commitment to external vocabs 
● Incremental formalization (RDF, RDFS, OWL-DL) 
● Shared metamodel vs. automatic inference 
Linked Data at Macmillan | 22 October 2014
NPG Subject Pages (2014) 
Topical access to content 
Features 
● Based on SKOS taxonomy 
– >2750 scientific terms 
– content inherited via SKOS tree 
● Completely automated 
– one webpage per subject term 
– structure based on article type 
– secondary pages for specific types 
● Various formats e.g. eAlerts, feeds, etc. 
– allows people to ‘follow’ a subject 
● Customized related content 
– ads, jobs, events, etc. 
Linked Data at Macmillan | 22 October 2014
Data Storage and Query 
Achieving speed by means of a hybrid architecture 
Linked Data at Macmillan | 22 October 2014 2
Content Hub 
Managed content warehouse for data discovery 
Capabilities 
● Discovery – Graph 
● Storage – Content Repos 
Features 
● Hybrid RDF + XML architecture 
– MarkLogic for XML, RDF/XML 
– Triplestore (TDB) for RDF validation 
● Repo’s for binary assets 
Datasets 
● Documents (large; >1m) 
● Ontologies (small; <10k) 
Linked Data at Macmillan | 22 October 2014
System Architecture 
Hub content 
Linked Data at Macmillan | 22 October 2014
Content Discovery – Principles 
Readying the API for applications 
Generations 
● 1st – Generic linked data API (RDF/*) 
● 2nd – Specific page model API (JSON) 
Concerns 
● Speed (20ms single object; 200ms filtered object) 
● Simplicity (data construction) 
● Stability (backup, clustering, security, transactions) 
Principles 
● Chunky not chatty, all data in a single response 
● Data as consumed, rather than as stored 
● Support common use cases in simple, obvious ways 
● Ensure a guaranteed, consistent speed of response for more complex queries 
● Build on foundation of standard, pragmatic REST (collections, items) 
Linked Data at Macmillan | 22 October 2014
Content Discovery – Optimization 
Tuning the API for performance 
Approaches 
● TDB + Fuseki – SPARQL 
● MarkLogic Semantics – SPARQL 
● MarkLogic – XQuery 
● MarkLogic (Optimized) – XQuery 
Techniques 
● Partitioning – RDF/XML objects 
● Streaming – serialization 
● Hashing – dictionary lookup 
● Cacheing – Varnish 
Linked Data at Macmillan | 22 October 2014
Content Storage – Layout and Indexing 
Readying the data for page delivery 
Challenges 
● Sort orders 
● RDF Lists 
● Facetting, counting 
Layout 
● Semantic RDF/XML includes in XML 
● RDF objects serialized in list order 
● Application XML for subject hierarchy 
Indexes 
● Indexes over all elements 
● Range indexes for datatypes (e.g. dates) 
Linked Data at Macmillan | 22 October 2014
Content Storage – Example 
Semantic metadata 
Techniques 
● XML header for semantic metadata 
● All article data is localized 
● Maintain named graphs via 
<graph/> elements 
● RDF/XML-ABBREV 
● Simple XML :: JSON mapping 
Linked Data at Macmillan | 22 October 2014
In Conclusion 
A few lessons learned 
Summary 
● An RDF metamodel allows for scalable enterprise-level data organization 
● It is crucial to adequately distinguish between internal and external use cases 
● A hybrid architecture proved to be an efficient internal solution for content delivery 
Future Work 
● Grow the ontology so that it matches product requirements more closely 
● Allow for more advanced automatic inferencing 
● Provide richer query options both via the API and SPARQL endpoints 
● Maintain and expand the vision of a shared semantic model as a core enterprise asset 
Linked Data at Macmillan | 22 October 2014
For more information 
please contact 
TONY HAMMOND 
Data Architect, Content Data Services 
tony.hammond@macmillan.com 
MICHELE PASIN 
Information Architect, Product Office 
michele.pasin@macmillan.com 
Thank you

Contenu connexe

Tendances

Analytics and Access to the UK web archive
Analytics and Access to the UK web archiveAnalytics and Access to the UK web archive
Analytics and Access to the UK web archive
Lewis Crawford
 
Rdf and open linked data a first approach
Rdf and open linked data a first approach Rdf and open linked data a first approach
Rdf and open linked data a first approach
@CULT Srl
 
Cenitpede: Analyzing Webcrawl
Cenitpede: Analyzing WebcrawlCenitpede: Analyzing Webcrawl
Cenitpede: Analyzing Webcrawl
Primal Pappachan
 

Tendances (20)

Ephedra: efficiently combining RDF data and services using SPARQL federation
Ephedra: efficiently combining RDF data and services using SPARQL federationEphedra: efficiently combining RDF data and services using SPARQL federation
Ephedra: efficiently combining RDF data and services using SPARQL federation
 
Scripting User Contributed Interlinking
Scripting User Contributed InterlinkingScripting User Contributed Interlinking
Scripting User Contributed Interlinking
 
Scalable Web Data Management using RDF
Scalable Web Data Management using RDF  Scalable Web Data Management using RDF
Scalable Web Data Management using RDF
 
Benchmarking RDF Metadata Representations: Reification, Singleton Property an...
Benchmarking RDF Metadata Representations: Reification, Singleton Property an...Benchmarking RDF Metadata Representations: Reification, Singleton Property an...
Benchmarking RDF Metadata Representations: Reification, Singleton Property an...
 
Beyond 2022 project presentation 2021
Beyond 2022 project presentation 2021Beyond 2022 project presentation 2021
Beyond 2022 project presentation 2021
 
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the CloudFirst Steps in Semantic Data Modelling and Search & Analytics in the Cloud
First Steps in Semantic Data Modelling and Search & Analytics in the Cloud
 
Contextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of EntitiesContextual Computing - Knowledge Graphs & Web of Entities
Contextual Computing - Knowledge Graphs & Web of Entities
 
Getting Started with Knowledge Graphs
Getting Started with Knowledge GraphsGetting Started with Knowledge Graphs
Getting Started with Knowledge Graphs
 
Analytics and Access to the UK web archive
Analytics and Access to the UK web archiveAnalytics and Access to the UK web archive
Analytics and Access to the UK web archive
 
LD4KD 2015 - Demos and tools
LD4KD 2015 - Demos and toolsLD4KD 2015 - Demos and tools
LD4KD 2015 - Demos and tools
 
The Bounties of Semantic Data Integration for the Enterprise
The Bounties of Semantic Data Integration for the Enterprise The Bounties of Semantic Data Integration for the Enterprise
The Bounties of Semantic Data Integration for the Enterprise
 
Enterprise knowledge graphs
Enterprise knowledge graphsEnterprise knowledge graphs
Enterprise knowledge graphs
 
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open DataMuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
MuseoTorino, first italian project using a GraphDB, RDFa, Linked Open Data
 
Finding Data Sets
Finding Data SetsFinding Data Sets
Finding Data Sets
 
The RDF Report Card: Beyond the Triple Count
The RDF Report Card: Beyond the Triple CountThe RDF Report Card: Beyond the Triple Count
The RDF Report Card: Beyond the Triple Count
 
Do it on your own - From 3 to 5 Star Linked Open Data with RMLio
Do it on your own - From 3 to 5 Star Linked Open Data with RMLioDo it on your own - From 3 to 5 Star Linked Open Data with RMLio
Do it on your own - From 3 to 5 Star Linked Open Data with RMLio
 
Rdf and open linked data a first approach
Rdf and open linked data a first approach Rdf and open linked data a first approach
Rdf and open linked data a first approach
 
Text and Data Mining at Springer Nature
Text and Data Mining at Springer NatureText and Data Mining at Springer Nature
Text and Data Mining at Springer Nature
 
Cenitpede: Analyzing Webcrawl
Cenitpede: Analyzing WebcrawlCenitpede: Analyzing Webcrawl
Cenitpede: Analyzing Webcrawl
 
Data Integration & Disintegration: Managing SN SciGraph with SHACL and OWL
Data Integration & Disintegration: Managing SN SciGraph with SHACL and OWLData Integration & Disintegration: Managing SN SciGraph with SHACL and OWL
Data Integration & Disintegration: Managing SN SciGraph with SHACL and OWL
 

Similaire à Linked data experience at Macmillan: Building discovery services for scientific and scholarly content on top of a semantic data model

A Mobile-First, Cloud-First Stack at Pearson
A Mobile-First, Cloud-First Stack at PearsonA Mobile-First, Cloud-First Stack at Pearson
A Mobile-First, Cloud-First Stack at Pearson
MongoDB
 
Slides semantic web and Drupal 7 NYCCamp 2012
Slides semantic web and Drupal 7 NYCCamp 2012Slides semantic web and Drupal 7 NYCCamp 2012
Slides semantic web and Drupal 7 NYCCamp 2012
scorlosquet
 
Tools for Next Generation of CMS: XML, RDF, & GRDDL
Tools for Next Generation of CMS: XML, RDF, & GRDDLTools for Next Generation of CMS: XML, RDF, & GRDDL
Tools for Next Generation of CMS: XML, RDF, & GRDDL
Chimezie Ogbuji
 
Michael Lang Sr. Presentation
Michael Lang Sr. PresentationMichael Lang Sr. Presentation
Michael Lang Sr. Presentation
Mediabistro
 
Intro to-technologies-Green-City-Hackathon-Athens
Intro to-technologies-Green-City-Hackathon-AthensIntro to-technologies-Green-City-Hackathon-Athens
Intro to-technologies-Green-City-Hackathon-Athens
Stoitsis Giannis
 
Semantic Technologies for Big Data
Semantic Technologies for Big DataSemantic Technologies for Big Data
Semantic Technologies for Big Data
Marin Dimitrov
 
E-ARK-iPRES2016-Bern-October-2016
E-ARK-iPRES2016-Bern-October-2016E-ARK-iPRES2016-Bern-October-2016
E-ARK-iPRES2016-Bern-October-2016
Sven Schlarb
 

Similaire à Linked data experience at Macmillan: Building discovery services for scientific and scholarly content on top of a semantic data model (20)

Graph basedrdf storeforapachecassandra
Graph basedrdf storeforapachecassandraGraph basedrdf storeforapachecassandra
Graph basedrdf storeforapachecassandra
 
A Mobile-First, Cloud-First Stack at Pearson
A Mobile-First, Cloud-First Stack at PearsonA Mobile-First, Cloud-First Stack at Pearson
A Mobile-First, Cloud-First Stack at Pearson
 
The Semantic Web and Drupal 7 - Loja 2013
The Semantic Web and Drupal 7 - Loja 2013The Semantic Web and Drupal 7 - Loja 2013
The Semantic Web and Drupal 7 - Loja 2013
 
Slides semantic web and Drupal 7 NYCCamp 2012
Slides semantic web and Drupal 7 NYCCamp 2012Slides semantic web and Drupal 7 NYCCamp 2012
Slides semantic web and Drupal 7 NYCCamp 2012
 
Why I don't use Semantic Web technologies anymore, event if they still influe...
Why I don't use Semantic Web technologies anymore, event if they still influe...Why I don't use Semantic Web technologies anymore, event if they still influe...
Why I don't use Semantic Web technologies anymore, event if they still influe...
 
Tools for Next Generation of CMS: XML, RDF, & GRDDL
Tools for Next Generation of CMS: XML, RDF, & GRDDLTools for Next Generation of CMS: XML, RDF, & GRDDL
Tools for Next Generation of CMS: XML, RDF, & GRDDL
 
Rdap12 wrap up reagan moore
Rdap12 wrap up reagan mooreRdap12 wrap up reagan moore
Rdap12 wrap up reagan moore
 
Drupal and the Semantic Web - ESIP Webinar
Drupal and the Semantic Web - ESIP WebinarDrupal and the Semantic Web - ESIP Webinar
Drupal and the Semantic Web - ESIP Webinar
 
Manchesterjan2015
Manchesterjan2015Manchesterjan2015
Manchesterjan2015
 
Michael Lang Sr. Presentation
Michael Lang Sr. PresentationMichael Lang Sr. Presentation
Michael Lang Sr. Presentation
 
Opening up MOOCs for OER management on the Web of linked data
Opening up MOOCs for OER management on the Web of linked dataOpening up MOOCs for OER management on the Web of linked data
Opening up MOOCs for OER management on the Web of linked data
 
The nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologiesThe nature.com ontologies portal: nature.com/ontologies
The nature.com ontologies portal: nature.com/ontologies
 
Intro to-technologies-Green-City-Hackathon-Athens
Intro to-technologies-Green-City-Hackathon-AthensIntro to-technologies-Green-City-Hackathon-Athens
Intro to-technologies-Green-City-Hackathon-Athens
 
Semantic Technologies for Big Data
Semantic Technologies for Big DataSemantic Technologies for Big Data
Semantic Technologies for Big Data
 
Linked Data Competency Index : Mapping the field for teachers and learners
 Linked Data Competency Index : Mapping the field for teachers and learners Linked Data Competency Index : Mapping the field for teachers and learners
Linked Data Competency Index : Mapping the field for teachers and learners
 
SWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic WebSWIB14 Weaving repository contents into the Semantic Web
SWIB14 Weaving repository contents into the Semantic Web
 
Describing Theses and Dissertations Using Schema.org
Describing Theses and Dissertations Using Schema.orgDescribing Theses and Dissertations Using Schema.org
Describing Theses and Dissertations Using Schema.org
 
Research Plan 2014
Research Plan 2014Research Plan 2014
Research Plan 2014
 
E-ARK-iPRES2016-Bern-October-2016
E-ARK-iPRES2016-Bern-October-2016E-ARK-iPRES2016-Bern-October-2016
E-ARK-iPRES2016-Bern-October-2016
 
On-Demand RDF Graph Databases in the Cloud
On-Demand RDF Graph Databases in the CloudOn-Demand RDF Graph Databases in the Cloud
On-Demand RDF Graph Databases in the Cloud
 

Plus de Michele Pasin

Prosopography and Computer Ontologies: Towards a Formal Representation of the...
Prosopography and Computer Ontologies: Towards a Formal Representation of the...Prosopography and Computer Ontologies: Towards a Formal Representation of the...
Prosopography and Computer Ontologies: Towards a Formal Representation of the...
Michele Pasin
 
Digital Humanities 2009 - Laying out the conceptual foundations for data inte...
Digital Humanities 2009 - Laying out the conceptual foundations for data inte...Digital Humanities 2009 - Laying out the conceptual foundations for data inte...
Digital Humanities 2009 - Laying out the conceptual foundations for data inte...
Michele Pasin
 
An Ontological View of Canonical Citations
An Ontological View of Canonical CitationsAn Ontological View of Canonical Citations
An Ontological View of Canonical Citations
Michele Pasin
 
DH11: Browsing Highly Interconnected Humanities Databases Through Multi-Resul...
DH11: Browsing Highly Interconnected Humanities Databases Through Multi-Resul...DH11: Browsing Highly Interconnected Humanities Databases Through Multi-Resul...
DH11: Browsing Highly Interconnected Humanities Databases Through Multi-Resul...
Michele Pasin
 
Livecoding with impromptu
Livecoding with impromptuLivecoding with impromptu
Livecoding with impromptu
Michele Pasin
 
Introducing FRBR-OO (CCH KR workshop 2.2)
Introducing FRBR-OO (CCH KR workshop 2.2)Introducing FRBR-OO (CCH KR workshop 2.2)
Introducing FRBR-OO (CCH KR workshop 2.2)
Michele Pasin
 
Introducing CIDOC-CRM (Cch KR workshop #2.1)
Introducing CIDOC-CRM (Cch KR workshop #2.1)Introducing CIDOC-CRM (Cch KR workshop #2.1)
Introducing CIDOC-CRM (Cch KR workshop #2.1)
Michele Pasin
 
KR Workshop 1 - Ontologies
KR Workshop 1 - OntologiesKR Workshop 1 - Ontologies
KR Workshop 1 - Ontologies
Michele Pasin
 

Plus de Michele Pasin (13)

Designing great dashboards: a slidedeck for dashboard developers
Designing great dashboards: a slidedeck for dashboard developersDesigning great dashboards: a slidedeck for dashboard developers
Designing great dashboards: a slidedeck for dashboard developers
 
STI 2022 - Generating large-scale network analyses of scientific landscapes i...
STI 2022 - Generating large-scale network analyses of scientific landscapes i...STI 2022 - Generating large-scale network analyses of scientific landscapes i...
STI 2022 - Generating large-scale network analyses of scientific landscapes i...
 
How do philosophers think their own disciplines?
How do philosophers think their own disciplines?How do philosophers think their own disciplines?
How do philosophers think their own disciplines?
 
Exploring highly interconnected humanities data: are faceted browsers always ...
Exploring highly interconnected humanities data: are faceted browsers always ...Exploring highly interconnected humanities data: are faceted browsers always ...
Exploring highly interconnected humanities data: are faceted browsers always ...
 
Semantic Web Approaches in Digital History: an Introduction
Semantic Web Approaches in Digital History: an IntroductionSemantic Web Approaches in Digital History: an Introduction
Semantic Web Approaches in Digital History: an Introduction
 
Prosopography and Computer Ontologies: Towards a Formal Representation of the...
Prosopography and Computer Ontologies: Towards a Formal Representation of the...Prosopography and Computer Ontologies: Towards a Formal Representation of the...
Prosopography and Computer Ontologies: Towards a Formal Representation of the...
 
Digital Humanities 2009 - Laying out the conceptual foundations for data inte...
Digital Humanities 2009 - Laying out the conceptual foundations for data inte...Digital Humanities 2009 - Laying out the conceptual foundations for data inte...
Digital Humanities 2009 - Laying out the conceptual foundations for data inte...
 
An Ontological View of Canonical Citations
An Ontological View of Canonical CitationsAn Ontological View of Canonical Citations
An Ontological View of Canonical Citations
 
DH11: Browsing Highly Interconnected Humanities Databases Through Multi-Resul...
DH11: Browsing Highly Interconnected Humanities Databases Through Multi-Resul...DH11: Browsing Highly Interconnected Humanities Databases Through Multi-Resul...
DH11: Browsing Highly Interconnected Humanities Databases Through Multi-Resul...
 
Livecoding with impromptu
Livecoding with impromptuLivecoding with impromptu
Livecoding with impromptu
 
Introducing FRBR-OO (CCH KR workshop 2.2)
Introducing FRBR-OO (CCH KR workshop 2.2)Introducing FRBR-OO (CCH KR workshop 2.2)
Introducing FRBR-OO (CCH KR workshop 2.2)
 
Introducing CIDOC-CRM (Cch KR workshop #2.1)
Introducing CIDOC-CRM (Cch KR workshop #2.1)Introducing CIDOC-CRM (Cch KR workshop #2.1)
Introducing CIDOC-CRM (Cch KR workshop #2.1)
 
KR Workshop 1 - Ontologies
KR Workshop 1 - OntologiesKR Workshop 1 - Ontologies
KR Workshop 1 - Ontologies
 

Dernier

Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRLLucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
imonikaupta
 
Low Sexy Call Girls In Mohali 9053900678 🥵Have Save And Good Place 🥵
Low Sexy Call Girls In Mohali 9053900678 🥵Have Save And Good Place 🥵Low Sexy Call Girls In Mohali 9053900678 🥵Have Save And Good Place 🥵
Low Sexy Call Girls In Mohali 9053900678 🥵Have Save And Good Place 🥵
Chandigarh Call girls 9053900678 Call girls in Chandigarh
 
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
@Chandigarh #call #Girls 9053900678 @Call #Girls in @Punjab 9053900678
 

Dernier (20)

Russian Call Girls Pune (Adult Only) 8005736733 Escort Service 24x7 Cash Pay...
Russian Call Girls Pune  (Adult Only) 8005736733 Escort Service 24x7 Cash Pay...Russian Call Girls Pune  (Adult Only) 8005736733 Escort Service 24x7 Cash Pay...
Russian Call Girls Pune (Adult Only) 8005736733 Escort Service 24x7 Cash Pay...
 
Yerawada ] Independent Escorts in Pune - Book 8005736733 Call Girls Available...
Yerawada ] Independent Escorts in Pune - Book 8005736733 Call Girls Available...Yerawada ] Independent Escorts in Pune - Book 8005736733 Call Girls Available...
Yerawada ] Independent Escorts in Pune - Book 8005736733 Call Girls Available...
 
Moving Beyond Twitter/X and Facebook - Social Media for local news providers
Moving Beyond Twitter/X and Facebook - Social Media for local news providersMoving Beyond Twitter/X and Facebook - Social Media for local news providers
Moving Beyond Twitter/X and Facebook - Social Media for local news providers
 
VVIP Pune Call Girls Sinhagad WhatSapp Number 8005736733 With Elite Staff And...
VVIP Pune Call Girls Sinhagad WhatSapp Number 8005736733 With Elite Staff And...VVIP Pune Call Girls Sinhagad WhatSapp Number 8005736733 With Elite Staff And...
VVIP Pune Call Girls Sinhagad WhatSapp Number 8005736733 With Elite Staff And...
 
Shikrapur - Call Girls in Pune Neha 8005736733 | 100% Gennuine High Class Ind...
Shikrapur - Call Girls in Pune Neha 8005736733 | 100% Gennuine High Class Ind...Shikrapur - Call Girls in Pune Neha 8005736733 | 100% Gennuine High Class Ind...
Shikrapur - Call Girls in Pune Neha 8005736733 | 100% Gennuine High Class Ind...
 
Russian Call girl in Ajman +971563133746 Ajman Call girl Service
Russian Call girl in Ajman +971563133746 Ajman Call girl ServiceRussian Call girl in Ajman +971563133746 Ajman Call girl Service
Russian Call girl in Ajman +971563133746 Ajman Call girl Service
 
Call Now ☎ 8264348440 !! Call Girls in Rani Bagh Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Rani Bagh Escort Service Delhi N.C.R.Call Now ☎ 8264348440 !! Call Girls in Rani Bagh Escort Service Delhi N.C.R.
Call Now ☎ 8264348440 !! Call Girls in Rani Bagh Escort Service Delhi N.C.R.
 
Enjoy Night⚡Call Girls Samalka Delhi >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Samalka Delhi >༒8448380779 Escort ServiceEnjoy Night⚡Call Girls Samalka Delhi >༒8448380779 Escort Service
Enjoy Night⚡Call Girls Samalka Delhi >༒8448380779 Escort Service
 
WhatsApp 📞 8448380779 ✅Call Girls In Mamura Sector 66 ( Noida)
WhatsApp 📞 8448380779 ✅Call Girls In Mamura Sector 66 ( Noida)WhatsApp 📞 8448380779 ✅Call Girls In Mamura Sector 66 ( Noida)
WhatsApp 📞 8448380779 ✅Call Girls In Mamura Sector 66 ( Noida)
 
(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7
(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7
(INDIRA) Call Girl Pune Call Now 8250077686 Pune Escorts 24x7
 
Al Barsha Night Partner +0567686026 Call Girls Dubai
Al Barsha Night Partner +0567686026 Call Girls  DubaiAl Barsha Night Partner +0567686026 Call Girls  Dubai
Al Barsha Night Partner +0567686026 Call Girls Dubai
 
Nanded City ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready ...
Nanded City ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready ...Nanded City ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready ...
Nanded City ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready ...
 
Katraj ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For S...
Katraj ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For S...Katraj ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For S...
Katraj ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For S...
 
Dubai=Desi Dubai Call Girls O525547819 Outdoor Call Girls Dubai
Dubai=Desi Dubai Call Girls O525547819 Outdoor Call Girls DubaiDubai=Desi Dubai Call Girls O525547819 Outdoor Call Girls Dubai
Dubai=Desi Dubai Call Girls O525547819 Outdoor Call Girls Dubai
 
Hire↠Young Call Girls in Tilak nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esc...
Hire↠Young Call Girls in Tilak nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esc...Hire↠Young Call Girls in Tilak nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esc...
Hire↠Young Call Girls in Tilak nagar (Delhi) ☎️ 9205541914 ☎️ Independent Esc...
 
Wagholi & High Class Call Girls Pune Neha 8005736733 | 100% Gennuine High Cla...
Wagholi & High Class Call Girls Pune Neha 8005736733 | 100% Gennuine High Cla...Wagholi & High Class Call Girls Pune Neha 8005736733 | 100% Gennuine High Cla...
Wagholi & High Class Call Girls Pune Neha 8005736733 | 100% Gennuine High Cla...
 
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRLLucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
Lucknow ❤CALL GIRL 88759*99948 ❤CALL GIRLS IN Lucknow ESCORT SERVICE❤CALL GIRL
 
Real Men Wear Diapers T Shirts sweatshirt
Real Men Wear Diapers T Shirts sweatshirtReal Men Wear Diapers T Shirts sweatshirt
Real Men Wear Diapers T Shirts sweatshirt
 
Low Sexy Call Girls In Mohali 9053900678 🥵Have Save And Good Place 🥵
Low Sexy Call Girls In Mohali 9053900678 🥵Have Save And Good Place 🥵Low Sexy Call Girls In Mohali 9053900678 🥵Have Save And Good Place 🥵
Low Sexy Call Girls In Mohali 9053900678 🥵Have Save And Good Place 🥵
 
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
6.High Profile Call Girls In Punjab +919053900678 Punjab Call GirlHigh Profil...
 

Linked data experience at Macmillan: Building discovery services for scientific and scholarly content on top of a semantic data model

  • 1. LINKED DATA EXPERIENCE AT MACMILLAN Building discovery services for scientific and scholarly content on top of a semantic data model 22 October 2014 Tony Hammond Michele Pasin
  • 2. Background About Macmillan and what we are doing Linked Data at Macmillan | 22 October 2014 1
  • 3. Macmillan Science and Education Group brands and businesses Linked Data at Macmillan | 22 October 2014
  • 4. MS&E Current trends Developing a richer graph of objects Change Drivers ● Digital first workflow – print becomes secondary – support for multiple workflows ● User-centric design – things, not data – focus on user experience ● Deeply integrated datasets – standard naming convention – common metadata model – flexible schema management – rich dataset descriptions Linked Data at Macmillan | 22 October 2014
  • 5. NPG Linked Data Platform (2012) data.nature.com Deliverables (2012–2014) ● Prototype for external use ● Two RDF dataset releases in 2012 – April 2012 (22m triples) – July 2012 (270m triples) ● Live updates to query endpoint ● SPARQL query service (now terminated) Current Work (2014–) ● Focus on internal use-cases ● Publish ontology pages ● Periodic data snapshots (no endpoint) Linked Data at Macmillan | 22 October 2014
  • 6. NPG Core Ontology (2014) Things: assets, documents, events, types Features ● Classes: ~65 ● Properties: ~200 ● Named graphs (per class) Namespaces ● npg: => http://ns.nature.com/terms/ ● npgg: => http://ns.nature.com/graphs/ Approach ● Minimal commitment to external vocabs ● Incremental formalization (RDF, RDFS, OWL-DL) ● Shared metamodel vs. automatic inference Linked Data at Macmillan | 22 October 2014
  • 7. NPG Subject Pages (2014) Topical access to content Features ● Based on SKOS taxonomy – >2750 scientific terms – content inherited via SKOS tree ● Completely automated – one webpage per subject term – structure based on article type – secondary pages for specific types ● Various formats e.g. eAlerts, feeds, etc. – allows people to ‘follow’ a subject ● Customized related content – ads, jobs, events, etc. Linked Data at Macmillan | 22 October 2014
  • 8. Data Storage and Query Achieving speed by means of a hybrid architecture Linked Data at Macmillan | 22 October 2014 2
  • 9. Content Hub Managed content warehouse for data discovery Capabilities ● Discovery – Graph ● Storage – Content Repos Features ● Hybrid RDF + XML architecture – MarkLogic for XML, RDF/XML – Triplestore (TDB) for RDF validation ● Repo’s for binary assets Datasets ● Documents (large; >1m) ● Ontologies (small; <10k) Linked Data at Macmillan | 22 October 2014
  • 10. System Architecture Hub content Linked Data at Macmillan | 22 October 2014
  • 11. Content Discovery – Principles Readying the API for applications Generations ● 1st – Generic linked data API (RDF/*) ● 2nd – Specific page model API (JSON) Concerns ● Speed (20ms single object; 200ms filtered object) ● Simplicity (data construction) ● Stability (backup, clustering, security, transactions) Principles ● Chunky not chatty, all data in a single response ● Data as consumed, rather than as stored ● Support common use cases in simple, obvious ways ● Ensure a guaranteed, consistent speed of response for more complex queries ● Build on foundation of standard, pragmatic REST (collections, items) Linked Data at Macmillan | 22 October 2014
  • 12. Content Discovery – Optimization Tuning the API for performance Approaches ● TDB + Fuseki – SPARQL ● MarkLogic Semantics – SPARQL ● MarkLogic – XQuery ● MarkLogic (Optimized) – XQuery Techniques ● Partitioning – RDF/XML objects ● Streaming – serialization ● Hashing – dictionary lookup ● Cacheing – Varnish Linked Data at Macmillan | 22 October 2014
  • 13. Content Storage – Layout and Indexing Readying the data for page delivery Challenges ● Sort orders ● RDF Lists ● Facetting, counting Layout ● Semantic RDF/XML includes in XML ● RDF objects serialized in list order ● Application XML for subject hierarchy Indexes ● Indexes over all elements ● Range indexes for datatypes (e.g. dates) Linked Data at Macmillan | 22 October 2014
  • 14. Content Storage – Example Semantic metadata Techniques ● XML header for semantic metadata ● All article data is localized ● Maintain named graphs via <graph/> elements ● RDF/XML-ABBREV ● Simple XML :: JSON mapping Linked Data at Macmillan | 22 October 2014
  • 15. In Conclusion A few lessons learned Summary ● An RDF metamodel allows for scalable enterprise-level data organization ● It is crucial to adequately distinguish between internal and external use cases ● A hybrid architecture proved to be an efficient internal solution for content delivery Future Work ● Grow the ontology so that it matches product requirements more closely ● Allow for more advanced automatic inferencing ● Provide richer query options both via the API and SPARQL endpoints ● Maintain and expand the vision of a shared semantic model as a core enterprise asset Linked Data at Macmillan | 22 October 2014
  • 16. For more information please contact TONY HAMMOND Data Architect, Content Data Services tony.hammond@macmillan.com MICHELE PASIN Information Architect, Product Office michele.pasin@macmillan.com Thank you