This document discusses a two-phase approach to introducing linked data from WorldCat records. Phase 1 involves mining existing MARC records to identify entities like persons, organizations, and subjects, and linking those strings to controlled vocabularies. Phase 2 models the data using schemas like Schema.org that are of interest to the web in order to share resources via linked data. The goal is to draw people to library resources by sharing in a web-native linked data format.
3. Representing the collective collection
in WorldCat Discovery and WorldCat.org
322+ million
bibliographic records
2+ billion holdings
980million records
38 million items
(Institutional repositories,
Google, HathiTrust, OAIster)
Bibliographic
information in
WorldCat
Licensed digital
content/articles in
library collections
Digitized
local library content
As of 11 June 2013
8. Structured
Data
Objectives
• Linking
with
hubs
of
authority
on
the
web
• viaf.org
–
persons
• Library
of
congress
–
subjects,
etc
9. Structured
Data
Objectives
• Linking
with
hubs
of
authority
on
the
web
• viaf.org
–
persons
• Library
of
congress
–
subjects,
etc
• Dewey.info
–
classifications
10. Structured
Data
Objectives
• Linking
with
hubs
of
authority
on
the
web
• viaf.org
–
persons
• Library
of
congress
–
subjects,
etc
• Dewey.info
–
classifications
• Dbpedia
–
most
things
11. Structured
Data
Objectives
• Linking
with
hubs
of
authority
on
the
web
• viaf.org
–
persons
• Library
of
congress
–
subjects,
etc
• Dewey.info
–
classifications
• Dbpedia
–
most
things
• Widely
distributed
&
understood
12. Structured
Data
Objectives
• Linking
with
hubs
of
authority
on
the
web
• viaf.org
–
persons
• Library
of
congress
–
subjects,
etc
• Dewey.info
–
classifications
• Dbpedia
–
most
things
• Widely
distributed
&
understood
• Web
standard
data
access
patterns
13. Structured
Data
Objectives
• Linking
with
hubs
of
authority
on
the
web
• viaf.org
–
persons
• Library
of
congress
–
subjects,
etc
• Dewey.info
–
classifications
• Dbpedia
–
most
things
• Widely
distributed
&
understood
• Web
standard
data
access
patterns
• Common
vocabularies
on
the
web
14. Structured
Data
Objectives
• Linking
with
hubs
of
authority
on
the
web
• viaf.org
–
persons
• Library
of
congress
–
subjects,
etc
• Dewey.info
–
classifications
• Dbpedia
–
most
things
• Widely
distributed
&
understood
• Web
standard
data
access
patterns
• Common
vocabularies
on
the
web
• Visibility
in
search
engines
15. Structured
Data
Objectives
• Linking
with
hubs
of
authority
on
the
web
• viaf.org
–
persons
• Library
of
congress
–
subjects,
etc
• Dewey.info
–
classifications
• Dbpedia
–
most
things
• Widely
distributed
&
understood
• Web
standard
data
access
patterns
• Common
vocabularies
on
the
web
• Visibility
Conclusions
in
search
engines
16. Structured
Data
Objectives
• Linking
with
hubs
of
authority
on
the
web
• viaf.org
–
persons
• Library
of
congress
–
subjects,
etc
• Dewey.info
–
classifications
• Dbpedia
–
most
things
• Widely
distributed
&
understood
• Web
standard
data
access
patterns
• Common
vocabularies
on
the
web
• Visibility
Conclusions
in
search
engines
• Linked
Data
17. Structured
Data
Objectives
• Linking
with
hubs
of
authority
on
the
web
• viaf.org
–
persons
• Library
of
congress
–
subjects,
etc
• Dewey.info
–
classifications
• Dbpedia
–
most
things
• Widely
distributed
&
understood
• Web
standard
data
access
patterns
• Common
vocabularies
on
the
web
• Visibility
Conclusions
in
search
engines
• Linked
Data
• RDF
–
RDFa,
RDF/XML,
JSON-‐LD,
Turtle,
nTriples
18. Structured
Data
Objectives
• Linking
with
hubs
of
authority
on
the
web
• viaf.org
–
persons
• Library
of
congress
–
subjects,
etc
• Dewey.info
–
classifications
• Dbpedia
–
most
things
• Widely
distributed
&
understood
• Web
standard
data
access
patterns
• Common
vocabularies
on
the
web
• Visibility
Conclusions
in
search
engines
• Linked
Data
• RDF
–
RDFa,
RDF/XML,
JSON-‐LD,
Turtle,
nTriples
• Canonical
URIs
19. Structured
Data
Objectives
• Linking
with
hubs
of
authority
on
the
web
• viaf.org
–
persons
• Library
of
congress
–
subjects,
etc
• Dewey.info
–
classifications
• Dbpedia
–
most
things
• Widely
distributed
&
understood
• Web
standard
data
access
patterns
• Common
vocabularies
on
the
web
• Visibility
Conclusions
in
search
engines
• Linked
Data
• RDF
–
RDFa,
RDF/XML,
JSON-‐LD,
Turtle,
nTriples
• Canonical
URIs
• Schema.org
20. Structured
Data
Objectives
• Linking
with
hubs
of
authority
on
the
web
• viaf.org
–
persons
• Library
of
congress
–
subjects,
etc
• Dewey.info
–
classifications
• Dbpedia
–
most
things
• Widely
distributed
&
understood
• Web
standard
data
access
patterns
• Common
vocabularies
on
the
web
• Visibility
Conclusions
in
search
engines
• Linked
Data
• RDF
–
RDFa,
RDF/XML,
JSON-‐LD,
Turtle,
nTriples
• Canonical
URIs
• Schema.org
• Backed
and
recognized
by
Google,
Bing,
Yahoo!,
Yandex
21. Structured
Data
Objectives
• Linking
with
hubs
of
authority
on
the
web
• viaf.org
–
persons
• Library
of
congress
–
subjects,
etc
• Dewey.info
–
classifications
• Dbpedia
–
most
things
• Widely
distributed
&
understood
• Web
standard
data
access
patterns
• Common
vocabularies
on
the
web
• Visibility
Conclusions
in
search
engines
• Linked
Data
• RDF
–
RDFa,
RDF/XML,
JSON-‐LD,
Turtle,
nTriples
• Canonical
URIs
• Schema.org
• Backed
and
recognized
by
Google,
Bing,
Yahoo!,
Yandex
• Widely
adopted
&
understood
–
15%
of
web
sites
22. • Linking
with
hubs
of
authority
on
the
web
• viaf.org
–
persons
• Library
of
congress
–
subjects,
etc
• Dewey.info
–
classifications
• Dbpedia
–
most
things
• Widely
distributed
&
understood
• Web
standard
data
access
patterns
• Common
vocabularies
on
the
web
• Visibility
obvious
Conclusions
in
search
engines
• Linked
Data
• RDF
–
RDFa,
RDF/XML,
JSON-‐LD,
Turtle,
nTriples
• Canonical
URIs
• Schema.org
• Backed
and
recognized
by
Google,
Bing,
Yahoo!,
Yandex
• Widely
adopted
&
understood
–
15%
of
web
sites
fairly
y
23. • Linking
with
hubs
of
authority
on
the
web
• viaf.org
–
persons
• Library
of
congress
–
subjects,
etc
• Dewey.info
–
classifications
• Dbpedia
–
most
things
• Widely
distributed
&
understood
• Web
standard
data
access
patterns
• Common
vocabularies
on
the
web
• Visibility
obvious
Conclusions
in
search
engines
• Linked
Data
• RDF
–
RDFa,
RDF/XML,
JSON-‐LD,
Turtle,
nTriples
• Canonical
URIs
• Schema.org
• Backed
and
recognized
by
Google,
Bing,
Yahoo!,
Yandex
• Widely
adopted
&
understood
–
15%
of
web
sites
fairly
y
27. Introducing
Linked
Data
Phase
1
• First
mine
the
data
• Records
held
in
Marc
• Identify
the
entities
28. Introducing
Linked
Data
Phase
1
• First
mine
the
data
• Records
held
in
Marc
• Identify
the
entities
• Person,
Organization,
CreativeWork,
etc.
29. Introducing
Linked
Data
Phase
1
• First
mine
the
data
• Records
held
in
Marc
• Identify
the
entities
• Person,
Organization,
CreativeWork,
etc.
• Match
strings
to
things
30. Introducing
Linked
Data
Phase
1
• First
mine
the
data
• Records
held
in
Marc
• Identify
the
entities
• Person,
Organization,
CreativeWork,
etc.
• Match
strings
to
things
• People/Organization
names
–
viaf.org,
etc
31. Introducing
Linked
Data
Phase
1
• First
mine
the
data
• Records
held
in
Marc
• Identify
the
entities
• Person,
Organization,
CreativeWork,
etc.
• Match
strings
to
things
• People/Organization
names
–
viaf.org,
etc
• Subjects
–
Library
of
Congress
32. Introducing
Linked
Data
Phase
1
• First
mine
the
data
• Records
held
in
Marc
• Identify
the
entities
• Person,
Organization,
CreativeWork,
etc.
• Match
strings
to
things
• People/Organization
names
–
viaf.org,
etc
• Subjects
–
Library
of
Congress
33. Introducing
Linked
Data
Phase
1
• First
mine
the
data
• Records
held
in
Marc
• Identify
the
entities
• Person,
Organization,
CreativeWork,
etc.
• Match
strings
to
things
• People/Organization
names
–
viaf.org,
etc
• Subjects
–
Library
of
Congress
Phase
2
34. Introducing
Linked
Data
Phase
1
• First
mine
the
data
• Records
held
in
Marc
• Identify
the
entities
• Person,
Organization,
CreativeWork,
etc.
• Match
strings
to
things
• People/Organization
names
–
viaf.org,
etc
• Subjects
–
Library
of
Congress
Phase
2
• Model
what
is
of
interest
to
the
Web
35. Introducing
Linked
Data
Phase
1
• First
mine
the
data
• Records
held
in
Marc
• Identify
the
entities
• Person,
Organization,
CreativeWork,
etc.
• Match
strings
to
things
• People/Organization
names
–
viaf.org,
etc
• Subjects
–
Library
of
Congress
Phase
2
• Model
what
is
of
interest
to
the
Web
• All
our
data
is
important
to
us
36. Introducing
Linked
Data
Phase
1
• First
mine
the
data
• Records
held
in
Marc
• Identify
the
entities
• Person,
Organization,
CreativeWork,
etc.
• Match
strings
to
things
• People/Organization
names
–
viaf.org,
etc
• Subjects
–
Library
of
Congress
Phase
2
• Model
what
is
of
interest
to
the
Web
• All
our
data
is
important
to
us
• What
will
draw
people
to
our
resources?
37. Introducing
Linked
Data
Phase
1
• First
mine
the
data
• Records
held
in
Marc
• Identify
the
entities
• Person,
Organization,
CreativeWork,
etc.
• Match
strings
to
things
• People/Organization
names
–
viaf.org,
etc
• Subjects
–
Library
of
Congress
Phase
2
• Model
what
is
of
interest
to
the
Web
• All
our
data
is
important
to
us
• What
will
draw
people
to
our
resources?
• Share
the
way
the
web
does
38. Introducing
Linked
Data
Phase
1
• First
mine
the
data
• Records
held
in
Marc
• Identify
the
entities
• Person,
Organization,
CreativeWork,
etc.
• Match
strings
to
things
• People/Organization
names
–
viaf.org,
etc
• Subjects
–
Library
of
Congress
Phase
2
• Model
what
is
of
interest
to
the
Web
• All
our
data
is
important
to
us
• What
will
draw
people
to
our
resources?
• Share
the
way
the
web
does
• Linked
Data
39. Introducing
Linked
Data
Phase
1
• First
mine
the
data
• Records
held
in
Marc
• Identify
the
entities
• Person,
Organization,
CreativeWork,
etc.
• Match
strings
to
things
• People/Organization
names
–
viaf.org,
etc
• Subjects
–
Library
of
Congress
Phase
2
• Model
what
is
of
interest
to
the
Web
• All
our
data
is
important
to
us
• What
will
draw
people
to
our
resources?
• Share
the
way
the
web
does
• Linked
Data
• Schema.org
40. Introducing
Linked
Data
Phase
1
• First
mine
the
data
• Records
held
in
Marc
• Identify
the
entities
• Person,
Organization,
CreativeWork,
etc.
• Match
strings
to
things
• People/Organization
names
–
viaf.org,
etc
• Subjects
–
Library
of
Congress
Phase
2
• Model
what
is
of
interest
to
the
Web
• All
our
data
is
important
to
us
• What
will
draw
people
to
our
resources?
• Share
the
way
the
web
does
• Linked
Data
• Schema.org
Phase
3
41. Introducing
Linked
Data
Phase
1
• First
mine
the
data
• Records
held
in
Marc
• Identify
the
entities
• Person,
Organization,
CreativeWork,
etc.
• Match
strings
to
things
• People/Organization
names
–
viaf.org,
etc
• Subjects
–
Library
of
Congress
Phase
2
• Model
what
is
of
interest
to
the
Web
• All
our
data
is
important
to
us
• What
will
draw
people
to
our
resources?
• Share
the
way
the
web
does
• Linked
Data
• Schema.org
Phase
3 -‐
Try
it
out!
50. What
to
do
about
Gaps
in
Schema.org
coverage
Lobby
them
for
updates
/
extensions
51. What
to
do
about
Gaps
in
Schema.org
coverage
Lobby
them
for
updates
/
extensions
52. What
to
do
about
Gaps
in
Schema.org
coverage
Lobby
them
for
updates
/
extensions
53. What
to
do
about
Gaps
in
Schema.org
coverage
Lobby
them
for
updates
/
extensions
54. What
to
do
about
Gaps
in
Schema.org
coverage
Lobby
them
for
updates
/
extensions
http://lists.w3.org/Archives/Public/public-‐vocabs
http://www.w3.org/wiki/WebSchemas
55. What
to
do
about
Gaps
in
Schema.org
coverage
Lobby
them
for
updates
/
extensions
http://lists.w3.org/Archives/Public/public-‐vocabs
http://www.w3.org/wiki/WebSchemas
…
or
form
a
group
to
do
it
56. What
to
do
about
Gaps
in
Schema.org
coverage
…
or
form
a
group
to
do
it
64. I’ll
create
my
own
vocabulary
But doesn’t that loose all the
benefits of Schema?
65. I’ll
create
my
own
vocabulary
But doesn’t that loose all the
benefits of Schema?
Not
if
it
is
an
extension
vocabulary
??
Just
add
your
terms
with
Schema
at
the
core
66. I’ll
create
my
own
vocabulary
But doesn’t that loose all the
benefits of Schema?
Not
if
it
is
an
extension
vocabulary
Just
add
your
terms
with
Schema
at
the
core Like frosting
on a cake?
74. Extension
Vocabulary
• Adds
your
domain
specifics
• Mostly
Schema.org
• Only
need
to
fill
in
the
gaps
75. Extension
Vocabulary
• Adds
your
domain
specifics
• Mostly
Schema.org
• Only
need
to
fill
in
the
gaps
• Search
engines
will
understand
most
76. Extension
Vocabulary
• Adds
your
domain
specifics
• Mostly
Schema.org
• Only
need
to
fill
in
the
gaps
• Search
engines
will
understand
most
• Familiar
77. Extension
Vocabulary
• Adds
your
domain
specifics
• Mostly
Schema.org
• Only
need
to
fill
in
the
gaps
• Search
engines
will
understand
most
• Familiar
• Eases
adoption
78. Extension
Vocabulary
• Adds
your
domain
specifics
• Mostly
Schema.org
• Only
need
to
fill
in
the
gaps
• Search
engines
will
understand
most
• Familiar
• Eases
adoption
• Minimal
namespaces
79. Extension
Vocabulary
• Adds
your
domain
specifics
• Mostly
Schema.org
• Only
need
to
fill
in
the
gaps
• Search
engines
will
understand
most
• Familiar
• Eases
adoption
• Minimal
namespaces
• Eases
adoption
80. Extension
Vocabulary
• Adds
your
domain
specifics
• Mostly
Schema.org
• Only
need
to
fill
in
the
gaps
• Search
engines
will
understand
most
• Familiar
• Eases
adoption
• Minimal
namespaces
• Eases
adoption
• Repeatable
pattern
90. @prefix
schema:
<http://schema.org/>
@prefix
bgn:
<http://bibliograph.net/>
!!!!!
• Not
as
good
as
a
single
namespace
• But
next
best
thing
and
understandable
by:
91. @prefix
schema:
<http://schema.org/>
@prefix
bgn:
<http://bibliograph.net/>
!!!!!
• Not
as
good
as
a
single
namespace
• But
next
best
thing
and
understandable
by:
• my
domain
92. @prefix
schema:
<http://schema.org/>
@prefix
bgn:
<http://bibliograph.net/>
!!!!!
• Not
as
good
as
a
single
namespace
• But
next
best
thing
and
understandable
by:
• my
domain
• the
rest
of
the
world
-‐
mostly
93. @prefix
schema:
<http://schema.org/>
@prefix
bgn:
<http://bibliograph.net/>
!!!!!
• Not
as
good
as
a
single
namespace
• But
next
best
thing
and
understandable
by:
• my
domain
• the
rest
of
the
world
-‐
mostly
96. An
extension
to
Schema.org…
• to
fill
in
some
[temporary
?]
domain
specific
gaps
97. An
extension
to
Schema.org…
• to
fill
in
some
[temporary
?]
domain
specific
gaps
• light
weight
access
to
rich
data
98. An
extension
to
Schema.org…
• to
fill
in
some
[temporary
?]
domain
specific
gaps
• light
weight
access
to
rich
data
• domain
specific
extensions
in
single
namespace
99. An
extension
to
Schema.org…
• to
fill
in
some
[temporary
?]
domain
specific
gaps
• light
weight
access
to
rich
data
• domain
specific
extensions
in
single
namespace
• currently
used
by
VIAF
and
WorldCat
linked
data
100. An
extension
to
Schema.org…
• to
fill
in
some
[temporary
?]
domain
specific
gaps
• light
weight
access
to
rich
data
• domain
specific
extensions
in
single
namespace
• currently
used
by
VIAF
and
WorldCat
linked
data
An
extension
to
Schema.org…
101. An
extension
to
Schema.org…
• to
fill
in
some
[temporary
?]
domain
specific
gaps
• light
weight
access
to
rich
data
• domain
specific
extensions
in
single
namespace
• currently
used
by
VIAF
and
WorldCat
linked
data
An
extension
to
Schema.org…
• not
a
standalone
vocabulary
–
needs
Schema.org
102. An
extension
to
Schema.org…
• to
fill
in
some
[temporary
?]
domain
specific
gaps
• light
weight
access
to
rich
data
• domain
specific
extensions
in
single
namespace
• currently
used
by
VIAF
and
WorldCat
linked
data
An
extension
to
Schema.org…
• not
a
standalone
vocabulary
–
needs
Schema.org
• not
a
replacement
for
rich
domain
specific
vocab(s)
103. An
extension
to
Schema.org…
• to
fill
in
some
[temporary
?]
domain
specific
gaps
• light
weight
access
to
rich
data
• domain
specific
extensions
in
single
namespace
• currently
used
by
VIAF
and
WorldCat
linked
data
An
extension
to
Schema.org…
• not
a
standalone
vocabulary
–
needs
Schema.org
• not
a
replacement
for
rich
domain
specific
vocab(s)
• complementary
[rest
of
the
world
friendly]
104. An
extension
to
Schema.org…
• to
fill
in
some
[temporary
?]
domain
specific
gaps
• light
weight
access
to
rich
data
• domain
specific
extensions
in
single
namespace
• currently
used
by
VIAF
and
WorldCat
linked
data
An
extension
to
Schema.org…
• not
a
standalone
vocabulary
–
needs
Schema.org
• not
a
replacement
for
rich
domain
specific
vocab(s)
• complementary
[rest
of
the
world
friendly]
An
extension
to
Schema.org…
105. An
extension
to
Schema.org…
• to
fill
in
some
[temporary
?]
domain
specific
gaps
• light
weight
access
to
rich
data
• domain
specific
extensions
in
single
namespace
• currently
used
by
VIAF
and
WorldCat
linked
data
An
extension
to
Schema.org…
• not
a
standalone
vocabulary
–
needs
Schema.org
• not
a
replacement
for
rich
domain
specific
vocab(s)
• complementary
[rest
of
the
world
friendly]
An
extension
to
Schema.org…
• an
example
of
how
others
might
do
it.
107. OCLC
Entity
Based
Data
Strategy
✓ VIAF,
ISNI,
FAST
Publish
Linked
Data
108. OCLC
Entity
Based
Data
Strategy
✓ VIAF,
ISNI,
FAST
Publish
Linked
Data
✓ WorldCat.org
Linked
Data
Release
2012
–
using
Schema.org
109. OCLC
Entity
Based
Data
Strategy
✓ VIAF,
ISNI,
FAST
Publish
Linked
Data
✓ WorldCat.org
Linked
Data
Release
–
using
Schema.org
✓ Internal
agreement
on
data
strategy
2012
2013
110. OCLC
Entity
Based
Data
Strategy
✓ VIAF,
ISNI,
FAST
Publish
Linked
Data
✓ WorldCat.org
Linked
Data
Release
–
using
Schema.org
✓ Internal
agreement
on
data
strategy
✓ Evangelism
2012
2013
111. OCLC
Entity
Based
Data
Strategy
✓ VIAF,
ISNI,
FAST
Publish
Linked
Data
✓ WorldCat.org
Linked
Data
Release
–
using
Schema.org
✓ Internal
agreement
on
data
strategy
✓ Evangelism
✓ Research
&
Design
with
Data
Architecture
Group
2012
2013
112. OCLC
Entity
Based
Data
Strategy
✓ VIAF,
ISNI,
FAST
Publish
Linked
Data
✓ WorldCat.org
Linked
Data
Release
–
using
Schema.org
✓ Internal
agreement
on
data
strategy
✓ Evangelism
✓ Research
&
Design
with
Data
Architecture
Group
✓ Data
mining
of
WorldCat
resources
2012
2013
113. OCLC
Entity
Based
Data
Strategy
✓ VIAF,
ISNI,
FAST
Publish
Linked
Data
✓ WorldCat.org
Linked
Data
Release
–
using
Schema.org
✓ Internal
agreement
on
data
strategy
✓ Evangelism
✓ Research
&
Design
with
Data
Architecture
Group
✓ Data
mining
of
WorldCat
resources
✓ WorldCat
Works
Released
2012
2013
2014
114. OCLC
Entity
Based
Data
Strategy
✓ VIAF,
ISNI,
FAST
Publish
Linked
Data
✓ WorldCat.org
Linked
Data
Release
–
using
Schema.org
✓ Internal
agreement
on
data
strategy
✓ Evangelism
✓ Research
&
Design
with
Data
Architecture
Group
✓ Data
mining
of
WorldCat
resources
✓ WorldCat
Works
Released
2012
2013
2014
➢Application
Integration
➢WorldCat
Discovery
➢Analytics
➢Discovery
API
➢Cataloging
2015
115. OCLC
Entity
Based
Data
Strategy
✓ VIAF,
ISNI,
FAST
Publish
Linked
Data
✓ WorldCat.org
Linked
Data
Release
–
using
Schema.org
✓ Internal
agreement
on
data
strategy
✓ Evangelism
✓ Research
&
Design
with
Data
Architecture
Group
✓ Data
mining
of
WorldCat
resources
✓ WorldCat
Works
Released
2012
2014
➢Application
Integration
➢WorldCat
Discovery
➢Analytics
➢Discovery
API
➢Cataloging
2015
➢More
Entities
Released
➢Person
➢Organization
➢Event
➢Concept
2013
116. OCLC
Entity
Based
Data
Strategy
✓ VIAF,
ISNI,
FAST
Publish
Linked
Data
✓ WorldCat.org
Linked
Data
Release
–
using
Schema.org
✓ Internal
agreement
on
data
strategy
✓ Evangelism
✓ Research
&
Design
with
Data
Architecture
Group
✓ Data
mining
of
WorldCat
resources
✓ WorldCat
Works
Released
2012
2014
➢Application
Integration
➢WorldCat
Discovery
➢Analytics
➢Discovery
API
➢Cataloging
2015
➢More
Entities
Released
➢Person
➢Organization
➢Event
➢Concept
➢New
Products
➢New
Services
2013
2016
117. OCLC
Entity
Based
Data
Strategy
✓ VIAF,
ISNI,
FAST
Publish
Linked
Data
✓ WorldCat.org
Linked
Data
Release
–
using
Schema.org
✓ Internal
agreement
on
data
strategy
✓ Evangelism
✓ Research
&
Design
with
Data
Architecture
Group
✓ Data
mining
of
WorldCat
resources
✓ WorldCat
Works
Released
2012
2014
➢Application
Integration
➢WorldCat
Discovery
➢Analytics
➢Discovery
API
➢Cataloging
2015
➢More
Entities
Released
➢Person
➢Organization
➢Event
➢Concept
➢New
Products
➢Continuing
Evangelism
➢New
Services
➢Continuing
Innovation
2013
2016
118. OCLC
Entity
Based
Data
Strategy
✓ VIAF,
ISNI,
FAST
Publish
Linked
Data
✓ WorldCat.org
Linked
Data
Release
–
using
Schema.org
✓ Internal
agreement
on
data
strategy
✓ Evangelism
✓ Research
&
Design
with
Data
Architecture
Group
✓ Data
mining
of
WorldCat
resources
✓ WorldCat
Works
Released
2012
2014
➢Application
Integration
➢WorldCat
Discovery
➢Analytics
➢Discovery
API
➢Cataloging
2015
➢More
Entities
Released
➢Person
➢Organization
➢Event
➢Concept
➢New
Products
➢Continuing
Evangelism
➢New
Services
➢Continuing
Innovation
2013
2016
119. OCLC
Entity
Based
Data
Strategy
✓ VIAF,
ISNI,
FAST
Publish
Linked
Data
✓ WorldCat.org
Linked
Data
Release
–
using
Schema.org
✓ Internal
agreement
on
data
strategy
✓ Evangelism
✓ Research
&
Design
with
Data
Architecture
Group
✓ Data
mining
of
WorldCat
resources
✓ WorldCat
Works
Released
2012
2014
➢Application
Integration
➢WorldCat
Discovery
➢Analytics
➢Discovery
API
➢Cataloging
2015
➢More
Entities
Released
➢Person
➢Organization
➢Event
➢Concept
➢New
Products
➢Continuing
Evangelism
➢New
Services
➢Continuing
Innovation
2013
2016
120. OCLC
Entity
Based
Data
Strategy
✓ VIAF,
ISNI,
FAST
Publish
Linked
Data
✓ WorldCat.org
Linked
Data
Release
–
using
Schema.org
✓ Internal
agreement
on
data
strategy
✓ Evangelism
✓ Research
&
Design
with
Data
Architecture
Group
✓ Data
mining
of
WorldCat
resources
✓ WorldCat
Works
Released
2012
2014
➢Application
Integration
➢WorldCat
Discovery
➢Analytics
➢Discovery
API
➢Cataloging
2015
➢More
Entities
Released
➢Person
➢Organization
➢Event
➢Concept
➢New
Products
➢Continuing
Evangelism
➢New
Services
➢Continuing
Innovation
2013
2016
121. OCLC
Entity
Based
Data
Strategy
✓ VIAF,
ISNI,
FAST
Publish
Linked
Data
✓ WorldCat.org
Linked
Data
Release
–
using
Schema.org
✓ Internal
agreement
on
data
strategy
✓ Evangelism
✓ Research
&
Design
with
Data
Architecture
Group
✓ Data
mining
of
WorldCat
resources
✓ WorldCat
Works
Released
2012
2014
➢Application
Integration
➢WorldCat
Discovery
➢Analytics
➢Discovery
API
➢Cataloging
2015
➢More
Entities
Released
➢Person
➢Organization
➢Event
➢Concept
➢New
Products
➢Continuing
Evangelism
➢New
Services
➢Continuing
Innovation
2013
2016
122. OCLC
Entity
Based
Data
Strategy
✓ VIAF,
ISNI,
FAST
Publish
Linked
Data
✓ WorldCat.org
Linked
Data
Release
–
using
Schema.org
✓ Internal
agreement
on
data
strategy
✓ Evangelism
✓ Research
&
Design
with
Data
Architecture
Group
✓ Data
mining
of
WorldCat
resources
✓ WorldCat
Works
Released
2012
2014
➢Application
Integration
➢WorldCat
Discovery
➢Analytics
➢Discovery
API
➢Cataloging
2015
➢More
Entities
Released
➢Person
➢Organization
➢Event
➢Concept
➢New
Products
➢Continuing
Evangelism
➢New
Services
➢Continuing
Innovation
2013
2016
123. OCLC
Entity
Based
Data
Strategy
✓ VIAF,
ISNI,
FAST
Publish
Linked
Data
✓ WorldCat.org
Linked
Data
Release
–
using
Schema.org
✓ Internal
agreement
on
data
strategy
✓ Evangelism
✓ Research
&
Design
with
Data
Architecture
Group
✓ Data
mining
of
WorldCat
resources
✓ WorldCat
Works
Released
2012
2014
➢Application
Integration
➢WorldCat
Discovery
➢Analytics
➢Discovery
API
➢Cataloging
2015
➢More
Entities
Released
➢Person
➢Organization
➢Event
➢Concept
➢New
Products
➢Continuing
Evangelism
➢New
Services
➢Continuing
Innovation
2013
2016
124. OCLC
Entity
Based
Data
Strategy
✓ VIAF,
ISNI,
FAST
Publish
Linked
Data
✓ WorldCat.org
Linked
Data
Release
–
using
Schema.org
✓ Internal
agreement
on
data
strategy
✓ Evangelism
✓ Research
&
Design
with
Data
Architecture
Group
✓ Data
mining
of
WorldCat
resources
✓ WorldCat
Works
Released
2012
2014
➢Application
Integration
➢WorldCat
Discovery
➢Analytics
➢Discovery
API
➢Cataloging
2015
➢More
Entities
Released
➢Person
➢Organization
➢Event
➢Concept
➢New
Products
➢Continuing
Evangelism
➢New
Services
➢Continuing
Innovation
2013
2016
125. Structured
Data
Objectives
obvious
Conclusions
• Linked
Data
• RDF
–
RDFa,
RDF/XML,
JSON-‐LD,
Turtle,
nTriples
• Canonical
URIs
• Schema.org
+
BiblioGraph.net
• Core
widely
adopted
&
understood
–
15%
of
web
sites
fairly
y
126. Structured
Data
Objectives
obvious
Conclusions
• Linked
Data
• RDF
–
RDFa,
RDF/XML,
JSON-‐LD,
Turtle,
nTriples
• Canonical
URIs
• Schema.org
+
BiblioGraph.net
• Core
widely
adopted
&
understood
–
15%
of
web
sites
fairly
y
Your
• Widely
distributed
&
understood
• Web
standard
data
access
patterns
• Common
vocabularies
on
the
web
• Visibility
in
search
engines