Using a Linked Data approach for publication & consumption of data on the Web is significantly reducing the costs and complexity of reaching many more consumers of your content. This presentation highlights how Best Buy, BBC, US EPA and Sentara Healthcare are leveraging a Linked Data approach. Session delivered at Enterprise Data World 2012 in Atlanta GA, USA on 2-May-2012.
"LLMs for Python Engineers: Advanced Data Analysis and Semantic Kernel",Oleks...
Linked Data Warehouses: A new breed of Business Intelligence
1. Linked Data Warehouses:
A New Generation of BI
ENTERPRISE DATA WORLD 2012
ATLANTA 2-MAY-2012
By: Bernadette Hyland,
Chair, W3C Government Linked Data WG
CEO, 3 Round Stones, Inc
Email. bhyland@3roundstones.com
Twitter: @BernHyland
This presentation: http://slideshare.net/3roundstones
Wednesday, May 2, 12
2. • Linked Data is
about publishing
and consuming
data using
international data
standards
• Based on 20 year
old idea
• A system of linked
information systems
Wednesday, May 2, 12
5. A HISTORY OF SILOS
$ cat foo.txt
| grep blah |
sort
1970s 1980s 1990s
A neat little package Client-Server The Early Web
Wednesday, May 2, 12
6. There is a better way to
connect data silos ...
•No one vendor owns it
•It scales ... to Web-scale
•Doesn t require a super model
•Based on International Data Exchange
Standards (RDF, SPARQL)
Wednesday, May 2, 12
7. ACCEPTABLE ROI FOR IT
4% 17%
13%
16%
49% 6 months
12 months
18 months
24 months
More than 24 months
Wednesday, May 2, 12
15. Linked Data in Context
Universal Client Ubiquitous,
reusable applications
URL Curation
Universal Connection Logic and interlinking
Web
of Data
Universal Database
Wednesday, May 2, 12
21. Why is RDF important?
• It is an international standard for publishing
data on the Web (public and private)
• Data exchange model
• Serializations include RDF/XML, N-triples,
N3, Turtle, ...
• It is the future of using the Web
Wednesday, May 2, 12
22. Today s data warehouses
• Data warehouse costs are high
• Failure rates are high
• Requires a lot of cooperation ...
• Vocabulary alignment & data harmonization
• Data formats not inter-operable
• Cooperation requires coordination
• 18 months or longer ...
Wednesday, May 2, 12
23. Alternatives include ...
• Use Data Exchange Standards to host
structured content
• Create Linked data warehouses
• Faster & less expensive
• Web architecture, Web-scale
Wednesday, May 2, 12
25. store name
hours
address
phone
geo
ratings
services
events
Wednesday, May 2, 12
26. Why?
58% of Americans research online before they buy.
Wednesday, May 2, 12
27. “We really didn’t go into it with any expectations. We
just wanted to see if it was something we might want
to do. That’s why we were caught by surprise by
the results… we weren’t really expecting any.”
-- Jay Myers, Lead Development Engineer, Best Buy
Wednesday, May 2, 12
28. The impact:
30% increase in organic search results
15% increase in click-through rate (CTR)
Wednesday, May 2, 12
31. BBC WANTED
The BBC publishes
large amounts of
content online, as
text, audio and video.
As the amount of
content grows, we
need to make it easy
for users to locate
items of interest
Wednesday, May 2, 12
32. "The RDF representations of these web identifiers allow
developers to use our data to build applications."
-- Yves Raimond, BBC
Wednesday, May 2, 12
45. CDC Open
Government
Linked
Data
EPA Data Cloud DBpedia
US
Census Pub
Med
Clinical
Ontology NLM
Business
Ontology
Social
Media Internal
Portal
Data
Facebook Physicians
TwiCer Services
EMR Loca*ons
Data
Clinical
Condi*on
Specific
Wednesday, May 2, 12
46. Value Proposition
• Decrease costly emergency department visits
• Reduce hospital re-admissions after treatment
• Improved self-care and medication compliance
• Education of triggers and disease management
Wednesday, May 2, 12
47. Func*onal
Model
1.
Define
target
popula*on
and
clinical
data
from
electronic
medical
record
2.
Iden*fy
sources
of
open
government
data
related
to
environmental,
weather,
and
other
variables
related
to
chronic
pulmonary
disease
exacerba*ons
3.
Combine
open
content
from
NLM,
PubMed,
Medline
to
support
educa*on
4.
Leverage
a
Linked
Data
approach,
using
Open
Source
and
interna*onal
data
exchange
standards
(RDF)
5.
Alert
pa*ent
of
possible
hazardous
condi*ons
and
recommend
appropriate
ac*ons
Wednesday, May 2, 12
48. Leverage
Linked
Data,
Open
Source
&
Standards
Web
of
Data SMS
CDC DBpedia
EPA Pub
Med
US
Census NLM Email
CA-‐email-‐message.jpg
Web
EMR
Wednesday, May 2, 12
50. Shows:
1) Air Quality data from US EPA
2) Anonymized EMR data
3) Doctor’s details from CSV file
Uses Callimachus,
a Linked Data Management
Platform
Wednesday, May 2, 12
51. Tools & best practices?
• Large and small vendors are involved in Linked Data
• From Oracle, IBM to 3 Round Stones
• Listing of active projects, companies and research See
http://dir.w3.org/
• Best practices, see http://www.w3.org/2011/gld/charter
Wednesday, May 2, 12
52. • Callimachus is a framework for data-driven applications
based on Linked Data principles
• Callimachus allows Web developers to easily create data
driven applications for the Web
• Callimachus Enterprise
• http://3roundstones.com
Wednesday, May 2, 12
53. “Linked Data means
Cooperation without coordination”
-- David Wood, PhD
Wednesday, May 2, 12
54. Where the Web has been,
the enterprise is
going ...
Wednesday, May 2, 12
55. • Additional information
available on the Web, in
books ...
• Open Source Linked Data
Management System
http://callimachusproject.org
Bernadette Hyland
Contact me at
@BernHyland
bhyland@3roundstones.com
Wednesday, May 2, 12
56. If you’d like to learn more ...
http://semtechbizsf2012.semanticweb.com
Wednesday, May 2, 12