SlideShare une entreprise Scribd logo
1  sur  46
Télécharger pour lire hors ligne
Welcome to
this webinar!
Andreas Blumauer
CEO of Semantic Web Company
About Semantic Web Company (SWC)
SWC was founded 2001, head-quartered in Vienna
25 experts in Linked Data technologies
PoolParty Suite based on RDF Graph Data Model
Serving customers from all over the world
EU- & US-based consulting services
Our Ecosystem: Customers & Partners
Some of our Customers
● Credit Suisse
● Boehringer Ingelheim
● Roche
● adidas
● The Pokémon Company
● Canadian Broadcasting Corporation (CBC)
● Red Bull Media House
● Wolters Kluwer
● TC Media
● Techtarget
● BMJ Publishing Group
● CafePress
● Pearson - Always Learning
● Education Services Australia
● American Physical Society
● Healthdirect Australia
● World Bank Group
● Inter-American Development Bank (IADB)
● Renewable Energy Partnership
● Wood MacKenzie
● Development Initiatives
● International Atomic Energy Agency (IAEA)
Finance / Automotive / Publisher / Health Care / Public Administration / Energy / Education
Selected Partners
● PwC
● EPAM Systems
● iQuest
● EBCONT
● Gravity Zero
● MarkLogic
● OpenLink Software
● Ontotext
● Wolters Kluwer
● Data to Value
● Digirati
● Term Management
● Altotech
We are all working on the
replacement of data chaos
by networking information
● Norwegian Directorate
of Immigration
● Ministry of Finance (A)
● Council of the E.U.
● Australian National
Data Service
PoolParty Core Modules
Bain Capital is a venture capital
company based in Boston, MA.
Since inception it has invested in
hundreds of companies including
AMC Entertainment, Brookstone,
and Burger King. The company was
co-founded by Mitt Romney.
Taxonomy &
Ontology Server
Entity Extraction &
Text Mining
Semantic Search,
Analytics & Visualization
Why Graph
Databases?
The Enterprise Perspective:
The End of the Document
“Life is no longer as
simple as making
PDF documents.”
John Walker
Business Analyst at NXP Semiconductors
The Enterprise Perspective:
Graph Databases are Smart Data Lakes
“Data in a large cooperation is
often scattered over various tools,
comes in different formats and with
different levels of quality.”
Fabian Heinemann
Data Scientist at Roche
The NPO Perspective:
Using common Definitions and Standards
“Very few datasets
tell a story
in isolation.”
The Data Manifesto
Development Initiatives
The warehouse approach
seems to be broken in a complex world
Data Warehouse
- structures and categories predefine
the kind of analysis that is possible
- excludes data to simplify the data
model
- does not efficiently handle new
types of data
- supports efficient indexing
- enforces consistency
- includes all data that may be used and
even data that may never be used
- all data regardless of source and structure
is kept
- data kept in its raw form and only
transformed when used
- handles structured and unstructured data
- data models emerge with usage over time
Data Lake
The Analyst’s Perspective: Data Lakes don’
t fix the problem of lacking semantics
“Organizations should focus on
semantic consistency and
performance in upstream applications
and data stores instead of information
consolidation in a data lake.”
Gartner
Beware of the Data Lake Fallacy
Data Lakes have all the information to
answer complex queries, but….
Country GDP Pop
AUS 1,560 23.14
SVE 580 9.60
WITH A COMBINED NUMBER of
357,100 registered asylum claims
in 2013, Germany, the United
States of America, France, Sweden
and Turkey were the top five
receiving countries, together
accounting for nearly six out of ten
asylum claims submitted in the 44
industrialized countries covered by
this report.
Place Asylum
seekers
Year
Australia 24,300 2013
Sweden 54,300 2013
Show me all reports, in which EU member countries
are mentioned with regards to their asylum politics,
which have more than 10 asylum-seekers per 1,000
inhabitants.
...taxonomies link constantly changing data
sources while analytic needs are evolving
Countries
European
Union
Sweden
SVE
France
FRA
Austria
AUT
Oceania
Country GDP Pop
AUS 1,560 23.14
SVE 580 9.60
Place Asylum
seekers
Year
Australia 24,300 2013
Sweden 54,300 2013
WITH A COMBINED NUMBER of
357,100 registered asylum claims
in 2013, Germany, the United
States of America, France, Sweden
and Turkey were the top five
receiving countries, together
accounting for nearly six out of ten
asylum claims submitted in the 44
industrialized countries covered by
this report.
Linked Data Warehouses are Smart Data Lakes
Data Lake
Data Warehouse
- supports efficient indexing
- enforces consistency
- handles structured &
unstructured data
- data models emerge
with usage over time
- standards-based
- unified data model
- powerful query language
What if questions
emerge when one
starts analyzing the
data?
The power of knowledge graphs:
Agility, flexibility, complexity
doc doc doc
Norway France Austria Canada
doc
Norway France Austria Canada
doc
Show me all
documents about
European countries
Traditional approach Graph-based approach
doc doc doc
The power of knowledge graphs:
Agility, flexibility, complexity
doc doc doc
Europe,
Norway
Europe,
France
Europe,
Austria
America,
Canada
doc
Norway France Austria Canada
doc
Show me all
documents about
European countries
Europe
Traditional approach Graph-based approach
doc doc doc
The power of knowledge graphs:
Agility, flexibility, complexity
doc doc doc
Europe,
Norway
Europe,
France
Europe,
Austria
America,
Canada
doc
Norway France Austria Canada
doc
Show me all
documents about
European countries
Europe
Traditional approach Graph-based approach
Show me all
documents about EU
member countries
doc doc doc
Norway France Austria Canada
The power of knowledge graphs:
Agility, flexibility, complexity
doc doc doc
Europe,
Norway
E.U,
Europe,
France
E.U,
Europe,
Austria
America,
Canada
doc doc doc doc doc
Show me all
documents about
European countries
Europe
Traditional approach Graph-based approach
Show me all
documents about EU
member countries
E.U
Norway France Austria Canada
The power of knowledge graphs:
Agility, flexibility, complexity
doc doc doc
Europe,
Norway
French,
EU,
Europe,
France
EU,
Europe,
Austria
French,
America,
Canada
doc doc doc doc doc
Show me all
documents about
European countries
Europe
Traditional approach Graph-based approach
Show me all
documents about EU
member countries
French-
speaking?
French-
speaking
EU
Norway France Austria Canada
The power of knowledge graphs:
Agility, flexibility, complexity
doc doc doc
Europe,
Norway
French,
EU,
Europe,
France
EU,
Europe,
Austria
French,
America,
Canada
doc doc doc doc doc
Show me all
documents about
European countries
Europe
Traditional approach Graph-based approach
Show me all
documents from EU
member countries
French-
speaking?
French-
speaking
EU
Metadata per
document
1. No or little network effects
2. No reuse of metadata
3. Metadata resides in silos
4. Data quality hard to measure
5. Not machine-readable
Knowledge about
metadata
1. Explicit knowledge models
2. Reusable and measurable
3. Metadata is machine-processable
4. Standards-based metadata
5. Linkable metadata opens silos
Better Together:
Unstructured and
Structured Data.
Towards a Linked Data based search
Bringing structure to text:
PoolParty GraphSearch
PoolParty GraphSearch =
Semantic Search + Analytics
Complex Queries based on
SPARQL and Linked Data
SELECT DISTINCT ?personname ?picture ?countryname ?hdi ?picture
WHERE
{
?person skos:prefLabel ?personname .
?country skos:prefLabel ?countryname .
?person a dbpedia:Person .
?country a dbpedia:Country .
?person skos:related ?country .
?country <http://dbpedia.org/property/hdi> ?hdi .
FILTER ( ?hdi < 0.6)
OPTIONAL
{
?person foaf:depiction ?picture .
}
} ORDER BY DESC(?hdi)
I want to explore medical
research trends in relation
to regional prosperity.
Organizing data in graphs using links
Graph nervous_system_diseases-abstracts
Graph en.dbpedia.org
Graph www.nlm.nih.gov/mesh
Graph www.geonames.org
PoolParty Semantic Integrator
System Architecture
Classified documents +
Linked taxonomies +
Knowledge graphs
● Dynamic filter criterias
● BI-like interface
● Large scale RDF store
● Fully RDF compatible
● All queries via SPARQL
sa
dd
sd
s
sa
dd
sd
s
ad
sa
dd
sd
s
ds
ad
ds
ds
sa
dd
sd
s
ds
ad
ds
ds
UnfiedViews as part of
PoolParty Semantic Integrator
UnifiedViews differs
from other ETL
frameworks by natively
supporting RDF data and
ontologies.
UnifiedViews has a
graphical user interface
for the administration,
debugging, and
monitoring of the ETL
process.
Use Cases
Success story: Healthdirect Australia
Over 120 information partners and sources
Great variety of category and metadata systems
One central vocabulary hub:
Australian Health Thesaurus (AHT)
Single point of access incl. harmonized search facets:
http://www.healthdirect.gov.au/
Clean Energy Data - Country Profiles
sOnr webMining for Confluence
Complex queries with SPARQL
PREFIX mrv-schema: <http://gbpn.org/mrv-schema/>
PREFIX qb: <http://purl.org/linked-data/cube#>
SELECT DISTINCT *
WHERE {
GRAPH <http://gbpn.org/mrv> {
?observation mrv-schema:year ?year.
?observation mrv-schema:region ?region.
?observation mrv-schema:region <http://gbpn.org/mrv-thes/region/India>.
?observation mrv-schema:scenario ?scenario.
?observation mrv-schema:scenario <http://gbpn.org/mrv-thes/scenario/deep-efficiency>.
{
?observation mrv-schema:urbanizationType ?urbanizationType.
?observation mrv-schema:urbanizationType <http://gbpn.org/mrv-thes/urbanization-type/urban>.
?observation mrv-schema:buildingType ?buildingType.
?observation mrv-schema:buildingType <http://gbpn.org/mrv-thes/building-type/MF>.
?observation mrv-schema:publicBuildingType ?publicBuildingType.
?observation mrv-schema:publicBuildingType <http://gbpn.org/mrv-thes/public-building-
type/NO>.
}
UNION
{
?observation mrv-schema:urbanizationType ?urbanizationType.
?observation mrv-schema:urbanizationType <http://gbpn.org/mrv-thes/urbanization-type/urban>.
?observation mrv-schema:buildingType ?buildingType.
?observation mrv-schema:buildingType <http://gbpn.org/mrv-thes/building-type/Slums>.
?observation mrv-schema:publicBuildingType ?publicBuildingType.
?observation mrv-schema:publicBuildingType <http://gbpn.org/mrv-thes/public-building-
type/NO>.
}
UNION
{
…….
More PoolParty Applications & Demos
Thesaurus Publishing Business Intelligence Content Recommendation Semantic Expert Finder
Web Mining Semantic Search Linked Data Visualization Symptom Checker
PoolParty 5.1
Highly precise entity extraction
Domain-specific extraction, highly
performant, language-agnostic,
disambiguation rules, REST API
Providing context in the knowledge graph
Activating disambiguation
Semantic Records Management: Integration
with Confluence Blueprints
⇒ Solution for Semantic
Records Management
Fully integrated web crawler
Make use of text corpus analysis:
Retrieve documents from various
sources, like RSS or from websites
Web Crawler extracts candidate terms
from any website
Extended ontology management &
semantic reasoning
From SKOS taxonomies to full-
blown ontologies:
PoolParty supports various levels of
knowledge modeling
Publishing custom schemes
Further extension of PoolParty API
● API method for skos:notes
● API method for skosxl:labels
● API methods for skos:collections
● API method to collect custom properties, attributes and types
● API method to R/W workflow status
● Retrieve history API method
● Retrieve SKOS subtree
Developer
Get started with PoolParty. Try it out now!
Get your PoolParty 5.1
Thesaurus Server &
Entity Extractor trial:
http://www.poolparty.biz/test-demo/
Contact points & further information
Andreas Blumauer, MSc IT
a.blumauer@semantic-web.at
https://www.linkedin.com/in/andreasblumauer
Semantic Web Company GmbH
Mariahilfer Strasse 70/8, A-1070 Vienna
+43-1-4021235
http://www.semantic-web.at
http://www.poolparty-software.com
Social Media Channels
http://slideshare.net/semwebcompany
http://youtube.com/semwebcompany
https://www.linkedin.com/groups?home=&gid=4059165

Contenu connexe

Tendances

Using the Research Graph and Data Switchboard for cross-platform discovery
Using the Research Graph and Data Switchboard for cross-platform discoveryUsing the Research Graph and Data Switchboard for cross-platform discovery
Using the Research Graph and Data Switchboard for cross-platform discovery
amiraryani
 
Hadoop and Big Data Readiness in Africa: A Case of Tanzania
Hadoop and Big Data Readiness in Africa: A Case of TanzaniaHadoop and Big Data Readiness in Africa: A Case of Tanzania
Hadoop and Big Data Readiness in Africa: A Case of Tanzania
ijsrd.com
 

Tendances (20)

CODATA, Open Science Policies and Capacity Building by Simon Hodson
CODATA, Open Science Policies and Capacity Building by Simon HodsonCODATA, Open Science Policies and Capacity Building by Simon Hodson
CODATA, Open Science Policies and Capacity Building by Simon Hodson
 
BIG DATA RESEARCH
BIG DATA RESEARCHBIG DATA RESEARCH
BIG DATA RESEARCH
 
Introduction to Data Warehouse
Introduction to Data WarehouseIntroduction to Data Warehouse
Introduction to Data Warehouse
 
Data-Ed Online: Let's Talk Metadata: Strategies and Successes
Data-Ed Online: Let's Talk Metadata: Strategies and Successes Data-Ed Online: Let's Talk Metadata: Strategies and Successes
Data-Ed Online: Let's Talk Metadata: Strategies and Successes
 
The Future of LOD
The Future of LODThe Future of LOD
The Future of LOD
 
2017 05 03 Implementing Pure at UWA - ANDS Webinar Series
2017 05 03 Implementing Pure at UWA - ANDS Webinar Series2017 05 03 Implementing Pure at UWA - ANDS Webinar Series
2017 05 03 Implementing Pure at UWA - ANDS Webinar Series
 
Rda in a Nutshell - December 2019
Rda in a Nutshell - December 2019Rda in a Nutshell - December 2019
Rda in a Nutshell - December 2019
 
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
Bio Data World - The promise of FAIR data lakes - The Hyve - 20191204
 
Rda in a Nutshell - November 2019
Rda in a Nutshell - November 2019Rda in a Nutshell - November 2019
Rda in a Nutshell - November 2019
 
Open Science Governance and Regulation/Simon Hodson
Open Science Governance and Regulation/Simon HodsonOpen Science Governance and Regulation/Simon Hodson
Open Science Governance and Regulation/Simon Hodson
 
Darwin ai covid-net mitre
Darwin ai   covid-net mitreDarwin ai   covid-net mitre
Darwin ai covid-net mitre
 
Tutorial Data Management and workflows
Tutorial Data Management and workflowsTutorial Data Management and workflows
Tutorial Data Management and workflows
 
Using the Research Graph and Data Switchboard for cross-platform discovery
Using the Research Graph and Data Switchboard for cross-platform discoveryUsing the Research Graph and Data Switchboard for cross-platform discovery
Using the Research Graph and Data Switchboard for cross-platform discovery
 
Hadoop and Big Data Readiness in Africa: A Case of Tanzania
Hadoop and Big Data Readiness in Africa: A Case of TanzaniaHadoop and Big Data Readiness in Africa: A Case of Tanzania
Hadoop and Big Data Readiness in Africa: A Case of Tanzania
 
M.Florence Dayana
M.Florence DayanaM.Florence Dayana
M.Florence Dayana
 
Hahn "Wikidata as a hub to library linked data re-use"
Hahn "Wikidata as a hub to library linked data re-use"Hahn "Wikidata as a hub to library linked data re-use"
Hahn "Wikidata as a hub to library linked data re-use"
 
Big Data Content Organization, Discovery, and Management
Big Data Content Organization, Discovery, and ManagementBig Data Content Organization, Discovery, and Management
Big Data Content Organization, Discovery, and Management
 
The Research Data Alliance: Opportunities for Public/Private Partnerships in...
The Research Data Alliance:  Opportunities for Public/Private Partnerships in...The Research Data Alliance:  Opportunities for Public/Private Partnerships in...
The Research Data Alliance: Opportunities for Public/Private Partnerships in...
 
Digital notebooks - a Jisc perspective
Digital notebooks - a Jisc perspectiveDigital notebooks - a Jisc perspective
Digital notebooks - a Jisc perspective
 
RDA in a Nutshell - May 2020
RDA in a Nutshell - May 2020RDA in a Nutshell - May 2020
RDA in a Nutshell - May 2020
 

En vedette

Dutch Interactive Awards - Nominees and Jury feedback
Dutch Interactive Awards - Nominees and Jury feedbackDutch Interactive Awards - Nominees and Jury feedback
Dutch Interactive Awards - Nominees and Jury feedback
Antoaneta Kyoseva
 
Cómo redactar un CV con impacto
Cómo redactar un CV con impactoCómo redactar un CV con impacto
Cómo redactar un CV con impacto
IE Business School
 
Presentacion 5o y 6o
Presentacion 5o y 6oPresentacion 5o y 6o
Presentacion 5o y 6o
Jez OBC
 
Victor Germán.20 especies endémicas de Canarias
Victor Germán.20 especies endémicas de  CanariasVictor Germán.20 especies endémicas de  Canarias
Victor Germán.20 especies endémicas de Canarias
Matias ascanio
 
Taller Online Acompañamiento a la Separación
Taller Online Acompañamiento a la SeparaciónTaller Online Acompañamiento a la Separación
Taller Online Acompañamiento a la Separación
evajimenezs
 

En vedette (20)

Stałe identyfikatory URI – tworzenie i zarządzanie
Stałe identyfikatory URI – tworzenie i zarządzanieStałe identyfikatory URI – tworzenie i zarządzanie
Stałe identyfikatory URI – tworzenie i zarządzanie
 
The Knowledge Discovery Quest
The Knowledge Discovery Quest The Knowledge Discovery Quest
The Knowledge Discovery Quest
 
Gacetilla SADOP 2014
Gacetilla SADOP  2014Gacetilla SADOP  2014
Gacetilla SADOP 2014
 
DOs and DONTs - Twitter für NGOs
DOs and DONTs - Twitter für NGOsDOs and DONTs - Twitter für NGOs
DOs and DONTs - Twitter für NGOs
 
Oltre il bilancio. Dal dato contabile alla valutazione del profilo di rischio
Oltre il bilancio. Dal dato contabile alla valutazione del profilo di rischioOltre il bilancio. Dal dato contabile alla valutazione del profilo di rischio
Oltre il bilancio. Dal dato contabile alla valutazione del profilo di rischio
 
Desarrollo sostenible y sustentable
Desarrollo sostenible y sustentable Desarrollo sostenible y sustentable
Desarrollo sostenible y sustentable
 
Power point tp chino definitivo
Power point tp chino definitivoPower point tp chino definitivo
Power point tp chino definitivo
 
AppAcademy #6 Oslo: Global Trends - Apps, Mobile, Social & Big Data
AppAcademy #6 Oslo: Global Trends - Apps, Mobile, Social & Big DataAppAcademy #6 Oslo: Global Trends - Apps, Mobile, Social & Big Data
AppAcademy #6 Oslo: Global Trends - Apps, Mobile, Social & Big Data
 
Cloudera Breakfast Series, Analytics Part 1: Use All Your Data
Cloudera Breakfast Series, Analytics Part 1: Use All Your DataCloudera Breakfast Series, Analytics Part 1: Use All Your Data
Cloudera Breakfast Series, Analytics Part 1: Use All Your Data
 
Dutch Interactive Awards - Nominees and Jury feedback
Dutch Interactive Awards - Nominees and Jury feedbackDutch Interactive Awards - Nominees and Jury feedback
Dutch Interactive Awards - Nominees and Jury feedback
 
Cómo redactar un CV con impacto
Cómo redactar un CV con impactoCómo redactar un CV con impacto
Cómo redactar un CV con impacto
 
Inspire 2011 bgdi & ngdi
Inspire 2011 bgdi & ngdiInspire 2011 bgdi & ngdi
Inspire 2011 bgdi & ngdi
 
Nuevos gobiernos y bancos
Nuevos gobiernos y bancosNuevos gobiernos y bancos
Nuevos gobiernos y bancos
 
Les Silicones2
Les Silicones2Les Silicones2
Les Silicones2
 
Autoconsumo y generacion distribuida
Autoconsumo y generacion distribuidaAutoconsumo y generacion distribuida
Autoconsumo y generacion distribuida
 
Presentacion 5o y 6o
Presentacion 5o y 6oPresentacion 5o y 6o
Presentacion 5o y 6o
 
Servicios notaria la plata
Servicios notaria la plataServicios notaria la plata
Servicios notaria la plata
 
Resolucion 4434 de 8 8-2006 - Validacion de SGC ed.media En Colombia
Resolucion 4434 de 8 8-2006 - Validacion de SGC ed.media En ColombiaResolucion 4434 de 8 8-2006 - Validacion de SGC ed.media En Colombia
Resolucion 4434 de 8 8-2006 - Validacion de SGC ed.media En Colombia
 
Victor Germán.20 especies endémicas de Canarias
Victor Germán.20 especies endémicas de  CanariasVictor Germán.20 especies endémicas de  Canarias
Victor Germán.20 especies endémicas de Canarias
 
Taller Online Acompañamiento a la Separación
Taller Online Acompañamiento a la SeparaciónTaller Online Acompañamiento a la Separación
Taller Online Acompañamiento a la Separación
 

Similaire à Dive deep into your Data Pools

Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014
Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014
Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014
KDZ - Zentrum für Verwaltungsforschung
 
The FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdfThe FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdf
Alan Morrison
 
Ontology Tutorial: Semantic Technology for Intelligence, Defense and Security
Ontology Tutorial: Semantic Technology for Intelligence, Defense and SecurityOntology Tutorial: Semantic Technology for Intelligence, Defense and Security
Ontology Tutorial: Semantic Technology for Intelligence, Defense and Security
Barry Smith
 
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECAProject
 

Similaire à Dive deep into your Data Pools (20)

SKOS as the focal point of linked data strategies
SKOS as the focal point of linked data strategiesSKOS as the focal point of linked data strategies
SKOS as the focal point of linked data strategies
 
Data centric business and knowledge graph trends
Data centric business and knowledge graph trendsData centric business and knowledge graph trends
Data centric business and knowledge graph trends
 
FAIR data_ Superior data visibility and reuse without warehousing.pdf
FAIR data_ Superior data visibility and reuse without warehousing.pdfFAIR data_ Superior data visibility and reuse without warehousing.pdf
FAIR data_ Superior data visibility and reuse without warehousing.pdf
 
Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014
Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014
Enterprise linked data - open or closed, Andreas Blumauer, Keynote SMWCon 2014
 
Linked Open Data Principles, benefits of LOD for sustainable development
Linked Open Data Principles, benefits of LOD for sustainable developmentLinked Open Data Principles, benefits of LOD for sustainable development
Linked Open Data Principles, benefits of LOD for sustainable development
 
The FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdfThe FAIR data movement and 22 Feb 2023.pdf
The FAIR data movement and 22 Feb 2023.pdf
 
Climate change action through artificial intelligence
Climate change action through artificial intelligenceClimate change action through artificial intelligence
Climate change action through artificial intelligence
 
Data Strategies: Metadata, Open Data, Linked Data
Data Strategies: Metadata, Open Data, Linked DataData Strategies: Metadata, Open Data, Linked Data
Data Strategies: Metadata, Open Data, Linked Data
 
Introduction to BigData
Introduction to BigData Introduction to BigData
Introduction to BigData
 
Ontology Tutorial: Semantic Technology for Intelligence, Defense and Security
Ontology Tutorial: Semantic Technology for Intelligence, Defense and SecurityOntology Tutorial: Semantic Technology for Intelligence, Defense and Security
Ontology Tutorial: Semantic Technology for Intelligence, Defense and Security
 
ASTRAZENECA. Knowledge Graphs Powering a Fast-moving Global Life Sciences Org...
ASTRAZENECA. Knowledge Graphs Powering a Fast-moving Global Life Sciences Org...ASTRAZENECA. Knowledge Graphs Powering a Fast-moving Global Life Sciences Org...
ASTRAZENECA. Knowledge Graphs Powering a Fast-moving Global Life Sciences Org...
 
Advanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data VirtualizationAdvanced Analytics and Machine Learning with Data Virtualization
Advanced Analytics and Machine Learning with Data Virtualization
 
Intro to RDM
Intro to RDMIntro to RDM
Intro to RDM
 
Managing, Sharing and Curating Your Research Data in a Digital Environment
Managing, Sharing and Curating Your Research Data in a Digital EnvironmentManaging, Sharing and Curating Your Research Data in a Digital Environment
Managing, Sharing and Curating Your Research Data in a Digital Environment
 
Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)Tragedy of the Data Commons (ODSC-East, 2021)
Tragedy of the Data Commons (ODSC-East, 2021)
 
Linked Open Data_mlanet13
Linked Open Data_mlanet13Linked Open Data_mlanet13
Linked Open Data_mlanet13
 
Data Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has ChangedData Science and AI in Biomedicine: The World has Changed
Data Science and AI in Biomedicine: The World has Changed
 
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
Optimising Scientific Knowledge Transfer: How Collective Sensemaking Can Ena...
 
Data Collaboration Stack
Data Collaboration StackData Collaboration Stack
Data Collaboration Stack
 
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
 

Plus de Semantic Web Company

Leveraging Knowledge Graphs in your Enterprise Knowledge Management System
Leveraging Knowledge Graphs in your Enterprise Knowledge Management SystemLeveraging Knowledge Graphs in your Enterprise Knowledge Management System
Leveraging Knowledge Graphs in your Enterprise Knowledge Management System
Semantic Web Company
 
Linking SharePoint Documents with Structured Data
Linking SharePoint Documents with Structured DataLinking SharePoint Documents with Structured Data
Linking SharePoint Documents with Structured Data
Semantic Web Company
 
Semantics as the Basis of Advanced Cognitive Computing
Semantics as the Basis of Advanced Cognitive ComputingSemantics as the Basis of Advanced Cognitive Computing
Semantics as the Basis of Advanced Cognitive Computing
Semantic Web Company
 

Plus de Semantic Web Company (20)

How Enterprise Architecture & Knowledge Graph Technologies Can Scale Business...
How Enterprise Architecture & Knowledge Graph Technologies Can Scale Business...How Enterprise Architecture & Knowledge Graph Technologies Can Scale Business...
How Enterprise Architecture & Knowledge Graph Technologies Can Scale Business...
 
Introduction to Knowledge Graphs and Semantic AI
Introduction to Knowledge Graphs and Semantic AIIntroduction to Knowledge Graphs and Semantic AI
Introduction to Knowledge Graphs and Semantic AI
 
Deep Text Analytics - How to extract hidden information and aboutness from text
Deep Text Analytics - How to extract hidden information and aboutness from textDeep Text Analytics - How to extract hidden information and aboutness from text
Deep Text Analytics - How to extract hidden information and aboutness from text
 
Leveraging Knowledge Graphs in your Enterprise Knowledge Management System
Leveraging Knowledge Graphs in your Enterprise Knowledge Management SystemLeveraging Knowledge Graphs in your Enterprise Knowledge Management System
Leveraging Knowledge Graphs in your Enterprise Knowledge Management System
 
Linking SharePoint Documents with Structured Data
Linking SharePoint Documents with Structured DataLinking SharePoint Documents with Structured Data
Linking SharePoint Documents with Structured Data
 
The Fast Track to Knowledge Engineering
The Fast Track to Knowledge EngineeringThe Fast Track to Knowledge Engineering
The Fast Track to Knowledge Engineering
 
Semantic AI
Semantic AISemantic AI
Semantic AI
 
BrightTALK - Semantic AI
BrightTALK - Semantic AI BrightTALK - Semantic AI
BrightTALK - Semantic AI
 
PoolParty Semantic Classifier
PoolParty Semantic ClassifierPoolParty Semantic Classifier
PoolParty Semantic Classifier
 
Leveraging Taxonomy Management with Machine Learning
Leveraging Taxonomy Management with Machine LearningLeveraging Taxonomy Management with Machine Learning
Leveraging Taxonomy Management with Machine Learning
 
Taxonomies put in the right place
Taxonomies put in the right placeTaxonomies put in the right place
Taxonomies put in the right place
 
PoolParty GraphSearch - The Fusion of Search, Recommendation and Analytics
PoolParty GraphSearch - The Fusion of Search, Recommendation and AnalyticsPoolParty GraphSearch - The Fusion of Search, Recommendation and Analytics
PoolParty GraphSearch - The Fusion of Search, Recommendation and Analytics
 
Semantics as the Basis of Advanced Cognitive Computing
Semantics as the Basis of Advanced Cognitive ComputingSemantics as the Basis of Advanced Cognitive Computing
Semantics as the Basis of Advanced Cognitive Computing
 
Structured Content Meets Taxonomy
Structured Content Meets TaxonomyStructured Content Meets Taxonomy
Structured Content Meets Taxonomy
 
PoolParty 6.0 - Climbing the Semantic Ladder
PoolParty 6.0 - Climbing the Semantic LadderPoolParty 6.0 - Climbing the Semantic Ladder
PoolParty 6.0 - Climbing the Semantic Ladder
 
PoolParty Semantic Suite - Release 6.0 (Technical Overview)
PoolParty Semantic Suite - Release 6.0 (Technical Overview)PoolParty Semantic Suite - Release 6.0 (Technical Overview)
PoolParty Semantic Suite - Release 6.0 (Technical Overview)
 
Taxonomies and Ontologies – The Yin and Yang of Knowledge Modelling
Taxonomies and Ontologies – The Yin and Yang of Knowledge ModellingTaxonomies and Ontologies – The Yin and Yang of Knowledge Modelling
Taxonomies and Ontologies – The Yin and Yang of Knowledge Modelling
 
PROPEL . Austrian's Roadmap for Enterprise Linked Data
PROPEL . Austrian's Roadmap for Enterprise Linked DataPROPEL . Austrian's Roadmap for Enterprise Linked Data
PROPEL . Austrian's Roadmap for Enterprise Linked Data
 
Taxonomy Quality Assessment
Taxonomy Quality AssessmentTaxonomy Quality Assessment
Taxonomy Quality Assessment
 
Taxonomy-Driven UX
Taxonomy-Driven UXTaxonomy-Driven UX
Taxonomy-Driven UX
 

Dernier

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Dernier (20)

What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 

Dive deep into your Data Pools

  • 1. Welcome to this webinar! Andreas Blumauer CEO of Semantic Web Company
  • 2. About Semantic Web Company (SWC) SWC was founded 2001, head-quartered in Vienna 25 experts in Linked Data technologies PoolParty Suite based on RDF Graph Data Model Serving customers from all over the world EU- & US-based consulting services
  • 3. Our Ecosystem: Customers & Partners Some of our Customers ● Credit Suisse ● Boehringer Ingelheim ● Roche ● adidas ● The Pokémon Company ● Canadian Broadcasting Corporation (CBC) ● Red Bull Media House ● Wolters Kluwer ● TC Media ● Techtarget ● BMJ Publishing Group ● CafePress ● Pearson - Always Learning ● Education Services Australia ● American Physical Society ● Healthdirect Australia ● World Bank Group ● Inter-American Development Bank (IADB) ● Renewable Energy Partnership ● Wood MacKenzie ● Development Initiatives ● International Atomic Energy Agency (IAEA) Finance / Automotive / Publisher / Health Care / Public Administration / Energy / Education Selected Partners ● PwC ● EPAM Systems ● iQuest ● EBCONT ● Gravity Zero ● MarkLogic ● OpenLink Software ● Ontotext ● Wolters Kluwer ● Data to Value ● Digirati ● Term Management ● Altotech We are all working on the replacement of data chaos by networking information ● Norwegian Directorate of Immigration ● Ministry of Finance (A) ● Council of the E.U. ● Australian National Data Service
  • 4. PoolParty Core Modules Bain Capital is a venture capital company based in Boston, MA. Since inception it has invested in hundreds of companies including AMC Entertainment, Brookstone, and Burger King. The company was co-founded by Mitt Romney. Taxonomy & Ontology Server Entity Extraction & Text Mining Semantic Search, Analytics & Visualization
  • 6. The Enterprise Perspective: The End of the Document “Life is no longer as simple as making PDF documents.” John Walker Business Analyst at NXP Semiconductors
  • 7. The Enterprise Perspective: Graph Databases are Smart Data Lakes “Data in a large cooperation is often scattered over various tools, comes in different formats and with different levels of quality.” Fabian Heinemann Data Scientist at Roche
  • 8. The NPO Perspective: Using common Definitions and Standards “Very few datasets tell a story in isolation.” The Data Manifesto Development Initiatives
  • 9. The warehouse approach seems to be broken in a complex world Data Warehouse - structures and categories predefine the kind of analysis that is possible - excludes data to simplify the data model - does not efficiently handle new types of data - supports efficient indexing - enforces consistency - includes all data that may be used and even data that may never be used - all data regardless of source and structure is kept - data kept in its raw form and only transformed when used - handles structured and unstructured data - data models emerge with usage over time Data Lake
  • 10. The Analyst’s Perspective: Data Lakes don’ t fix the problem of lacking semantics “Organizations should focus on semantic consistency and performance in upstream applications and data stores instead of information consolidation in a data lake.” Gartner Beware of the Data Lake Fallacy
  • 11. Data Lakes have all the information to answer complex queries, but…. Country GDP Pop AUS 1,560 23.14 SVE 580 9.60 WITH A COMBINED NUMBER of 357,100 registered asylum claims in 2013, Germany, the United States of America, France, Sweden and Turkey were the top five receiving countries, together accounting for nearly six out of ten asylum claims submitted in the 44 industrialized countries covered by this report. Place Asylum seekers Year Australia 24,300 2013 Sweden 54,300 2013 Show me all reports, in which EU member countries are mentioned with regards to their asylum politics, which have more than 10 asylum-seekers per 1,000 inhabitants.
  • 12. ...taxonomies link constantly changing data sources while analytic needs are evolving Countries European Union Sweden SVE France FRA Austria AUT Oceania Country GDP Pop AUS 1,560 23.14 SVE 580 9.60 Place Asylum seekers Year Australia 24,300 2013 Sweden 54,300 2013 WITH A COMBINED NUMBER of 357,100 registered asylum claims in 2013, Germany, the United States of America, France, Sweden and Turkey were the top five receiving countries, together accounting for nearly six out of ten asylum claims submitted in the 44 industrialized countries covered by this report.
  • 13. Linked Data Warehouses are Smart Data Lakes Data Lake Data Warehouse - supports efficient indexing - enforces consistency - handles structured & unstructured data - data models emerge with usage over time - standards-based - unified data model - powerful query language
  • 14. What if questions emerge when one starts analyzing the data?
  • 15. The power of knowledge graphs: Agility, flexibility, complexity doc doc doc Norway France Austria Canada doc Norway France Austria Canada doc Show me all documents about European countries Traditional approach Graph-based approach doc doc doc
  • 16. The power of knowledge graphs: Agility, flexibility, complexity doc doc doc Europe, Norway Europe, France Europe, Austria America, Canada doc Norway France Austria Canada doc Show me all documents about European countries Europe Traditional approach Graph-based approach doc doc doc
  • 17. The power of knowledge graphs: Agility, flexibility, complexity doc doc doc Europe, Norway Europe, France Europe, Austria America, Canada doc Norway France Austria Canada doc Show me all documents about European countries Europe Traditional approach Graph-based approach Show me all documents about EU member countries doc doc doc
  • 18. Norway France Austria Canada The power of knowledge graphs: Agility, flexibility, complexity doc doc doc Europe, Norway E.U, Europe, France E.U, Europe, Austria America, Canada doc doc doc doc doc Show me all documents about European countries Europe Traditional approach Graph-based approach Show me all documents about EU member countries E.U
  • 19. Norway France Austria Canada The power of knowledge graphs: Agility, flexibility, complexity doc doc doc Europe, Norway French, EU, Europe, France EU, Europe, Austria French, America, Canada doc doc doc doc doc Show me all documents about European countries Europe Traditional approach Graph-based approach Show me all documents about EU member countries French- speaking? French- speaking EU
  • 20. Norway France Austria Canada The power of knowledge graphs: Agility, flexibility, complexity doc doc doc Europe, Norway French, EU, Europe, France EU, Europe, Austria French, America, Canada doc doc doc doc doc Show me all documents about European countries Europe Traditional approach Graph-based approach Show me all documents from EU member countries French- speaking? French- speaking EU Metadata per document 1. No or little network effects 2. No reuse of metadata 3. Metadata resides in silos 4. Data quality hard to measure 5. Not machine-readable Knowledge about metadata 1. Explicit knowledge models 2. Reusable and measurable 3. Metadata is machine-processable 4. Standards-based metadata 5. Linkable metadata opens silos
  • 22. Towards a Linked Data based search
  • 23. Bringing structure to text: PoolParty GraphSearch
  • 24. PoolParty GraphSearch = Semantic Search + Analytics
  • 25. Complex Queries based on SPARQL and Linked Data SELECT DISTINCT ?personname ?picture ?countryname ?hdi ?picture WHERE { ?person skos:prefLabel ?personname . ?country skos:prefLabel ?countryname . ?person a dbpedia:Person . ?country a dbpedia:Country . ?person skos:related ?country . ?country <http://dbpedia.org/property/hdi> ?hdi . FILTER ( ?hdi < 0.6) OPTIONAL { ?person foaf:depiction ?picture . } } ORDER BY DESC(?hdi) I want to explore medical research trends in relation to regional prosperity.
  • 26. Organizing data in graphs using links Graph nervous_system_diseases-abstracts Graph en.dbpedia.org Graph www.nlm.nih.gov/mesh Graph www.geonames.org
  • 27. PoolParty Semantic Integrator System Architecture Classified documents + Linked taxonomies + Knowledge graphs ● Dynamic filter criterias ● BI-like interface ● Large scale RDF store ● Fully RDF compatible ● All queries via SPARQL sa dd sd s sa dd sd s ad sa dd sd s ds ad ds ds sa dd sd s ds ad ds ds
  • 28. UnfiedViews as part of PoolParty Semantic Integrator UnifiedViews differs from other ETL frameworks by natively supporting RDF data and ontologies. UnifiedViews has a graphical user interface for the administration, debugging, and monitoring of the ETL process.
  • 30. Success story: Healthdirect Australia Over 120 information partners and sources Great variety of category and metadata systems One central vocabulary hub: Australian Health Thesaurus (AHT) Single point of access incl. harmonized search facets: http://www.healthdirect.gov.au/
  • 31. Clean Energy Data - Country Profiles
  • 32. sOnr webMining for Confluence
  • 33. Complex queries with SPARQL PREFIX mrv-schema: <http://gbpn.org/mrv-schema/> PREFIX qb: <http://purl.org/linked-data/cube#> SELECT DISTINCT * WHERE { GRAPH <http://gbpn.org/mrv> { ?observation mrv-schema:year ?year. ?observation mrv-schema:region ?region. ?observation mrv-schema:region <http://gbpn.org/mrv-thes/region/India>. ?observation mrv-schema:scenario ?scenario. ?observation mrv-schema:scenario <http://gbpn.org/mrv-thes/scenario/deep-efficiency>. { ?observation mrv-schema:urbanizationType ?urbanizationType. ?observation mrv-schema:urbanizationType <http://gbpn.org/mrv-thes/urbanization-type/urban>. ?observation mrv-schema:buildingType ?buildingType. ?observation mrv-schema:buildingType <http://gbpn.org/mrv-thes/building-type/MF>. ?observation mrv-schema:publicBuildingType ?publicBuildingType. ?observation mrv-schema:publicBuildingType <http://gbpn.org/mrv-thes/public-building- type/NO>. } UNION { ?observation mrv-schema:urbanizationType ?urbanizationType. ?observation mrv-schema:urbanizationType <http://gbpn.org/mrv-thes/urbanization-type/urban>. ?observation mrv-schema:buildingType ?buildingType. ?observation mrv-schema:buildingType <http://gbpn.org/mrv-thes/building-type/Slums>. ?observation mrv-schema:publicBuildingType ?publicBuildingType. ?observation mrv-schema:publicBuildingType <http://gbpn.org/mrv-thes/public-building- type/NO>. } UNION { …….
  • 34. More PoolParty Applications & Demos Thesaurus Publishing Business Intelligence Content Recommendation Semantic Expert Finder Web Mining Semantic Search Linked Data Visualization Symptom Checker
  • 36. Highly precise entity extraction Domain-specific extraction, highly performant, language-agnostic, disambiguation rules, REST API
  • 37. Providing context in the knowledge graph
  • 39. Semantic Records Management: Integration with Confluence Blueprints ⇒ Solution for Semantic Records Management
  • 40. Fully integrated web crawler Make use of text corpus analysis: Retrieve documents from various sources, like RSS or from websites
  • 41. Web Crawler extracts candidate terms from any website
  • 42. Extended ontology management & semantic reasoning From SKOS taxonomies to full- blown ontologies: PoolParty supports various levels of knowledge modeling
  • 44. Further extension of PoolParty API ● API method for skos:notes ● API method for skosxl:labels ● API methods for skos:collections ● API method to collect custom properties, attributes and types ● API method to R/W workflow status ● Retrieve history API method ● Retrieve SKOS subtree Developer
  • 45. Get started with PoolParty. Try it out now! Get your PoolParty 5.1 Thesaurus Server & Entity Extractor trial: http://www.poolparty.biz/test-demo/
  • 46. Contact points & further information Andreas Blumauer, MSc IT a.blumauer@semantic-web.at https://www.linkedin.com/in/andreasblumauer Semantic Web Company GmbH Mariahilfer Strasse 70/8, A-1070 Vienna +43-1-4021235 http://www.semantic-web.at http://www.poolparty-software.com Social Media Channels http://slideshare.net/semwebcompany http://youtube.com/semwebcompany https://www.linkedin.com/groups?home=&gid=4059165