Using DBpedia for Thesaurus Creation and Management as well as Linked Open Data (LOD) Integration with PoolParty Semantic Suite (http://www.poolparty.biz) at Semantic Web Company (SWC, http://www.semantic-web.at).
Unleash Your Potential - Namagunga Girls Coding Club
Using DBpedia for Thesaurus Management and Linked Open Data Integration
1. @semwebcompany
@lod2project
Using DBpedia at Semantic Web Company
Work on enterprise taxonomies and Linked Open Data (LOD) integration
Martin Kaltenböck
Semantic Web Company
www.semantic-web.at
www.poolparty.biz
These slides are published under :
http://creativecommons.org/licenses/by/3.0
2. About Semantic Web Company
•
•
•
•
•
•
•
Semantic Web Company founded 2000 in Vienna, Austria
20 experts in strategy, coding, consulting, research & PM
Main Product: PoolParty Semantic Suite (launched 2009)
Focus: Semantic Information Management & LOD Services
Serving global 500 companies, GOV & NGOs worldwide
EU- & US-based consulting services
Intense involvement in / support of LOD community
Partner Network:
3. Some Customers we server…
• Pearson
• Daimler
• Wolters Kluwer • Ministry of Finance (AUT)
• GBPN
• Credit Suisse• Council of EU
• Education Services (AUS)
• World Bank • Roche
• Wood Mackenzie • REEEP
10. Use Case 2
Linked (Open) Data Integration
• Linked Open Data (LOD) Integration: MashUps
• LOD Integration: Data Analysis
• DBpedia based Entity Linking Approach
12. Application example #1:
Energy Market Intelligence
Scenario #1:
I am an energy market researcher
at the International Energy Agency
(IEA).
I inform policy makers about the
situation in specific renewable
energy areas to develop targeted
energy support programs.
For my research I need indicators
about utilisation levels of all
alternative forms of energy with
regards to geographical and
political categories.
http://integrator.poolparty.biz/report_renewable/
12
13. How does it work…
• Articles about Renewable Energy
– 72,018 documents
– From ~300 web sources
• reegle Thesaurus: ~3,000 concepts
– Traverse hierarchies below main categories (wind, solar, etc.)
and classify documents
• Geonames
– Annotate documents with regards to their geographical entities
• DBpedia
– Lookup several Yago classes to all extracted geographical
entities to assert additional categories, e.g.: EU-countries,
French-speaking countries etc.
13
14. How does it work…
Semantic
Search
Geospatial
Search
Data
Visualisation
SPARQL
PoolParty Semantic Integrator
….
14
15. Application example #2:
Health Care
Scenario #2:
I am an information officer at
the Global Health Observatory of
the World Health Organisation.
I inform policy makers about the
global situation in specific
disease areas to direct support
to the required health support
programs.
For my research I need data
about disease prevalence in
relation
to
socio-economic
factors.
http://integrator.poolparty.biz/report_medicine/
16. How does it work…
• PubMed Articles
– Cardiovascular Diseases: 39,911 documents
– Neoplasms: 69,937 documents
– Nervous System Diseases: 48,128 documents
• MeSH: 26,700 concepts / 346,600 triples
– Traverse hierarchies below disease main categories and classify
documents
• Geonames
– Annotate documents with regards to their geographical entities
• DBpedia
– Lookup HDI (The Human Development Index (HDI) is a
composite statistic of life expectancy, education, and income
indices used to rank countries into four tiers of human
development)
16
17. Enity Linking using Dbpedia…
Solving the issue of disambiguation and thereby enable
better textmining & categorisation (semantic tagging).
• Interestingly, Democrats in Congress
and Clinton approved of similar interim funding
measures during the last government shutdown.
• Clinton was traded by the Boston Red Sox to the Los
Angeles Angels for Lee Thomas.
• Clinton is a Power Station located near Clinton, Illinois,
USA.
17
18. Further Reading & Announcements
The publication Linked Open Data: The Essentials gives
decision makers a good overview of Open Government, Open
Government Data, Open Data and Linked Open Data (LOD).
See: http://www.semantic-web.at/LOD-TheEssentials.pdf
European Data Forum 2014 (EDF2014)
• 19 – 20 March 2014, Greece Athens
• Call for Exhibition still open!
• See: http://2014.data-forum.eu
21.03.2014, Athens
SEMANTICS Conference
4. & 5. September 2014 in Leipzig, Germany
September 2015 in Vienna, Austria
www.semantics.cc coming soon !
19. Contact
Martin Kaltenböck
CFO, Semantic Web Company
+43 1 4021235
m.kaltenboeck@semantic-web.at
www.semantic-web.at
www.poolparty.biz
These slides are published under :
http://creativecommons.org/licenses/by/3.0