The document provides information about the Digital Enterprise Research Institute (DERI) in Galway, Ireland. It discusses DERI's research areas including semantic web, social networks, and data mining. It also outlines DERI's funding sources and partners. The document then shifts to discussing linked open data, including its key components like RDF and vocabularies. Finally, it provides examples of linked open data projects by DERI and others.
2. DERI, NUI Galway
Digital Enterprise Research Institute www.deri.ie
Centre for Science, Engineering and Technology (CSET) established in
2003 with funding from the Science Foundation Ireland (SFI)
~130 researchers
Research Areas: Semantic Web, Web Science, Social Networks, Data
Mining, Information Systems
Application Areas: eGovernment, Bioinformatics, Security, eBusiness and
financial services, eHealth, and Green & Sustainable IT.
Enabling Networked Knowledge
3. DERI, NUI Galway
Digital Enterprise Research Institute www.deri.ie
National funding from SFI, EI, IRCSET and industrial collaborations
EC Funding: FP6, FP7, etc.
DERI technology driving 100,000s of Websites (i.e. in Drupal)
DERI technology installed on countless desktops (i.e. in Linux)
~100 industry and public partners
Avaya, Alcatel-Lucent, Celtrak, Cisco, Ericsson, FBK, OpenLink, Storm Technology, etc.
> 1,000 peer-reviewed papers
Actively participate in 17 standardisation activities (W3C, OASIS)
Enabling Networked Knowledge
4. Digital Enterprise Research Institute www.deri.ie
What is Linked Open Data?
Open Data?
Linked Data Standards & Tools
Linked Open Data in Practice
Enabling Networked Knowledge
5. Digital Enterprise Research Institute www.deri.ie
Public OP N
E
Data Difficult to find Data
Difficult to reuse
Difficult to integrate
Enabling Networked Knowledge
6. What is Linked Open Data?
Digital Enterprise Research Institute www.deri.ie
? Enabling Networked Knowledge
7. What is Linked Open Data?
Digital Enterprise Research Institute www.deri.ie
hasRugbyTeam
hasCapital
IRELAND
hasGovernment
hasMusicGroup
hasUniversity
hasUnemployment
Enabling Networked Knowledge
8. What is Linked Open Data?
Digital Enterprise Research Institute www.deri.ie
Facilitating data integration through:
Common data model
Building relations
Enabling Networked Knowledge
9. Two Key Ingredients
Digital Enterprise Research Institute www.deri.ie
1. RDF – Resource Description Framework
(Graph based Data)
Identifies objects (URIs)
Interlink information (Relationships)
1. Vocabularies (Ontologies)
Provide shared understanding of a domain
Organise knowledge in a machine-comprehensible way
Give an exploitable meaning to the data
Enabling Networked Knowledge
9 of 46
11. TimBL’s 5 Open Data
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
12. ★ On the Web, Open License
Digital Enterprise Research Institute www.deri.ie
On the Web
Wide access
Google can index it
People can find it themselves
Open License
Regulates reuse of data
Helps maintain provenance
Strengthens business reuse
– http://opendefinition.org/licenses/
Enabling Networked Knowledge
13. ★ ★ Structured Data
Digital Enterprise Research Institute www.deri.ie
Machine-readable
Enabling Networked Knowledge
14. Screenscraping
Digital Enterprise Research Institute www.deri.ie
People use tools like ScraperWiki to
get at data that isn't machine-
readable
https://scraperwiki.com/tags/ireland
Scraping is problematic because:
It is expensive
It is brittle
It puts a strain on computing resources
Enabling Networked Knowledge
15. Formats
Digital Enterprise Research Institute www.deri.ie
Good:
MS Excel, CSV, XML, JSON, Microdata
Not so good:
Pure websites, MS Word
Bad:
PDF
Really bad:
Only charts/maps without numbers, images
Enabling Networked Knowledge
16. ★ ★ ★ Non-Proprietary Formats
Digital Enterprise Research Institute www.deri.ie
Freedom of how to process, analyse and visualise
data
Proprietary:
Word, Excel, PDF
Non-proprietary:
CSV, XML, JSON, Microdata, RDF
Enabling Networked Knowledge
17. ★ ★ ★ ★ Use URIs
Digital Enterprise Research Institute www.deri.ie
Unique identifiers enable others to point to the data.
<http://www.deri.ie/about/team/member/Deirdre_Lee>
<http://www.deri.ie/publications#uid_339 >
Enabling Networked Knowledge
18. ★ ★ ★ ★ ★ Linking Data
Digital Enterprise Research Institute www.deri.ie
Link your data to other data to provide context
http://lod-cloud.net
Enabling Networked Knowledge
19. Digital Enterprise Research Institute www.deri.ie
What is Linked Data
Linked Data Standards & Tools
Linked Open Data in Practice
Enabling Networked Knowledge
20. Linked Data Standards
Digital Enterprise Research Institute www.deri.ie
Government Linked Data (GLD) WG www.w3.org/2011/gld/
Enabling Networked Knowledge
21. Linked Open Metadata
Digital Enterprise Research Institute www.deri.ie
Data Catalog Vocabulary (DCAT)
http://www.w3.org/TR/vocab-dca
Enabling Networked Knowledge
22. Integrating Linked Metadata
Repositories
Digital Enterprise Research Institute www.deri.ie
Shukair, G., et al., Integrating Linked Metadata Repositories in the Web of Data, in Third
International Workshop on Consuming Linked Data (COLD 2012)at ISWC 2012: Boston, US.
Enabling Networked Knowledge
23. Domain-Specific Vocabularies
Digital Enterprise Research Institute www.deri.ie
JOINUP European Commission ISA Semantic Assets
Core Person Vocabulary
Core Location Vocabulary
Core Business Vocabulary
Core Public Service Vocabulary
– http://joinup.ec.europa.eu/
Data Cube Vocabulary
http://www.w3.org/2011/gld/wiki/Data_Cube_Vocabulary
Vocab Lists
http://vocab.deri.ie/
http://vocab.data.gov/
Enabling Networked Knowledge
25. Digital Enterprise Research Institute www.deri.ie
What is Linked Data
Linked Data Standards & Tools
Linked Open Data in Practice
Enabling Networked Knowledge
26. Lets Do It Galway 2012
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
27. Galway Open Data Portal
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
29. County Rank
Digital Enterprise Research Institute www.deri.ie
http://county-rank.data-gov.ie/
Enabling Networked Knowledge
30. Fingal Fact Finder
Digital Enterprise Research Institute www.deri.ie
http://vmsgov03.deri.ie:8080/data-cube-searcher/about.html
Enabling Networked Knowledge
31. World Bank Linked Data
Digital Enterprise Research Institute www.deri.ie
World Bank Indicators http://worldbank.270a.info
World Bank Finances
World Bank Projects and
Operations
World Bank Climate Change
Sarven Capadisli
Enabling Networked Knowledge
32. Europeana LOD Pilot
Digital Enterprise Research Institute www.deri.ie
http://data.europeana.eu http://srvgal85.deri.ie/ab-app/
Fully open metadata
2.4 M objects
200 individual
providers
15 countries
Enabling Networked Knowledge
33. Linked Sensor Middleware (LSM)
Digital Enterprise Research Institute www.deri.ie
Live data
http://lsm.deri.ie
Enabling Networked Knowledge
34. Over 110,000 Live Data Sources
Digital Enterprise Research Institute www.deri.ie
…and growing!!!
Enabling Networked Knowledge
35. Digital Enterprise Research Institute www.deri.ie
3
Railway Station
a
Flight information update
CallSign: EIN432. Latitude: 47.17525.
Longitude: 8.61251. Altitude: 34000.0 (feet). Speed:
392 (kts). Departure: ARN. Destination: LHR
1
RDF data b
2
Traffic camera
4
Enabling Networked Knowledge
36. Super Stream Collider
Digital Enterprise Research Institute www.deri.ie
LSM Sensors
SPARQL
Endpoint
http://superstreamcollider.org
Enabling Networked Knowledge
36
37. Linked Data in Systems Biology
Digital Enterprise Research Institute www.deri.ie
~20 000
genes ~100 interesting
High-throughput technologies
genes/proteins
~ 10 interesting
Computational statistics
pathways
Browse databases
~5 proteins testable in
the lab
Literature
Linked Data
Hypothesis
Generation
“I like to call it low-input, high-
throughput, no-output biology.”
Enabling Networked Knowledge
38. Data.gov.uk Linked Open Data
Digital Enterprise Research Institute www.deri.ie
Enabling Networked Knowledge
39. Data.gov Linked Open Data
Digital Enterprise Research Institute www.deri.ie
Clinical Quality Linked Data on Health.data.gov
EPA s Facility Registry and Substance Registry
Enabling Networked Knowledge
40. Norwegian National Master
Data as LOD
Digital Enterprise Research Institute www.deri.ie
Norwegian master data:
Business Property
Citizen
(Legal Entities) (inc. map data)
The Central Coordinating Register for Legal Entities (RLE)
~1 million companies, 40 attributes
Norwegian Semantic Repository of Electronic Services (SERES)
Metadata repository
Register of Company Accounts
Myrseth, P., et al., National Master Data as 5 Star Linked Open Data, in Electronic
Government (eGov2012). 2012, Trauner-Verlag: Kristiansand, Norway.
Enabling Networked Knowledge
41. Fire Department
Amsterdam-Amstelland
Digital Enterprise Research Institute www.deri.ie
Bart van Leeuwen – Fire fighter & netage.nl
Problem:
http://blog.resc.info/
Masses of data
Navigation system didn’t work
Operational risks due to communication failure
Need for:
Structured incident information
Used by >15 Fire Stations in the greater
Amsterdam area
All Linked Data published on Web
Enabling Networked Knowledge
42. New York Times
Digital Enterprise Research Institute www.deri.ie
http://data.nytimes.com/
Enabling Networked Knowledge
43. How the BBC makes Websites
Digital Enterprise Research Institute www.deri.ie
Develop a domain model
Populate your data model
Design URIs
Build pages
Apply layout and decor CSS
Test and iterate
Mike Atherton, ‘Beyond the Polar Bear’
http://www.slideshare.net/reduxd/beyond-the-polar-bear
Enabling Networked Knowledge
45. Open Data Publishing Pipeline
Digital Enterprise Research Institute
(ODPP) www.deri.ie
Difficulty with Publishing Open Data:
Remains quite a manual process
Modular Data Management System for publishing
standard Open Data, based on Open Source
components.
http://publishing-pipeline.com/
Enabling Networked Knowledge
46. Open Data Publishing Pipeline
Digital Enterprise Research Institute
(ODPP) www.deri.ie
Enabling Networked Knowledge
47. European Data Forum 2013
Digital Enterprise Research Institute www.deri.ie
April 9th/10th, Dublin
Enabling Networked Knowledge
Notes de l'éditeur
Interoperability solutions for European public administrations
All of the datasets that contain statistics at the time of writing (they were about 60)
All of the datasets that contain statistics at the time of writing (they were about 60)
Linked Data workshop at DRI’s Realising the Opportunities of Digital Humanities
LSM (Linked Sensor Middleware): a platform that brings together the live real world sensed data and the Semantic Web. A LSM deployment is available at http://lsm.deri.ie/ . It provides many functionalities such as: i) wrappers for real time data collection and publishing; ii) a web interface for data annotation and visualisation; and iii) a SPARQL endpoint for querying unified Linked Stream Data and Linked Data.
SCC (Super Stream Collider): Developed on top of LSM, SCC is a platform, which provides a web-based interface and tools for building sophisticated mashups combining semantically annotated Linked Stream and Linked Data sources into easy to use resources for applications.
Messages: Finding the mathematics of biology; patterns and interrealtedness of biological entities Biological data in computational formats; automate data analysis and annotation is a dream which is not yet achieved Technologies that could help make such a dream reality; transform the www into a computational platform where read and write operations are supported and boundaries between knowledge systems are erased
For years, RLE has offered online access via web interface & web services. Main groups of users were government bodies and legal entities. Most common usage patterns are verifying existence of a legal entity and listing the CEO, board, etc. But also increating request for interoperability
Understanding – No acknowledgement of information shared ● Interpretation – Terms not always used in right context – Non aligned vocabularies between disciplines
Massive update in industry adoption ~400 suppliers Enterprise Software HP, IBM, Microsoft, Oracle, SAP, and Software AG Search: Bing & Google Freebase, Refine, Squared & Rich snippets Social: Linked In & Facebook eCommerce Best Buy & Overstock Publishing Thomson Reuters Standards OMG, ISO, W3W and OASIS Linked Open Data Interdisciplinary data set of 50B Facts Exponential Growth