SlideShare une entreprise Scribd logo
1  sur  87
Linked Open Data
    NOKIOS 2012, 2012-10-30
    Lars Marius Garshol, larsga@bouvet.no, http://twitter.com/larsga

1
Agenda

    •   What and why?
    •   Examples of use
    •   How to publish
    •   Linked Data
    •   Problems
    •   Conclusion

2
What and why?



3
Linked Open Data
                                                foundation for all IT functionality
                                                no data – no service


                              published
                              license allowing reuse
                              machine-readable
    connected to other data
    self-documenting




4
5
US Government Works

    • US copyright law states that
        – "a work prepared by an officer or employee of the
          U.S. government as part of that person's official
          duties”
    • is
        – not entitled to domestic copyright protection




6   http://en.wikipedia.org/wiki/Copyright_status_of_work_by_the_U.S._government
7
8   http://www.digst.dk/Home/Servicemenu/English/Digitisation/~/media/Files/English/BasicDataUKweb20121008.ashx
9
http://www.regjeringen.no/nb/dep/fad/dok/rundskriv/2010/fellesforinger-
     tildelingsbrevene-for-201.html?id=624792




10
11
Why publish open data?

     • To empower other people to do things
       –   you can’t afford
       –   you don’t have the time to
       –   you don’t want to
       –   you couldn’t imagine
     • Again: no data, no service


12
Why is this so popular now?

     • Because the advent of
       – ubiquitous computing,
       – cheap hardware and disk space,
       – cheap and fast networks
     • has dramatically reduced the cost of
       distributing data
       – and building applications,
       – while dramatically increasing the value from such
         applications
     • Also, fashion


13
Why is access to data important?

     • It’s necessary for democracy to function
       properly
       – voters and activists cannot make the right
         decisions if they don’t have the necessary
         information
     • It’s important for economic growth
       – in a post-modern society, information is the life-
         blood of the economy
       – many companies make their living simply by
         selling repackaged information



14
Economic effects
     • Making information more easily available
       – levels the playing field
       – benefiting smaller companies
       – improves competition
     • Large companies have
       – advertising budgets,
       – money to extract data,
       – ...
     • Smaller entities don’t

16
Two kinds of availability (1)

     • Available to humans
       – this makes the raw data available to humans
       – only humans can then digest and process it, and
         ultimately pass it on to other humans




17
Two kinds of availability (2)

     • Available to machines
       – that is, make the data available in machine-
         processable form
       – people can then build many different kinds of
         services based on the data
       – allows many different kinds of access to the data




18
19
20
21
22
http://www.guardian.co.uk/
     news/datablog/2012/oct/09/
     devolution-national-
     statistics-data

23
24
Examples of use



25
http://www.dagblad
     et.no/kommunebor
     sen/




26
27   Research project by SINTEF and Computas
Must be at meeting at 1345. Three transport alternatives.




28   Research project by SINTEF and Computas
Data sources




     Research project by
     SINTEF and Computas

29
Bygninger.no




                    (Database is over free limit.)




30
German Federal Budget




31    http://bund.offenerhaushalt.de/
Social Security Ministry




32
Cleaning customer data

     • I did work deduplicating a customer
       database
     • We wanted to look at cleaning the data
       automatically
        – ie: picking which of the duplicate records to use
     • Decided at the time it couldn’t be done
       automatically
        – because company data not available online
     • Now it is available...


33
http://dbpedia.org/About

     • Not really an application
       – but a very important data set
     • Basically Wikipedia as Linked Open Data
       – Wikipedia fact boxes etc extracted as RDF
       – 400 million statements about 3.77 million things




34
35
How to publish



36
Ten Principles for Open Gov’t Data

     1. Completeness
     2. Primary
     3. Timeliness
     4. Ease of access
     5. Machine readable
     6. Non-discrimination
     7. Use open standards
     8. Licensing
     9. Permanence
     10. Usage cost

37
     http://sunlightfoundation.com/policy/documents/ten-open-data-principles/
5-star model

                            Available, open license

                            Machine-readable format

                            Non-proprietary format

                            URIs as identifiers

                            Linked to other data




38
              http://5stardata.info/
Data licenses
     • Necessary so users know what they are
       allowed/not allowed to do with the data
     • Open Data Commons has licenses you can
       reuse:
       – http://opendatacommons.org/licenses/
     • Such as
       – public domain (anything goes)
       – attribution license (must give credit)
       – attribution share-alike
     • Norwegian license
       – http://data.norge.no/nlod/en
       – Norwegian Licence for Open Government Data
39
Machine-readable
     Readable                        Not readable
     • CSV                           • Microsoft Word
     • XML                           • PDF
     • Microsoft Excel               • HTML
     • RDF                           • Flash
     • JSON
           >>> import csv
           >>> r = csv.reader(open('countries-mondial.csv'))
           >>> r.next()
           ['id', 'country', 'capital', 'area']
           >>> r.next()
           ['4202', 'Malta', 'Valletta', '320']
           >>> int(r.next()[3]) * 247.105
           3330975.4000000004 # area of next country in acres
40
Non-proprietary

     • Microsoft Excel is a proprietary format
       – it’s owned and controlled by Microsoft
       – it’s also very complicated to read
     • An open alternative is CSV
     • Open, standardized alternatives are
       – XML, RDF, JSON




41
Ways to publish

     • Download
       – this is the easiest
       – just put up static data files for download
       – vastly better than nothing
     • API
       – build an API people can use for interacting with the
         data
       – fashionable, but not really necessary
     • Stream
       – publish streams of changes, for easy syncing
       – using SDshare, for example
       – very useful, but can be costly

42
How hard is it to publish data?

     • Depends on how you do it
     • Dumping CSV from a relational database is
       trivial
       – can literally be as simple as just a few lines of code
     • The more ambitious you are, the more
       work it becomes
       –   filtering out sensitive records
       –   using better formats than CSV
       –   linking to other data
       –   documenting
       –   adding streaming
       –   ...
43
URIs as identifiers

• URIs are globally unique
  – thanks to their use of domain names
  – anyone with a domain name can make URIs
• Benefits
  – identifiers that can be reused anywhere, and still
    remain unique
  – can be resolved to an explanation of what they
    identify
Linking to other data (1)

     • Data becomes more valuable when it is
       connected to other data
       – because this reduces the cost of reusing and
         processing the data
     • Imagine reusing the data below
       – how do you connect with country data from other
         sources?

          ID          COUNTRY     CAPITAL        AREA
          4202        Malta       Valletta       320
          19654       Moldova     Chisinau       25333
          8715        Kazakstan   Almaty         2717300
                                             Data from Mondial
45
Linking to other data (2)

      • By using common URIs for your concepts
        you can make data reuse much easier

     ID                                       COUNTRY      CAPITAL    AREA
     http://dbpedia.org/resource/Malta        Malta        Valletta   316

     http://dbpedia.org/resource/Moldova      Moldova      Chişinău   33846
     http://dbpedia.org/resource/Kazakhstan   Kazakhstan   Almaty     2717300
                                                              Data from DBpedia




46
47   Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/
Linked Data principles

     1. Use URIs as names for things

     1. Use HTTP URIs so that people can look up
        those names.

     1. When someone looks up a URI, provide
        useful information, using the standards
        (RDF*, SPARQL)

     1. Include links to other URIs so that they
        can discover more things.

48         http://www.w3.org/DesignIssues/LinkedData
An example
     <result>
       <heading>VEGMELDINGER - Viktigste</heading>
       <messages>
        <message>
         <heading>Oslo (Ulven) - Karihaugen, ved [41] Furuset</heading>
         <messagenumber>64952</messagenumber>
         <version>1</version>
         <ingress>To felt stengt på grunn av vedlikeholdsarbeid i periodene: Onsdag og torsdag fra
     22:00 til 05:30 (neste dag). Fare for kø.</ingress>
         <messageType>Redusert framkommelighet</messageType>
         <urgency>X</urgency>
         <roadType>Ev</roadType>
         <roadNumber>6</roadNumber>
         <validFrom>2012-10-24 22:00:00.0 CEST</validFrom>
         <validTo>2012-10-26 05:30:00.999 CEST</validTo>                  Are these helpful ways to
         <actualCounties>                                                 refer to route E6 and the
          <string>Oslo</string>
         </actualCounties>                                                county of Oslo?
         <coordinates>
          <crs>EPSG:4326</crs>
          <startPoint>
           <xCoord>10.889964</xCoord>
           <yCoord>59.937266</yCoord>
          </startPoint>
         </coordinates>




49
An alternative
     <result>
       <heading>VEGMELDINGER - Viktigste</heading>
       <messages>
        <message>
         <heading>Oslo (Ulven) - Karihaugen, ved [41] Furuset</heading>
         <messagenumber>64952</messagenumber>
         <version>1</version>
         <ingress>To felt stengt på grunn av vedlikeholdsarbeid i periodene: Onsdag og torsdag fra
     22:00 til 05:30 (neste dag). Fare for kø.</ingress>
         <messageType>Redusert framkommelighet</messageType>
         <urgency>X</urgency>
         <road>http://dbpedia.org/resource/European_route_E06</road>
         <validFrom>2012-10-24 22:00:00.0 CEST</validFrom>
         <validTo>2012-10-26 05:30:00.999 CEST</validTo>
         <actualCounties>
          <county>http://dbpedia.org/resource/Oslo</county>
         </actualCounties>
         <coordinates>
          <crs>EPSG:4326</crs>
          <startPoint>                              Now we’re using URIs as names for these
           <xCoord>10.889964</xCoord>
           <yCoord>59.937266</yCoord>               concepts. And, what’s more, the names
          </startPoint>
         </coordinates>                             resolve to more information.




50
HTML
     [larsga@Lars-Marius-Garshols-MacBook-Pro-6 ~]$ telnet dbpedia.org 80
     Trying 194.109.129.58...
     Connected to dbpedia.org (194.109.129.58).
     Escape character is '^]'.
     GET /page/European_route_E06 HTTP/1.0
                                                       This is the green type of DBpedia web
     HTTP/1.1 200 OK
     Date: Wed, 24 Oct 2012 07:16:21 GMT                    page you’ve already seen for Rælingen.
     Content-Type: text/html; charset=UTF-8
     Content-Length: 44484
     Connection: close
     Vary: Accept-Encoding
     Server: Virtuoso/06.04.3132 (Linux) x86_64-generic-linux-glibc212-64 VDB
     Accept-Ranges: bytes
     Expires: Wed, 31 Oct 2012 07:16:20 GMT
     Link: <http://dbpedia.org/data/European_route_E06.rdf>; rel="alternate";
     type="application/rdf+xml" ...
     <?xml version="1.0" encoding="UTF-8" ?>
     <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML+RDFa 1.0//EN"
     "http://www.w3.org/MarkUp/DTD/xhtml-rdfa-1.dtd">
     <html xmlns="http://www.w3.org/1999/xhtml"
       xmlns:dbpprop="http://dbpedia.org/property/"
       xmlns:foaf="http://xmlns.com/foaf/0.1/"
       version="XHTML+RDFa 1.0" xml:lang="en">




51
RDF/XML
     [larsga@Lars-Marius-Garshols-MacBook-Pro-6 ~]$ telnet dbpedia.org 80
     Trying 194.109.129.58...
     Connected to dbpedia.org (194.109.129.58).
     Escape character is '^]'.
     GET /data/European_route_E06.rdf HTTP/1.0
     HTTP/1.1 200 OK
     Date: Wed, 24 Oct 2012 07:36:03 GMT
     Content-Type: application/rdf+xml; charset=UTF-8
     Content-Length: 24377
     Connection: close
     Vary: Accept-Encoding
     Server: Virtuoso/06.04.3132 (Linux) x86_64-generic-linux-glibc212-64 VDB
     Accept-Ranges: bytes
     Expires: Wed, 31 Oct 2012 07:36:03 GMT
     Link: ...
     <?xml version="1.0" encoding="utf-8" ?>
     <rdf:RDF
             xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#"
             xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#"
             xmlns:owl="http://www.w3.org/2002/07/owl#"
             xmlns:foaf="http://xmlns.com/foaf/0.1/"
             xmlns:dbpedia-owl="http://dbpedia.org/ontology/"
             xmlns:dcterms="http://purl.org/dc/terms/"
             xmlns:dbpprop="http://dbpedia.org/property/"
             xmlns:ns7="http://www.w3.org/ns/prov#"
             xmlns:ns8="http://dbpedia.org/ontology/Infrastructure/" >
      <rdf:Description rdf:about="http://dbpedia.org/resource/European_route_E06">
       <rdf:type rdf:resource="http://dbpedia.org/ontology/Place" />
       <rdf:type rdf:resource="http://dbpedia.org/ontology/Infrastructure" />
       <rdf:type rdf:resource="http://schema.org/Place" />


52
Linked Data



53
Weaknesses in traditional formats

     • Cannot be imported directly
       – must always be translated/interpreted somehow
     • Cannot be automatically merged
       – no concept of identity
     • Schema information is not linked
       – types and properties not connected to other types
         and properties
     • Linked Data solves all of these problems
       – Linked Data ≈ RDF




54
How RDF works
     relational table ‘PERSON’
     ID NAME                     EMAIL
     1   Stian Danenbarger       stian.danenbarger@
     2   Lars Marius Garshol     larsga@bouvet.no
     3   Axel Borge              axel.borge@bouvet

             RDF-ized data
             SUBJECT                        PROPERTY   OBJECT
             http://example.com/person/1    rdf:type   ex:Person
             http://example.com/person/1    ex:name    Stian Danenbarger
             http://example.com/person/1    ex:email   stian.danenbarger@
             http://example.com/person/2    rdf:type   Person
             http://example.com/person/2    ex:name    Lars Marius Garshol
             ...                            ...        ...

55
Important RDF standards

     •   RDF/XML    Exchange format.
     •   Turtle     Human-readable format.
     •   NTriples   Braindead simple format.
     •   SPARQL     The query language.
     •   RDFS       Simple schema language.
     •   OWL        Extremely powerful schema
                    language.




56
Hafslund SESAM
• An archive system, really
• Automatically enriches metadata on documents
  when archived
• To do this, must collect data from enterprise
  systems




       http://www.slideshare.net/larsga/hafslund-sesam-semantic-integration-in-practice
High-level architecture

                     ERP      CRM         Intranet


                            SDshare
                                                          CMIS




  Search   SDshare
                            Virtuoso           SDshare   Archive
  engine                   triple store
Auto-tagging
                                        Manager




  Sent to archive           Project

                                        Customer

                    Work
                    order

                            Equipment


                                        Equipment
Data structure in triple store
  ERP

                sameAs

  Archive

                         sameAs


 CRM



 Intranet



                                  Triple store
Simplified core ontology
A simple example

     • We were building an internet service for
       DSS for use in the ministries
       – based on RDF
     • Were looking for open data which internal
       data at the ministries could be connected
       to
       – all for display in the intranet




62
RDF description of the data set,
using Dublin Core and VOID




 http://sws.ifi.uio.no/npd/page/
 LinkedOpenNPDFactPages
68
How hard would it be to use?

     • We loaded the data locally in a few minutes
     • We already have the ability to display
       arbitrary data from RDF
     • All that’s missing is connections from
       existing data
       – organization numbers is one way
       – statistical analysis is another




69
Linking data without common ids
     • It’s possible, using statistical techniques
        – in many cases not even that hard
        – http://code.google.com/p/duke/




70
Are there any tools out there?

Modelling



                                       pellet
Reasoners



                            Redland RDF Libraries
 APIs




Triple stores
Linked Schemas

     • Objects are identified by URI
       – but so are properties and classes
     • Means you can describe your properties
       and classes in terms of others
       – you can build on existing schemas
     • RDF provides very powerful ways to do this




72
RDFS and OWL
• Schema languages for RDF
  – used to describe classes and properties
  – in many ways like XML Schema or a database schema
• Represented in RDF
  – just like the data
  – means you can say anything you want about the data
• However, it doesn’t work like you expect
  – based on Open World Assumption
  – based on logical reasoning
Open World Assumption
• That nobody’s said it, doesn’t mean it’s not true
   – that we don’t have the date of death doesn’t mean the
     person is alive
   – that we have two different rows in the PERSON table
     doesn’t mean we have two different people
• In other words, data may be connected in
  unexpected ways
   – this usually doesn’t apply in single systems
   – but when you’re on the open web...
Open World Assumption
Rule: The value of dc:creator must be a person

                                                 rdfs:Range

                              dc:creator                 rdf:type

                                                                         person




                                                                            owl:disjointWith
                                                              rdf:type

                             dc:creator                  rdf:type

                                                                           sheep
Is Kyoto in Asia?
                                                 owl:Tr
                        locat
                                type-instance    ansitiv
                        ed-in                    eProp
                                                  erty




       Kyot
                 located-in        Japa
                                            located-in
                                                           Asia
        o                            n




              select ?c where { Kyoto located-in ?c . }
One model can extend another
                                                                       FOAF
           foaf:
                   foaf:depiction            foaf:
                                                                       (friend of a friend)
          Person                            Image




 foaf:knows


                   ph:depicted-in
                            owl:inverseOf
           ph:                               ph:                             ph:
          Person   ph:depiction-of          Photo                           Event


                     ph:taken-by



                                 ph:                    ph:            Photo ontology
                               Category                Place
                                                                       (not developed yet)
                                                               ph:contained-in


                              SKOS                   dbpedia
Serious logic
 • Bus drivers are people who drive buses
 • Drivers are people who drive vehicles
 • Therefore bus drivers are drivers!
                                                             ex:Bus           owl:subClassOf
                                              owl:someValuesFrom

            owl:intersectionOf               owl:restriction
ex:BusDriver
                                                                     owl:on

       owl:subClassOf                                          ex:drives
                                 ex:Person
                                                                   owl:on
               owl:intersectionOf            owl:restriction
ex:Driver

                                                owl:someValuesFrom

                                                               ex:Vehicle
            http://owl.man.ac.uk/2003/why/latest/
But ... you have to speak logic
                     • A serious challenge for
                       most people
                     • Must be very precise
                       about what you say
                     • People mostly use just
                       small fractions of OWL
Problems



80
Sensitive data

     • Some data is sensitive due to
       – privacy concerns
       – public security concerns
     • These concerns are real, and must be
       addressed
       – data may need to be filtered, or
       – in the worst case, not published at all
     • A rule of thumb
       – if it’s available on paper or in human-readable form,
         machine-readable should be OK, too



81
Will people understand the data?

     • People can and will misunderstand
       anything
       – this not your responsibility
       – but you can help
     • How to avoid
       – document the data
       – use self-describing linked data




82
Capacity issues

     • What if the data become too popular,
       could there be overloading issues?
       – yes, this happens
     • What to do
       – scale up (more hardware)
       – implement download restrictions, and charge for
         use above limit
       – let a data hotel host the data
          • http://data.norge.no/datahotellet




83
Data quality

     • What if the data is not 100% correct?
     • No data set is perfect
        – there are always problems with data
        – in the worst case you can add a disclaimer
        – all use is at own risk, in any case
     • Rules of thumb
        – if you take the trouble to use and maintain the data, it’s good
          enough for the public, too
        – if the data is too poor to publish, you should probably delete it

84
Media outrage

     • What if someone analyses the data and
       finds evidence of a scandal?
     • Does that mean it was wrong to publish
       the data?
     • Discuss!




85
The end



86
Conclusion

     • Open data is important
       – for democracy, and for the economy
     • Open data is Norwegian gov’t policy
       – (per letter from FAD in 2010)
     • There are different kinds of open
       – human-readable is good
       – machine-readable is easier, and often better
       – linked data is best (but not necessary)




87
Where to learn more

     • These slides
       – http://slideshare.net/larsga
     • FADs viderebruksveileder
       – http://no.wikibooks.org/wiki/Viderebruksveileder
     • Free ebook on Linked Open Data
       – http://linkeddatabook.com/editions/1.0/




88

Contenu connexe

Tendances

Designing and developing vocabularies in RDF
Designing and developing vocabularies in RDFDesigning and developing vocabularies in RDF
Designing and developing vocabularies in RDFOpen Data Support
 
Open Data Support - Service Description
Open Data Support - Service DescriptionOpen Data Support - Service Description
Open Data Support - Service DescriptionOpen Data Support
 
WWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
WWW2014 Tutorial: Online Learning & Linked Data - Lessons LearnedWWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
WWW2014 Tutorial: Online Learning & Linked Data - Lessons LearnedStefan Dietze
 
The linked open government data and metadata lifecycle
The linked open government data and metadata lifecycleThe linked open government data and metadata lifecycle
The linked open government data and metadata lifecycleOpen Data Support
 
Keystone summer school_2015_miguel_antonio_ldcompression_4-joined
Keystone summer school_2015_miguel_antonio_ldcompression_4-joinedKeystone summer school_2015_miguel_antonio_ldcompression_4-joined
Keystone summer school_2015_miguel_antonio_ldcompression_4-joinedJoel Azzopardi
 
euclid_linkedup WWW tutorial (Besnik Fetahu)
euclid_linkedup WWW tutorial (Besnik Fetahu)euclid_linkedup WWW tutorial (Besnik Fetahu)
euclid_linkedup WWW tutorial (Besnik Fetahu)Besnik Fetahu
 
HDL - Towards A Harmonized Dataset Model for Open Data Portals
HDL - Towards A Harmonized Dataset Model for Open Data PortalsHDL - Towards A Harmonized Dataset Model for Open Data Portals
HDL - Towards A Harmonized Dataset Model for Open Data PortalsAhmad Assaf
 
Linked Energy Data Generation
Linked Energy Data GenerationLinked Energy Data Generation
Linked Energy Data GenerationFilip Radulovic
 
Design and manage persistent URIs
Design and manage persistent URIsDesign and manage persistent URIs
Design and manage persistent URIsOpen Data Support
 
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataIntroduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataSören Auer
 
Big Linked Data - Creating Training Curricula
Big Linked Data - Creating Training CurriculaBig Linked Data - Creating Training Curricula
Big Linked Data - Creating Training CurriculaEUCLID project
 
PoolParty Semantic Suite: Management Briefing and Functional Overview
PoolParty Semantic Suite: Management Briefing and Functional Overview PoolParty Semantic Suite: Management Briefing and Functional Overview
PoolParty Semantic Suite: Management Briefing and Functional Overview Martin Kaltenböck
 
Publish and use your data
Publish and use your dataPublish and use your data
Publish and use your dataLD4SC
 
Introduction to RDF & SPARQL
Introduction to RDF & SPARQLIntroduction to RDF & SPARQL
Introduction to RDF & SPARQLOpen Data Support
 
Standardizing for Open Data
Standardizing for Open DataStandardizing for Open Data
Standardizing for Open DataIvan Herman
 
Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic WebTomek Pluskiewicz
 
Introduction to Metadata
Introduction to MetadataIntroduction to Metadata
Introduction to MetadataEUDAT
 
What can linked data do for digital libraries
What can linked data do for digital librariesWhat can linked data do for digital libraries
What can linked data do for digital librariesSören Auer
 

Tendances (20)

Designing and developing vocabularies in RDF
Designing and developing vocabularies in RDFDesigning and developing vocabularies in RDF
Designing and developing vocabularies in RDF
 
Open Data Support - Service Description
Open Data Support - Service DescriptionOpen Data Support - Service Description
Open Data Support - Service Description
 
WWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
WWW2014 Tutorial: Online Learning & Linked Data - Lessons LearnedWWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
WWW2014 Tutorial: Online Learning & Linked Data - Lessons Learned
 
The linked open government data and metadata lifecycle
The linked open government data and metadata lifecycleThe linked open government data and metadata lifecycle
The linked open government data and metadata lifecycle
 
Keystone summer school_2015_miguel_antonio_ldcompression_4-joined
Keystone summer school_2015_miguel_antonio_ldcompression_4-joinedKeystone summer school_2015_miguel_antonio_ldcompression_4-joined
Keystone summer school_2015_miguel_antonio_ldcompression_4-joined
 
euclid_linkedup WWW tutorial (Besnik Fetahu)
euclid_linkedup WWW tutorial (Besnik Fetahu)euclid_linkedup WWW tutorial (Besnik Fetahu)
euclid_linkedup WWW tutorial (Besnik Fetahu)
 
Linked Data In Action
Linked Data In ActionLinked Data In Action
Linked Data In Action
 
HDL - Towards A Harmonized Dataset Model for Open Data Portals
HDL - Towards A Harmonized Dataset Model for Open Data PortalsHDL - Towards A Harmonized Dataset Model for Open Data Portals
HDL - Towards A Harmonized Dataset Model for Open Data Portals
 
Linked Energy Data Generation
Linked Energy Data GenerationLinked Energy Data Generation
Linked Energy Data Generation
 
Design and manage persistent URIs
Design and manage persistent URIsDesign and manage persistent URIs
Design and manage persistent URIs
 
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked DataIntroduction to the Data Web, DBpedia and the Life-cycle of Linked Data
Introduction to the Data Web, DBpedia and the Life-cycle of Linked Data
 
Big Linked Data - Creating Training Curricula
Big Linked Data - Creating Training CurriculaBig Linked Data - Creating Training Curricula
Big Linked Data - Creating Training Curricula
 
PoolParty Semantic Suite: Management Briefing and Functional Overview
PoolParty Semantic Suite: Management Briefing and Functional Overview PoolParty Semantic Suite: Management Briefing and Functional Overview
PoolParty Semantic Suite: Management Briefing and Functional Overview
 
Publish and use your data
Publish and use your dataPublish and use your data
Publish and use your data
 
Introduction to RDF & SPARQL
Introduction to RDF & SPARQLIntroduction to RDF & SPARQL
Introduction to RDF & SPARQL
 
Standardizing for Open Data
Standardizing for Open DataStandardizing for Open Data
Standardizing for Open Data
 
Introduction to the Semantic Web
Introduction to the Semantic WebIntroduction to the Semantic Web
Introduction to the Semantic Web
 
Introduction to Metadata
Introduction to MetadataIntroduction to Metadata
Introduction to Metadata
 
What can linked data do for digital libraries
What can linked data do for digital librariesWhat can linked data do for digital libraries
What can linked data do for digital libraries
 
Does metadata matter?
Does metadata matter?Does metadata matter?
Does metadata matter?
 

En vedette

Linked Open Data for Libraries
Linked Open Data for LibrariesLinked Open Data for Libraries
Linked Open Data for LibrariesLukas Koster
 
Madrid Building blocks of Linked Data
Madrid Building blocks of Linked DataMadrid Building blocks of Linked Data
Madrid Building blocks of Linked DataVictor de Boer
 
Linked Data and Images: Building Blocks for Cultural Heritage
Linked Data and Images: Building Blocks for Cultural HeritageLinked Data and Images: Building Blocks for Cultural Heritage
Linked Data and Images: Building Blocks for Cultural HeritageRobert Sanderson
 
Navigli sssw
Navigli ssswNavigli sssw
Navigli ssswSSSW
 
What is pattern recognition (lecture 4 of 6)
What is pattern recognition (lecture 4 of 6)What is pattern recognition (lecture 4 of 6)
What is pattern recognition (lecture 4 of 6)Randa Elanwar
 
Tutorial Cognition - Irene
Tutorial Cognition - IreneTutorial Cognition - Irene
Tutorial Cognition - IreneSSSW
 
Introduction to open data quality et
Introduction to open data quality etIntroduction to open data quality et
Introduction to open data quality etOpen Data Support
 
LOD(Linked Open Data) Recommendations
LOD(Linked Open Data) RecommendationsLOD(Linked Open Data) Recommendations
LOD(Linked Open Data) RecommendationsMyungjin Lee
 
Intro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & MuseumsIntro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & MuseumsJon Voss
 
Linked Open Data
Linked Open DataLinked Open Data
Linked Open DataDerilinx
 
Statistical classification: A review on some techniques
Statistical classification: A review on some techniquesStatistical classification: A review on some techniques
Statistical classification: A review on some techniquesGiorgos Bamparopoulos
 
EIFL 2014 - Linked Open Data
EIFL 2014 - Linked Open DataEIFL 2014 - Linked Open Data
EIFL 2014 - Linked Open DataAntoine Isaac
 
Seminar(Pattern Recognition)
Seminar(Pattern Recognition)Seminar(Pattern Recognition)
Seminar(Pattern Recognition)anurodhsinha
 
Artificial intelligence Pattern recognition system
Artificial intelligence Pattern recognition systemArtificial intelligence Pattern recognition system
Artificial intelligence Pattern recognition systemREHMAT ULLAH
 
Pattern Recognition
Pattern RecognitionPattern Recognition
Pattern RecognitionMaaz Hasan
 
Data Mining: Concepts and techniques classification _chapter 9 :advanced methods
Data Mining: Concepts and techniques classification _chapter 9 :advanced methodsData Mining: Concepts and techniques classification _chapter 9 :advanced methods
Data Mining: Concepts and techniques classification _chapter 9 :advanced methodsSalah Amean
 

En vedette (20)

Linked Open Data for Libraries
Linked Open Data for LibrariesLinked Open Data for Libraries
Linked Open Data for Libraries
 
Madrid Building blocks of Linked Data
Madrid Building blocks of Linked DataMadrid Building blocks of Linked Data
Madrid Building blocks of Linked Data
 
Linked (Open) Data
Linked (Open) DataLinked (Open) Data
Linked (Open) Data
 
Linked Data and Images: Building Blocks for Cultural Heritage
Linked Data and Images: Building Blocks for Cultural HeritageLinked Data and Images: Building Blocks for Cultural Heritage
Linked Data and Images: Building Blocks for Cultural Heritage
 
Navigli sssw
Navigli ssswNavigli sssw
Navigli sssw
 
What is pattern recognition (lecture 4 of 6)
What is pattern recognition (lecture 4 of 6)What is pattern recognition (lecture 4 of 6)
What is pattern recognition (lecture 4 of 6)
 
Tutorial Cognition - Irene
Tutorial Cognition - IreneTutorial Cognition - Irene
Tutorial Cognition - Irene
 
Introduction to open data quality et
Introduction to open data quality etIntroduction to open data quality et
Introduction to open data quality et
 
Pattern recognition
Pattern recognitionPattern recognition
Pattern recognition
 
LOD(Linked Open Data) Recommendations
LOD(Linked Open Data) RecommendationsLOD(Linked Open Data) Recommendations
LOD(Linked Open Data) Recommendations
 
Intro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & MuseumsIntro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & Museums
 
Open Data and Linked Data
Open Data and Linked DataOpen Data and Linked Data
Open Data and Linked Data
 
Linked Open Data
Linked Open DataLinked Open Data
Linked Open Data
 
Linked Data Tutorial
Linked Data TutorialLinked Data Tutorial
Linked Data Tutorial
 
Statistical classification: A review on some techniques
Statistical classification: A review on some techniquesStatistical classification: A review on some techniques
Statistical classification: A review on some techniques
 
EIFL 2014 - Linked Open Data
EIFL 2014 - Linked Open DataEIFL 2014 - Linked Open Data
EIFL 2014 - Linked Open Data
 
Seminar(Pattern Recognition)
Seminar(Pattern Recognition)Seminar(Pattern Recognition)
Seminar(Pattern Recognition)
 
Artificial intelligence Pattern recognition system
Artificial intelligence Pattern recognition systemArtificial intelligence Pattern recognition system
Artificial intelligence Pattern recognition system
 
Pattern Recognition
Pattern RecognitionPattern Recognition
Pattern Recognition
 
Data Mining: Concepts and techniques classification _chapter 9 :advanced methods
Data Mining: Concepts and techniques classification _chapter 9 :advanced methodsData Mining: Concepts and techniques classification _chapter 9 :advanced methods
Data Mining: Concepts and techniques classification _chapter 9 :advanced methods
 

Similaire à Linked Open Data

COMSODE networking session at ICT Lisbon 2015
COMSODE networking session at ICT Lisbon 2015COMSODE networking session at ICT Lisbon 2015
COMSODE networking session at ICT Lisbon 2015Comsode - FP7 project
 
Industry@RuleML2015 DataGraft
Industry@RuleML2015 DataGraftIndustry@RuleML2015 DataGraft
Industry@RuleML2015 DataGraftRuleML
 
Entity-Centric Data Management
Entity-Centric Data ManagementEntity-Centric Data Management
Entity-Centric Data ManagementeXascale Infolab
 
Data Science at Scale - The DevOps Approach
Data Science at Scale - The DevOps ApproachData Science at Scale - The DevOps Approach
Data Science at Scale - The DevOps ApproachMihai Criveti
 
Introduction to Cloud computing and Big Data-Hadoop
Introduction to Cloud computing and  Big Data-HadoopIntroduction to Cloud computing and  Big Data-Hadoop
Introduction to Cloud computing and Big Data-HadoopNagarjuna D.N
 
Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked DataMarin Dimitrov
 
Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked DataEUCLID project
 
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...Mihai Criveti
 
Data-as-a-Service: DataGraft
Data-as-a-Service: DataGraftData-as-a-Service: DataGraft
Data-as-a-Service: DataGraftdapaasproject
 
Myth Busters VII: I’m building a data mesh, so I don’t need data virtualization
Myth Busters VII: I’m building a data mesh, so I don’t need data virtualizationMyth Busters VII: I’m building a data mesh, so I don’t need data virtualization
Myth Busters VII: I’m building a data mesh, so I don’t need data virtualizationDenodo
 
Dublinked tech workshop_15_dec2011
Dublinked tech workshop_15_dec2011Dublinked tech workshop_15_dec2011
Dublinked tech workshop_15_dec2011Dublinked .
 
Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareIMC Technologies
 
Large scale computing
Large scale computing Large scale computing
Large scale computing Bhupesh Bansal
 
To mesh or mess up your data organisation - Jochem van Grondelle (Prosus/OLX ...
To mesh or mess up your data organisation - Jochem van Grondelle (Prosus/OLX ...To mesh or mess up your data organisation - Jochem van Grondelle (Prosus/OLX ...
To mesh or mess up your data organisation - Jochem van Grondelle (Prosus/OLX ...Jochem van Grondelle
 
Introduction to APIs and Linked Data
Introduction to APIs and Linked DataIntroduction to APIs and Linked Data
Introduction to APIs and Linked DataAdrian Stevenson
 

Similaire à Linked Open Data (20)

COMSODE networking session at ICT Lisbon 2015
COMSODE networking session at ICT Lisbon 2015COMSODE networking session at ICT Lisbon 2015
COMSODE networking session at ICT Lisbon 2015
 
Industry@RuleML2015 DataGraft
Industry@RuleML2015 DataGraftIndustry@RuleML2015 DataGraft
Industry@RuleML2015 DataGraft
 
Entity-Centric Data Management
Entity-Centric Data ManagementEntity-Centric Data Management
Entity-Centric Data Management
 
Data Science at Scale - The DevOps Approach
Data Science at Scale - The DevOps ApproachData Science at Scale - The DevOps Approach
Data Science at Scale - The DevOps Approach
 
Introduction to Cloud computing and Big Data-Hadoop
Introduction to Cloud computing and  Big Data-HadoopIntroduction to Cloud computing and  Big Data-Hadoop
Introduction to Cloud computing and Big Data-Hadoop
 
Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked Data
 
Methodology for the publication of Linked Open Data from small and medium siz...
Methodology for the publication of Linked Open Data from small and medium siz...Methodology for the publication of Linked Open Data from small and medium siz...
Methodology for the publication of Linked Open Data from small and medium siz...
 
Scaling up Linked Data
Scaling up Linked DataScaling up Linked Data
Scaling up Linked Data
 
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
DevOps for Data Engineers - Automate Your Data Science Pipeline with Ansible,...
 
Data-as-a-Service: DataGraft
Data-as-a-Service: DataGraftData-as-a-Service: DataGraft
Data-as-a-Service: DataGraft
 
Myth Busters VII: I’m building a data mesh, so I don’t need data virtualization
Myth Busters VII: I’m building a data mesh, so I don’t need data virtualizationMyth Busters VII: I’m building a data mesh, so I don’t need data virtualization
Myth Busters VII: I’m building a data mesh, so I don’t need data virtualization
 
Dublinked tech workshop_15_dec2011
Dublinked tech workshop_15_dec2011Dublinked tech workshop_15_dec2011
Dublinked tech workshop_15_dec2011
 
LOD2 Webinar Series: CubeViz
LOD2 Webinar Series: CubeViz LOD2 Webinar Series: CubeViz
LOD2 Webinar Series: CubeViz
 
KEDL DBpedia 2019
KEDL DBpedia  2019KEDL DBpedia  2019
KEDL DBpedia 2019
 
Here Comes Everything
Here Comes EverythingHere Comes Everything
Here Comes Everything
 
Linked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the SoftwareLinked Data for the Masses: The approach and the Software
Linked Data for the Masses: The approach and the Software
 
Large scale computing
Large scale computing Large scale computing
Large scale computing
 
To mesh or mess up your data organisation - Jochem van Grondelle (Prosus/OLX ...
To mesh or mess up your data organisation - Jochem van Grondelle (Prosus/OLX ...To mesh or mess up your data organisation - Jochem van Grondelle (Prosus/OLX ...
To mesh or mess up your data organisation - Jochem van Grondelle (Prosus/OLX ...
 
Introduction to APIs and Linked Data
Introduction to APIs and Linked DataIntroduction to APIs and Linked Data
Introduction to APIs and Linked Data
 
The Web of Data: The W3C Semantic Web Initiative
The Web of Data: The W3C Semantic Web InitiativeThe Web of Data: The W3C Semantic Web Initiative
The Web of Data: The W3C Semantic Web Initiative
 

Plus de Lars Marius Garshol

JSLT: JSON querying and transformation
JSLT: JSON querying and transformationJSLT: JSON querying and transformation
JSLT: JSON querying and transformationLars Marius Garshol
 
Data collection in AWS at Schibsted
Data collection in AWS at SchibstedData collection in AWS at Schibsted
Data collection in AWS at SchibstedLars Marius Garshol
 
NoSQL and Einstein's theory of relativity
NoSQL and Einstein's theory of relativityNoSQL and Einstein's theory of relativity
NoSQL and Einstein's theory of relativityLars Marius Garshol
 
Using the search engine as recommendation engine
Using the search engine as recommendation engineUsing the search engine as recommendation engine
Using the search engine as recommendation engineLars Marius Garshol
 
Linked Open Data for the Cultural Sector
Linked Open Data for the Cultural SectorLinked Open Data for the Cultural Sector
Linked Open Data for the Cultural SectorLars Marius Garshol
 
NoSQL databases, the CAP theorem, and the theory of relativity
NoSQL databases, the CAP theorem, and the theory of relativityNoSQL databases, the CAP theorem, and the theory of relativity
NoSQL databases, the CAP theorem, and the theory of relativityLars Marius Garshol
 
Introduction to Big Data/Machine Learning
Introduction to Big Data/Machine LearningIntroduction to Big Data/Machine Learning
Introduction to Big Data/Machine LearningLars Marius Garshol
 
Hafslund SESAM - Semantic integration in practice
Hafslund SESAM - Semantic integration in practiceHafslund SESAM - Semantic integration in practice
Hafslund SESAM - Semantic integration in practiceLars Marius Garshol
 
Experiments in genetic programming
Experiments in genetic programmingExperiments in genetic programming
Experiments in genetic programmingLars Marius Garshol
 

Plus de Lars Marius Garshol (20)

JSLT: JSON querying and transformation
JSLT: JSON querying and transformationJSLT: JSON querying and transformation
JSLT: JSON querying and transformation
 
Data collection in AWS at Schibsted
Data collection in AWS at SchibstedData collection in AWS at Schibsted
Data collection in AWS at Schibsted
 
Kveik - what is it?
Kveik - what is it?Kveik - what is it?
Kveik - what is it?
 
Nature-inspired algorithms
Nature-inspired algorithmsNature-inspired algorithms
Nature-inspired algorithms
 
Collecting 600M events/day
Collecting 600M events/dayCollecting 600M events/day
Collecting 600M events/day
 
History of writing
History of writingHistory of writing
History of writing
 
NoSQL and Einstein's theory of relativity
NoSQL and Einstein's theory of relativityNoSQL and Einstein's theory of relativity
NoSQL and Einstein's theory of relativity
 
Norwegian farmhouse ale
Norwegian farmhouse aleNorwegian farmhouse ale
Norwegian farmhouse ale
 
Archive integration with RDF
Archive integration with RDFArchive integration with RDF
Archive integration with RDF
 
The Euro crisis in 10 minutes
The Euro crisis in 10 minutesThe Euro crisis in 10 minutes
The Euro crisis in 10 minutes
 
Using the search engine as recommendation engine
Using the search engine as recommendation engineUsing the search engine as recommendation engine
Using the search engine as recommendation engine
 
Linked Open Data for the Cultural Sector
Linked Open Data for the Cultural SectorLinked Open Data for the Cultural Sector
Linked Open Data for the Cultural Sector
 
NoSQL databases, the CAP theorem, and the theory of relativity
NoSQL databases, the CAP theorem, and the theory of relativityNoSQL databases, the CAP theorem, and the theory of relativity
NoSQL databases, the CAP theorem, and the theory of relativity
 
Bitcoin - digital gold
Bitcoin - digital goldBitcoin - digital gold
Bitcoin - digital gold
 
Introduction to Big Data/Machine Learning
Introduction to Big Data/Machine LearningIntroduction to Big Data/Machine Learning
Introduction to Big Data/Machine Learning
 
Hops - the green gold
Hops - the green goldHops - the green gold
Hops - the green gold
 
Big data 101
Big data 101Big data 101
Big data 101
 
Hafslund SESAM - Semantic integration in practice
Hafslund SESAM - Semantic integration in practiceHafslund SESAM - Semantic integration in practice
Hafslund SESAM - Semantic integration in practice
 
Approximate string comparators
Approximate string comparatorsApproximate string comparators
Approximate string comparators
 
Experiments in genetic programming
Experiments in genetic programmingExperiments in genetic programming
Experiments in genetic programming
 

Dernier

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 

Dernier (20)

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 

Linked Open Data

  • 1. Linked Open Data NOKIOS 2012, 2012-10-30 Lars Marius Garshol, larsga@bouvet.no, http://twitter.com/larsga 1
  • 2. Agenda • What and why? • Examples of use • How to publish • Linked Data • Problems • Conclusion 2
  • 4. Linked Open Data foundation for all IT functionality no data – no service published license allowing reuse machine-readable connected to other data self-documenting 4
  • 5. 5
  • 6. US Government Works • US copyright law states that – "a work prepared by an officer or employee of the U.S. government as part of that person's official duties” • is – not entitled to domestic copyright protection 6 http://en.wikipedia.org/wiki/Copyright_status_of_work_by_the_U.S._government
  • 7. 7
  • 8. 8 http://www.digst.dk/Home/Servicemenu/English/Digitisation/~/media/Files/English/BasicDataUKweb20121008.ashx
  • 9. 9
  • 10. http://www.regjeringen.no/nb/dep/fad/dok/rundskriv/2010/fellesforinger- tildelingsbrevene-for-201.html?id=624792 10
  • 11. 11
  • 12. Why publish open data? • To empower other people to do things – you can’t afford – you don’t have the time to – you don’t want to – you couldn’t imagine • Again: no data, no service 12
  • 13. Why is this so popular now? • Because the advent of – ubiquitous computing, – cheap hardware and disk space, – cheap and fast networks • has dramatically reduced the cost of distributing data – and building applications, – while dramatically increasing the value from such applications • Also, fashion 13
  • 14. Why is access to data important? • It’s necessary for democracy to function properly – voters and activists cannot make the right decisions if they don’t have the necessary information • It’s important for economic growth – in a post-modern society, information is the life- blood of the economy – many companies make their living simply by selling repackaged information 14
  • 15. Economic effects • Making information more easily available – levels the playing field – benefiting smaller companies – improves competition • Large companies have – advertising budgets, – money to extract data, – ... • Smaller entities don’t 16
  • 16. Two kinds of availability (1) • Available to humans – this makes the raw data available to humans – only humans can then digest and process it, and ultimately pass it on to other humans 17
  • 17. Two kinds of availability (2) • Available to machines – that is, make the data available in machine- processable form – people can then build many different kinds of services based on the data – allows many different kinds of access to the data 18
  • 18. 19
  • 19. 20
  • 20. 21
  • 21. 22
  • 22. http://www.guardian.co.uk/ news/datablog/2012/oct/09/ devolution-national- statistics-data 23
  • 23. 24
  • 25. http://www.dagblad et.no/kommunebor sen/ 26
  • 26. 27 Research project by SINTEF and Computas
  • 27. Must be at meeting at 1345. Three transport alternatives. 28 Research project by SINTEF and Computas
  • 28. Data sources Research project by SINTEF and Computas 29
  • 29. Bygninger.no (Database is over free limit.) 30
  • 30. German Federal Budget 31 http://bund.offenerhaushalt.de/
  • 32. Cleaning customer data • I did work deduplicating a customer database • We wanted to look at cleaning the data automatically – ie: picking which of the duplicate records to use • Decided at the time it couldn’t be done automatically – because company data not available online • Now it is available... 33
  • 33. http://dbpedia.org/About • Not really an application – but a very important data set • Basically Wikipedia as Linked Open Data – Wikipedia fact boxes etc extracted as RDF – 400 million statements about 3.77 million things 34
  • 34. 35
  • 36. Ten Principles for Open Gov’t Data 1. Completeness 2. Primary 3. Timeliness 4. Ease of access 5. Machine readable 6. Non-discrimination 7. Use open standards 8. Licensing 9. Permanence 10. Usage cost 37 http://sunlightfoundation.com/policy/documents/ten-open-data-principles/
  • 37. 5-star model Available, open license Machine-readable format Non-proprietary format URIs as identifiers Linked to other data 38 http://5stardata.info/
  • 38. Data licenses • Necessary so users know what they are allowed/not allowed to do with the data • Open Data Commons has licenses you can reuse: – http://opendatacommons.org/licenses/ • Such as – public domain (anything goes) – attribution license (must give credit) – attribution share-alike • Norwegian license – http://data.norge.no/nlod/en – Norwegian Licence for Open Government Data 39
  • 39. Machine-readable Readable Not readable • CSV • Microsoft Word • XML • PDF • Microsoft Excel • HTML • RDF • Flash • JSON >>> import csv >>> r = csv.reader(open('countries-mondial.csv')) >>> r.next() ['id', 'country', 'capital', 'area'] >>> r.next() ['4202', 'Malta', 'Valletta', '320'] >>> int(r.next()[3]) * 247.105 3330975.4000000004 # area of next country in acres 40
  • 40. Non-proprietary • Microsoft Excel is a proprietary format – it’s owned and controlled by Microsoft – it’s also very complicated to read • An open alternative is CSV • Open, standardized alternatives are – XML, RDF, JSON 41
  • 41. Ways to publish • Download – this is the easiest – just put up static data files for download – vastly better than nothing • API – build an API people can use for interacting with the data – fashionable, but not really necessary • Stream – publish streams of changes, for easy syncing – using SDshare, for example – very useful, but can be costly 42
  • 42. How hard is it to publish data? • Depends on how you do it • Dumping CSV from a relational database is trivial – can literally be as simple as just a few lines of code • The more ambitious you are, the more work it becomes – filtering out sensitive records – using better formats than CSV – linking to other data – documenting – adding streaming – ... 43
  • 43. URIs as identifiers • URIs are globally unique – thanks to their use of domain names – anyone with a domain name can make URIs • Benefits – identifiers that can be reused anywhere, and still remain unique – can be resolved to an explanation of what they identify
  • 44. Linking to other data (1) • Data becomes more valuable when it is connected to other data – because this reduces the cost of reusing and processing the data • Imagine reusing the data below – how do you connect with country data from other sources? ID COUNTRY CAPITAL AREA 4202 Malta Valletta 320 19654 Moldova Chisinau 25333 8715 Kazakstan Almaty 2717300 Data from Mondial 45
  • 45. Linking to other data (2) • By using common URIs for your concepts you can make data reuse much easier ID COUNTRY CAPITAL AREA http://dbpedia.org/resource/Malta Malta Valletta 316 http://dbpedia.org/resource/Moldova Moldova Chişinău 33846 http://dbpedia.org/resource/Kazakhstan Kazakhstan Almaty 2717300 Data from DBpedia 46
  • 46. 47 Linking Open Data cloud diagram, by Richard Cyganiak and Anja Jentzsch. http://lod-cloud.net/
  • 47. Linked Data principles 1. Use URIs as names for things 1. Use HTTP URIs so that people can look up those names. 1. When someone looks up a URI, provide useful information, using the standards (RDF*, SPARQL) 1. Include links to other URIs so that they can discover more things. 48 http://www.w3.org/DesignIssues/LinkedData
  • 48. An example <result> <heading>VEGMELDINGER - Viktigste</heading> <messages> <message> <heading>Oslo (Ulven) - Karihaugen, ved [41] Furuset</heading> <messagenumber>64952</messagenumber> <version>1</version> <ingress>To felt stengt på grunn av vedlikeholdsarbeid i periodene: Onsdag og torsdag fra 22:00 til 05:30 (neste dag). Fare for kø.</ingress> <messageType>Redusert framkommelighet</messageType> <urgency>X</urgency> <roadType>Ev</roadType> <roadNumber>6</roadNumber> <validFrom>2012-10-24 22:00:00.0 CEST</validFrom> <validTo>2012-10-26 05:30:00.999 CEST</validTo> Are these helpful ways to <actualCounties> refer to route E6 and the <string>Oslo</string> </actualCounties> county of Oslo? <coordinates> <crs>EPSG:4326</crs> <startPoint> <xCoord>10.889964</xCoord> <yCoord>59.937266</yCoord> </startPoint> </coordinates> 49
  • 49. An alternative <result> <heading>VEGMELDINGER - Viktigste</heading> <messages> <message> <heading>Oslo (Ulven) - Karihaugen, ved [41] Furuset</heading> <messagenumber>64952</messagenumber> <version>1</version> <ingress>To felt stengt på grunn av vedlikeholdsarbeid i periodene: Onsdag og torsdag fra 22:00 til 05:30 (neste dag). Fare for kø.</ingress> <messageType>Redusert framkommelighet</messageType> <urgency>X</urgency> <road>http://dbpedia.org/resource/European_route_E06</road> <validFrom>2012-10-24 22:00:00.0 CEST</validFrom> <validTo>2012-10-26 05:30:00.999 CEST</validTo> <actualCounties> <county>http://dbpedia.org/resource/Oslo</county> </actualCounties> <coordinates> <crs>EPSG:4326</crs> <startPoint> Now we’re using URIs as names for these <xCoord>10.889964</xCoord> <yCoord>59.937266</yCoord> concepts. And, what’s more, the names </startPoint> </coordinates> resolve to more information. 50
  • 50. HTML [larsga@Lars-Marius-Garshols-MacBook-Pro-6 ~]$ telnet dbpedia.org 80 Trying 194.109.129.58... Connected to dbpedia.org (194.109.129.58). Escape character is '^]'. GET /page/European_route_E06 HTTP/1.0 This is the green type of DBpedia web HTTP/1.1 200 OK Date: Wed, 24 Oct 2012 07:16:21 GMT page you’ve already seen for Rælingen. Content-Type: text/html; charset=UTF-8 Content-Length: 44484 Connection: close Vary: Accept-Encoding Server: Virtuoso/06.04.3132 (Linux) x86_64-generic-linux-glibc212-64 VDB Accept-Ranges: bytes Expires: Wed, 31 Oct 2012 07:16:20 GMT Link: <http://dbpedia.org/data/European_route_E06.rdf>; rel="alternate"; type="application/rdf+xml" ... <?xml version="1.0" encoding="UTF-8" ?> <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML+RDFa 1.0//EN" "http://www.w3.org/MarkUp/DTD/xhtml-rdfa-1.dtd"> <html xmlns="http://www.w3.org/1999/xhtml" xmlns:dbpprop="http://dbpedia.org/property/" xmlns:foaf="http://xmlns.com/foaf/0.1/" version="XHTML+RDFa 1.0" xml:lang="en"> 51
  • 51. RDF/XML [larsga@Lars-Marius-Garshols-MacBook-Pro-6 ~]$ telnet dbpedia.org 80 Trying 194.109.129.58... Connected to dbpedia.org (194.109.129.58). Escape character is '^]'. GET /data/European_route_E06.rdf HTTP/1.0 HTTP/1.1 200 OK Date: Wed, 24 Oct 2012 07:36:03 GMT Content-Type: application/rdf+xml; charset=UTF-8 Content-Length: 24377 Connection: close Vary: Accept-Encoding Server: Virtuoso/06.04.3132 (Linux) x86_64-generic-linux-glibc212-64 VDB Accept-Ranges: bytes Expires: Wed, 31 Oct 2012 07:36:03 GMT Link: ... <?xml version="1.0" encoding="utf-8" ?> <rdf:RDF xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:rdfs="http://www.w3.org/2000/01/rdf-schema#" xmlns:owl="http://www.w3.org/2002/07/owl#" xmlns:foaf="http://xmlns.com/foaf/0.1/" xmlns:dbpedia-owl="http://dbpedia.org/ontology/" xmlns:dcterms="http://purl.org/dc/terms/" xmlns:dbpprop="http://dbpedia.org/property/" xmlns:ns7="http://www.w3.org/ns/prov#" xmlns:ns8="http://dbpedia.org/ontology/Infrastructure/" > <rdf:Description rdf:about="http://dbpedia.org/resource/European_route_E06"> <rdf:type rdf:resource="http://dbpedia.org/ontology/Place" /> <rdf:type rdf:resource="http://dbpedia.org/ontology/Infrastructure" /> <rdf:type rdf:resource="http://schema.org/Place" /> 52
  • 53. Weaknesses in traditional formats • Cannot be imported directly – must always be translated/interpreted somehow • Cannot be automatically merged – no concept of identity • Schema information is not linked – types and properties not connected to other types and properties • Linked Data solves all of these problems – Linked Data ≈ RDF 54
  • 54. How RDF works relational table ‘PERSON’ ID NAME EMAIL 1 Stian Danenbarger stian.danenbarger@ 2 Lars Marius Garshol larsga@bouvet.no 3 Axel Borge axel.borge@bouvet RDF-ized data SUBJECT PROPERTY OBJECT http://example.com/person/1 rdf:type ex:Person http://example.com/person/1 ex:name Stian Danenbarger http://example.com/person/1 ex:email stian.danenbarger@ http://example.com/person/2 rdf:type Person http://example.com/person/2 ex:name Lars Marius Garshol ... ... ... 55
  • 55. Important RDF standards • RDF/XML Exchange format. • Turtle Human-readable format. • NTriples Braindead simple format. • SPARQL The query language. • RDFS Simple schema language. • OWL Extremely powerful schema language. 56
  • 56. Hafslund SESAM • An archive system, really • Automatically enriches metadata on documents when archived • To do this, must collect data from enterprise systems http://www.slideshare.net/larsga/hafslund-sesam-semantic-integration-in-practice
  • 57. High-level architecture ERP CRM Intranet SDshare CMIS Search SDshare Virtuoso SDshare Archive engine triple store
  • 58. Auto-tagging Manager Sent to archive Project Customer Work order Equipment Equipment
  • 59. Data structure in triple store ERP sameAs Archive sameAs CRM Intranet Triple store
  • 61. A simple example • We were building an internet service for DSS for use in the ministries – based on RDF • Were looking for open data which internal data at the ministries could be connected to – all for display in the intranet 62
  • 62.
  • 63. RDF description of the data set, using Dublin Core and VOID http://sws.ifi.uio.no/npd/page/ LinkedOpenNPDFactPages
  • 64.
  • 65.
  • 66.
  • 67. 68
  • 68. How hard would it be to use? • We loaded the data locally in a few minutes • We already have the ability to display arbitrary data from RDF • All that’s missing is connections from existing data – organization numbers is one way – statistical analysis is another 69
  • 69. Linking data without common ids • It’s possible, using statistical techniques – in many cases not even that hard – http://code.google.com/p/duke/ 70
  • 70. Are there any tools out there? Modelling pellet Reasoners Redland RDF Libraries APIs Triple stores
  • 71. Linked Schemas • Objects are identified by URI – but so are properties and classes • Means you can describe your properties and classes in terms of others – you can build on existing schemas • RDF provides very powerful ways to do this 72
  • 72. RDFS and OWL • Schema languages for RDF – used to describe classes and properties – in many ways like XML Schema or a database schema • Represented in RDF – just like the data – means you can say anything you want about the data • However, it doesn’t work like you expect – based on Open World Assumption – based on logical reasoning
  • 73. Open World Assumption • That nobody’s said it, doesn’t mean it’s not true – that we don’t have the date of death doesn’t mean the person is alive – that we have two different rows in the PERSON table doesn’t mean we have two different people • In other words, data may be connected in unexpected ways – this usually doesn’t apply in single systems – but when you’re on the open web...
  • 74. Open World Assumption Rule: The value of dc:creator must be a person rdfs:Range dc:creator rdf:type person owl:disjointWith rdf:type dc:creator rdf:type sheep
  • 75. Is Kyoto in Asia? owl:Tr locat type-instance ansitiv ed-in eProp erty Kyot located-in Japa located-in Asia o n select ?c where { Kyoto located-in ?c . }
  • 76. One model can extend another FOAF foaf: foaf:depiction foaf: (friend of a friend) Person Image foaf:knows ph:depicted-in owl:inverseOf ph: ph: ph: Person ph:depiction-of Photo Event ph:taken-by ph: ph: Photo ontology Category Place (not developed yet) ph:contained-in SKOS dbpedia
  • 77. Serious logic • Bus drivers are people who drive buses • Drivers are people who drive vehicles • Therefore bus drivers are drivers! ex:Bus owl:subClassOf owl:someValuesFrom owl:intersectionOf owl:restriction ex:BusDriver owl:on owl:subClassOf ex:drives ex:Person owl:on owl:intersectionOf owl:restriction ex:Driver owl:someValuesFrom ex:Vehicle http://owl.man.ac.uk/2003/why/latest/
  • 78. But ... you have to speak logic • A serious challenge for most people • Must be very precise about what you say • People mostly use just small fractions of OWL
  • 80. Sensitive data • Some data is sensitive due to – privacy concerns – public security concerns • These concerns are real, and must be addressed – data may need to be filtered, or – in the worst case, not published at all • A rule of thumb – if it’s available on paper or in human-readable form, machine-readable should be OK, too 81
  • 81. Will people understand the data? • People can and will misunderstand anything – this not your responsibility – but you can help • How to avoid – document the data – use self-describing linked data 82
  • 82. Capacity issues • What if the data become too popular, could there be overloading issues? – yes, this happens • What to do – scale up (more hardware) – implement download restrictions, and charge for use above limit – let a data hotel host the data • http://data.norge.no/datahotellet 83
  • 83. Data quality • What if the data is not 100% correct? • No data set is perfect – there are always problems with data – in the worst case you can add a disclaimer – all use is at own risk, in any case • Rules of thumb – if you take the trouble to use and maintain the data, it’s good enough for the public, too – if the data is too poor to publish, you should probably delete it 84
  • 84. Media outrage • What if someone analyses the data and finds evidence of a scandal? • Does that mean it was wrong to publish the data? • Discuss! 85
  • 86. Conclusion • Open data is important – for democracy, and for the economy • Open data is Norwegian gov’t policy – (per letter from FAD in 2010) • There are different kinds of open – human-readable is good – machine-readable is easier, and often better – linked data is best (but not necessary) 87
  • 87. Where to learn more • These slides – http://slideshare.net/larsga • FADs viderebruksveileder – http://no.wikibooks.org/wiki/Viderebruksveileder • Free ebook on Linked Open Data – http://linkeddatabook.com/editions/1.0/ 88