SlideShare une entreprise Scribd logo
1  sur  87
Digital Enterprise Research Institute                        deri.ie




     Interlinking Personal Semantic Data
                                on the Desktop and the Web

                                           Laura Drǎgan
Outline
Digital Enterprise Research Institute     www.deri.ie



       // Introduction
           Background and motivation
           Research questions

       // Directions and results
           Within the Semantic desktop
           To the Web of Data
           A use case to rule them all

       // Conclusion
           Research answers
           Future work
                                                   1
Outline
Digital Enterprise Research Institute     www.deri.ie



       // Introduction
           Background and motivation
           Research questions

       // Directions and results
           Within the Semantic desktop
           To the Web of Data
           A use case to rule them all

       // Conclusion
           Research answers
           Future work
                                                   1
Outline
Digital Enterprise Research Institute     www.deri.ie



       // Introduction
           Background and motivation
           Research questions

       // Directions and results
           Within the Semantic desktop
           To the Web of Data
           A use case to rule them all

       // Conclusion
           Research answers
           Future work
                                                   1
Background
Digital Enterprise Research Institute    www.deri.ie



       Personal Information Management




                                                  2
Background
Digital Enterprise Research Institute                  www.deri.ie



       Personal Information Management


                                                      1962
                                                      1968




                                        1945



                                               1965

                                                                2
Background
Digital Enterprise Research Institute    www.deri.ie



       Personal Information Management
       Web




                                                  2
Background
Digital Enterprise Research Institute    www.deri.ie



       Personal Information Management
       Semantic Web




                                                  2
Background
Digital Enterprise Research Institute    www.deri.ie



       Personal Information Management
       Semantic Web

       Semantic Desktop




                                                  2
Background
Digital Enterprise Research Institute    www.deri.ie



       Personal Information Management
       Semantic Web

       Semantic Desktop




                                                  2
Motivation
Digital Enterprise Research Institute                            www.deri.ie




                             Use the framework provided by
                          the Semantic Desktop to build useful
                                applications and services




                                                                          3
Research questions
Digital Enterprise Research Institute                       www.deri.ie




       Q1.   How to build semantic applications and tools for
            the Semantic Desktop to provide the best
            experience for the users, while creating reusable
            semantic data?




                                                                     4
Research questions
Digital Enterprise Research Institute                       www.deri.ie




       Q1.   How to build semantic applications and tools for
            the Semantic Desktop to provide the best
            experience for the users, while creating reusable
            semantic data?




                                                                     4
Research questions
Digital Enterprise Research Institute                       www.deri.ie




       Q1. How to buildsemantic applications and tools for the
            Semantic Desktop?




                                                                     4
Research questions
Digital Enterprise Research Institute                       www.deri.ie




       Q1. How to buildsemantic applications and tools for the
            Semantic Desktop?

       Q2.   How to expand the scope of the Semantic Desktop
            into the realm of the Web of Data, to benefit the
            users and enhance their experience?




                                                                     4
Research questions
Digital Enterprise Research Institute                       www.deri.ie




       Q1. How to buildsemantic applications and tools for the
            Semantic Desktop?

       Q2.   How to expand the scope of the Semantic Desktop
            into the realm of the Web of Data, to benefit the
            users and enhance their experience?




                                                                     4
Research questions
Digital Enterprise Research Institute                       www.deri.ie




       Q1. How to buildsemantic applications and tools for the
            Semantic Desktop?

       Q2. How to   expand the scope of the Semantic Desktop
            into the Web of Data?




                                                                     4
Q1 sub-questions
Digital Enterprise Research Institute                                      www.deri.ie




       semantic applications for the Semantic Desktop

              Q1.1.  How to create semantic data that is complete, correct,
                  safe, and provides a high degree of interlinking with the
                  already existing network of semantic data on the desktop?
              Q1.2. How to reuse existing Semantic Desktop data in an
                  application?
              Q1.3. How to design the human-computer interaction in an
                  application for the Semantic Desktop?
              Q1.4.    How to correctly evaluate a semantic application?




                                                                                    5
Q1 sub-questions
Digital Enterprise Research Institute                                                       www.deri.ie




       semantic applications for the Semantic Desktop

              Q1.1. How to create semantic data that is complete, correct, safe, and
                provides a high degree of interlinking with the already existing network
                of semantic data on the desktop?
              Q1.2. How to              reuse existing Semantic Desktop data in an application?
              Q1.3. How to design the human-computer interaction in an application
                for the Semantic Desktop?
              Q1.4. How to correctly            evaluate a semantic application?




                                                                                                     5
Q2 sub-questions
Digital Enterprise Research Institute                                      www.deri.ie




       connect the Semantic Desktop with the Web of Data

              Q2.1. How to find Web instances representing the same real-
                  world thing described by a Semantic Desktop resource?
              Q2.2. How to use the Web information which is related to a
                  desktop resource?
              Q2.3.    How to make desktop data available online safely?




                                                                                    6
Q2 sub-questions
Digital Enterprise Research Institute                                                     www.deri.ie




       connect the Semantic Desktop with the Web of Data

              Q2.1. How to find Web instances representing the same real-world
                thing described by a Semantic Desktop resource?
              Q2.2. How to              use the Web information which is related to a desktop
                resource?
              Q2.3. How to make desktop data             available online safely?




                                                                                                   6
Directions
Digital Enterprise Research Institute   www.deri.ie




                                                 7
Directions
Digital Enterprise Research Institute        www.deri.ie




                                        1.




                                                      7
Directions
Digital Enterprise Research Institute   www.deri.ie




                                 1.




                                                 7
Directions
Digital Enterprise Research Institute   www.deri.ie




                                                 7
Directions
Digital Enterprise Research Institute        www.deri.ie




                                        2.




                                                      7
Directions
Digital Enterprise Research Institute        www.deri.ie




                                        2.
                                1.




                                                      7
Within the Semantic Desktop
Digital Enterprise Research Institute   www.deri.ie




                                                 8
SemNotes
Digital Enterprise Research Institute                         www.deri.ie



       Challenges described by Q1

                 create new semantic data
                     – Data representation
                     – Data management
                 reuse existing Semantic Desktop data
                     – Interlinking
                 design the human-computer interaction
                     – Visualisation
                 correctly evaluate a semantic application
                     – Task-based comparison to Evernote




                                                                       9
Data representation
Digital Enterprise Research Institute   www.deri.ie




       <nepomuk:/a_note>




                                                 10
Data representation
Digital Enterprise Research Institute     www.deri.ie




       <nepomuk:/a_note>
                          a pimo:Note ;




                                                   10
Data representation
Digital Enterprise Research Institute                       www.deri.ie




       <nepomuk:/a_note>
                          a pimo:Note ;
                          nao:prefLabel "holiday plans" ;




                                                                     10
Data representation
Digital Enterprise Research Institute                                         www.deri.ie




       <nepomuk:/a_note>
                          a pimo:Note ;
                          nao:prefLabel “holiday plans” ;
                          nao:created “2010-09-16T21:08:54.29Z”ˆˆxsd:dateTime ;
                          nao:lastModified “2010-09-17T10:59:01.58Z”ˆˆxsd:dateTime
       ;
                          nao:numericRating “9”ˆˆxsd:int ;




                                                                                       10
Data representation
Digital Enterprise Research Institute                                           www.deri.ie




       <nepomuk:/a_note>
                          a pimo:Note ;
                          nao:prefLabel “holiday plans” ;
                          nao:created “2010-09-16T21:08:54.29Z”ˆˆxsd:dateTime ;
                          nao:lastModified “2010-09-17T10:59:01.58Z”ˆˆxsd:dateTime
       ;
                          nao:numericRating “9”ˆˆxsd:int ;
                          nao:description “<html >... </ html>”ˆˆxsd:string ;



                                                                                         10
Data representation
Digital Enterprise Research Institute                                           www.deri.ie




       <nepomuk:/a_note>
                          a pimo:Note ;
                          nao:prefLabel “holiday plans” ;
                          nao:created “2010-09-16T21:08:54.29Z”ˆˆxsd:dateTime ;
                          nao:lastModified “2010-09-17T10:59:01.58Z”ˆˆxsd:dateTime
       ;
                          nao:numericRating “9”ˆˆxsd:int ;
                          nao:description “<html >... </ html>”ˆˆxsd:string ;
                          nao:hasTag <nepomuk:/res/travel> ;

                                                                                         10
Data representation
Digital Enterprise Research Institute                                           www.deri.ie




       <nepomuk:/a_note>
                          a pimo:Note ;
                          nao:prefLabel “holiday plans” ;
                          nao:created “2010-09-16T21:08:54.29Z”ˆˆxsd:dateTime ;
                          nao:lastModified “2010-09-17T10:59:01.58Z”ˆˆxsd:dateTime
       ;
                          nao:numericRating “9”ˆˆxsd:int ;
                          nao:description “<html >... </ html>”ˆˆxsd:string ;
                          nao:hasTag <nepomuk:/res/travel> ;
                 pimo:isRelated <nepomuk:/res/Rome>,
       <nepomuk:/res/Jane> .                                                             10
Interlinking
Digital Enterprise Research Institute                                www.deri.ie




       Annotation suggestions:
                 Based on the content of the note.
                 Certain types preferred.
                 Preference based on past use and matched length.


       “ ... brian ... “
                 Brian Davis
                 Brian Wall
       “ ... brian davis ... “
                 Brian Davis


                                                                              11
Interlinking algorithm
Digital Enterprise Research Institute                                        www.deri.ie




           Algorithm
                 scan text; identify possible entities
                 for each possible entity find a list of desktop resource
                  candidates
                     – compute score for each possible candidate
                     – filter list by score
                     – sort by score
                 present the candidates to the user
                 create the relation only if the user chooses a resource
Visualisation - HCI
Digital Enterprise Research Institute   www.deri.ie




                                                 12
Visualisation - versions
Digital Enterprise Research Institute   www.deri.ie




                                                 13
Visualisation - versions
Digital Enterprise Research Institute   www.deri.ie




                                                 13
Visualisation - HCI
Digital Enterprise Research Institute   www.deri.ie




                                                 13
Evaluation
Digital Enterprise Research Institute                                       www.deri.ie




         The effort of interlinking lower than the effort spent when searching.



                 Task-based experiment
                 Comparation of SemNotes to Evernote




                                                                                     14
Evaluation
Digital Enterprise Research Institute                                            www.deri.ie



       Experimental setup
                 20 participants
                     – 14 use note-taking regularly
                     – 5 use Evernote in their daily activity
                 Familiar data
                     – 130 contacts
                     – 20 scientific papers
                     – 50 notes
                 8 tasks
                     – 2 tasks - familiarise the participants with the dataset
                     – 6 tasks focused on note-taking, varying the complexity
                 Measurements
                     – Time spent
                     – Mouse clicks
                     – Keystrokes
                                                                                          15
Evaluation
Digital Enterprise Research Institute                                    www.deri.ie



       Tasks
              T1. Find notes tagged with “todo”
              T2. Find to-dos that are related to DERI
              T3. Find a to-do related to a presentation given by John
              T4. Take a note about planning a social event for your group
              T5. Find a note containing minutes from the last meeting
                  about the NICE project. Change the date of the next
                  meeting planned
              T6. Take a note for the action item assigned to you at the last
                  meeting
Evaluation
Digital Enterprise Research Institute                                      www.deri.ie



       Quantitative results
                 Time spent note-taking
                     – no significant differences
                 Time spent searching
                     – SemNotes significantly faster for complex queries
                     – no significant difference for simple queries




                                                                                    16
Evaluation
Digital Enterprise Research Institute                                      www.deri.ie



       Quantitative results
                 Time spent note-taking
                     – no significant differences
                 Time spent searching
                     – SemNotes significantly faster for complex queries
                     – no significant difference for simple queries


       Questionnaire results
                  Faster                Better




                                                                                    16
Evaluation
Digital Enterprise Research Institute                                              www.deri.ie



       Quantitative results


                                        Time                      Clicks
                 Task
                               Avg      Med       t      Avg      Med        t
                   T1           0.5      0      0.152    0.167      0      0.692
                   T2            -8      -8     -2.94    -0.333     -1     -0.48
                   T3         -0.125     1      -0.046   0.857      1      1.426
                   T4         0.063     0.016   0.486    6.067      8      2.026
                   T5        14.357      13     1.713    4.812      2      1.527
                   T6         0.249     0.243   1.004    20.8      12      3.08
But ...
Digital Enterprise Research Institute                       www.deri.ie



       The desktop is not any more the sole repository of
       personal information
                 Social networks
                 Mobile devices
                 Cloud services




                                                                     17
To the Web of Data
Digital Enterprise Research Institute                              www.deri.ie



       Challenges described by Q2 (Q2.1.)
                 find Web aliases of Semantic Desktop resources




                                                                            18
Finding Web Aliases
Digital Enterprise Research Institute                             www.deri.ie



       Web alias
                      = Web resource representing the same
                      real-world entity as the desktop resource




                                                                           19
Finding Web Aliases
Digital Enterprise Research Institute   www.deri.ie



       Different identifiers




                                                 19
Finding Web Aliases
Digital Enterprise Research Institute                              www.deri.ie



       Different identifiers



             nepomuk:/res/Angela



                                        http://angelaonthe.net/foaf/me




                                                                            19
Finding Web Aliases
Digital Enterprise Research Institute   www.deri.ie



       Different vocabularies




                                                 19
Finding Web Aliases
Digital Enterprise Research Institute      www.deri.ie



       The sheer size of the Web of Data




                                                    19
Finding Web Aliases
Digital Enterprise Research Institute      www.deri.ie



       The sheer size of the Web of Data




                                                    19
2 Step approach
Digital Enterprise Research Institute   www.deri.ie



       1. Candidate Selection




                                                 20
2 Step approach
Digital Enterprise Research Institute                       www.deri.ie



       1. Candidate Selection
                 Query various Web of Data sources
                 Identify candidate URIs
                 Retrieve data for each of the candidate




                                                                     20
2 Step approach
Digital Enterprise Research Institute                       www.deri.ie



       1. Candidate Selection
                 Query various Web of Data sources
                 Identify candidate URIs
                 Retrieve data for each of the candidate



       2. Candidate Filtering




                                                                     20
2 Step approach
Digital Enterprise Research Institute                       www.deri.ie



       1. Candidate Selection
                 Query various Web of Data sources
                 Identify candidate URIs
                 Retrieve data for each of the candidate



       2. Candidate Filtering
                 Compute similarity score.
                 Filter the candidates.




                                                                     20
Candidate Selection
Digital Enterprise Research Institute     www.deri.ie



       Determined set of sources
                 Specific requirements
                 Restricted domain



       Semantic search engine
                 Generic domain
                 Unknown data sources




                                                   21
Candidate Selection
Digital Enterprise Research Institute     www.deri.ie



       Determined set of sources
                 Specific requirements
                 Restricted domain



       Semantic search engine
                 Generic domain
                 Unknown data sources




                                                   21
Candidate Filtering
Digital Enterprise Research Institute                             www.deri.ie




          (local, web)




                                    1. Filter by type
                                    2. Compute similarity score
                                    3. Filter by score


                                                        return
                                                         score




                                                                           22
Matching Module
Digital Enterprise Research Institute                                          www.deri.ie




          (local, web)                     Type               No
                                                                    return 0
                                          matching


                                           Yes

                                        Compute score



                                           score ≥
                                          threshold            No

                                                        Yes         return
                                                                     score
Matching Parameters
Digital Enterprise Research Institute                                       www.deri.ie



       String matching (SM)
                 Exact matching versus approximate string matching
                 Koeln vs. Köln


       Weighted properties (WP)
                 Weighted participation of properties in the final score
                 Email address more exact than name


       Multi-valued properties (MVP)
                 All matching values for a property contribute              to
                  the score
                 e.g. Authors' names for a paper
Score Calculation
Digital Enterprise Research Institute                                      www.deri.ie



       Driven by the local data




                                    •weighted sum of matching props
          •score =
                                        •total sum of all weighted props
Evaluation
Digital Enterprise Research Institute            www.deri.ie



       Manually constructed gold standard
                 Data collection
                 Relevance judgements


       IR measures
                 Effect of parameter settings
                 Adjust thresholds




                                                          23
Data collection
Digital Enterprise Research Institute                                    www.deri.ie



       Desktop data
                 50 people – nco:PersonContact
                 50 music albums – nmo:MusicAlbum
                 50 publications – nfo:PaginatedTextDocument
                 11.917 triples


       Web data
                 20 candidates for each desktop resource -> 3000 URIs
                 1.530.686 triples




                                                                                  24
Relevance Judgements
Digital Enterprise Research Institute   www.deri.ie




                                                 25
Relevance Judgements
Digital Enterprise Research Institute   www.deri.ie




   3000 pairs x 3 experts

   Fleiss' K = 0.638  ± 0.214
   Average pairwise agreement 92.252%


                                                 25
IR Measures
Digital Enterprise Research Institute                         www.deri.ie



           MAP
           NDCG
           P@k (k=1,2,3,4,5)



       Baseline:
                 exact match
                 all properties count equally
                 single value considered for each property
Evaluation Results
Digital Enterprise Research Institute                               www.deri.ie



       Approximate string matching
                 improves results for albums and people
                 does not help for publications


        Weights and multiple values
                 when combined improve results for publications,
                  but not for the other types




                                                                             26
Merging the two directions
Digital Enterprise Research Institute             www.deri.ie




                                             2.


                                        1.




                                                           27
A use case
Digital Enterprise Research Institute                                       www.deri.ie




                                        Note              Blog post
            [Semantic] note-taking                    [Semantic] blogging



                                           [Preserve context]
                                           [Preserve privacy]




                                                                                     28
Steps
Digital Enterprise Research Institute                   www.deri.ie




       Transformation
                 On the local side
                 Extension to SemNotes


       Publication
                 On the server side
                 According to Linked Data principles




                                                                 29
Steps
Digital Enterprise Research Institute                   www.deri.ie



       (Note-taking & annotation)
       (Entity matching)

       Transformation
                 On the local side
                 Extension to SemNotes


       Publication
                 On the server side
                 According to Linked Data principles




                                                                 29
Levels and layers
Digital Enterprise Research Institute   www.deri.ie




                                                 30
Ontology level
Digital Enterprise Research Institute                                              www.deri.ie



           Local - Nepomuk ontologies
           Remote – SIOC, FOAF, DC, ...



         pimo:Note                 sioc:Post      nao:prefLabel      rdfs:label
         nao:Tag                   sioct:Tag      nao:created        dcterms:created
         pimo:Person               foaf:Person    nao:lastModified   dcterms:modified
         pimo:Project              doap:Project   nao:hasTag         sioc:topic
         pimo:Event                ical:Vevent    pimo:isRelated     sioc:related_to
Data level
Digital Enterprise Research Institute                          www.deri.ie



           Local – notes, desktop resources (tags included)
           Remote – blog posts, Web resources, tags

       http://semnotes.deri.ie/notes/note/id
       http://semnotes.deri.ie/notes/resource/id
       http://semnotes.deri.ie/notes/tag/label
Application level - local
Digital Enterprise Research Institute                                     www.deri.ie



           Plugin for SemNotes
                 Ask server for server URLs for the new note and resources
                 Replace desktop URIs with the server URLs in the note
                 Add RDFa to the note
                 Push the transformed note to the server
Application level - remote
Digital Enterprise Research Institute             www.deri.ie



           Web server with MySQL, PHP, ARC2
                 Create new URLs for resources
                 Receive and process the note
                 Publish the data online
Published data
Digital Enterprise Research Institute   www.deri.ie




                                                 31
Research answers
Digital Enterprise Research Institute                       www.deri.ie




       Q1. How to buildsemantic applications and tools for the
            Semantic Desktop?

       Q2. How to   expand the scope of the Semantic Desktop
            into the Web of Data?




                                                                     32
Research answers
Digital Enterprise Research Institute                       www.deri.ie




       Q1. How to buildsemantic applications and tools for the
            Semantic Desktop?
                 SemNotes
                     –   Create new data
                     –   Reuse existing data
                     –   HCI
                     –   Evaluation


       Q2. How to   expand the scope of the Semantic Desktop
            into the Web of Data?



                                                                     32
Research answers
Digital Enterprise Research Institute                       www.deri.ie




       Q1. How to buildsemantic applications and tools for the
            Semantic Desktop?
                 SemNotes
                     –   Create new data
                     –   Reuse existing data
                     –   HCI
                     –   Evaluation


       Q2. How to   expand the scope of the Semantic Desktop
            into the Web of Data?
                 Web aliases
                 Semantic blogging use case

                                                                     32
Future work
Digital Enterprise Research Institute                                     www.deri.ie



       Information Extraction algorithms and methods
                 create multiple types of relations based on the text
                 extract new entities from text
                 extract links between entities mentioned in the notes


       Explore visualisations
                 personal data browser


       Large scale user study of semantic personal
       information usage and behaviours



                                                                                   33
Summary
Digital Enterprise Research Institute                                     www.deri.ie




                                                            Web aliases
                                                       2.


                                        1.




                                             + semantic publishing use case

Contenu connexe

Tendances

Lessons and requirements from a decade of deployed Semantic Web apps
Lessons and requirements from a decade of deployed Semantic Web appsLessons and requirements from a decade of deployed Semantic Web apps
Lessons and requirements from a decade of deployed Semantic Web apps
Benjamin Heitmann
 
23625509 internetworking-technologies
23625509 internetworking-technologies23625509 internetworking-technologies
23625509 internetworking-technologies
jayaprakash
 
The Information Advantage - Information Access in Tomorrow's Enterprise
The Information Advantage - Information Access in Tomorrow's EnterpriseThe Information Advantage - Information Access in Tomorrow's Enterprise
The Information Advantage - Information Access in Tomorrow's Enterprise
Elizabeth Lupfer
 

Tendances (20)

Lessons and requirements from a decade of deployed Semantic Web apps
Lessons and requirements from a decade of deployed Semantic Web appsLessons and requirements from a decade of deployed Semantic Web apps
Lessons and requirements from a decade of deployed Semantic Web apps
 
Turning social disputes into knowledge representations DERI reading group 201...
Turning social disputes into knowledge representations DERI reading group 201...Turning social disputes into knowledge representations DERI reading group 201...
Turning social disputes into knowledge representations DERI reading group 201...
 
Implementing Semantic Web applications: reference architecture and challenges
Implementing Semantic Web applications:  reference architecture and challengesImplementing Semantic Web applications:  reference architecture and challenges
Implementing Semantic Web applications: reference architecture and challenges
 
Towards Patient Controlled Privacy
Towards Patient Controlled PrivacyTowards Patient Controlled Privacy
Towards Patient Controlled Privacy
 
Tips for a successful Digital workplace strategy
Tips for a successful Digital workplace strategyTips for a successful Digital workplace strategy
Tips for a successful Digital workplace strategy
 
Enabling Case-Based Reasoning on the Web of Data (How to create a Web of Exp...
Enabling Case-Based Reasoning  on the Web of Data (How to create a Web of Exp...Enabling Case-Based Reasoning  on the Web of Data (How to create a Web of Exp...
Enabling Case-Based Reasoning on the Web of Data (How to create a Web of Exp...
 
23625509 internetworking-technologies
23625509 internetworking-technologies23625509 internetworking-technologies
23625509 internetworking-technologies
 
Federating Distributed Social Data to Build an Interlinked Online Information...
Federating Distributed Social Data to Build an Interlinked Online Information...Federating Distributed Social Data to Build an Interlinked Online Information...
Federating Distributed Social Data to Build an Interlinked Online Information...
 
Caliber 2009 Tutorial Mgsree
Caliber 2009 Tutorial MgsreeCaliber 2009 Tutorial Mgsree
Caliber 2009 Tutorial Mgsree
 
The Information Advantage - Information Access in Tomorrow's Enterprise
The Information Advantage - Information Access in Tomorrow's EnterpriseThe Information Advantage - Information Access in Tomorrow's Enterprise
The Information Advantage - Information Access in Tomorrow's Enterprise
 
Using Semantics to Improve Corporate Online Communities
Using Semantics to Improve Corporate Online CommunitiesUsing Semantics to Improve Corporate Online Communities
Using Semantics to Improve Corporate Online Communities
 
Social network architecture - Part 3. Big data - Machine learning
Social network architecture - Part 3. Big data - Machine learningSocial network architecture - Part 3. Big data - Machine learning
Social network architecture - Part 3. Big data - Machine learning
 
1109 siit jfriedrich v02
1109 siit jfriedrich v021109 siit jfriedrich v02
1109 siit jfriedrich v02
 
The Gnowsis Semantic Desktop approach to Personal Information Management - Di...
The Gnowsis Semantic Desktopapproach to Personal InformationManagement - Di...The Gnowsis Semantic Desktopapproach to Personal InformationManagement - Di...
The Gnowsis Semantic Desktop approach to Personal Information Management - Di...
 
Lgd 2
Lgd 2Lgd 2
Lgd 2
 
Twitter and research impact
Twitter and research impactTwitter and research impact
Twitter and research impact
 
Digital Natives
Digital NativesDigital Natives
Digital Natives
 
Externalization Trend
Externalization TrendExternalization Trend
Externalization Trend
 
Intranet, Web 2.0, Enterprise 2.0
Intranet, Web 2.0, Enterprise 2.0Intranet, Web 2.0, Enterprise 2.0
Intranet, Web 2.0, Enterprise 2.0
 
Understanding Composite Web Applications with SharePoint 2010
Understanding Composite Web Applications with SharePoint 2010Understanding Composite Web Applications with SharePoint 2010
Understanding Composite Web Applications with SharePoint 2010
 

En vedette (7)

Six Sigma Presentation Storybd 07 Mar24
Six Sigma Presentation Storybd 07 Mar24Six Sigma Presentation Storybd 07 Mar24
Six Sigma Presentation Storybd 07 Mar24
 
Six Sigma - A Presentation by Akshay Anand
Six Sigma - A Presentation by Akshay AnandSix Sigma - A Presentation by Akshay Anand
Six Sigma - A Presentation by Akshay Anand
 
Lean manufacturing concepts and tools and quality management1
Lean manufacturing concepts and tools and quality management1Lean manufacturing concepts and tools and quality management1
Lean manufacturing concepts and tools and quality management1
 
Lean Process Improvement Techniques
Lean Process Improvement TechniquesLean Process Improvement Techniques
Lean Process Improvement Techniques
 
Lean Manufacturing PowerPoint Presentation Sample
Lean Manufacturing PowerPoint Presentation SampleLean Manufacturing PowerPoint Presentation Sample
Lean Manufacturing PowerPoint Presentation Sample
 
Lean presentation ppt
Lean presentation pptLean presentation ppt
Lean presentation ppt
 
Six Sigma the best ppt
Six Sigma the best pptSix Sigma the best ppt
Six Sigma the best ppt
 

Similaire à Interlinking Personal Semantic Data on the Semantic Desktop and the Web of Data

Manfred Linking the Real World
Manfred Linking the Real WorldManfred Linking the Real World
Manfred Linking the Real World
sssw2012
 
A Multidimensional Semantic Space for Data Model Independent Queries over RDF...
A Multidimensional Semantic Space for Data Model Independent Queries over RDF...A Multidimensional Semantic Space for Data Model Independent Queries over RDF...
A Multidimensional Semantic Space for Data Model Independent Queries over RDF...
Andre Freitas
 
Web 3.0: The Upcoming Revolution
Web 3.0: The Upcoming RevolutionWeb 3.0: The Upcoming Revolution
Web 3.0: The Upcoming Revolution
Nitin Godawat
 
Sap business objects BI4.0 reporting presentation
Sap business objects BI4.0 reporting presentationSap business objects BI4.0 reporting presentation
Sap business objects BI4.0 reporting presentation
shaktell2
 

Similaire à Interlinking Personal Semantic Data on the Semantic Desktop and the Web of Data (20)

Knowledge management on the desktop
Knowledge management on the desktopKnowledge management on the desktop
Knowledge management on the desktop
 
How to Publish Open Data
How to Publish Open DataHow to Publish Open Data
How to Publish Open Data
 
Future Careers
Future CareersFuture Careers
Future Careers
 
Towards Expertise Modelling for Routing Data Cleaning Tasks within a Communit...
Towards Expertise Modelling for Routing Data Cleaning Tasks within a Communit...Towards Expertise Modelling for Routing Data Cleaning Tasks within a Communit...
Towards Expertise Modelling for Routing Data Cleaning Tasks within a Communit...
 
From research to business: the Web of linked data
From research to business: the Web of linked dataFrom research to business: the Web of linked data
From research to business: the Web of linked data
 
Multi-Source Provenance-Aware User Interest Profiling on the Social Semantic Web
Multi-Source Provenance-Aware User Interest Profiling on the Social Semantic WebMulti-Source Provenance-Aware User Interest Profiling on the Social Semantic Web
Multi-Source Provenance-Aware User Interest Profiling on the Social Semantic Web
 
SharePoint Saturdays_ECM_SCN20_Webinar
SharePoint Saturdays_ECM_SCN20_WebinarSharePoint Saturdays_ECM_SCN20_Webinar
SharePoint Saturdays_ECM_SCN20_Webinar
 
Hello Open World - Semtech 2009
Hello Open World - Semtech 2009Hello Open World - Semtech 2009
Hello Open World - Semtech 2009
 
Computer Applications and Systems - Workshop V
Computer Applications and Systems - Workshop VComputer Applications and Systems - Workshop V
Computer Applications and Systems - Workshop V
 
Towards Social semantic journalism
Towards Social semantic journalismTowards Social semantic journalism
Towards Social semantic journalism
 
Share point user group
Share point user groupShare point user group
Share point user group
 
Service oriented cloud computing
Service oriented cloud computingService oriented cloud computing
Service oriented cloud computing
 
Manfred Linking the Real World
Manfred Linking the Real WorldManfred Linking the Real World
Manfred Linking the Real World
 
Reading UTC Launch Event
Reading UTC Launch EventReading UTC Launch Event
Reading UTC Launch Event
 
A Multidimensional Semantic Space for Data Model Independent Queries over RDF...
A Multidimensional Semantic Space for Data Model Independent Queries over RDF...A Multidimensional Semantic Space for Data Model Independent Queries over RDF...
A Multidimensional Semantic Space for Data Model Independent Queries over RDF...
 
Web 3.0: The Upcoming Revolution
Web 3.0: The Upcoming RevolutionWeb 3.0: The Upcoming Revolution
Web 3.0: The Upcoming Revolution
 
IT skills Important for CA
IT skills Important for CAIT skills Important for CA
IT skills Important for CA
 
Internet of Things, Cloud & Artificial Intelligence
Internet of Things, Cloud & Artificial IntelligenceInternet of Things, Cloud & Artificial Intelligence
Internet of Things, Cloud & Artificial Intelligence
 
Global ai night sept 2019 - Milwaukee
Global ai night sept 2019 - MilwaukeeGlobal ai night sept 2019 - Milwaukee
Global ai night sept 2019 - Milwaukee
 
Sap business objects BI4.0 reporting presentation
Sap business objects BI4.0 reporting presentationSap business objects BI4.0 reporting presentation
Sap business objects BI4.0 reporting presentation
 

Dernier

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Dernier (20)

Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 

Interlinking Personal Semantic Data on the Semantic Desktop and the Web of Data

  • 1. Digital Enterprise Research Institute deri.ie Interlinking Personal Semantic Data on the Desktop and the Web Laura Drǎgan
  • 2. Outline Digital Enterprise Research Institute www.deri.ie // Introduction  Background and motivation  Research questions // Directions and results  Within the Semantic desktop  To the Web of Data  A use case to rule them all // Conclusion  Research answers  Future work 1
  • 3. Outline Digital Enterprise Research Institute www.deri.ie // Introduction  Background and motivation  Research questions // Directions and results  Within the Semantic desktop  To the Web of Data  A use case to rule them all // Conclusion  Research answers  Future work 1
  • 4. Outline Digital Enterprise Research Institute www.deri.ie // Introduction  Background and motivation  Research questions // Directions and results  Within the Semantic desktop  To the Web of Data  A use case to rule them all // Conclusion  Research answers  Future work 1
  • 5. Background Digital Enterprise Research Institute www.deri.ie Personal Information Management 2
  • 6. Background Digital Enterprise Research Institute www.deri.ie Personal Information Management 1962 1968 1945 1965 2
  • 7. Background Digital Enterprise Research Institute www.deri.ie Personal Information Management Web 2
  • 8. Background Digital Enterprise Research Institute www.deri.ie Personal Information Management Semantic Web 2
  • 9. Background Digital Enterprise Research Institute www.deri.ie Personal Information Management Semantic Web Semantic Desktop 2
  • 10. Background Digital Enterprise Research Institute www.deri.ie Personal Information Management Semantic Web Semantic Desktop 2
  • 11. Motivation Digital Enterprise Research Institute www.deri.ie Use the framework provided by the Semantic Desktop to build useful applications and services 3
  • 12. Research questions Digital Enterprise Research Institute www.deri.ie Q1. How to build semantic applications and tools for the Semantic Desktop to provide the best experience for the users, while creating reusable semantic data? 4
  • 13. Research questions Digital Enterprise Research Institute www.deri.ie Q1. How to build semantic applications and tools for the Semantic Desktop to provide the best experience for the users, while creating reusable semantic data? 4
  • 14. Research questions Digital Enterprise Research Institute www.deri.ie Q1. How to buildsemantic applications and tools for the Semantic Desktop? 4
  • 15. Research questions Digital Enterprise Research Institute www.deri.ie Q1. How to buildsemantic applications and tools for the Semantic Desktop? Q2. How to expand the scope of the Semantic Desktop into the realm of the Web of Data, to benefit the users and enhance their experience? 4
  • 16. Research questions Digital Enterprise Research Institute www.deri.ie Q1. How to buildsemantic applications and tools for the Semantic Desktop? Q2. How to expand the scope of the Semantic Desktop into the realm of the Web of Data, to benefit the users and enhance their experience? 4
  • 17. Research questions Digital Enterprise Research Institute www.deri.ie Q1. How to buildsemantic applications and tools for the Semantic Desktop? Q2. How to expand the scope of the Semantic Desktop into the Web of Data? 4
  • 18. Q1 sub-questions Digital Enterprise Research Institute www.deri.ie semantic applications for the Semantic Desktop Q1.1. How to create semantic data that is complete, correct, safe, and provides a high degree of interlinking with the already existing network of semantic data on the desktop? Q1.2. How to reuse existing Semantic Desktop data in an application? Q1.3. How to design the human-computer interaction in an application for the Semantic Desktop? Q1.4. How to correctly evaluate a semantic application? 5
  • 19. Q1 sub-questions Digital Enterprise Research Institute www.deri.ie semantic applications for the Semantic Desktop Q1.1. How to create semantic data that is complete, correct, safe, and provides a high degree of interlinking with the already existing network of semantic data on the desktop? Q1.2. How to reuse existing Semantic Desktop data in an application? Q1.3. How to design the human-computer interaction in an application for the Semantic Desktop? Q1.4. How to correctly evaluate a semantic application? 5
  • 20. Q2 sub-questions Digital Enterprise Research Institute www.deri.ie connect the Semantic Desktop with the Web of Data Q2.1. How to find Web instances representing the same real- world thing described by a Semantic Desktop resource? Q2.2. How to use the Web information which is related to a desktop resource? Q2.3. How to make desktop data available online safely? 6
  • 21. Q2 sub-questions Digital Enterprise Research Institute www.deri.ie connect the Semantic Desktop with the Web of Data Q2.1. How to find Web instances representing the same real-world thing described by a Semantic Desktop resource? Q2.2. How to use the Web information which is related to a desktop resource? Q2.3. How to make desktop data available online safely? 6
  • 22. Directions Digital Enterprise Research Institute www.deri.ie 7
  • 23. Directions Digital Enterprise Research Institute www.deri.ie 1. 7
  • 24. Directions Digital Enterprise Research Institute www.deri.ie 1. 7
  • 25. Directions Digital Enterprise Research Institute www.deri.ie 7
  • 26. Directions Digital Enterprise Research Institute www.deri.ie 2. 7
  • 27. Directions Digital Enterprise Research Institute www.deri.ie 2. 1. 7
  • 28. Within the Semantic Desktop Digital Enterprise Research Institute www.deri.ie 8
  • 29. SemNotes Digital Enterprise Research Institute www.deri.ie Challenges described by Q1  create new semantic data – Data representation – Data management  reuse existing Semantic Desktop data – Interlinking  design the human-computer interaction – Visualisation  correctly evaluate a semantic application – Task-based comparison to Evernote 9
  • 30. Data representation Digital Enterprise Research Institute www.deri.ie <nepomuk:/a_note> 10
  • 31. Data representation Digital Enterprise Research Institute www.deri.ie <nepomuk:/a_note> a pimo:Note ; 10
  • 32. Data representation Digital Enterprise Research Institute www.deri.ie <nepomuk:/a_note> a pimo:Note ; nao:prefLabel "holiday plans" ; 10
  • 33. Data representation Digital Enterprise Research Institute www.deri.ie <nepomuk:/a_note> a pimo:Note ; nao:prefLabel “holiday plans” ; nao:created “2010-09-16T21:08:54.29Z”ˆˆxsd:dateTime ; nao:lastModified “2010-09-17T10:59:01.58Z”ˆˆxsd:dateTime ; nao:numericRating “9”ˆˆxsd:int ; 10
  • 34. Data representation Digital Enterprise Research Institute www.deri.ie <nepomuk:/a_note> a pimo:Note ; nao:prefLabel “holiday plans” ; nao:created “2010-09-16T21:08:54.29Z”ˆˆxsd:dateTime ; nao:lastModified “2010-09-17T10:59:01.58Z”ˆˆxsd:dateTime ; nao:numericRating “9”ˆˆxsd:int ; nao:description “<html >... </ html>”ˆˆxsd:string ; 10
  • 35. Data representation Digital Enterprise Research Institute www.deri.ie <nepomuk:/a_note> a pimo:Note ; nao:prefLabel “holiday plans” ; nao:created “2010-09-16T21:08:54.29Z”ˆˆxsd:dateTime ; nao:lastModified “2010-09-17T10:59:01.58Z”ˆˆxsd:dateTime ; nao:numericRating “9”ˆˆxsd:int ; nao:description “<html >... </ html>”ˆˆxsd:string ; nao:hasTag <nepomuk:/res/travel> ; 10
  • 36. Data representation Digital Enterprise Research Institute www.deri.ie <nepomuk:/a_note> a pimo:Note ; nao:prefLabel “holiday plans” ; nao:created “2010-09-16T21:08:54.29Z”ˆˆxsd:dateTime ; nao:lastModified “2010-09-17T10:59:01.58Z”ˆˆxsd:dateTime ; nao:numericRating “9”ˆˆxsd:int ; nao:description “<html >... </ html>”ˆˆxsd:string ; nao:hasTag <nepomuk:/res/travel> ; pimo:isRelated <nepomuk:/res/Rome>, <nepomuk:/res/Jane> . 10
  • 37. Interlinking Digital Enterprise Research Institute www.deri.ie Annotation suggestions:  Based on the content of the note.  Certain types preferred.  Preference based on past use and matched length. “ ... brian ... “  Brian Davis  Brian Wall “ ... brian davis ... “  Brian Davis 11
  • 38. Interlinking algorithm Digital Enterprise Research Institute www.deri.ie  Algorithm  scan text; identify possible entities  for each possible entity find a list of desktop resource candidates – compute score for each possible candidate – filter list by score – sort by score  present the candidates to the user  create the relation only if the user chooses a resource
  • 39. Visualisation - HCI Digital Enterprise Research Institute www.deri.ie 12
  • 40. Visualisation - versions Digital Enterprise Research Institute www.deri.ie 13
  • 41. Visualisation - versions Digital Enterprise Research Institute www.deri.ie 13
  • 42. Visualisation - HCI Digital Enterprise Research Institute www.deri.ie 13
  • 43. Evaluation Digital Enterprise Research Institute www.deri.ie The effort of interlinking lower than the effort spent when searching.  Task-based experiment  Comparation of SemNotes to Evernote 14
  • 44. Evaluation Digital Enterprise Research Institute www.deri.ie Experimental setup  20 participants – 14 use note-taking regularly – 5 use Evernote in their daily activity  Familiar data – 130 contacts – 20 scientific papers – 50 notes  8 tasks – 2 tasks - familiarise the participants with the dataset – 6 tasks focused on note-taking, varying the complexity  Measurements – Time spent – Mouse clicks – Keystrokes 15
  • 45. Evaluation Digital Enterprise Research Institute www.deri.ie Tasks T1. Find notes tagged with “todo” T2. Find to-dos that are related to DERI T3. Find a to-do related to a presentation given by John T4. Take a note about planning a social event for your group T5. Find a note containing minutes from the last meeting about the NICE project. Change the date of the next meeting planned T6. Take a note for the action item assigned to you at the last meeting
  • 46. Evaluation Digital Enterprise Research Institute www.deri.ie Quantitative results  Time spent note-taking – no significant differences  Time spent searching – SemNotes significantly faster for complex queries – no significant difference for simple queries 16
  • 47. Evaluation Digital Enterprise Research Institute www.deri.ie Quantitative results  Time spent note-taking – no significant differences  Time spent searching – SemNotes significantly faster for complex queries – no significant difference for simple queries Questionnaire results Faster Better 16
  • 48. Evaluation Digital Enterprise Research Institute www.deri.ie Quantitative results Time Clicks Task Avg Med t Avg Med t T1 0.5 0 0.152 0.167 0 0.692 T2 -8 -8 -2.94 -0.333 -1 -0.48 T3 -0.125 1 -0.046 0.857 1 1.426 T4 0.063 0.016 0.486 6.067 8 2.026 T5 14.357 13 1.713 4.812 2 1.527 T6 0.249 0.243 1.004 20.8 12 3.08
  • 49. But ... Digital Enterprise Research Institute www.deri.ie The desktop is not any more the sole repository of personal information  Social networks  Mobile devices  Cloud services 17
  • 50. To the Web of Data Digital Enterprise Research Institute www.deri.ie Challenges described by Q2 (Q2.1.)  find Web aliases of Semantic Desktop resources 18
  • 51. Finding Web Aliases Digital Enterprise Research Institute www.deri.ie Web alias = Web resource representing the same real-world entity as the desktop resource 19
  • 52. Finding Web Aliases Digital Enterprise Research Institute www.deri.ie Different identifiers 19
  • 53. Finding Web Aliases Digital Enterprise Research Institute www.deri.ie Different identifiers nepomuk:/res/Angela http://angelaonthe.net/foaf/me 19
  • 54. Finding Web Aliases Digital Enterprise Research Institute www.deri.ie Different vocabularies 19
  • 55. Finding Web Aliases Digital Enterprise Research Institute www.deri.ie The sheer size of the Web of Data 19
  • 56. Finding Web Aliases Digital Enterprise Research Institute www.deri.ie The sheer size of the Web of Data 19
  • 57. 2 Step approach Digital Enterprise Research Institute www.deri.ie 1. Candidate Selection 20
  • 58. 2 Step approach Digital Enterprise Research Institute www.deri.ie 1. Candidate Selection  Query various Web of Data sources  Identify candidate URIs  Retrieve data for each of the candidate 20
  • 59. 2 Step approach Digital Enterprise Research Institute www.deri.ie 1. Candidate Selection  Query various Web of Data sources  Identify candidate URIs  Retrieve data for each of the candidate 2. Candidate Filtering 20
  • 60. 2 Step approach Digital Enterprise Research Institute www.deri.ie 1. Candidate Selection  Query various Web of Data sources  Identify candidate URIs  Retrieve data for each of the candidate 2. Candidate Filtering  Compute similarity score.  Filter the candidates. 20
  • 61. Candidate Selection Digital Enterprise Research Institute www.deri.ie Determined set of sources  Specific requirements  Restricted domain Semantic search engine  Generic domain  Unknown data sources 21
  • 62. Candidate Selection Digital Enterprise Research Institute www.deri.ie Determined set of sources  Specific requirements  Restricted domain Semantic search engine  Generic domain  Unknown data sources 21
  • 63. Candidate Filtering Digital Enterprise Research Institute www.deri.ie (local, web) 1. Filter by type 2. Compute similarity score 3. Filter by score return score 22
  • 64. Matching Module Digital Enterprise Research Institute www.deri.ie (local, web) Type No return 0 matching Yes Compute score score ≥ threshold No Yes return score
  • 65. Matching Parameters Digital Enterprise Research Institute www.deri.ie String matching (SM)  Exact matching versus approximate string matching  Koeln vs. Köln Weighted properties (WP)  Weighted participation of properties in the final score  Email address more exact than name Multi-valued properties (MVP)  All matching values for a property contribute to the score  e.g. Authors' names for a paper
  • 66. Score Calculation Digital Enterprise Research Institute www.deri.ie Driven by the local data •weighted sum of matching props •score = •total sum of all weighted props
  • 67. Evaluation Digital Enterprise Research Institute www.deri.ie Manually constructed gold standard  Data collection  Relevance judgements IR measures  Effect of parameter settings  Adjust thresholds 23
  • 68. Data collection Digital Enterprise Research Institute www.deri.ie Desktop data  50 people – nco:PersonContact  50 music albums – nmo:MusicAlbum  50 publications – nfo:PaginatedTextDocument  11.917 triples Web data  20 candidates for each desktop resource -> 3000 URIs  1.530.686 triples 24
  • 69. Relevance Judgements Digital Enterprise Research Institute www.deri.ie 25
  • 70. Relevance Judgements Digital Enterprise Research Institute www.deri.ie 3000 pairs x 3 experts Fleiss' K = 0.638  ± 0.214 Average pairwise agreement 92.252% 25
  • 71. IR Measures Digital Enterprise Research Institute www.deri.ie  MAP  NDCG  P@k (k=1,2,3,4,5) Baseline:  exact match  all properties count equally  single value considered for each property
  • 72. Evaluation Results Digital Enterprise Research Institute www.deri.ie Approximate string matching  improves results for albums and people  does not help for publications Weights and multiple values  when combined improve results for publications, but not for the other types 26
  • 73. Merging the two directions Digital Enterprise Research Institute www.deri.ie 2. 1. 27
  • 74. A use case Digital Enterprise Research Institute www.deri.ie Note Blog post [Semantic] note-taking [Semantic] blogging [Preserve context] [Preserve privacy] 28
  • 75. Steps Digital Enterprise Research Institute www.deri.ie Transformation  On the local side  Extension to SemNotes Publication  On the server side  According to Linked Data principles 29
  • 76. Steps Digital Enterprise Research Institute www.deri.ie (Note-taking & annotation) (Entity matching) Transformation  On the local side  Extension to SemNotes Publication  On the server side  According to Linked Data principles 29
  • 77. Levels and layers Digital Enterprise Research Institute www.deri.ie 30
  • 78. Ontology level Digital Enterprise Research Institute www.deri.ie  Local - Nepomuk ontologies  Remote – SIOC, FOAF, DC, ... pimo:Note sioc:Post nao:prefLabel rdfs:label nao:Tag sioct:Tag nao:created dcterms:created pimo:Person foaf:Person nao:lastModified dcterms:modified pimo:Project doap:Project nao:hasTag sioc:topic pimo:Event ical:Vevent pimo:isRelated sioc:related_to
  • 79. Data level Digital Enterprise Research Institute www.deri.ie  Local – notes, desktop resources (tags included)  Remote – blog posts, Web resources, tags http://semnotes.deri.ie/notes/note/id http://semnotes.deri.ie/notes/resource/id http://semnotes.deri.ie/notes/tag/label
  • 80. Application level - local Digital Enterprise Research Institute www.deri.ie  Plugin for SemNotes  Ask server for server URLs for the new note and resources  Replace desktop URIs with the server URLs in the note  Add RDFa to the note  Push the transformed note to the server
  • 81. Application level - remote Digital Enterprise Research Institute www.deri.ie  Web server with MySQL, PHP, ARC2  Create new URLs for resources  Receive and process the note  Publish the data online
  • 82. Published data Digital Enterprise Research Institute www.deri.ie 31
  • 83. Research answers Digital Enterprise Research Institute www.deri.ie Q1. How to buildsemantic applications and tools for the Semantic Desktop? Q2. How to expand the scope of the Semantic Desktop into the Web of Data? 32
  • 84. Research answers Digital Enterprise Research Institute www.deri.ie Q1. How to buildsemantic applications and tools for the Semantic Desktop?  SemNotes – Create new data – Reuse existing data – HCI – Evaluation Q2. How to expand the scope of the Semantic Desktop into the Web of Data? 32
  • 85. Research answers Digital Enterprise Research Institute www.deri.ie Q1. How to buildsemantic applications and tools for the Semantic Desktop?  SemNotes – Create new data – Reuse existing data – HCI – Evaluation Q2. How to expand the scope of the Semantic Desktop into the Web of Data?  Web aliases  Semantic blogging use case 32
  • 86. Future work Digital Enterprise Research Institute www.deri.ie Information Extraction algorithms and methods  create multiple types of relations based on the text  extract new entities from text  extract links between entities mentioned in the notes Explore visualisations  personal data browser Large scale user study of semantic personal information usage and behaviours 33
  • 87. Summary Digital Enterprise Research Institute www.deri.ie Web aliases 2. 1. + semantic publishing use case

Notes de l'éditeur

  1. And then of course the 2 directions can and should and are combined
  2. But the semantic desktop, as efficient as it might become with semantic tools and interconnected data, is no longer the only repository or even the main one some would say of personal data.