SlideShare a Scribd company logo
1 of 18
Download to read offline
SAnno: a unifying framework for semantic annotation

Davide Eynard
IDSIA, 01/06/2010
2

Introduction

  •   S(emantic)Anno(tations)
        • … in Italy, “sanno” also means “they know”


  •   Basic principle: anyone should be able to say anything about
      anything else
        • Well, this should hold in general :-)
        • Actually, in our case it is “anything about any URI”
        • And we would like everyone to say that in a formal way


  •   But first, a little step back in time...




        IDSIA, 01/06/2010   Davide Eynard
3

Participation and semantics




                                               Data




                          Structure


      IDSIA, 01/06/2010        Davide Eynard
4

Sanno's grandfather: Speakinabout [1]

  •   Purpose: produce semantic annotations about named entities
        • When you read “Harry Potter”, is it the book or the movie?


  •   Plays with user gratifications
        • When users annotate a string as matching a specific concept, they
          are shown a list of services/search engines which are related to it

  •   Relies on user provided data:
        • Freebase types
        • User generated search templates, built inside a wiki system




        IDSIA, 01/06/2010   Davide Eynard
5

Sanno's grandfather: Speakinabout [1]




      IDSIA, 01/06/2010   Davide Eynard
6

Sanno's grandfather: Speakinabout [1]




      IDSIA, 01/06/2010   Davide Eynard
7

Sanno's father: RDFMonkey [2]

  •   Purpose: augment browsing experience by providing
      information/services related to the visited URL

  •   Relies on Freebase types
       • … as in SpeakinAbout, but without requiring user interaction
       • Types are found by searching backlinks in Freebase (which topics
         are linking the visited page)

  •   Related services as widgets inside a browser extension
       • The app could load widgets at runtime (from Freebase itself or
         another collaborative system)




       IDSIA, 01/06/2010   Davide Eynard
8

    Sanno's father: RDFMonkey [2]
                                                 Cities
Musical Artists




                                                 Books




             IDSIA, 01/06/2010   Davide Eynard
9

The problem

  •   We already have semantics on the annotation (i.e. Annotea), but
      how can we have semantics within the annotation?

  •   Good starting points:
        • Some participative systems already provide semi-structured
          information (i.e. infoboxes in Wikipedia)
        • Some communities of practice already built their own bottom-up
          way to structure information (i.e. machine tags)
        • Some (relatively new) systems allow, with some additional effort, to
          save information in a structured way almost without requiring users
          to know that (i.e. semantic wikis)


  •   Challenges
        • Provide a shared way to describe annotations coming from
          heterogeneous systems
        • Aggregate this information to provide something new and useful



        IDSIA, 01/06/2010   Davide Eynard
10

SAnno as a framework

  •   Sanno is built up of many different parts, which all together
      provide something (we consider) new and useful
        • An ontology to describe annotations (the “shells” that contain
          metadata about a resource)
        • An ontology describing the types of properties we are already able
          to aggregate
        • A set of conversion tools which are able to translate existing
          annotations from other systems into our notation
        • A system to show the results of the aggregation of different
          annotations
        • A system to manage provenance, authorship, and filters on
          incoming annotations




        IDSIA, 01/06/2010   Davide Eynard
11

The annotations ontology

  •   Every annotation can be considered as a “Post-it”, a piece of
      paper where something is written about something else
        • … you can say things about what is written there, but also about the
          Post-it itself


  •   The annotation is about a resource, it is created by someone in a
      specific date, it comes from a particular annotation system and
      might be connected to a specific community

  •   Main goal: do not reinvent everything from scratch
        • Reuse well-known ontologies such as DC, SIOC, etc.
        • Use named graphs as an alternative to reifications

  •   Start in an easy way: restriction to URLs
        • Also a way to provide instant gratification to users: show
          annotations while they are browsing a website


        IDSIA, 01/06/2010   Davide Eynard
12

The aggregation ontology

  •    Aggregation deals with the contents of the annotation (i.e. The
       triples found in the NG)

  •    Objectives
         • Avoid constraining users to a specific vocabulary for annotations
         • Find a way to collect different annotations and provide something
           new and interesting by aggregating them

       Our approach
         • Properties used inside annotations could be described as belonging
           to families we already know how to deal with
         • Examples: very specific (tags, ratings), more general (transitive
           relations)
         • Properties inside some external vocabulary are mapped as
           subproperties of ours
         • … by whom? High-experience users who have incentives to do this
           (think about users building templates in Wikipedia...)

         IDSIA, 01/06/2010   Davide Eynard
13

Conversion tools

  •     Our worst enemy: the bootstrap
          • who is going to annotate the first resources? I don't have time!


  •     Our best friends: already existing annotation systems
          • why don't we convert existing data to our notation and show the
            advantages of our approach?

       Different families of conversion tools
          • Easy: already existing APIs, with realtime search functionalities
            (i.e. del.icio.us)
          • Medium: conversions from existing structured repositories such as
            SPARQL endpoints (advantage: the conversion is very clean, you
            just need one tool and different CONSTRUCTs)
          • A little harder: Web scraping when no other sources are available




          IDSIA, 01/06/2010   Davide Eynard
14

Annotation client

  •   Actually, two possible clients in our mind:
        • a browser extension which shows annotations while users are
          browsing the Web
        • an independent service which is able to aggregate heterogeneous
          information related to similar resources (i.e. URLs marked as being
          MP3 files)

  •   Filter annotations according to author, date, originating system,
      and community
        • Users should be able to “subscribe” to some annotating
          communities and ignore others
        • System is thought as distributed, as data can come from different,
          unrelated sources




        IDSIA, 01/06/2010   Davide Eynard
15

The prototype

  •   Early annotation ontology
  •   Property families: tag, rating, generically related URI
  •   Conversions from SMW, Delicious
  •   Visualization as a web service + Firefox extension
  •   No subscriptions yet




        IDSIA, 01/06/2010   Davide Eynard
16

The prototype




      IDSIA, 01/06/2010   Davide Eynard
17

The end




                              Thank you! Questions?



 References:
 •   [0] D.Laniado, D.Eynard and M.Colombetti. Using WordNet to turn a folksonomy into a
     hierarchy of concepts. Semantic Web Application and Perspectives 192–201, 2007.
 •   [1] D.Eynard and M.Colombetti. Exploiting User Gratification for Collaborative Semantic
     Annotation. Proceedings of SWUI 2008. April 2008.
 •   [2] D.Eynard. Using semantics and user participation to customize personalization. HP Labs
     Technical Report HPL-2008-197. September 2008.
 •   [3] L.Mazzola, D.Eynard and R.Mazza. GVIS: a framework for graphical mashups of
     heterogeneous sources to support data interpretation. HSI 2010. May 2010.

          IDSIA, 01/06/2010      Davide Eynard
Contact                        Davide Eynard

                                 Tel. 02 2399 4010
                                 Fax 02 2399 3411

                               eynard@elet.polimi.it
                               http://davide.eynard.it
                 Project page @AIRLab: http://airwiki.elet.polimi.it

          Back

More Related Content

Similar to SAnno: a unifying framework for semantic annotation

IASSIT Kansa Presentation
IASSIT Kansa PresentationIASSIT Kansa Presentation
IASSIT Kansa Presentationekansa
 
Thesis summary-arguments-about-deleting-wikipedia-content-paris-2013-04-19
Thesis summary-arguments-about-deleting-wikipedia-content-paris-2013-04-19Thesis summary-arguments-about-deleting-wikipedia-content-paris-2013-04-19
Thesis summary-arguments-about-deleting-wikipedia-content-paris-2013-04-19jodischneider
 
Open source caqdas what is in the box and what is missing
Open source caqdas what is in the box and what is missingOpen source caqdas what is in the box and what is missing
Open source caqdas what is in the box and what is missingMerlien Institute
 
What is DITA? And Is It Right for Your Team or Project?
What is DITA? And Is It Right for Your Team or Project?What is DITA? And Is It Right for Your Team or Project?
What is DITA? And Is It Right for Your Team or Project?Toni Mantych, MA, PMP
 
AICPA Leadership Retreat - Technology
AICPA Leadership Retreat - TechnologyAICPA Leadership Retreat - Technology
AICPA Leadership Retreat - TechnologyASAE
 
OpenGLAM in museums: Linked Open Data and Wikipedia
OpenGLAM in museums: Linked Open Data and WikipediaOpenGLAM in museums: Linked Open Data and Wikipedia
OpenGLAM in museums: Linked Open Data and WikipediaGeorgina Goodlander
 
Geo-annotations in Semantic Digital Libraries
Geo-annotations in Semantic Digital Libraries Geo-annotations in Semantic Digital Libraries
Geo-annotations in Semantic Digital Libraries mdabrowski
 
The Social Semantic Web
The Social Semantic WebThe Social Semantic Web
The Social Semantic WebJohn Breslin
 
Identity - Past Solutions & Current Blockchain Landscape
 Identity - Past Solutions & Current Blockchain Landscape Identity - Past Solutions & Current Blockchain Landscape
Identity - Past Solutions & Current Blockchain Landscapeaatkin1971
 
Introduction to Metadata for IDAH Fellows
Introduction to Metadata for IDAH FellowsIntroduction to Metadata for IDAH Fellows
Introduction to Metadata for IDAH FellowsJenn Riley
 
Web Accessibility and Design
Web Accessibility and DesignWeb Accessibility and Design
Web Accessibility and Designcolinbdclark
 
Intro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & MuseumsIntro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & MuseumsJon Voss
 
Repository Federation: Towards Data Interoperability
Repository Federation: Towards Data InteroperabilityRepository Federation: Towards Data Interoperability
Repository Federation: Towards Data InteroperabilityRobert H. McDonald
 

Similar to SAnno: a unifying framework for semantic annotation (20)

IASSIT Kansa Presentation
IASSIT Kansa PresentationIASSIT Kansa Presentation
IASSIT Kansa Presentation
 
Open Science and Identifiers
Open Science and IdentifiersOpen Science and Identifiers
Open Science and Identifiers
 
Thesis summary-arguments-about-deleting-wikipedia-content-paris-2013-04-19
Thesis summary-arguments-about-deleting-wikipedia-content-paris-2013-04-19Thesis summary-arguments-about-deleting-wikipedia-content-paris-2013-04-19
Thesis summary-arguments-about-deleting-wikipedia-content-paris-2013-04-19
 
Open source caqdas what is in the box and what is missing
Open source caqdas what is in the box and what is missingOpen source caqdas what is in the box and what is missing
Open source caqdas what is in the box and what is missing
 
What is DITA? And Is It Right for Your Team or Project?
What is DITA? And Is It Right for Your Team or Project?What is DITA? And Is It Right for Your Team or Project?
What is DITA? And Is It Right for Your Team or Project?
 
AICPA Leadership Retreat - Technology
AICPA Leadership Retreat - TechnologyAICPA Leadership Retreat - Technology
AICPA Leadership Retreat - Technology
 
OpenGLAM in museums: Linked Open Data and Wikipedia
OpenGLAM in museums: Linked Open Data and WikipediaOpenGLAM in museums: Linked Open Data and Wikipedia
OpenGLAM in museums: Linked Open Data and Wikipedia
 
Geo-annotations in Semantic Digital Libraries
Geo-annotations in Semantic Digital Libraries Geo-annotations in Semantic Digital Libraries
Geo-annotations in Semantic Digital Libraries
 
Pln 101
Pln 101Pln 101
Pln 101
 
Creating Order Out of the Chaos
Creating Order Out of the ChaosCreating Order Out of the Chaos
Creating Order Out of the Chaos
 
The Social Semantic Web
The Social Semantic WebThe Social Semantic Web
The Social Semantic Web
 
Shaping our futures: the Social Semantic Web
Shaping our futures: the Social Semantic WebShaping our futures: the Social Semantic Web
Shaping our futures: the Social Semantic Web
 
Identity - Past Solutions & Current Blockchain Landscape
 Identity - Past Solutions & Current Blockchain Landscape Identity - Past Solutions & Current Blockchain Landscape
Identity - Past Solutions & Current Blockchain Landscape
 
C N I20080404
C N I20080404C N I20080404
C N I20080404
 
Torsten Reimer
Torsten ReimerTorsten Reimer
Torsten Reimer
 
Introduction to Metadata for IDAH Fellows
Introduction to Metadata for IDAH FellowsIntroduction to Metadata for IDAH Fellows
Introduction to Metadata for IDAH Fellows
 
Web Accessibility and Design
Web Accessibility and DesignWeb Accessibility and Design
Web Accessibility and Design
 
Intro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & MuseumsIntro to Linked Open Data in Libraries, Archives & Museums
Intro to Linked Open Data in Libraries, Archives & Museums
 
Repository Federation: Towards Data Interoperability
Repository Federation: Towards Data InteroperabilityRepository Federation: Towards Data Interoperability
Repository Federation: Towards Data Interoperability
 
Schema and Identity for Linked Data
Schema and Identity for Linked DataSchema and Identity for Linked Data
Schema and Identity for Linked Data
 

More from Davide Eynard

Building Compatible Bases on Graphs, Images, and Manifolds
Building Compatible Bases on Graphs, Images, and ManifoldsBuilding Compatible Bases on Graphs, Images, and Manifolds
Building Compatible Bases on Graphs, Images, and ManifoldsDavide Eynard
 
Laplacian Colormaps: a framework for structure-preserving color transformations
Laplacian Colormaps: a framework for structure-preserving color transformationsLaplacian Colormaps: a framework for structure-preserving color transformations
Laplacian Colormaps: a framework for structure-preserving color transformationsDavide Eynard
 
Notes on Spectral Clustering
Notes on Spectral ClusteringNotes on Spectral Clustering
Notes on Spectral ClusteringDavide Eynard
 
An integrated approach to discover tag semantics
An integrated approach to discover tag semanticsAn integrated approach to discover tag semantics
An integrated approach to discover tag semanticsDavide Eynard
 
A Virtuous Cycle of Semantics and Participation
A Virtuous Cycle of Semantics and ParticipationA Virtuous Cycle of Semantics and Participation
A Virtuous Cycle of Semantics and ParticipationDavide Eynard
 
ReSearch - Searching for Researchers
ReSearch - Searching for ResearchersReSearch - Searching for Researchers
ReSearch - Searching for ResearchersDavide Eynard
 
PhDLinux: A Linux Crash Course for PhD Students
PhDLinux: A Linux Crash Course for PhD StudentsPhDLinux: A Linux Crash Course for PhD Students
PhDLinux: A Linux Crash Course for PhD StudentsDavide Eynard
 
Exploiting user gratification for collaborative semantic annotation
Exploiting user gratification for collaborative semantic annotationExploiting user gratification for collaborative semantic annotation
Exploiting user gratification for collaborative semantic annotationDavide Eynard
 
Performance Attacks on Intrusion Detection Systems
Performance Attacks on Intrusion Detection SystemsPerformance Attacks on Intrusion Detection Systems
Performance Attacks on Intrusion Detection SystemsDavide Eynard
 
Cracking Codes With Genetic Algorithms
Cracking Codes With Genetic AlgorithmsCracking Codes With Genetic Algorithms
Cracking Codes With Genetic AlgorithmsDavide Eynard
 
Fast algorithms for large scale genome alignment and comparison
Fast algorithms for large scale genome alignment and comparisonFast algorithms for large scale genome alignment and comparison
Fast algorithms for large scale genome alignment and comparisonDavide Eynard
 
Unambiguous Recognizable Two-dimensional Languages
Unambiguous Recognizable Two-dimensional LanguagesUnambiguous Recognizable Two-dimensional Languages
Unambiguous Recognizable Two-dimensional LanguagesDavide Eynard
 
Research on collaborative information sharing systems
Research on collaborative information sharing systemsResearch on collaborative information sharing systems
Research on collaborative information sharing systemsDavide Eynard
 

More from Davide Eynard (15)

Building Compatible Bases on Graphs, Images, and Manifolds
Building Compatible Bases on Graphs, Images, and ManifoldsBuilding Compatible Bases on Graphs, Images, and Manifolds
Building Compatible Bases on Graphs, Images, and Manifolds
 
Laplacian Colormaps: a framework for structure-preserving color transformations
Laplacian Colormaps: a framework for structure-preserving color transformationsLaplacian Colormaps: a framework for structure-preserving color transformations
Laplacian Colormaps: a framework for structure-preserving color transformations
 
Notes on Spectral Clustering
Notes on Spectral ClusteringNotes on Spectral Clustering
Notes on Spectral Clustering
 
An integrated approach to discover tag semantics
An integrated approach to discover tag semanticsAn integrated approach to discover tag semantics
An integrated approach to discover tag semantics
 
A Virtuous Cycle of Semantics and Participation
A Virtuous Cycle of Semantics and ParticipationA Virtuous Cycle of Semantics and Participation
A Virtuous Cycle of Semantics and Participation
 
Talk Hpl
Talk HplTalk Hpl
Talk Hpl
 
ReSearch - Searching for Researchers
ReSearch - Searching for ResearchersReSearch - Searching for Researchers
ReSearch - Searching for Researchers
 
PhDLinux: A Linux Crash Course for PhD Students
PhDLinux: A Linux Crash Course for PhD StudentsPhDLinux: A Linux Crash Course for PhD Students
PhDLinux: A Linux Crash Course for PhD Students
 
Exploiting user gratification for collaborative semantic annotation
Exploiting user gratification for collaborative semantic annotationExploiting user gratification for collaborative semantic annotation
Exploiting user gratification for collaborative semantic annotation
 
Performance Attacks on Intrusion Detection Systems
Performance Attacks on Intrusion Detection SystemsPerformance Attacks on Intrusion Detection Systems
Performance Attacks on Intrusion Detection Systems
 
Cracking Codes With Genetic Algorithms
Cracking Codes With Genetic AlgorithmsCracking Codes With Genetic Algorithms
Cracking Codes With Genetic Algorithms
 
Rewire the Net
Rewire the NetRewire the Net
Rewire the Net
 
Fast algorithms for large scale genome alignment and comparison
Fast algorithms for large scale genome alignment and comparisonFast algorithms for large scale genome alignment and comparison
Fast algorithms for large scale genome alignment and comparison
 
Unambiguous Recognizable Two-dimensional Languages
Unambiguous Recognizable Two-dimensional LanguagesUnambiguous Recognizable Two-dimensional Languages
Unambiguous Recognizable Two-dimensional Languages
 
Research on collaborative information sharing systems
Research on collaborative information sharing systemsResearch on collaborative information sharing systems
Research on collaborative information sharing systems
 

Recently uploaded

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelDeepika Singh
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWERMadyBayot
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyKhushali Kathiriya
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...apidays
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesrafiqahmad00786416
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbuapidays
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024The Digital Insurer
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Zilliz
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...apidays
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDropbox
 

Recently uploaded (20)

"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot ModelNavi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Navi Mumbai Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
Apidays New York 2024 - The Good, the Bad and the Governed by David O'Neill, ...
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu SubbuApidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
Apidays Singapore 2024 - Modernizing Securities Finance by Madhu Subbu
 
AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024AXA XL - Insurer Innovation Award Americas 2024
AXA XL - Insurer Innovation Award Americas 2024
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
Apidays Singapore 2024 - Scalable LLM APIs for AI and Generative AI Applicati...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 

SAnno: a unifying framework for semantic annotation

  • 1. SAnno: a unifying framework for semantic annotation Davide Eynard IDSIA, 01/06/2010
  • 2. 2 Introduction • S(emantic)Anno(tations) • … in Italy, “sanno” also means “they know” • Basic principle: anyone should be able to say anything about anything else • Well, this should hold in general :-) • Actually, in our case it is “anything about any URI” • And we would like everyone to say that in a formal way • But first, a little step back in time... IDSIA, 01/06/2010 Davide Eynard
  • 3. 3 Participation and semantics Data Structure IDSIA, 01/06/2010 Davide Eynard
  • 4. 4 Sanno's grandfather: Speakinabout [1] • Purpose: produce semantic annotations about named entities • When you read “Harry Potter”, is it the book or the movie? • Plays with user gratifications • When users annotate a string as matching a specific concept, they are shown a list of services/search engines which are related to it • Relies on user provided data: • Freebase types • User generated search templates, built inside a wiki system IDSIA, 01/06/2010 Davide Eynard
  • 5. 5 Sanno's grandfather: Speakinabout [1] IDSIA, 01/06/2010 Davide Eynard
  • 6. 6 Sanno's grandfather: Speakinabout [1] IDSIA, 01/06/2010 Davide Eynard
  • 7. 7 Sanno's father: RDFMonkey [2] • Purpose: augment browsing experience by providing information/services related to the visited URL • Relies on Freebase types • … as in SpeakinAbout, but without requiring user interaction • Types are found by searching backlinks in Freebase (which topics are linking the visited page) • Related services as widgets inside a browser extension • The app could load widgets at runtime (from Freebase itself or another collaborative system) IDSIA, 01/06/2010 Davide Eynard
  • 8. 8 Sanno's father: RDFMonkey [2] Cities Musical Artists Books IDSIA, 01/06/2010 Davide Eynard
  • 9. 9 The problem • We already have semantics on the annotation (i.e. Annotea), but how can we have semantics within the annotation? • Good starting points: • Some participative systems already provide semi-structured information (i.e. infoboxes in Wikipedia) • Some communities of practice already built their own bottom-up way to structure information (i.e. machine tags) • Some (relatively new) systems allow, with some additional effort, to save information in a structured way almost without requiring users to know that (i.e. semantic wikis) • Challenges • Provide a shared way to describe annotations coming from heterogeneous systems • Aggregate this information to provide something new and useful IDSIA, 01/06/2010 Davide Eynard
  • 10. 10 SAnno as a framework • Sanno is built up of many different parts, which all together provide something (we consider) new and useful • An ontology to describe annotations (the “shells” that contain metadata about a resource) • An ontology describing the types of properties we are already able to aggregate • A set of conversion tools which are able to translate existing annotations from other systems into our notation • A system to show the results of the aggregation of different annotations • A system to manage provenance, authorship, and filters on incoming annotations IDSIA, 01/06/2010 Davide Eynard
  • 11. 11 The annotations ontology • Every annotation can be considered as a “Post-it”, a piece of paper where something is written about something else • … you can say things about what is written there, but also about the Post-it itself • The annotation is about a resource, it is created by someone in a specific date, it comes from a particular annotation system and might be connected to a specific community • Main goal: do not reinvent everything from scratch • Reuse well-known ontologies such as DC, SIOC, etc. • Use named graphs as an alternative to reifications • Start in an easy way: restriction to URLs • Also a way to provide instant gratification to users: show annotations while they are browsing a website IDSIA, 01/06/2010 Davide Eynard
  • 12. 12 The aggregation ontology • Aggregation deals with the contents of the annotation (i.e. The triples found in the NG) • Objectives • Avoid constraining users to a specific vocabulary for annotations • Find a way to collect different annotations and provide something new and interesting by aggregating them  Our approach • Properties used inside annotations could be described as belonging to families we already know how to deal with • Examples: very specific (tags, ratings), more general (transitive relations) • Properties inside some external vocabulary are mapped as subproperties of ours • … by whom? High-experience users who have incentives to do this (think about users building templates in Wikipedia...) IDSIA, 01/06/2010 Davide Eynard
  • 13. 13 Conversion tools • Our worst enemy: the bootstrap • who is going to annotate the first resources? I don't have time! • Our best friends: already existing annotation systems • why don't we convert existing data to our notation and show the advantages of our approach?  Different families of conversion tools • Easy: already existing APIs, with realtime search functionalities (i.e. del.icio.us) • Medium: conversions from existing structured repositories such as SPARQL endpoints (advantage: the conversion is very clean, you just need one tool and different CONSTRUCTs) • A little harder: Web scraping when no other sources are available IDSIA, 01/06/2010 Davide Eynard
  • 14. 14 Annotation client • Actually, two possible clients in our mind: • a browser extension which shows annotations while users are browsing the Web • an independent service which is able to aggregate heterogeneous information related to similar resources (i.e. URLs marked as being MP3 files) • Filter annotations according to author, date, originating system, and community • Users should be able to “subscribe” to some annotating communities and ignore others • System is thought as distributed, as data can come from different, unrelated sources IDSIA, 01/06/2010 Davide Eynard
  • 15. 15 The prototype • Early annotation ontology • Property families: tag, rating, generically related URI • Conversions from SMW, Delicious • Visualization as a web service + Firefox extension • No subscriptions yet IDSIA, 01/06/2010 Davide Eynard
  • 16. 16 The prototype IDSIA, 01/06/2010 Davide Eynard
  • 17. 17 The end Thank you! Questions? References: • [0] D.Laniado, D.Eynard and M.Colombetti. Using WordNet to turn a folksonomy into a hierarchy of concepts. Semantic Web Application and Perspectives 192–201, 2007. • [1] D.Eynard and M.Colombetti. Exploiting User Gratification for Collaborative Semantic Annotation. Proceedings of SWUI 2008. April 2008. • [2] D.Eynard. Using semantics and user participation to customize personalization. HP Labs Technical Report HPL-2008-197. September 2008. • [3] L.Mazzola, D.Eynard and R.Mazza. GVIS: a framework for graphical mashups of heterogeneous sources to support data interpretation. HSI 2010. May 2010. IDSIA, 01/06/2010 Davide Eynard
  • 18. Contact Davide Eynard Tel. 02 2399 4010 Fax 02 2399 3411 eynard@elet.polimi.it http://davide.eynard.it Project page @AIRLab: http://airwiki.elet.polimi.it Back