SlideShare une entreprise Scribd logo
1  sur  15
Concept-based, semantic search




Andreas Blumauer
Semantic Web Company
www.semantic-web.at




                  © Semantic Web Company – http://www.semantic-web.at/   1
Content/agenda


1. What means „concept-based“?
2. Concept-tagging
3. Semantic search
  •   Faceted search
  •   Similarity search
4. Semantics as a means for
   ‚interpretation‘
5. Topic pages
6. Three levels of semantic search

             © Semantic Web Company – http://www.semantic-web.at/   2
What is a concept?
                The semiotic triangle



                                                                    Mental model
                                                                    of „A-Class“


                                  concept                              Another
                                                                     mental model
                                                                     of „A-Class“

another
 object



A-Class

A-Klasse   label                                               object
 W 176




             © Semantic Web Company – http://www.semantic-web.at/                   3
Concept-based enterprise vocabulary
                           http://voc.org.com/core/355           http://voc.org.com/core/54


     Vehicle           prefLabel                                                                     prefLabel
  manufacturing                                                                                                    compact car
    company




                                      broader




                                                                               broader
  Daimler-Benz                                                                                                       A-Class


                       prefLabel                           related                                prefLabel (de)
   Daimler AG                                                                                                       A-Klasse

                              http://voc.org.com/core/97         http://voc.org.com/core/176

                                                                                                                     W 176
                                       narrower




                                                                               narrower
 Mercedes-AMG


                       prefLabel                           related                                   prefLabel
       AMG                                                                                                         A 250 Sport

                           http://voc.org.com/core/77                http://voc.org.com/core/44


Each concept      has a unique URI and can have various multi-lingual labels. Additionaly, it can have various types of
semantic relations with other concepts. 4
                                        W3C´s SKOS standard describes a pre-defined set of semantic relations especially
for controlled vocabularies.

                                © Semantic Web Company – http://www.semantic-web.at/                                    4
Concept-tagging vs. Term-tagging


Concept-tagging is done on top             Enterprise vocabulary
of concepts which are already
part of the enterprise
vocabulary, thus contextualised                                                   ‚Term-tags„ become a ‚concept„
and linked to other concepts.                                                   as part of the enterprise vocabulary

Term-tagging means that tags
are extracted from text
(automatically via text mining)
which are not part of the                Concept Tagging
controlled vocabulary yet.
                                                                     --- ------ -                           Term Tagging
Term-tags can be inserted into
the enterprise vocabulary.                                          -- --- ---- -
This extends and refines the                                        ---- ---- ---
vocabulary more and more.
                                                                    ---- --- - --
                                                                    - --- ---- --
                                                                      --- ------
                                                                   Content from CMS


                                  © Semantic Web Company – http://www.semantic-web.at/                            5
Concept-tagging: pre-condition for
                        semantic search


                                                          W 176                     search



--- -- ----- --
                                                              prefLabel
------ ---- ---                                                                 A-Class
------ --- ----
    -- --- --
                                               narrower
                                                                                W 176
A 250 Sport ---- -
 ---- ---- ----
    ---- ---                                                      prefLabel
                                                                              A 250 Sport




                     © Semantic Web Company – http://www.semantic-web.at/                   6
Traditional search methods vs.
                                    semantic search

                                    W 176                                     search
                                                                                           Semantic:
                                                                prefLabel
                                                                                           Can the search phrase
                                                                               A-Class
                                                                                           be found analogously?
Traditional:




                                                     narrower
Can the search phrase                                                          W 176

be found literally
in the document?
                                                                prefLabel
                                                                             A 250 Sport

               --- -- ----- --                                                  --- -- -- --- -
               ------ ---- ---                                                  ----- --- -----
               ------ --- ----                                                   - --- ---- ---
               -- --- --- ----                                                  ----------A 250
               A 250 Sport ---- -                                               Sport ---- -----

                ---- ---- ----                                                  ---- ---- ----
                   ---- ---                                                            ---


                               © Semantic Web Company – http://www.semantic-web.at/                     7
Semantics as a means for
                                     interpretation

Semantics helps to make
different language levels or
                                                      W 176                                    search
various perspectives
comparable.
                                                                   prefLabel
                                                                                   A-Class
Example: Vendors and their
customers quite often talk




                                                        narrower
                                                                                   W 176
different languages. Wrong or
sometimes time-consuming
‚translations„ and                                                 prefLabel
                                                                                 A 250 Sport
interpretations have to be done
by the customers themselves.

Example: The state of
                                                                               ----- --- -----
knowledge of employees can be                                                  - --- ---- ----
quite divergent. Semantics as a                                                 --- --- -A 250
search assistant can serve
especially less experienced                                                    Sport ---- -----
colleagues.                                                                    ---- ---- ----
                                                                                      ---


                                  © Semantic Web Company – http://www.semantic-web.at/                  8
Concept-based high-precision facet
                               classification
#1     ---- --- -- --
      Daimler-Benz -----                                                           Synonyms and hidden labels:
                                                                                   #1 is also classified as ‚Daimler
      - --- ------ ---                                                             AG„ because ‚Daimler-Benz„ is
                                                                                   also (an old) name for ‚Daimler
      - ----- ---- ---                                                             AG„.
      - ---- ------ --
                                                                                   Transitivity:
                                                COMPANY                            #2 is categorized as ‚vehicle
                                                                                   manufacturer„ too, because in

#2 ----- ------ --
                                                                                   our thesaurus ‚AMG„ is narrower
                                                    Vehicle manufacturer (2)       (is part of) of ‚Daimler„ which is a
   - ------ -- ---                                                                 ‚vehicle manufacturer„.

   ---- ---- -----                                            Daimler AG (2)
    ---- ---- ----
        AMG --
                                                                     AMG (1)
   ---- --- ------
          --

Concept-/thesaurus-based facet classification of documents is as precise as the classification scheme
used by the enterprise thesaurus itself. In consideration of all different labels of concepts and their
transitive hierarchical relations, a more precise facet classification can be realised than with
traditional term-based methods.
                                    9

                            © Semantic Web Company – http://www.semantic-web.at/                      9
Similarity search: efficient re-use of
                                existing information

                   Mercedes-AMG                                                         --- -- AMG
                                                       http://voc.org.com/core/77
                                                                                      --- ------ ---
                                         prefLabel                                    ------ -- ----
                          AMG
                                                                                      -- ---- ----- -
                                                                                      -- --A 250 Sport -
                                                                                      --- ----- ----
                                                        http://voc.org.com/core/176
  --- ---- ----- -- ---
   --- -- --- ------ --                                                    A-Class    ---- ---- --- -
  Mercedes-AMG --------                                                               ----- --------
  --- -------- -- W 176                                                    W 176            ----

                                                        narrower
  ---- ----- ---- ----
         ---- ---

                                A 250 Sport

                                              http://voc.org.com/core/44



Content-authors as well as end-users can benefit from similarity search (content recommendation),
e.g. by ‚skim reading„ or by the avoidance of duplicated work. Even if two documents have no words in
common they can be classified as similar when using a concept-based text analysis.
                                    10

                           © Semantic Web Company – http://www.semantic-web.at/                        10
Topic Pages: Mashups for a
                 fast 360O view




                                                                              Articles (twitter, videos etc.) can be retrieved
      Short
                                                                     http:/




                                                                                        from various content sources
description
                                                                       /

   Related
  concepts
                                                                      CMS


Geo search

                                                                      API




                      11

              © Semantic Web Company – http://www.semantic-web.at/    11
Linked Data: complex queries on top of
                      standard technologies

Example: Find industry news which mention countries or regions, in which our export
volume increased by more than 10% over the last 5 years an which mention either one of
our products and/or a competitor.



                                (Federated) SPARQL Queries




             Industry
                                                                          Export statistics
               News



                           12

                   © Semantic Web Company – http://www.semantic-web.at/                       12
Conclusio 1: The three levels of
                                 semantic search


Year in which the    2014                        Semantics is explicitly available via linked knowledge models.
underlying                                       Content from various sources and deparments can be linked and
technology will
                       Linked Data               mashed on top of an explicit meta data layer. Complex queries
be/has been rolled     based search              which use data from many sources can be made by using the
out.                                             standard query language SPARQL.



             2011                     Semantics is explicitly available by using controlled vocabularies
                                      and thesauri. Thesauri are the basis for precise text analysis and
                 Concept-             to build a semantic index. Building knowledge models is
               based search           especially cost-efficient for larger organisations since a more
                                      precise search can be provided.


                            No Standards
    2005                    Semantics is calculated by text analysis. Example: Because
       Term-based           „Dieter Zetsche“ frequently occurs together with „Daimler AG“
                            in a text the algorithm assumes that those two phrases relate
         search             somehow to each other. Term-based methods are less precise
                            than the two from further above.




                              © Semantic Web Company – http://www.semantic-web.at/                         13
Conclusio 2: Explicit metadata layer



                          Data                                  Data
    Research                                                                   Production




                                               Metadata:
•    Stored and processed separately from data
•    Metadata management is part of the enterprise information management strategy




                               Data                            Data
      Marketing/Sales
                                                                                    HR


                        © Semantic Web Company – http://www.semantic-web.at/                14
“Thank you for your time and
                                           please forward any comments
                                           or questions to me to get more
                                           information on our product or
                                           linked data & vocabularies!”
Andreas Blumauer
Managing Partner
a.blumauer@semantic-web.at



Semantic Web Company GmbH                                http://www.semantic-web.at/
Mariahilfer Strasse 70/8                                          http://poolparty.biz
1070 Vienna
Austria                                       http://twitter.com/semwebcompany


                © Semantic Web Company – http://www.semantic-web.at/          15
                                                                              15

Contenu connexe

Similaire à Concept based semantic search

re:Invent CON320 Tracing and Debugging for Containerized Services
re:Invent CON320 Tracing and Debugging for Containerized Servicesre:Invent CON320 Tracing and Debugging for Containerized Services
re:Invent CON320 Tracing and Debugging for Containerized ServicesCalvin French-Owen
 
Build an ETL Pipeline to Analyze Customer Data (AIM416) - AWS re:Invent 2018
Build an ETL Pipeline to Analyze Customer Data (AIM416) - AWS re:Invent 2018Build an ETL Pipeline to Analyze Customer Data (AIM416) - AWS re:Invent 2018
Build an ETL Pipeline to Analyze Customer Data (AIM416) - AWS re:Invent 2018Amazon Web Services
 
Mcl345 re invent_sagemaker_dmbanga
Mcl345 re invent_sagemaker_dmbangaMcl345 re invent_sagemaker_dmbanga
Mcl345 re invent_sagemaker_dmbangaDan Romuald Mbanga
 
CI/CD Pipeline Security: Advanced Continuous Delivery Recommendations
CI/CD Pipeline Security: Advanced Continuous Delivery RecommendationsCI/CD Pipeline Security: Advanced Continuous Delivery Recommendations
CI/CD Pipeline Security: Advanced Continuous Delivery RecommendationsAmazon Web Services
 
NEW LAUNCH! Integrating Amazon SageMaker into your Enterprise - MCL345 - re:I...
NEW LAUNCH! Integrating Amazon SageMaker into your Enterprise - MCL345 - re:I...NEW LAUNCH! Integrating Amazon SageMaker into your Enterprise - MCL345 - re:I...
NEW LAUNCH! Integrating Amazon SageMaker into your Enterprise - MCL345 - re:I...Amazon Web Services
 
Add Intelligence to Applications with AWS ML Services
Add Intelligence to Applications with AWS ML ServicesAdd Intelligence to Applications with AWS ML Services
Add Intelligence to Applications with AWS ML ServicesAmazon Web Services
 
Add Intelligence to Applications with AWS ML Services: Machine Learning Week ...
Add Intelligence to Applications with AWS ML Services: Machine Learning Week ...Add Intelligence to Applications with AWS ML Services: Machine Learning Week ...
Add Intelligence to Applications with AWS ML Services: Machine Learning Week ...Amazon Web Services
 
AWS Machine Learning Week SF: Build Intelligent Applications with AWS ML Serv...
AWS Machine Learning Week SF: Build Intelligent Applications with AWS ML Serv...AWS Machine Learning Week SF: Build Intelligent Applications with AWS ML Serv...
AWS Machine Learning Week SF: Build Intelligent Applications with AWS ML Serv...Amazon Web Services
 
WordPress Optimization & Security - ThinkVisibility 2012, Leeds
WordPress Optimization & Security - ThinkVisibility 2012, LeedsWordPress Optimization & Security - ThinkVisibility 2012, Leeds
WordPress Optimization & Security - ThinkVisibility 2012, LeedsBastian Grimm
 
CI/CD for Your Machine Learning Pipeline with Amazon SageMaker (DVC303) - AWS...
CI/CD for Your Machine Learning Pipeline with Amazon SageMaker (DVC303) - AWS...CI/CD for Your Machine Learning Pipeline with Amazon SageMaker (DVC303) - AWS...
CI/CD for Your Machine Learning Pipeline with Amazon SageMaker (DVC303) - AWS...Amazon Web Services
 
Seo training-2010-100818134052-phpapp02
Seo training-2010-100818134052-phpapp02Seo training-2010-100818134052-phpapp02
Seo training-2010-100818134052-phpapp02Javier Romero Alonso
 
SKOS: Building taxonomies with minimum ontological commitment
SKOS: Building taxonomies  with minimum ontological commitmentSKOS: Building taxonomies  with minimum ontological commitment
SKOS: Building taxonomies with minimum ontological commitmentSzymon Klarman
 
ABD215_Serverless Data Prep with AWS Glue
ABD215_Serverless Data Prep with AWS GlueABD215_Serverless Data Prep with AWS Glue
ABD215_Serverless Data Prep with AWS GlueAmazon Web Services
 
ABD215_Serverless Data Prep with AWS Glue
ABD215_Serverless Data Prep with AWS GlueABD215_Serverless Data Prep with AWS Glue
ABD215_Serverless Data Prep with AWS GlueAmazon Web Services
 

Similaire à Concept based semantic search (17)

re:Invent CON320 Tracing and Debugging for Containerized Services
re:Invent CON320 Tracing and Debugging for Containerized Servicesre:Invent CON320 Tracing and Debugging for Containerized Services
re:Invent CON320 Tracing and Debugging for Containerized Services
 
Build an ETL Pipeline to Analyze Customer Data (AIM416) - AWS re:Invent 2018
Build an ETL Pipeline to Analyze Customer Data (AIM416) - AWS re:Invent 2018Build an ETL Pipeline to Analyze Customer Data (AIM416) - AWS re:Invent 2018
Build an ETL Pipeline to Analyze Customer Data (AIM416) - AWS re:Invent 2018
 
Mcl345 re invent_sagemaker_dmbanga
Mcl345 re invent_sagemaker_dmbangaMcl345 re invent_sagemaker_dmbanga
Mcl345 re invent_sagemaker_dmbanga
 
Guidelines HTML5 & CSS3 - Atlogys (2018)
Guidelines HTML5 & CSS3 - Atlogys (2018)Guidelines HTML5 & CSS3 - Atlogys (2018)
Guidelines HTML5 & CSS3 - Atlogys (2018)
 
CI/CD Pipeline Security: Advanced Continuous Delivery Recommendations
CI/CD Pipeline Security: Advanced Continuous Delivery RecommendationsCI/CD Pipeline Security: Advanced Continuous Delivery Recommendations
CI/CD Pipeline Security: Advanced Continuous Delivery Recommendations
 
NEW LAUNCH! Integrating Amazon SageMaker into your Enterprise - MCL345 - re:I...
NEW LAUNCH! Integrating Amazon SageMaker into your Enterprise - MCL345 - re:I...NEW LAUNCH! Integrating Amazon SageMaker into your Enterprise - MCL345 - re:I...
NEW LAUNCH! Integrating Amazon SageMaker into your Enterprise - MCL345 - re:I...
 
Add Intelligence to Applications with AWS ML Services
Add Intelligence to Applications with AWS ML ServicesAdd Intelligence to Applications with AWS ML Services
Add Intelligence to Applications with AWS ML Services
 
Add Intelligence to Applications with AWS ML Services: Machine Learning Week ...
Add Intelligence to Applications with AWS ML Services: Machine Learning Week ...Add Intelligence to Applications with AWS ML Services: Machine Learning Week ...
Add Intelligence to Applications with AWS ML Services: Machine Learning Week ...
 
AWS Machine Learning Week SF: Build Intelligent Applications with AWS ML Serv...
AWS Machine Learning Week SF: Build Intelligent Applications with AWS ML Serv...AWS Machine Learning Week SF: Build Intelligent Applications with AWS ML Serv...
AWS Machine Learning Week SF: Build Intelligent Applications with AWS ML Serv...
 
WordPress Optimization & Security - ThinkVisibility 2012, Leeds
WordPress Optimization & Security - ThinkVisibility 2012, LeedsWordPress Optimization & Security - ThinkVisibility 2012, Leeds
WordPress Optimization & Security - ThinkVisibility 2012, Leeds
 
CI/CD for Your Machine Learning Pipeline with Amazon SageMaker (DVC303) - AWS...
CI/CD for Your Machine Learning Pipeline with Amazon SageMaker (DVC303) - AWS...CI/CD for Your Machine Learning Pipeline with Amazon SageMaker (DVC303) - AWS...
CI/CD for Your Machine Learning Pipeline with Amazon SageMaker (DVC303) - AWS...
 
Seo training-2010-100818134052-phpapp02
Seo training-2010-100818134052-phpapp02Seo training-2010-100818134052-phpapp02
Seo training-2010-100818134052-phpapp02
 
Seo training
Seo trainingSeo training
Seo training
 
SEO Introduction
SEO IntroductionSEO Introduction
SEO Introduction
 
SKOS: Building taxonomies with minimum ontological commitment
SKOS: Building taxonomies  with minimum ontological commitmentSKOS: Building taxonomies  with minimum ontological commitment
SKOS: Building taxonomies with minimum ontological commitment
 
ABD215_Serverless Data Prep with AWS Glue
ABD215_Serverless Data Prep with AWS GlueABD215_Serverless Data Prep with AWS Glue
ABD215_Serverless Data Prep with AWS Glue
 
ABD215_Serverless Data Prep with AWS Glue
ABD215_Serverless Data Prep with AWS GlueABD215_Serverless Data Prep with AWS Glue
ABD215_Serverless Data Prep with AWS Glue
 

Plus de Semantic Web Company

How Enterprise Architecture & Knowledge Graph Technologies Can Scale Business...
How Enterprise Architecture & Knowledge Graph Technologies Can Scale Business...How Enterprise Architecture & Knowledge Graph Technologies Can Scale Business...
How Enterprise Architecture & Knowledge Graph Technologies Can Scale Business...Semantic Web Company
 
Introduction to Knowledge Graphs and Semantic AI
Introduction to Knowledge Graphs and Semantic AIIntroduction to Knowledge Graphs and Semantic AI
Introduction to Knowledge Graphs and Semantic AISemantic Web Company
 
Deep Text Analytics - How to extract hidden information and aboutness from text
Deep Text Analytics - How to extract hidden information and aboutness from textDeep Text Analytics - How to extract hidden information and aboutness from text
Deep Text Analytics - How to extract hidden information and aboutness from textSemantic Web Company
 
Leveraging Knowledge Graphs in your Enterprise Knowledge Management System
Leveraging Knowledge Graphs in your Enterprise Knowledge Management SystemLeveraging Knowledge Graphs in your Enterprise Knowledge Management System
Leveraging Knowledge Graphs in your Enterprise Knowledge Management SystemSemantic Web Company
 
Linking SharePoint Documents with Structured Data
Linking SharePoint Documents with Structured DataLinking SharePoint Documents with Structured Data
Linking SharePoint Documents with Structured DataSemantic Web Company
 
The Fast Track to Knowledge Engineering
The Fast Track to Knowledge EngineeringThe Fast Track to Knowledge Engineering
The Fast Track to Knowledge EngineeringSemantic Web Company
 
Leveraging Taxonomy Management with Machine Learning
Leveraging Taxonomy Management with Machine LearningLeveraging Taxonomy Management with Machine Learning
Leveraging Taxonomy Management with Machine LearningSemantic Web Company
 
PoolParty GraphSearch - The Fusion of Search, Recommendation and Analytics
PoolParty GraphSearch - The Fusion of Search, Recommendation and AnalyticsPoolParty GraphSearch - The Fusion of Search, Recommendation and Analytics
PoolParty GraphSearch - The Fusion of Search, Recommendation and AnalyticsSemantic Web Company
 
Semantics as the Basis of Advanced Cognitive Computing
Semantics as the Basis of Advanced Cognitive ComputingSemantics as the Basis of Advanced Cognitive Computing
Semantics as the Basis of Advanced Cognitive ComputingSemantic Web Company
 
PoolParty 6.0 - Climbing the Semantic Ladder
PoolParty 6.0 - Climbing the Semantic LadderPoolParty 6.0 - Climbing the Semantic Ladder
PoolParty 6.0 - Climbing the Semantic LadderSemantic Web Company
 
PoolParty Semantic Suite - Release 6.0 (Technical Overview)
PoolParty Semantic Suite - Release 6.0 (Technical Overview)PoolParty Semantic Suite - Release 6.0 (Technical Overview)
PoolParty Semantic Suite - Release 6.0 (Technical Overview)Semantic Web Company
 
Taxonomies and Ontologies – The Yin and Yang of Knowledge Modelling
Taxonomies and Ontologies – The Yin and Yang of Knowledge ModellingTaxonomies and Ontologies – The Yin and Yang of Knowledge Modelling
Taxonomies and Ontologies – The Yin and Yang of Knowledge ModellingSemantic Web Company
 
PROPEL . Austrian's Roadmap for Enterprise Linked Data
PROPEL . Austrian's Roadmap for Enterprise Linked DataPROPEL . Austrian's Roadmap for Enterprise Linked Data
PROPEL . Austrian's Roadmap for Enterprise Linked DataSemantic Web Company
 

Plus de Semantic Web Company (20)

How Enterprise Architecture & Knowledge Graph Technologies Can Scale Business...
How Enterprise Architecture & Knowledge Graph Technologies Can Scale Business...How Enterprise Architecture & Knowledge Graph Technologies Can Scale Business...
How Enterprise Architecture & Knowledge Graph Technologies Can Scale Business...
 
Introduction to Knowledge Graphs and Semantic AI
Introduction to Knowledge Graphs and Semantic AIIntroduction to Knowledge Graphs and Semantic AI
Introduction to Knowledge Graphs and Semantic AI
 
Deep Text Analytics - How to extract hidden information and aboutness from text
Deep Text Analytics - How to extract hidden information and aboutness from textDeep Text Analytics - How to extract hidden information and aboutness from text
Deep Text Analytics - How to extract hidden information and aboutness from text
 
Leveraging Knowledge Graphs in your Enterprise Knowledge Management System
Leveraging Knowledge Graphs in your Enterprise Knowledge Management SystemLeveraging Knowledge Graphs in your Enterprise Knowledge Management System
Leveraging Knowledge Graphs in your Enterprise Knowledge Management System
 
Linking SharePoint Documents with Structured Data
Linking SharePoint Documents with Structured DataLinking SharePoint Documents with Structured Data
Linking SharePoint Documents with Structured Data
 
The Fast Track to Knowledge Engineering
The Fast Track to Knowledge EngineeringThe Fast Track to Knowledge Engineering
The Fast Track to Knowledge Engineering
 
Semantic AI
Semantic AISemantic AI
Semantic AI
 
BrightTALK - Semantic AI
BrightTALK - Semantic AI BrightTALK - Semantic AI
BrightTALK - Semantic AI
 
PoolParty Semantic Classifier
PoolParty Semantic ClassifierPoolParty Semantic Classifier
PoolParty Semantic Classifier
 
Leveraging Taxonomy Management with Machine Learning
Leveraging Taxonomy Management with Machine LearningLeveraging Taxonomy Management with Machine Learning
Leveraging Taxonomy Management with Machine Learning
 
Taxonomies put in the right place
Taxonomies put in the right placeTaxonomies put in the right place
Taxonomies put in the right place
 
PoolParty GraphSearch - The Fusion of Search, Recommendation and Analytics
PoolParty GraphSearch - The Fusion of Search, Recommendation and AnalyticsPoolParty GraphSearch - The Fusion of Search, Recommendation and Analytics
PoolParty GraphSearch - The Fusion of Search, Recommendation and Analytics
 
Semantics as the Basis of Advanced Cognitive Computing
Semantics as the Basis of Advanced Cognitive ComputingSemantics as the Basis of Advanced Cognitive Computing
Semantics as the Basis of Advanced Cognitive Computing
 
Structured Content Meets Taxonomy
Structured Content Meets TaxonomyStructured Content Meets Taxonomy
Structured Content Meets Taxonomy
 
PoolParty 6.0 - Climbing the Semantic Ladder
PoolParty 6.0 - Climbing the Semantic LadderPoolParty 6.0 - Climbing the Semantic Ladder
PoolParty 6.0 - Climbing the Semantic Ladder
 
PoolParty Semantic Suite - Release 6.0 (Technical Overview)
PoolParty Semantic Suite - Release 6.0 (Technical Overview)PoolParty Semantic Suite - Release 6.0 (Technical Overview)
PoolParty Semantic Suite - Release 6.0 (Technical Overview)
 
Taxonomies and Ontologies – The Yin and Yang of Knowledge Modelling
Taxonomies and Ontologies – The Yin and Yang of Knowledge ModellingTaxonomies and Ontologies – The Yin and Yang of Knowledge Modelling
Taxonomies and Ontologies – The Yin and Yang of Knowledge Modelling
 
PROPEL . Austrian's Roadmap for Enterprise Linked Data
PROPEL . Austrian's Roadmap for Enterprise Linked DataPROPEL . Austrian's Roadmap for Enterprise Linked Data
PROPEL . Austrian's Roadmap for Enterprise Linked Data
 
Taxonomy Quality Assessment
Taxonomy Quality AssessmentTaxonomy Quality Assessment
Taxonomy Quality Assessment
 
Taxonomy-Driven UX
Taxonomy-Driven UXTaxonomy-Driven UX
Taxonomy-Driven UX
 

Concept based semantic search

  • 1. Concept-based, semantic search Andreas Blumauer Semantic Web Company www.semantic-web.at © Semantic Web Company – http://www.semantic-web.at/ 1
  • 2. Content/agenda 1. What means „concept-based“? 2. Concept-tagging 3. Semantic search • Faceted search • Similarity search 4. Semantics as a means for ‚interpretation‘ 5. Topic pages 6. Three levels of semantic search © Semantic Web Company – http://www.semantic-web.at/ 2
  • 3. What is a concept? The semiotic triangle Mental model of „A-Class“ concept Another mental model of „A-Class“ another object A-Class A-Klasse label object W 176 © Semantic Web Company – http://www.semantic-web.at/ 3
  • 4. Concept-based enterprise vocabulary http://voc.org.com/core/355 http://voc.org.com/core/54 Vehicle prefLabel prefLabel manufacturing compact car company broader broader Daimler-Benz A-Class prefLabel related prefLabel (de) Daimler AG A-Klasse http://voc.org.com/core/97 http://voc.org.com/core/176 W 176 narrower narrower Mercedes-AMG prefLabel related prefLabel AMG A 250 Sport http://voc.org.com/core/77 http://voc.org.com/core/44 Each concept has a unique URI and can have various multi-lingual labels. Additionaly, it can have various types of semantic relations with other concepts. 4 W3C´s SKOS standard describes a pre-defined set of semantic relations especially for controlled vocabularies. © Semantic Web Company – http://www.semantic-web.at/ 4
  • 5. Concept-tagging vs. Term-tagging Concept-tagging is done on top Enterprise vocabulary of concepts which are already part of the enterprise vocabulary, thus contextualised ‚Term-tags„ become a ‚concept„ and linked to other concepts. as part of the enterprise vocabulary Term-tagging means that tags are extracted from text (automatically via text mining) which are not part of the Concept Tagging controlled vocabulary yet. --- ------ - Term Tagging Term-tags can be inserted into the enterprise vocabulary. -- --- ---- - This extends and refines the ---- ---- --- vocabulary more and more. ---- --- - -- - --- ---- -- --- ------ Content from CMS © Semantic Web Company – http://www.semantic-web.at/ 5
  • 6. Concept-tagging: pre-condition for semantic search W 176 search --- -- ----- -- prefLabel ------ ---- --- A-Class ------ --- ---- -- --- -- narrower W 176 A 250 Sport ---- - ---- ---- ---- ---- --- prefLabel A 250 Sport © Semantic Web Company – http://www.semantic-web.at/ 6
  • 7. Traditional search methods vs. semantic search W 176 search Semantic: prefLabel Can the search phrase A-Class be found analogously? Traditional: narrower Can the search phrase W 176 be found literally in the document? prefLabel A 250 Sport --- -- ----- -- --- -- -- --- - ------ ---- --- ----- --- ----- ------ --- ---- - --- ---- --- -- --- --- ---- ----------A 250 A 250 Sport ---- - Sport ---- ----- ---- ---- ---- ---- ---- ---- ---- --- --- © Semantic Web Company – http://www.semantic-web.at/ 7
  • 8. Semantics as a means for interpretation Semantics helps to make different language levels or W 176 search various perspectives comparable. prefLabel A-Class Example: Vendors and their customers quite often talk narrower W 176 different languages. Wrong or sometimes time-consuming ‚translations„ and prefLabel A 250 Sport interpretations have to be done by the customers themselves. Example: The state of ----- --- ----- knowledge of employees can be - --- ---- ---- quite divergent. Semantics as a --- --- -A 250 search assistant can serve especially less experienced Sport ---- ----- colleagues. ---- ---- ---- --- © Semantic Web Company – http://www.semantic-web.at/ 8
  • 9. Concept-based high-precision facet classification #1 ---- --- -- -- Daimler-Benz ----- Synonyms and hidden labels: #1 is also classified as ‚Daimler - --- ------ --- AG„ because ‚Daimler-Benz„ is also (an old) name for ‚Daimler - ----- ---- --- AG„. - ---- ------ -- Transitivity: COMPANY #2 is categorized as ‚vehicle manufacturer„ too, because in #2 ----- ------ -- our thesaurus ‚AMG„ is narrower Vehicle manufacturer (2) (is part of) of ‚Daimler„ which is a - ------ -- --- ‚vehicle manufacturer„. ---- ---- ----- Daimler AG (2) ---- ---- ---- AMG -- AMG (1) ---- --- ------ -- Concept-/thesaurus-based facet classification of documents is as precise as the classification scheme used by the enterprise thesaurus itself. In consideration of all different labels of concepts and their transitive hierarchical relations, a more precise facet classification can be realised than with traditional term-based methods. 9 © Semantic Web Company – http://www.semantic-web.at/ 9
  • 10. Similarity search: efficient re-use of existing information Mercedes-AMG --- -- AMG http://voc.org.com/core/77 --- ------ --- prefLabel ------ -- ---- AMG -- ---- ----- - -- --A 250 Sport - --- ----- ---- http://voc.org.com/core/176 --- ---- ----- -- --- --- -- --- ------ -- A-Class ---- ---- --- - Mercedes-AMG -------- ----- -------- --- -------- -- W 176 W 176 ---- narrower ---- ----- ---- ---- ---- --- A 250 Sport http://voc.org.com/core/44 Content-authors as well as end-users can benefit from similarity search (content recommendation), e.g. by ‚skim reading„ or by the avoidance of duplicated work. Even if two documents have no words in common they can be classified as similar when using a concept-based text analysis. 10 © Semantic Web Company – http://www.semantic-web.at/ 10
  • 11. Topic Pages: Mashups for a fast 360O view Articles (twitter, videos etc.) can be retrieved Short http:/ from various content sources description / Related concepts CMS Geo search API 11 © Semantic Web Company – http://www.semantic-web.at/ 11
  • 12. Linked Data: complex queries on top of standard technologies Example: Find industry news which mention countries or regions, in which our export volume increased by more than 10% over the last 5 years an which mention either one of our products and/or a competitor. (Federated) SPARQL Queries Industry Export statistics News 12 © Semantic Web Company – http://www.semantic-web.at/ 12
  • 13. Conclusio 1: The three levels of semantic search Year in which the 2014 Semantics is explicitly available via linked knowledge models. underlying Content from various sources and deparments can be linked and technology will Linked Data mashed on top of an explicit meta data layer. Complex queries be/has been rolled based search which use data from many sources can be made by using the out. standard query language SPARQL. 2011 Semantics is explicitly available by using controlled vocabularies and thesauri. Thesauri are the basis for precise text analysis and Concept- to build a semantic index. Building knowledge models is based search especially cost-efficient for larger organisations since a more precise search can be provided. No Standards 2005 Semantics is calculated by text analysis. Example: Because Term-based „Dieter Zetsche“ frequently occurs together with „Daimler AG“ in a text the algorithm assumes that those two phrases relate search somehow to each other. Term-based methods are less precise than the two from further above. © Semantic Web Company – http://www.semantic-web.at/ 13
  • 14. Conclusio 2: Explicit metadata layer Data Data Research Production Metadata: • Stored and processed separately from data • Metadata management is part of the enterprise information management strategy Data Data Marketing/Sales HR © Semantic Web Company – http://www.semantic-web.at/ 14
  • 15. “Thank you for your time and please forward any comments or questions to me to get more information on our product or linked data & vocabularies!” Andreas Blumauer Managing Partner a.blumauer@semantic-web.at Semantic Web Company GmbH http://www.semantic-web.at/ Mariahilfer Strasse 70/8 http://poolparty.biz 1070 Vienna Austria http://twitter.com/semwebcompany © Semantic Web Company – http://www.semantic-web.at/ 15 15