SlideShare une entreprise Scribd logo
1  sur  64
• Barbara Starr ( ) 
– Basics of What semantic search is, what tools 
and techniques are used 
• Bill Slawski ( ) 
– Strategy for SEO 
– Case based examples and analysis
• Pursued a doctorate in Artificial Intelligence from 
South Africa in the 80's. 
• Recruited to build intelligent/predictive trading 
systems on Wall Street 
• Migrated to government-based contracts, several 
of which turned into real world products like 
– SIRI (PAL from DARPA) 
– WATSON (Acquaint - IBM Watson Labs was 
a team member) 
• From the vantage of a semantic technologist, I 
keenly watched the evolution of the Semantic Web. 
• “Shocked into the real world” when working as a 
consultant @ Overstock. 
– Rdfa on 900,000 item pages 2 days before Google adopted it 
– UPC and identifier “miner” 
• Today – Consultant for companies such as GS1 
US, Columnist, Strategist, …
• Primitive UI – Hunt and Peck
Primarily Stochastic in nature
• Based on concept of “citations” and very easily gamed 
• Probabilistic or Statistical (Not Symbolic) 
• Keyword Based Search Engine (Not Concept Based or 
Ontology Based) 
• “link juice” ? 
• Other odd vernacular that 
became standard jargon in the 
“SEO” community
SIRI 
“Amazing fact: same amount 
of computing to answer one 
Google Search query as all the 
computing done – 
in flight and on the ground 
-- for the entire Apollo program!” 
“Moore's law is the observation 
that, over the history of 
computing hardware, the 
number of transistors in a 
dense integrated circuit doubles 
approximately every two years”” 
Source: Wikipedia
“A new form of Web 
content that is meaningful 
to computers will unleash a 
revolution of new 
possibilities” 
• Tim Berners Lee 
• James Hendler 
• Ora Lassila 
http://www.cs.umd.edu/~golbeck/LBSC690/SemanticWeb.html
What they want 
When they want it (Now) 
Accurate (Reliable & Informative) 
Available 
Search engines must satisfy consumer needs, else:
“Def. Semantic Search is any retrieval method where 
– User intent and resources are represented in a semantic model 
• A set of concepts or topics that generalize over tokens/phrases 
• Additional structure such as a hierarchy among concepts, relationships among 
concepts etc. 
– Semantic representations of the query and the user intent are exploited 
in some part of the retrieval process” 
Peter Mika, Sr. Research Scientist, Yahoo Labs ⎪ June 19, 2014
Inevitable passage of 
Semantic Web adoption 
(or some version thereof) 
– culminating in 
schema.org 
http://semanticweb.com/semtech-2011-coverage-the-rdfaseo-wave-how-to-catch-it-and-why_b20458
“Things” not” strings” -May 16 2012 
Understanding “things” helps Google 
understand what things are in the world 
and what users are searching for 
June 2012 –Twitter announces Twitter Cards Pinterest 
Rich Pins
• Directly extracting on page metadata to create enhanced displays 
• Searching directly on consumed metadata 
• Provide direct answers to queries by searching on consumed, verified and validated 
information 
RICH SNIPPETS 2009 
Searchmonkey 2008 
• Aggregate answers or deduce them (like a timeline of events) 
• Expose more relevant answers in the long tail of search 
• Assist in interpreting a user query 
• Detect relevancy signals: i.e what content to show to what audience 
• Use it in conjunction with machine learning techniques- to eg. Train other components 
• … 
tiles 
Long tail: 
Peanut Butter 
and Jelly in 
stripes ?
Search is changing 
• Semantic, Predictive, Personalised, Conversational 
– Search over documents 
– Search over Data 
• Rise of Answer Engines (Direct answers proliferating) 
• Data Quality is imperative 
Becoming Less like a search Engine 
and more like a personal Assistant
SIRI 
Google Now 
Cortana 
AiAgents 
(create your own) 
Runs cross platform
“Answer 
box” 
Organic 
Search 
Results 
Search 
Over Data 
Knowledge 
Panel 
Search 
Over 
Documents
Synonymous with the migration to “Answer Engines “ & “Search Over Data”
Crawling & 
Indexing 
Query 
Interpretation 
Indexing and 
Ranking 
Results 
Presentation 
Indexed 
information
Means of preprocessing documents to speed 
up search (serving results in real time)
• Microsoft has given a fairly concise definition of the entity 
recognition and disambiguation process: 
– The objective of an Entity Recognition and Disambiguation 
system is to recognize mentions of entities in a given text, 
disambiguate them, and map them to the entities in a given 
entity collection or knowledge base. 
• In Google’s case, that means recognizing entities on web 
pages or web documents and mapping them back to 
specific entities in their Knowledge Graph
Implicit entity graph derived/inferred 
from the text on a web page 
Explicit entities obtained from 
structured markup on a web page 
May need to map to 
external Ontologies like 
schema.org or some 
other ontology 
Technology – NLP or IR or … Technology – Semantic Web
Make it Search Engine/Machine Friendly & tell them (explicitly) 
what “things” are on your web page 
• Make it (your information on your website) available to Google (and the major search and social 
engines), ensure you make it easy for computers to read and discover your stuff. 
• With schema.org (and/or the preferred vocabulary/ontology of the search social engine you are 
optimizing for, e.g for Facebook use rdfa & Opengraph). Google, Yahoo, Bing, Yandex => 
Schema.org 
• Pick a markup format (syntax) and stick with it 
– Microdata 
– Microformat 
– Rdfa 
– Rdfa lite 
– JSON-LD
• Recall some of Google’s Mission/Objective Statements or goals 
– “Organizing the worlds information to make it universally accessible and useful” 
– “To help with that we have built the knowledge graph” 
– Give an identity to every “thing” in the world 
• The knowledge graph 
– Contains information and entities and their relationships 
– Helps in Resolving ambiguities when processing queries 
You can explicitly disambiguate your content by providing a freebase mid – 
machine identifier - (in your markup)
Ref: Google I/O 2013
Google plus in “Enhanced Displays and 
the knowledge Graph 
• Authorship 
• Local businesses 
• Knowledge Carousel 
• ………
With Schema.org (and JSON-LD in this case) 
• Note the sameAs statement 
• mid makes it easier to match or reconcile the “thing” 
https://www.youtube.com/watch?v=W9pRpSW_KqA&src_vid=0oOwrBEeQss&feature=iv&annotation_id=annotation_1139520055 Ref: Google I/O 2014
The Knowledge Graph Powers: 
• Rich snippets in Events 
• Event listings in Google Maps 
• Notifications in Google Now 
https://www.youtube.com/watch?v=XXw8g-FbemI Ref: Google I/O 2014
https://www.youtube.com/watch?v=XXw8g-FbemI Ref: Google I/O 2014
http://youtu.be/pkrxhefQIBs
Rich snippets make your data more visible in Search Engine Results Pages 
Which would you rather click on? 
No Rich Snippets With Rich Snippets 
Lower Bounce Rate
32 
More Visibility in 
verticals, recipes 
& images via 
markup 
In Search Engine Results Pages 
Your product is not visible 
if no “color” attribute is 
populated 
& 
Search Verticals
You want peanut 
butter and jelly in 
stripes ? 
Allows unique and interesting content to surface
“Google 
Plus” 
Key Point - 
Corollary: If you don’t exist as an entity you do not exist in the knowledge graph or in “Search Over Data” 
The cost of that: Anonymity and Irrelevance!
http://www.socialmediaexaminer.com/rich-pins-on-pinterest/ 
Twitter Cards & Deep Linking 
Pinterest Pins 
Facebook 
Opengraph 
• Drive Brand awareness 
• Diversify Revenue Sources 
(Reduce Dependence on 
Google) 
• Increase Lift & Conversions
Google’s Structured Markup Helper 
• Generates JSON-LD or microdata 
• E-mail and web page markup 
Data Highlighter 
https://support.google.com/webmasters/answer/99170?hl=en&ref_topic=1088472 
“Google can present your data more attractively 
-- and in new ways -- in search results and in other 
products such as the Google Knowledge Graph.” 
List provided on schema.rdfs.org 
Wordpress plugin and html code http://schema.rdfs.org/tools.html
Make sure 
to enable 
Microdata
• Microdata reveal 
· JSON-LD sniffer 
· Semantic inspector 
· META SEO inspector 
· Green Turtle RDFa 
List maintained by Aaron Bradley: 
http://www.seoskeptic.com/structured-data-markup-validation-testing-tools/ 
Written Explanation of Walkthrough 
http://searchengineland.com/see-entities-web-page-tools-help-194710 
GRUFF
• Alchemyapi (with freebase mappings of entities since July 2013) 
• Opencalais 
• Semantic Verses 
• Aylien which was launched in Feb 2014, provides mappings to freebase and schema.org. 
• Smartlogic 
• lexalytics 
• Text-Processing 
• Stanford’s Ner 
• Textrazor
The following information 
MUST MATCH!
Ensure sure you supply rich, high quality data, 
mapped to search filters for maximum visibility 
Not visible if no “color” 
attribute populated 
Fill in The 
Gaps
• Ensure to supply rich, consistent data in any 
format you submit and ensure it is validated, 
verified and fresh 
• Send Consistent signals 
• Provide global identifiers whenever possible
Rich 
Product 
information 
with GTIN
• Implicit (content and Bill) also tools I have
• “Query logs record the actual usage of search systems and their analysis has proven critical to 
improving search engine functionality. Yet, despite the deluge of information, query log analysis 
often suffers from the sparsity of the query space. 
we propose a new model for query log data called the entity-aware 
click graph. In this representation, we decompose queries into entities and modifiers, and 
measure their association with clicked pages. We demonstrate the benefits of this approach on 
the crucial task of understanding which websites fulfill similar user needs, showing that using this 
representation we can achieve a higher precision than other query log-based approaches ” 
Measuring website similarity using an entity-aware click graph 
2012 publication: Peter Mika, Hugo Zaragoza, Pablo N Mendes, RoI Blanco 
http://dl.acm.org/citation.cfm?id=2398500
Need to understand the question in order to answer it 
• Entity Mention Queries: Common structure to entity mention queries: 
query = <entity> + <intent> 
• Queries that return facts as an answer 
• What form does the question take? (Question forms) 
Where was X born? 
When was X born? 
Who invented X? 
Where was X invented? 
What is the X of Y? 
Flights from ?x to ?y 
Visit old problems/solutions with scale (Parameterized Queries, Form Based Queries, 
Query Template, Template Based Query) 
Takeaway: Create Content that will provide great answers to these kinds of questions 
(for entities relevant to your audience)
• Social Graphs 
• Interest Graphs 
• Mobile Social graphs 
• Attraction graphs 
• Engagement graphs 
• Attention Graphs 
• Intent graph 
• User Query Graph 
• ……..
Takeaway: Write engaging content around your audiences interests 
(Find ways – “Big Data” - to determine their interests)
Anatomy of a Google Search 
Results Page (Revisited) 
Search 
Over Data 
Search 
Over 
Documents
• Slide:3 https://www.flickr.com/photos/67262490@N04/6151466225/ 
• Slide 5 https://www.flickr.com/photos/outsourcetechndu/8241430872/ 
• Slide 9: https://www.flickr.com/photos/drs2biz/197524395/ 
• Slide 3: https://www.flickr.com/photos/106426559@N03/10448641806/ 
• Slide 3: https://www.flickr.com/photos/amynkassam/2866419139/ 
• Slide 5 https://www.flickr.com/photos/legocy/8291983493/in/photolist 
• slide 4: https://www.flickr.com/photos/mekz/2389113709/in/photolist

Contenu connexe

Tendances

Semantic Search tutorial at SemTech 2012
Semantic Search tutorial at SemTech 2012Semantic Search tutorial at SemTech 2012
Semantic Search tutorial at SemTech 2012
Peter Mika
 
Henry stewart dam2010_taxonomicsearch_markohurst
Henry stewart dam2010_taxonomicsearch_markohurstHenry stewart dam2010_taxonomicsearch_markohurst
Henry stewart dam2010_taxonomicsearch_markohurst
WIKOLO
 
Semantic Search Engine: Semantic Search and Query Parsing with Phrases and En...
Semantic Search Engine: Semantic Search and Query Parsing with Phrases and En...Semantic Search Engine: Semantic Search and Query Parsing with Phrases and En...
Semantic Search Engine: Semantic Search and Query Parsing with Phrases and En...
Koray Tugberk GUBUR
 
Relational Navigation Brings Social Computing and Semantic Technology Computi...
Relational Navigation Brings Social Computing and Semantic Technology Computi...Relational Navigation Brings Social Computing and Semantic Technology Computi...
Relational Navigation Brings Social Computing and Semantic Technology Computi...
Bradley Allen
 

Tendances (20)

Knowledge Panels, Rich Snippets and Semantic Markup
Knowledge Panels, Rich Snippets and Semantic MarkupKnowledge Panels, Rich Snippets and Semantic Markup
Knowledge Panels, Rich Snippets and Semantic Markup
 
Keyword Research and Topic Modeling in a Semantic Web
Keyword Research and Topic Modeling in a Semantic WebKeyword Research and Topic Modeling in a Semantic Web
Keyword Research and Topic Modeling in a Semantic Web
 
Semantic Search at Yahoo
Semantic Search at YahooSemantic Search at Yahoo
Semantic Search at Yahoo
 
Making things findable
Making things findableMaking things findable
Making things findable
 
Bill Slawski SEO and the New Search Results
Bill Slawski   SEO and the New Search ResultsBill Slawski   SEO and the New Search Results
Bill Slawski SEO and the New Search Results
 
Semantic Search tutorial at SemTech 2012
Semantic Search tutorial at SemTech 2012Semantic Search tutorial at SemTech 2012
Semantic Search tutorial at SemTech 2012
 
Smx advanced-william-slawski-final
Smx advanced-william-slawski-finalSmx advanced-william-slawski-final
Smx advanced-william-slawski-final
 
Semantic seo and the evolution of queries
Semantic seo and the evolution of queriesSemantic seo and the evolution of queries
Semantic seo and the evolution of queries
 
SemTech 2011 Semantic Search tutorial
SemTech 2011 Semantic Search tutorialSemTech 2011 Semantic Search tutorial
SemTech 2011 Semantic Search tutorial
 
Implementing Semantic Search
Implementing Semantic SearchImplementing Semantic Search
Implementing Semantic Search
 
Henry stewart dam2010_taxonomicsearch_markohurst
Henry stewart dam2010_taxonomicsearch_markohurstHenry stewart dam2010_taxonomicsearch_markohurst
Henry stewart dam2010_taxonomicsearch_markohurst
 
Slawskiwilliam thegrowthofdirectanswers
Slawskiwilliam thegrowthofdirectanswersSlawskiwilliam thegrowthofdirectanswers
Slawskiwilliam thegrowthofdirectanswers
 
What happened to the Semantic Web?
What happened to the Semantic Web?What happened to the Semantic Web?
What happened to the Semantic Web?
 
Seo; Cutting Through The Noise
Seo; Cutting Through The NoiseSeo; Cutting Through The Noise
Seo; Cutting Through The Noise
 
Knowledge Integration in Practice
Knowledge Integration in PracticeKnowledge Integration in Practice
Knowledge Integration in Practice
 
Semantic Search Engine: Semantic Search and Query Parsing with Phrases and En...
Semantic Search Engine: Semantic Search and Query Parsing with Phrases and En...Semantic Search Engine: Semantic Search and Query Parsing with Phrases and En...
Semantic Search Engine: Semantic Search and Query Parsing with Phrases and En...
 
Understanding Queries through Entities
Understanding Queries through EntitiesUnderstanding Queries through Entities
Understanding Queries through Entities
 
Relational Navigation Brings Social Computing and Semantic Technology Computi...
Relational Navigation Brings Social Computing and Semantic Technology Computi...Relational Navigation Brings Social Computing and Semantic Technology Computi...
Relational Navigation Brings Social Computing and Semantic Technology Computi...
 
Related Entity Finding on the Web
Related Entity Finding on the WebRelated Entity Finding on the Web
Related Entity Finding on the Web
 
Semantic search: from document retrieval to virtual assistants
Semantic search: from document retrieval to virtual assistantsSemantic search: from document retrieval to virtual assistants
Semantic search: from document retrieval to virtual assistants
 

En vedette

Db research e invoicing 8-2009
Db research e invoicing 8-2009Db research e invoicing 8-2009
Db research e invoicing 8-2009
ECR Community
 
Universidad nacional de chimborazo.pptx gaby
Universidad nacional de chimborazo.pptx gabyUniversidad nacional de chimborazo.pptx gaby
Universidad nacional de chimborazo.pptx gaby
GabyYungan
 
MANITOS_Area técnico-laboral
MANITOS_Area técnico-laboralMANITOS_Area técnico-laboral
MANITOS_Area técnico-laboral
manitosgumiel
 

En vedette (20)

Linked Data Lessons from Digital Humanities
Linked Data Lessons from Digital HumanitiesLinked Data Lessons from Digital Humanities
Linked Data Lessons from Digital Humanities
 
Brands, packaging, and other product feature
Brands, packaging, and other product featureBrands, packaging, and other product feature
Brands, packaging, and other product feature
 
Db research e invoicing 8-2009
Db research e invoicing 8-2009Db research e invoicing 8-2009
Db research e invoicing 8-2009
 
Media Trends in America. Past, Present and Future--Duane "DJ" Sprague
Media Trends in America. Past, Present and Future--Duane "DJ" SpragueMedia Trends in America. Past, Present and Future--Duane "DJ" Sprague
Media Trends in America. Past, Present and Future--Duane "DJ" Sprague
 
Pasta: Vote for Enriched Energy
Pasta: Vote for Enriched EnergyPasta: Vote for Enriched Energy
Pasta: Vote for Enriched Energy
 
Integration of Micronutrient-rich Small Fish in Aquaculture Systems for Incre...
Integration of Micronutrient-rich Small Fish in Aquaculture Systems for Incre...Integration of Micronutrient-rich Small Fish in Aquaculture Systems for Incre...
Integration of Micronutrient-rich Small Fish in Aquaculture Systems for Incre...
 
User Flows
User FlowsUser Flows
User Flows
 
Universidad nacional de chimborazo.pptx gaby
Universidad nacional de chimborazo.pptx gabyUniversidad nacional de chimborazo.pptx gaby
Universidad nacional de chimborazo.pptx gaby
 
Infidelity Checklist | Baldwin Legal Investigations
Infidelity Checklist | Baldwin Legal InvestigationsInfidelity Checklist | Baldwin Legal Investigations
Infidelity Checklist | Baldwin Legal Investigations
 
Nepse Technical Analysis April 17 - April 21, 2016
Nepse Technical Analysis April 17 - April 21, 2016Nepse Technical Analysis April 17 - April 21, 2016
Nepse Technical Analysis April 17 - April 21, 2016
 
Challenge Us! 2
Challenge Us! 2Challenge Us! 2
Challenge Us! 2
 
Distribion Targeting Solutions Sales Deck
Distribion Targeting Solutions Sales DeckDistribion Targeting Solutions Sales Deck
Distribion Targeting Solutions Sales Deck
 
Alex Manchester Pria 08 Slideshare
Alex Manchester Pria 08 SlideshareAlex Manchester Pria 08 Slideshare
Alex Manchester Pria 08 Slideshare
 
Катя Микула – Сложности работы с удалённой командой при матричной структуре ...
Катя Микула – Сложности  работы с удалённой командой при матричной структуре ...Катя Микула – Сложности  работы с удалённой командой при матричной структуре ...
Катя Микула – Сложности работы с удалённой командой при матричной структуре ...
 
תמי תמיר - תורת המשחקים האלגוריתמית
תמי תמיר - תורת המשחקים האלגוריתמיתתמי תמיר - תורת המשחקים האלגוריתמית
תמי תמיר - תורת המשחקים האלגוריתמית
 
Time Has An End
Time Has An EndTime Has An End
Time Has An End
 
Sandals case study
Sandals case studySandals case study
Sandals case study
 
MANITOS_Area técnico-laboral
MANITOS_Area técnico-laboralMANITOS_Area técnico-laboral
MANITOS_Area técnico-laboral
 
HAWK - Prospecção Comercial B2B
HAWK - Prospecção Comercial B2BHAWK - Prospecção Comercial B2B
HAWK - Prospecção Comercial B2B
 
Planificacion de mi tiempo
Planificacion de mi tiempoPlanificacion de mi tiempo
Planificacion de mi tiempo
 

Similaire à Semtech bizsemanticsearchtutorial

Web search engines and search technology
Web search engines and search technologyWeb search engines and search technology
Web search engines and search technology
Stefanos Anastasiadis
 
SPLive Orlando - Beyond the Search Center - Application or Solution?
SPLive Orlando - Beyond the Search Center - Application or Solution?SPLive Orlando - Beyond the Search Center - Application or Solution?
SPLive Orlando - Beyond the Search Center - Application or Solution?
Agnes Molnar
 
Building Effective Frameworks for Social Media Analysis
Building Effective Frameworks for Social Media AnalysisBuilding Effective Frameworks for Social Media Analysis
Building Effective Frameworks for Social Media Analysis
Open Analytics
 

Similaire à Semtech bizsemanticsearchtutorial (20)

Search Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By DesignSearch Solutions 2011: Successful Enterprise Search By Design
Search Solutions 2011: Successful Enterprise Search By Design
 
Leveraging the semantic web meetup, Semantic Search, Schema.org and more
Leveraging the semantic web meetup, Semantic Search, Schema.org and moreLeveraging the semantic web meetup, Semantic Search, Schema.org and more
Leveraging the semantic web meetup, Semantic Search, Schema.org and more
 
Bioschemas Workshop
Bioschemas WorkshopBioschemas Workshop
Bioschemas Workshop
 
2017 01-11 intelligent search and intranet - chihuahuas vs muffins v1
2017 01-11 intelligent search and intranet - chihuahuas vs muffins v12017 01-11 intelligent search and intranet - chihuahuas vs muffins v1
2017 01-11 intelligent search and intranet - chihuahuas vs muffins v1
 
Brave new search world
Brave new search worldBrave new search world
Brave new search world
 
Web search engines and search technology
Web search engines and search technologyWeb search engines and search technology
Web search engines and search technology
 
CS6007 information retrieval - 5 units notes
CS6007   information retrieval - 5 units notesCS6007   information retrieval - 5 units notes
CS6007 information retrieval - 5 units notes
 
SPLive Orlando - Beyond the Search Center - Application or Solution?
SPLive Orlando - Beyond the Search Center - Application or Solution?SPLive Orlando - Beyond the Search Center - Application or Solution?
SPLive Orlando - Beyond the Search Center - Application or Solution?
 
Not Your Mom's SEO
Not Your Mom's SEONot Your Mom's SEO
Not Your Mom's SEO
 
Basic SEO by Andrea H. Berberich @webpresenceopti
Basic SEO by Andrea H. Berberich @webpresenceoptiBasic SEO by Andrea H. Berberich @webpresenceopti
Basic SEO by Andrea H. Berberich @webpresenceopti
 
Search Analytics for Content Strategists
Search Analytics for Content StrategistsSearch Analytics for Content Strategists
Search Analytics for Content Strategists
 
Digital Marketing & Discoverability for the Performing Arts
Digital Marketing & Discoverability for the Performing ArtsDigital Marketing & Discoverability for the Performing Arts
Digital Marketing & Discoverability for the Performing Arts
 
Building Effective Frameworks for Social Media Analysis
Building Effective Frameworks for Social Media AnalysisBuilding Effective Frameworks for Social Media Analysis
Building Effective Frameworks for Social Media Analysis
 
Social Media Data Collection & Analysis
Social Media Data Collection & AnalysisSocial Media Data Collection & Analysis
Social Media Data Collection & Analysis
 
Building Effective Frameworks for Social Media Analysis
Building Effective Frameworks for Social Media AnalysisBuilding Effective Frameworks for Social Media Analysis
Building Effective Frameworks for Social Media Analysis
 
Building Enterprise-Ready Knowledge Graph Applications in the Cloud
Building Enterprise-Ready Knowledge Graph Applications in the CloudBuilding Enterprise-Ready Knowledge Graph Applications in the Cloud
Building Enterprise-Ready Knowledge Graph Applications in the Cloud
 
Semantic Search
Semantic SearchSemantic Search
Semantic Search
 
Data Scientist Toolbox
Data Scientist ToolboxData Scientist Toolbox
Data Scientist Toolbox
 
How Oracle Uses CrowdFlower For Sentiment Analysis
How Oracle Uses CrowdFlower For Sentiment AnalysisHow Oracle Uses CrowdFlower For Sentiment Analysis
How Oracle Uses CrowdFlower For Sentiment Analysis
 
Meetup SF - Amundsen
Meetup SF  -  AmundsenMeetup SF  -  Amundsen
Meetup SF - Amundsen
 

Plus de Barbara Starr

Knowledge intensive query processing copy
Knowledge intensive query processing copyKnowledge intensive query processing copy
Knowledge intensive query processing copy
Barbara Starr
 
Semantic Search, Question Answering systems, inferencing
Semantic Search, Question Answering systems, inferencingSemantic Search, Question Answering systems, inferencing
Semantic Search, Question Answering systems, inferencing
Barbara Starr
 
Aquaint kickoff-overview-prange
Aquaint kickoff-overview-prangeAquaint kickoff-overview-prange
Aquaint kickoff-overview-prange
Barbara Starr
 

Plus de Barbara Starr (20)

Kdd14 t2-bordes-gabrilovich (3)
Kdd14 t2-bordes-gabrilovich (3)Kdd14 t2-bordes-gabrilovich (3)
Kdd14 t2-bordes-gabrilovich (3)
 
Kdd 2014 tutorial bringing structure to text - chi
Kdd 2014 tutorial   bringing structure to text - chiKdd 2014 tutorial   bringing structure to text - chi
Kdd 2014 tutorial bringing structure to text - chi
 
Smx west Barbara Starr Mac Version - Schema 201 for Real world Succes
Smx west Barbara Starr Mac Version - Schema 201 for Real world SuccesSmx west Barbara Starr Mac Version - Schema 201 for Real world Succes
Smx west Barbara Starr Mac Version - Schema 201 for Real world Succes
 
Smxeastbarbarastarr2012
Smxeastbarbarastarr2012Smxeastbarbarastarr2012
Smxeastbarbarastarr2012
 
Event templates for Question answering
Event templates for Question answeringEvent templates for Question answering
Event templates for Question answering
 
Event templatesfor qa2
Event templatesfor qa2Event templatesfor qa2
Event templatesfor qa2
 
RDFa, SEO wave
RDFa, SEO waveRDFa, SEO wave
RDFa, SEO wave
 
SAIC System architecture
SAIC System architectureSAIC System architecture
SAIC System architecture
 
Event templates for improved narrative understanding in Question Answering sy...
Event templates for improved narrative understanding in Question Answering sy...Event templates for improved narrative understanding in Question Answering sy...
Event templates for improved narrative understanding in Question Answering sy...
 
Semantic alignment paper
Semantic alignment paperSemantic alignment paper
Semantic alignment paper
 
Knowledge intensive query processing copy
Knowledge intensive query processing copyKnowledge intensive query processing copy
Knowledge intensive query processing copy
 
Knowledge intensive query Processing
Knowledge intensive query ProcessingKnowledge intensive query Processing
Knowledge intensive query Processing
 
Semantic Search, Question Answering systems, inferencing
Semantic Search, Question Answering systems, inferencingSemantic Search, Question Answering systems, inferencing
Semantic Search, Question Answering systems, inferencing
 
Proceedings
ProceedingsProceedings
Proceedings
 
Proceedings
ProceedingsProceedings
Proceedings
 
Saic aqua summary
Saic aqua summarySaic aqua summary
Saic aqua summary
 
Aquaint kickoff-overview-prange
Aquaint kickoff-overview-prangeAquaint kickoff-overview-prange
Aquaint kickoff-overview-prange
 
Saic aqua summary
Saic aqua summarySaic aqua summary
Saic aqua summary
 
Saic aqua
Saic aquaSaic aqua
Saic aqua
 
Hpkb year 1 results
Hpkb   year 1 resultsHpkb   year 1 results
Hpkb year 1 results
 

Dernier

Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
dlhescort
 
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
amitlee9823
 
Al Mizhar Dubai Escorts +971561403006 Escorts Service In Al Mizhar
Al Mizhar Dubai Escorts +971561403006 Escorts Service In Al MizharAl Mizhar Dubai Escorts +971561403006 Escorts Service In Al Mizhar
Al Mizhar Dubai Escorts +971561403006 Escorts Service In Al Mizhar
allensay1
 
Call Girls From Raj Nagar Extension Ghaziabad❤️8448577510 ⊹Best Escorts Servi...
Call Girls From Raj Nagar Extension Ghaziabad❤️8448577510 ⊹Best Escorts Servi...Call Girls From Raj Nagar Extension Ghaziabad❤️8448577510 ⊹Best Escorts Servi...
Call Girls From Raj Nagar Extension Ghaziabad❤️8448577510 ⊹Best Escorts Servi...
lizamodels9
 
Call Girls In Noida 959961⊹3876 Independent Escort Service Noida
Call Girls In Noida 959961⊹3876 Independent Escort Service NoidaCall Girls In Noida 959961⊹3876 Independent Escort Service Noida
Call Girls In Noida 959961⊹3876 Independent Escort Service Noida
dlhescort
 
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
daisycvs
 
The Abortion pills for sale in Qatar@Doha [+27737758557] []Deira Dubai Kuwait
The Abortion pills for sale in Qatar@Doha [+27737758557] []Deira Dubai KuwaitThe Abortion pills for sale in Qatar@Doha [+27737758557] []Deira Dubai Kuwait
The Abortion pills for sale in Qatar@Doha [+27737758557] []Deira Dubai Kuwait
daisycvs
 
Call Girls Kengeri Satellite Town Just Call 👗 7737669865 👗 Top Class Call Gir...
Call Girls Kengeri Satellite Town Just Call 👗 7737669865 👗 Top Class Call Gir...Call Girls Kengeri Satellite Town Just Call 👗 7737669865 👗 Top Class Call Gir...
Call Girls Kengeri Satellite Town Just Call 👗 7737669865 👗 Top Class Call Gir...
amitlee9823
 

Dernier (20)

How to Get Started in Social Media for Art League City
How to Get Started in Social Media for Art League CityHow to Get Started in Social Media for Art League City
How to Get Started in Social Media for Art League City
 
Whitefield CALL GIRL IN 98274*61493 ❤CALL GIRLS IN ESCORT SERVICE❤CALL GIRL
Whitefield CALL GIRL IN 98274*61493 ❤CALL GIRLS IN ESCORT SERVICE❤CALL GIRLWhitefield CALL GIRL IN 98274*61493 ❤CALL GIRLS IN ESCORT SERVICE❤CALL GIRL
Whitefield CALL GIRL IN 98274*61493 ❤CALL GIRLS IN ESCORT SERVICE❤CALL GIRL
 
Famous Olympic Siblings from the 21st Century
Famous Olympic Siblings from the 21st CenturyFamous Olympic Siblings from the 21st Century
Famous Olympic Siblings from the 21st Century
 
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
Call Girls in Delhi, Escort Service Available 24x7 in Delhi 959961-/-3876
 
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service BangaloreCall Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
Call Girls Hebbal Just Call 👗 7737669865 👗 Top Class Call Girl Service Bangalore
 
Falcon Invoice Discounting: The best investment platform in india for investors
Falcon Invoice Discounting: The best investment platform in india for investorsFalcon Invoice Discounting: The best investment platform in india for investors
Falcon Invoice Discounting: The best investment platform in india for investors
 
Al Mizhar Dubai Escorts +971561403006 Escorts Service In Al Mizhar
Al Mizhar Dubai Escorts +971561403006 Escorts Service In Al MizharAl Mizhar Dubai Escorts +971561403006 Escorts Service In Al Mizhar
Al Mizhar Dubai Escorts +971561403006 Escorts Service In Al Mizhar
 
Call Girls From Raj Nagar Extension Ghaziabad❤️8448577510 ⊹Best Escorts Servi...
Call Girls From Raj Nagar Extension Ghaziabad❤️8448577510 ⊹Best Escorts Servi...Call Girls From Raj Nagar Extension Ghaziabad❤️8448577510 ⊹Best Escorts Servi...
Call Girls From Raj Nagar Extension Ghaziabad❤️8448577510 ⊹Best Escorts Servi...
 
Organizational Transformation Lead with Culture
Organizational Transformation Lead with CultureOrganizational Transformation Lead with Culture
Organizational Transformation Lead with Culture
 
Dr. Admir Softic_ presentation_Green Club_ENG.pdf
Dr. Admir Softic_ presentation_Green Club_ENG.pdfDr. Admir Softic_ presentation_Green Club_ENG.pdf
Dr. Admir Softic_ presentation_Green Club_ENG.pdf
 
PHX May 2024 Corporate Presentation Final
PHX May 2024 Corporate Presentation FinalPHX May 2024 Corporate Presentation Final
PHX May 2024 Corporate Presentation Final
 
Call Girls In Noida 959961⊹3876 Independent Escort Service Noida
Call Girls In Noida 959961⊹3876 Independent Escort Service NoidaCall Girls In Noida 959961⊹3876 Independent Escort Service Noida
Call Girls In Noida 959961⊹3876 Independent Escort Service Noida
 
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
Quick Doctor In Kuwait +2773`7758`557 Kuwait Doha Qatar Dubai Abu Dhabi Sharj...
 
Call Girls Service In Old Town Dubai ((0551707352)) Old Town Dubai Call Girl ...
Call Girls Service In Old Town Dubai ((0551707352)) Old Town Dubai Call Girl ...Call Girls Service In Old Town Dubai ((0551707352)) Old Town Dubai Call Girl ...
Call Girls Service In Old Town Dubai ((0551707352)) Old Town Dubai Call Girl ...
 
Unveiling Falcon Invoice Discounting: Leading the Way as India's Premier Bill...
Unveiling Falcon Invoice Discounting: Leading the Way as India's Premier Bill...Unveiling Falcon Invoice Discounting: Leading the Way as India's Premier Bill...
Unveiling Falcon Invoice Discounting: Leading the Way as India's Premier Bill...
 
The Abortion pills for sale in Qatar@Doha [+27737758557] []Deira Dubai Kuwait
The Abortion pills for sale in Qatar@Doha [+27737758557] []Deira Dubai KuwaitThe Abortion pills for sale in Qatar@Doha [+27737758557] []Deira Dubai Kuwait
The Abortion pills for sale in Qatar@Doha [+27737758557] []Deira Dubai Kuwait
 
Call Girls Kengeri Satellite Town Just Call 👗 7737669865 👗 Top Class Call Gir...
Call Girls Kengeri Satellite Town Just Call 👗 7737669865 👗 Top Class Call Gir...Call Girls Kengeri Satellite Town Just Call 👗 7737669865 👗 Top Class Call Gir...
Call Girls Kengeri Satellite Town Just Call 👗 7737669865 👗 Top Class Call Gir...
 
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service AvailableCall Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service Available
Call Girls Ludhiana Just Call 98765-12871 Top Class Call Girl Service Available
 
Cheap Rate Call Girls In Noida Sector 62 Metro 959961乂3876
Cheap Rate Call Girls In Noida Sector 62 Metro 959961乂3876Cheap Rate Call Girls In Noida Sector 62 Metro 959961乂3876
Cheap Rate Call Girls In Noida Sector 62 Metro 959961乂3876
 
The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...
The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...
The Path to Product Excellence: Avoiding Common Pitfalls and Enhancing Commun...
 

Semtech bizsemanticsearchtutorial

  • 1. • Barbara Starr ( ) – Basics of What semantic search is, what tools and techniques are used • Bill Slawski ( ) – Strategy for SEO – Case based examples and analysis
  • 2. • Pursued a doctorate in Artificial Intelligence from South Africa in the 80's. • Recruited to build intelligent/predictive trading systems on Wall Street • Migrated to government-based contracts, several of which turned into real world products like – SIRI (PAL from DARPA) – WATSON (Acquaint - IBM Watson Labs was a team member) • From the vantage of a semantic technologist, I keenly watched the evolution of the Semantic Web. • “Shocked into the real world” when working as a consultant @ Overstock. – Rdfa on 900,000 item pages 2 days before Google adopted it – UPC and identifier “miner” • Today – Consultant for companies such as GS1 US, Columnist, Strategist, …
  • 3. • Primitive UI – Hunt and Peck
  • 5. • Based on concept of “citations” and very easily gamed • Probabilistic or Statistical (Not Symbolic) • Keyword Based Search Engine (Not Concept Based or Ontology Based) • “link juice” ? • Other odd vernacular that became standard jargon in the “SEO” community
  • 6. SIRI “Amazing fact: same amount of computing to answer one Google Search query as all the computing done – in flight and on the ground -- for the entire Apollo program!” “Moore's law is the observation that, over the history of computing hardware, the number of transistors in a dense integrated circuit doubles approximately every two years”” Source: Wikipedia
  • 7. “A new form of Web content that is meaningful to computers will unleash a revolution of new possibilities” • Tim Berners Lee • James Hendler • Ora Lassila http://www.cs.umd.edu/~golbeck/LBSC690/SemanticWeb.html
  • 8. What they want When they want it (Now) Accurate (Reliable & Informative) Available Search engines must satisfy consumer needs, else:
  • 9.
  • 10. “Def. Semantic Search is any retrieval method where – User intent and resources are represented in a semantic model • A set of concepts or topics that generalize over tokens/phrases • Additional structure such as a hierarchy among concepts, relationships among concepts etc. – Semantic representations of the query and the user intent are exploited in some part of the retrieval process” Peter Mika, Sr. Research Scientist, Yahoo Labs ⎪ June 19, 2014
  • 11. Inevitable passage of Semantic Web adoption (or some version thereof) – culminating in schema.org http://semanticweb.com/semtech-2011-coverage-the-rdfaseo-wave-how-to-catch-it-and-why_b20458
  • 12. “Things” not” strings” -May 16 2012 Understanding “things” helps Google understand what things are in the world and what users are searching for June 2012 –Twitter announces Twitter Cards Pinterest Rich Pins
  • 13. • Directly extracting on page metadata to create enhanced displays • Searching directly on consumed metadata • Provide direct answers to queries by searching on consumed, verified and validated information RICH SNIPPETS 2009 Searchmonkey 2008 • Aggregate answers or deduce them (like a timeline of events) • Expose more relevant answers in the long tail of search • Assist in interpreting a user query • Detect relevancy signals: i.e what content to show to what audience • Use it in conjunction with machine learning techniques- to eg. Train other components • … tiles Long tail: Peanut Butter and Jelly in stripes ?
  • 14. Search is changing • Semantic, Predictive, Personalised, Conversational – Search over documents – Search over Data • Rise of Answer Engines (Direct answers proliferating) • Data Quality is imperative Becoming Less like a search Engine and more like a personal Assistant
  • 15. SIRI Google Now Cortana AiAgents (create your own) Runs cross platform
  • 16. “Answer box” Organic Search Results Search Over Data Knowledge Panel Search Over Documents
  • 17. Synonymous with the migration to “Answer Engines “ & “Search Over Data”
  • 18. Crawling & Indexing Query Interpretation Indexing and Ranking Results Presentation Indexed information
  • 19. Means of preprocessing documents to speed up search (serving results in real time)
  • 20. • Microsoft has given a fairly concise definition of the entity recognition and disambiguation process: – The objective of an Entity Recognition and Disambiguation system is to recognize mentions of entities in a given text, disambiguate them, and map them to the entities in a given entity collection or knowledge base. • In Google’s case, that means recognizing entities on web pages or web documents and mapping them back to specific entities in their Knowledge Graph
  • 21. Implicit entity graph derived/inferred from the text on a web page Explicit entities obtained from structured markup on a web page May need to map to external Ontologies like schema.org or some other ontology Technology – NLP or IR or … Technology – Semantic Web
  • 22. Make it Search Engine/Machine Friendly & tell them (explicitly) what “things” are on your web page • Make it (your information on your website) available to Google (and the major search and social engines), ensure you make it easy for computers to read and discover your stuff. • With schema.org (and/or the preferred vocabulary/ontology of the search social engine you are optimizing for, e.g for Facebook use rdfa & Opengraph). Google, Yahoo, Bing, Yandex => Schema.org • Pick a markup format (syntax) and stick with it – Microdata – Microformat – Rdfa – Rdfa lite – JSON-LD
  • 23. • Recall some of Google’s Mission/Objective Statements or goals – “Organizing the worlds information to make it universally accessible and useful” – “To help with that we have built the knowledge graph” – Give an identity to every “thing” in the world • The knowledge graph – Contains information and entities and their relationships – Helps in Resolving ambiguities when processing queries You can explicitly disambiguate your content by providing a freebase mid – machine identifier - (in your markup)
  • 25. Google plus in “Enhanced Displays and the knowledge Graph • Authorship • Local businesses • Knowledge Carousel • ………
  • 26. With Schema.org (and JSON-LD in this case) • Note the sameAs statement • mid makes it easier to match or reconcile the “thing” https://www.youtube.com/watch?v=W9pRpSW_KqA&src_vid=0oOwrBEeQss&feature=iv&annotation_id=annotation_1139520055 Ref: Google I/O 2014
  • 27. The Knowledge Graph Powers: • Rich snippets in Events • Event listings in Google Maps • Notifications in Google Now https://www.youtube.com/watch?v=XXw8g-FbemI Ref: Google I/O 2014
  • 30.
  • 31. Rich snippets make your data more visible in Search Engine Results Pages Which would you rather click on? No Rich Snippets With Rich Snippets Lower Bounce Rate
  • 32. 32 More Visibility in verticals, recipes & images via markup In Search Engine Results Pages Your product is not visible if no “color” attribute is populated & Search Verticals
  • 33. You want peanut butter and jelly in stripes ? Allows unique and interesting content to surface
  • 34. “Google Plus” Key Point - Corollary: If you don’t exist as an entity you do not exist in the knowledge graph or in “Search Over Data” The cost of that: Anonymity and Irrelevance!
  • 35. http://www.socialmediaexaminer.com/rich-pins-on-pinterest/ Twitter Cards & Deep Linking Pinterest Pins Facebook Opengraph • Drive Brand awareness • Diversify Revenue Sources (Reduce Dependence on Google) • Increase Lift & Conversions
  • 36.
  • 37. Google’s Structured Markup Helper • Generates JSON-LD or microdata • E-mail and web page markup Data Highlighter https://support.google.com/webmasters/answer/99170?hl=en&ref_topic=1088472 “Google can present your data more attractively -- and in new ways -- in search results and in other products such as the Google Knowledge Graph.” List provided on schema.rdfs.org Wordpress plugin and html code http://schema.rdfs.org/tools.html
  • 38.
  • 39.
  • 40.
  • 41.
  • 42.
  • 43.
  • 44. Make sure to enable Microdata
  • 45.
  • 46.
  • 47. • Microdata reveal · JSON-LD sniffer · Semantic inspector · META SEO inspector · Green Turtle RDFa List maintained by Aaron Bradley: http://www.seoskeptic.com/structured-data-markup-validation-testing-tools/ Written Explanation of Walkthrough http://searchengineland.com/see-entities-web-page-tools-help-194710 GRUFF
  • 48.
  • 49. • Alchemyapi (with freebase mappings of entities since July 2013) • Opencalais • Semantic Verses • Aylien which was launched in Feb 2014, provides mappings to freebase and schema.org. • Smartlogic • lexalytics • Text-Processing • Stanford’s Ner • Textrazor
  • 51. Ensure sure you supply rich, high quality data, mapped to search filters for maximum visibility Not visible if no “color” attribute populated Fill in The Gaps
  • 52. • Ensure to supply rich, consistent data in any format you submit and ensure it is validated, verified and fresh • Send Consistent signals • Provide global identifiers whenever possible
  • 54.
  • 55. • Implicit (content and Bill) also tools I have
  • 56.
  • 57. • “Query logs record the actual usage of search systems and their analysis has proven critical to improving search engine functionality. Yet, despite the deluge of information, query log analysis often suffers from the sparsity of the query space. we propose a new model for query log data called the entity-aware click graph. In this representation, we decompose queries into entities and modifiers, and measure their association with clicked pages. We demonstrate the benefits of this approach on the crucial task of understanding which websites fulfill similar user needs, showing that using this representation we can achieve a higher precision than other query log-based approaches ” Measuring website similarity using an entity-aware click graph 2012 publication: Peter Mika, Hugo Zaragoza, Pablo N Mendes, RoI Blanco http://dl.acm.org/citation.cfm?id=2398500
  • 58. Need to understand the question in order to answer it • Entity Mention Queries: Common structure to entity mention queries: query = <entity> + <intent> • Queries that return facts as an answer • What form does the question take? (Question forms) Where was X born? When was X born? Who invented X? Where was X invented? What is the X of Y? Flights from ?x to ?y Visit old problems/solutions with scale (Parameterized Queries, Form Based Queries, Query Template, Template Based Query) Takeaway: Create Content that will provide great answers to these kinds of questions (for entities relevant to your audience)
  • 59.
  • 60. • Social Graphs • Interest Graphs • Mobile Social graphs • Attraction graphs • Engagement graphs • Attention Graphs • Intent graph • User Query Graph • ……..
  • 61. Takeaway: Write engaging content around your audiences interests (Find ways – “Big Data” - to determine their interests)
  • 62. Anatomy of a Google Search Results Page (Revisited) Search Over Data Search Over Documents
  • 63.
  • 64. • Slide:3 https://www.flickr.com/photos/67262490@N04/6151466225/ • Slide 5 https://www.flickr.com/photos/outsourcetechndu/8241430872/ • Slide 9: https://www.flickr.com/photos/drs2biz/197524395/ • Slide 3: https://www.flickr.com/photos/106426559@N03/10448641806/ • Slide 3: https://www.flickr.com/photos/amynkassam/2866419139/ • Slide 5 https://www.flickr.com/photos/legocy/8291983493/in/photolist • slide 4: https://www.flickr.com/photos/mekz/2389113709/in/photolist