SlideShare une entreprise Scribd logo
1  sur  46
JR Oakes | @jroakes #TechSEOBoost
Fun with Machines. How Machine
Learning is Shaping Google and
Technical SEO
JR Oakes | @jroakes #TechSEOBoost
About Me
• Studied Industrial Design at NCSU
• Worked as an architectural glass
artist for 10 years.
• Was Lead Developer and then
Director of Strategy for medium-
sized agency with 100+ clients
worldwide.
• Work as Director, Technical SEO
for Adapt.
JR Oakes | @jroakes #TechSEOBoost
I have a problem with tf-idf
JR Oakes | @jroakes #TechSEOBoost
About TF-IDF
TF-IDF is very hand-wavy and
sounds very fancy, but is not the
magic elixir to DOMINATING ON
GOOGLE.
JR Oakes | @jroakes #TechSEOBoost
About TF-IDF
It is actually not even the best IR
algorithm.
BM25 takes into account document
length in addition to other factors in
various iterations.
JR Oakes | @jroakes #TechSEOBoost
About TF-IDF
https://wikimedia-research.github.io/Discovery-Search-Test-BM25/
https://wikimedia-research.github.io/Discovery-Search-Test-InterleavedLTR/
JR Oakes | @jroakes #TechSEOBoost
About TF-IDF
The Search Platform Team has been working on improving search on
Wikimedia projects with machine learning. Machine learned-ranking (MLR)
enables us to rank relevance of pages using a model trained on implicit and
explicit judgements. In the first test of the learning-to-rank (LTR) project, we
evaluated the performance of a click-based model on users searching English
Wikipedia. We found that users were slightly more likely to engage with MLR-
provided results than with BM25 results (assessed via the clickthrough rate
and a preference statistic). We also found that users with machine learning-
ranked results were statistically significantly more likely to click on the first
search result first than users with BM25-ranked results, which indicates that
we are onto something. The next step for us is to evaluate the model’s
performance on Wikipedia in other languages.
JR Oakes | @jroakes #TechSEOBoost
About TF-IDF
JR Oakes | @jroakes #TechSEOBoost
About TF-IDF
Wikimedia Research released their first model on Github last month.
MjoLniR – our Python and Spark-based library for handling the
backend data processing for Machine Learned Ranking at
Wikimedia.
https://github.com/wikimedia/search-MjoLniR/tree/master/mjolnir
JR Oakes | @jroakes #TechSEOBoost
About TF-IDF
We are WAY beyond TF-IDF. TF-IDF seems to work because it causes you to
look for related phrases, but it is not a very good relevance metric. It is a
keyword frequency metric.
JR Oakes | @jroakes #TechSEOBoost
How is Google Using Machine
Learning?
JR Oakes | @jroakes #TechSEOBoost
Was Larry Kim right?
JR Oakes | @jroakes #TechSEOBoost
CTR As A Ranking Factor
JR Oakes | @jroakes #TechSEOBoost
CTR As A Ranking Factor
JR Oakes | @jroakes #TechSEOBoost
CTR As A Ranking Factor
Potentially:
• Clicks - For our click model we use a generalization of the PositionBased
Model (PBM) [9], at the core of which lies an examination hypothesis,
stating that in order to be clicked a document has to be examined and
attractive:
• Attention – What if users get the information that they need directly from
the SERP (Answer boxes), without a click, how do we know they were
satisfied?
• Satisfaction – “While looking at the reasons specified by the raters we
found out that 42% of the raters who said that they would click through on
a SERP, indicated that their goal was “to confirm information already
present in the summary” So additional clicks don’t necessarily mean a
poor initial result.
JR Oakes | @jroakes #TechSEOBoost
CTR As A Ranking Factor
JR Oakes | @jroakes #TechSEOBoost
CTR As A Ranking Factor
Machine Learning in its simplest form takes:
1. Input features
2. An algorithm that processes the features (most often) in a linear, non-
linear, or tree-based way to make a prediction.
3. And an evaluation metric that compares the prediction to your “ground
truth” data.
It is technically possible that CTR and / or Quality Rater data provides the
ground truth.
JR Oakes | @jroakes #TechSEOBoost
CTR As A Ranking Factor
The problem is:
We don’t have the ground truth, we don’t know the features, and we sure as
hell have no idea what is in here:
JR Oakes | @jroakes #TechSEOBoost
CTR As A Ranking Factor
We know that it probably depends on:
• Click-through-rate
• Context models
• Ground-truth quality (Quality Rater’s Guidelines)
• And other standard factors.
JR Oakes | @jroakes #TechSEOBoost
Storytelling
JR Oakes | @jroakes #TechSEOBoost
Storytelling
JR Oakes | @jroakes #TechSEOBoost
Storytelling
Using Generative Adversarial Networks to train machines how to see the
storylines in news events.
https://www.ijcai.org/proceedings/2017/0554.pdf
JR Oakes | @jroakes #TechSEOBoost
LSTMs
JR Oakes | @jroakes #TechSEOBoost
LSTMs
We would also guess that LSTMs (with attention) play some role in Rankbrain
based on its state-of-the-art ability to pick up referential information in texts
well beyond traditional BOW models.
http://colah.github.io/posts/2015-08-Understanding-LSTMs/
JR Oakes | @jroakes #TechSEOBoost
What should we focus on?
JR Oakes | @jroakes #TechSEOBoost
Query Disambiguation
JR Oakes | @jroakes #TechSEOBoost
Query Disambiguation
Very little information in the query and
a lot of information in the possible
results.
JR Oakes | @jroakes #TechSEOBoost
Query Disambiguation
Google tries to give us a nudge.
JR Oakes | @jroakes #TechSEOBoost
Query Disambiguation
What a strong hint to
consider when thinking about
what needs to be included
on a page discussing:
Lipton Tea
Also a very strong hint at
potential navigation.
JR Oakes | @jroakes #TechSEOBoost
Query Disambiguation
AT&T does an amazing job
at this.
JR Oakes | @jroakes #TechSEOBoost
Semantic Relevance
JR Oakes | @jroakes #TechSEOBoost
Semantic Relevance
Bill Slawski (as always) is spot on.
JR Oakes | @jroakes #TechSEOBoost
Semantic Relevance
Going back to the patent from Google
in 2014 (Integrated external related
phrase information into a phrase-
based indexing information retrieval
system), we see that there is an
marked gain in the significance of
phrases in a page based on
additional semantically related
qualifying phrases.
JR Oakes | @jroakes #TechSEOBoost
Semantic Relevance
There are many ways to handle this on a
page level.
JR Oakes | @jroakes #TechSEOBoost
Semantic Relevance
But, this really starts much sooner by
trying to discover content / intent
categories that your site is relevant
for to even start the process of
building out relevant content
categories for your visitors.
https://anaconda.org/jroakes/cluster-
share/notebook
JR Oakes | @jroakes #TechSEOBoost
Semantic Relevance
The prior notebook ingests your
keywords, models them to vector
space, and then runs k-means to
group the keywords into relevance
clusters.
JR Oakes | @jroakes #TechSEOBoost
Semantic Relevance
JR Oakes | @jroakes #TechSEOBoost
Semantic Relevance
Note this goes well beyond term-
frequency.
JR Oakes | @jroakes #TechSEOBoost
Semantic Relevance
Using skip-gram models impart
probability of cooccurrence across
large corpuses which is much closer
to what Google does than simple tf-
idf.
JR Oakes | @jroakes #TechSEOBoost
We should also care about click
satisfaction.
JR Oakes | @jroakes #TechSEOBoost
Click Satisfaction
JR Oakes | @jroakes #TechSEOBoost
Click Satisfaction
Working hard to ensure that your pages get the clicks. H/T to @fighto for the
excellent article here:
https://searchengineland.com/alert-abnormal-organic-ctr-detected-automatic-
detection-poorly-performing-meta-data-280290
https://anaconda.org/jroakes/ctr_anamolies_share/notebook
JR Oakes | @jroakes #TechSEOBoost
We should also care about content
deduplication.
JR Oakes | @jroakes #TechSEOBoost
Content Deduplication
https://anaconda.org/jroakes/duplicate_detection
_with_shingling_share/notebook
JR Oakes | @jroakes #TechSEOBoost
Wrapping Up
It is very difficult to gain intuition into how Google works based on solely external
data. The reality is that context, machine learning, and click data allows for the
building of models that humans cannot understand easily.
We wanted to move the conversation away from simplistic keyword mechanisms
and towards an understanding that there semantics and context are much more
valuable to ranking.
JR Oakes | @jroakes #TechSEOBoost

Contenu connexe

Tendances

The inbounder London - 2. May 2017 Tom Anthony
The inbounder London - 2. May 2017  Tom Anthony The inbounder London - 2. May 2017  Tom Anthony
The inbounder London - 2. May 2017 Tom Anthony We Are Marketing
 
TechSEO Boost 2018: The Statelessness of Technical SEO
TechSEO Boost 2018: The Statelessness of Technical SEOTechSEO Boost 2018: The Statelessness of Technical SEO
TechSEO Boost 2018: The Statelessness of Technical SEOCatalyst
 
TechSEO Boost 2017: The State of Technical SEO
TechSEO Boost 2017: The State of Technical SEOTechSEO Boost 2017: The State of Technical SEO
TechSEO Boost 2017: The State of Technical SEOCatalyst
 
NLP for SEO
NLP for SEONLP for SEO
NLP for SEOCatalyst
 
Scaling Keyword Research to Find Content Gaps
Scaling Keyword Research to Find Content GapsScaling Keyword Research to Find Content Gaps
Scaling Keyword Research to Find Content GapsHamlet Batista
 
Crawl Budget Conqueror - Take Control of Your Crawl Budget
Crawl Budget Conqueror - Take Control of Your Crawl BudgetCrawl Budget Conqueror - Take Control of Your Crawl Budget
Crawl Budget Conqueror - Take Control of Your Crawl BudgetCatalyst
 
MeasureFest July 2021 - Session Segmentation with Machine Learning
MeasureFest July 2021 - Session Segmentation with Machine LearningMeasureFest July 2021 - Session Segmentation with Machine Learning
MeasureFest July 2021 - Session Segmentation with Machine LearningRichard Lawrence
 
What I Learned Building a Toy Example to Crawl & Render like Google
What I Learned Building a Toy Example to Crawl & Render like GoogleWhat I Learned Building a Toy Example to Crawl & Render like Google
What I Learned Building a Toy Example to Crawl & Render like GoogleCatalyst
 
Technical Content Optimization
Technical Content OptimizationTechnical Content Optimization
Technical Content OptimizationMichael King
 
SearchLove Boston 2016 | Mary Bowling | Local Search Experience Optimization
SearchLove Boston 2016 | Mary Bowling | Local Search Experience OptimizationSearchLove Boston 2016 | Mary Bowling | Local Search Experience Optimization
SearchLove Boston 2016 | Mary Bowling | Local Search Experience OptimizationDistilled
 
Using Competitive Gap Analyses to Discover Low-Hanging Fruit
Using Competitive Gap Analyses to Discover Low-Hanging FruitUsing Competitive Gap Analyses to Discover Low-Hanging Fruit
Using Competitive Gap Analyses to Discover Low-Hanging FruitKeith Goode
 
SearchLeeds 2018 - Steve Chambers - Stickyeyes - How not to F**K up a Migration
SearchLeeds 2018 - Steve Chambers - Stickyeyes - How not to F**K up a Migration SearchLeeds 2018 - Steve Chambers - Stickyeyes - How not to F**K up a Migration
SearchLeeds 2018 - Steve Chambers - Stickyeyes - How not to F**K up a Migration Branded3
 
Breaking Down NLP for SEOs - SMX Advanced Europe 2019 - Paul Shapiro
Breaking Down NLP for SEOs - SMX Advanced Europe 2019 - Paul ShapiroBreaking Down NLP for SEOs - SMX Advanced Europe 2019 - Paul Shapiro
Breaking Down NLP for SEOs - SMX Advanced Europe 2019 - Paul ShapiroPaul Shapiro
 
Max Prin - MnSearch Summit 2018 - SEO for the Current Mobile Landscape
Max Prin - MnSearch Summit 2018 - SEO for the Current Mobile LandscapeMax Prin - MnSearch Summit 2018 - SEO for the Current Mobile Landscape
Max Prin - MnSearch Summit 2018 - SEO for the Current Mobile LandscapeMax Prin
 
Conflicting Website Signals & Confused Search Engines - Rachel Costello, Tech...
Conflicting Website Signals & Confused Search Engines - Rachel Costello, Tech...Conflicting Website Signals & Confused Search Engines - Rachel Costello, Tech...
Conflicting Website Signals & Confused Search Engines - Rachel Costello, Tech...Rachel Costello
 
The New Renaissance of JavaScript
The New Renaissance of JavaScriptThe New Renaissance of JavaScript
The New Renaissance of JavaScriptHamlet Batista
 
Software Testing for SEO
Software Testing for SEOSoftware Testing for SEO
Software Testing for SEOMichael King
 
Schema and Open Graph 101 - SMX Munich
Schema and Open Graph 101 - SMX MunichSchema and Open Graph 101 - SMX Munich
Schema and Open Graph 101 - SMX MunichMatthew Brown
 

Tendances (20)

The inbounder London - 2. May 2017 Tom Anthony
The inbounder London - 2. May 2017  Tom Anthony The inbounder London - 2. May 2017  Tom Anthony
The inbounder London - 2. May 2017 Tom Anthony
 
TechSEO Boost 2018: The Statelessness of Technical SEO
TechSEO Boost 2018: The Statelessness of Technical SEOTechSEO Boost 2018: The Statelessness of Technical SEO
TechSEO Boost 2018: The Statelessness of Technical SEO
 
TechSEO Boost 2017: The State of Technical SEO
TechSEO Boost 2017: The State of Technical SEOTechSEO Boost 2017: The State of Technical SEO
TechSEO Boost 2017: The State of Technical SEO
 
NLP for SEO
NLP for SEONLP for SEO
NLP for SEO
 
Scaling Keyword Research to Find Content Gaps
Scaling Keyword Research to Find Content GapsScaling Keyword Research to Find Content Gaps
Scaling Keyword Research to Find Content Gaps
 
Crawl Budget Conqueror - Take Control of Your Crawl Budget
Crawl Budget Conqueror - Take Control of Your Crawl BudgetCrawl Budget Conqueror - Take Control of Your Crawl Budget
Crawl Budget Conqueror - Take Control of Your Crawl Budget
 
MeasureFest July 2021 - Session Segmentation with Machine Learning
MeasureFest July 2021 - Session Segmentation with Machine LearningMeasureFest July 2021 - Session Segmentation with Machine Learning
MeasureFest July 2021 - Session Segmentation with Machine Learning
 
What I Learned Building a Toy Example to Crawl & Render like Google
What I Learned Building a Toy Example to Crawl & Render like GoogleWhat I Learned Building a Toy Example to Crawl & Render like Google
What I Learned Building a Toy Example to Crawl & Render like Google
 
Technical Content Optimization
Technical Content OptimizationTechnical Content Optimization
Technical Content Optimization
 
SearchLove Boston 2016 | Mary Bowling | Local Search Experience Optimization
SearchLove Boston 2016 | Mary Bowling | Local Search Experience OptimizationSearchLove Boston 2016 | Mary Bowling | Local Search Experience Optimization
SearchLove Boston 2016 | Mary Bowling | Local Search Experience Optimization
 
Using Competitive Gap Analyses to Discover Low-Hanging Fruit
Using Competitive Gap Analyses to Discover Low-Hanging FruitUsing Competitive Gap Analyses to Discover Low-Hanging Fruit
Using Competitive Gap Analyses to Discover Low-Hanging Fruit
 
SearchLeeds 2018 - Steve Chambers - Stickyeyes - How not to F**K up a Migration
SearchLeeds 2018 - Steve Chambers - Stickyeyes - How not to F**K up a Migration SearchLeeds 2018 - Steve Chambers - Stickyeyes - How not to F**K up a Migration
SearchLeeds 2018 - Steve Chambers - Stickyeyes - How not to F**K up a Migration
 
Breaking Down NLP for SEOs - SMX Advanced Europe 2019 - Paul Shapiro
Breaking Down NLP for SEOs - SMX Advanced Europe 2019 - Paul ShapiroBreaking Down NLP for SEOs - SMX Advanced Europe 2019 - Paul Shapiro
Breaking Down NLP for SEOs - SMX Advanced Europe 2019 - Paul Shapiro
 
MnSearch Summit 2018 - Rob Ousbey – The Evolution of SEO: Split-Testing for S...
MnSearch Summit 2018 - Rob Ousbey – The Evolution of SEO: Split-Testing for S...MnSearch Summit 2018 - Rob Ousbey – The Evolution of SEO: Split-Testing for S...
MnSearch Summit 2018 - Rob Ousbey – The Evolution of SEO: Split-Testing for S...
 
Max Prin - MnSearch Summit 2018 - SEO for the Current Mobile Landscape
Max Prin - MnSearch Summit 2018 - SEO for the Current Mobile LandscapeMax Prin - MnSearch Summit 2018 - SEO for the Current Mobile Landscape
Max Prin - MnSearch Summit 2018 - SEO for the Current Mobile Landscape
 
Conflicting Website Signals & Confused Search Engines - Rachel Costello, Tech...
Conflicting Website Signals & Confused Search Engines - Rachel Costello, Tech...Conflicting Website Signals & Confused Search Engines - Rachel Costello, Tech...
Conflicting Website Signals & Confused Search Engines - Rachel Costello, Tech...
 
The New Renaissance of JavaScript
The New Renaissance of JavaScriptThe New Renaissance of JavaScript
The New Renaissance of JavaScript
 
Software Testing for SEO
Software Testing for SEOSoftware Testing for SEO
Software Testing for SEO
 
Schema and Open Graph 101 - SMX Munich
Schema and Open Graph 101 - SMX MunichSchema and Open Graph 101 - SMX Munich
Schema and Open Graph 101 - SMX Munich
 
MnSearch Summit 2018 - Paul Shapiro – Start Building SEO Efficiencies with Au...
MnSearch Summit 2018 - Paul Shapiro – Start Building SEO Efficiencies with Au...MnSearch Summit 2018 - Paul Shapiro – Start Building SEO Efficiencies with Au...
MnSearch Summit 2018 - Paul Shapiro – Start Building SEO Efficiencies with Au...
 

Similaire à Machine Learning Shaping Google and Technical SEO

You Don't Know SEO
You Don't Know SEOYou Don't Know SEO
You Don't Know SEOMichael King
 
ChatGPT-and-Generative-AI-Landscape Working of generative ai search
ChatGPT-and-Generative-AI-Landscape Working of generative ai searchChatGPT-and-Generative-AI-Landscape Working of generative ai search
ChatGPT-and-Generative-AI-Landscape Working of generative ai searchrohitcse52
 
Seo company in chennai
Seo company in chennaiSeo company in chennai
Seo company in chennaiNissyMary
 
TechSEO Boost 2018: Research Competition
TechSEO Boost 2018: Research CompetitionTechSEO Boost 2018: Research Competition
TechSEO Boost 2018: Research CompetitionCatalyst
 
TechSEO Boost 2019: Research Competition
TechSEO Boost 2019: Research CompetitionTechSEO Boost 2019: Research Competition
TechSEO Boost 2019: Research CompetitionCatalyst
 
TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...
TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...
TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...Catalyst
 
Croud Presents: How to Build a Data-driven SEO Strategy Using NLP
Croud Presents: How to Build a Data-driven SEO Strategy Using NLPCroud Presents: How to Build a Data-driven SEO Strategy Using NLP
Croud Presents: How to Build a Data-driven SEO Strategy Using NLPDaniel Liddle
 
Search engine optimization Company
Search engine optimization CompanySearch engine optimization Company
Search engine optimization Companytirumala375
 
seo company in chennai
seo company in chennai seo company in chennai
seo company in chennai Vinoth Dlk
 
Module 9: Natural Language Processing Part 2
Module 9:  Natural Language Processing Part 2Module 9:  Natural Language Processing Part 2
Module 9: Natural Language Processing Part 2Sara Hooker
 
Birst Webinar Slides: "Build vs. Buy - Making the Right Choice for a Great Da...
Birst Webinar Slides: "Build vs. Buy - Making the Right Choice for a Great Da...Birst Webinar Slides: "Build vs. Buy - Making the Right Choice for a Great Da...
Birst Webinar Slides: "Build vs. Buy - Making the Right Choice for a Great Da...Birst
 
skillshare organic search strategy
skillshare organic search strategyskillshare organic search strategy
skillshare organic search strategyAlexis Sanders
 
Machine Learning and Python For Marketing Automation | MKGO October 2019 | Ru...
Machine Learning and Python For Marketing Automation | MKGO October 2019 | Ru...Machine Learning and Python For Marketing Automation | MKGO October 2019 | Ru...
Machine Learning and Python For Marketing Automation | MKGO October 2019 | Ru...Ruth Everett
 
Trusting Your Ingredients - What Building Software And Cheesecake Have In Common
Trusting Your Ingredients - What Building Software And Cheesecake Have In CommonTrusting Your Ingredients - What Building Software And Cheesecake Have In Common
Trusting Your Ingredients - What Building Software And Cheesecake Have In CommonLeon Stigter
 
TechEvent 2019: Artificial Intelligence in Dev & Ops; Martin Luckow - Trivadis
TechEvent 2019: Artificial Intelligence in Dev & Ops; Martin Luckow - TrivadisTechEvent 2019: Artificial Intelligence in Dev & Ops; Martin Luckow - Trivadis
TechEvent 2019: Artificial Intelligence in Dev & Ops; Martin Luckow - TrivadisTrivadis
 
Technical SEO Training Day | Igoo
Technical SEO Training Day | Igoo Technical SEO Training Day | Igoo
Technical SEO Training Day | Igoo Charlie Whitworth
 
Making Reddit Search Relevant and Scalable - Anupama Joshi & Jerry Bao, Reddit
Making Reddit Search Relevant and Scalable - Anupama Joshi & Jerry Bao, RedditMaking Reddit Search Relevant and Scalable - Anupama Joshi & Jerry Bao, Reddit
Making Reddit Search Relevant and Scalable - Anupama Joshi & Jerry Bao, RedditLucidworks
 
A Deep Dive Into SEO Tactics For Modern Javascript Frameworks
A Deep Dive Into SEO Tactics For Modern Javascript FrameworksA Deep Dive Into SEO Tactics For Modern Javascript Frameworks
A Deep Dive Into SEO Tactics For Modern Javascript FrameworksHamlet Batista
 

Similaire à Machine Learning Shaping Google and Technical SEO (20)

You Don't Know SEO
You Don't Know SEOYou Don't Know SEO
You Don't Know SEO
 
ChatGPT-and-Generative-AI-Landscape Working of generative ai search
ChatGPT-and-Generative-AI-Landscape Working of generative ai searchChatGPT-and-Generative-AI-Landscape Working of generative ai search
ChatGPT-and-Generative-AI-Landscape Working of generative ai search
 
Seo company in chennai
Seo company in chennaiSeo company in chennai
Seo company in chennai
 
TechSEO Boost 2018: Research Competition
TechSEO Boost 2018: Research CompetitionTechSEO Boost 2018: Research Competition
TechSEO Boost 2018: Research Competition
 
TechSEO Boost 2019: Research Competition
TechSEO Boost 2019: Research CompetitionTechSEO Boost 2019: Research Competition
TechSEO Boost 2019: Research Competition
 
TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...
TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...
TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...
 
Croud Presents: How to Build a Data-driven SEO Strategy Using NLP
Croud Presents: How to Build a Data-driven SEO Strategy Using NLPCroud Presents: How to Build a Data-driven SEO Strategy Using NLP
Croud Presents: How to Build a Data-driven SEO Strategy Using NLP
 
Search engine optimization Company
Search engine optimization CompanySearch engine optimization Company
Search engine optimization Company
 
seo company in chennai
seo company in chennai seo company in chennai
seo company in chennai
 
Module 9: Natural Language Processing Part 2
Module 9:  Natural Language Processing Part 2Module 9:  Natural Language Processing Part 2
Module 9: Natural Language Processing Part 2
 
Birst Webinar Slides: "Build vs. Buy - Making the Right Choice for a Great Da...
Birst Webinar Slides: "Build vs. Buy - Making the Right Choice for a Great Da...Birst Webinar Slides: "Build vs. Buy - Making the Right Choice for a Great Da...
Birst Webinar Slides: "Build vs. Buy - Making the Right Choice for a Great Da...
 
Demo day
Demo dayDemo day
Demo day
 
skillshare organic search strategy
skillshare organic search strategyskillshare organic search strategy
skillshare organic search strategy
 
Machine Learning and Python For Marketing Automation | MKGO October 2019 | Ru...
Machine Learning and Python For Marketing Automation | MKGO October 2019 | Ru...Machine Learning and Python For Marketing Automation | MKGO October 2019 | Ru...
Machine Learning and Python For Marketing Automation | MKGO October 2019 | Ru...
 
Trusting Your Ingredients - What Building Software And Cheesecake Have In Common
Trusting Your Ingredients - What Building Software And Cheesecake Have In CommonTrusting Your Ingredients - What Building Software And Cheesecake Have In Common
Trusting Your Ingredients - What Building Software And Cheesecake Have In Common
 
TechEvent 2019: Artificial Intelligence in Dev & Ops; Martin Luckow - Trivadis
TechEvent 2019: Artificial Intelligence in Dev & Ops; Martin Luckow - TrivadisTechEvent 2019: Artificial Intelligence in Dev & Ops; Martin Luckow - Trivadis
TechEvent 2019: Artificial Intelligence in Dev & Ops; Martin Luckow - Trivadis
 
Technical SEO Training Day | Igoo
Technical SEO Training Day | Igoo Technical SEO Training Day | Igoo
Technical SEO Training Day | Igoo
 
Orchestration, the conductor's score
Orchestration, the conductor's scoreOrchestration, the conductor's score
Orchestration, the conductor's score
 
Making Reddit Search Relevant and Scalable - Anupama Joshi & Jerry Bao, Reddit
Making Reddit Search Relevant and Scalable - Anupama Joshi & Jerry Bao, RedditMaking Reddit Search Relevant and Scalable - Anupama Joshi & Jerry Bao, Reddit
Making Reddit Search Relevant and Scalable - Anupama Joshi & Jerry Bao, Reddit
 
A Deep Dive Into SEO Tactics For Modern Javascript Frameworks
A Deep Dive Into SEO Tactics For Modern Javascript FrameworksA Deep Dive Into SEO Tactics For Modern Javascript Frameworks
A Deep Dive Into SEO Tactics For Modern Javascript Frameworks
 

Plus de Catalyst

Closing the Gap: Adopting Omnichannel Strategies for Stronger Brand-Consumer ...
Closing the Gap: Adopting Omnichannel Strategies for Stronger Brand-Consumer ...Closing the Gap: Adopting Omnichannel Strategies for Stronger Brand-Consumer ...
Closing the Gap: Adopting Omnichannel Strategies for Stronger Brand-Consumer ...Catalyst
 
TechSEO Boost 2021 - Cultivating a Product Mindset for Success
TechSEO Boost 2021 - Cultivating a Product Mindset for SuccessTechSEO Boost 2021 - Cultivating a Product Mindset for Success
TechSEO Boost 2021 - Cultivating a Product Mindset for SuccessCatalyst
 
TechSEO Boost 2021 - SEO Experimentation
TechSEO Boost 2021 - SEO ExperimentationTechSEO Boost 2021 - SEO Experimentation
TechSEO Boost 2021 - SEO ExperimentationCatalyst
 
TechSEO Boost 2021 - The Future Is The Past: Tagging And Tracking Through The...
TechSEO Boost 2021 - The Future Is The Past: Tagging And Tracking Through The...TechSEO Boost 2021 - The Future Is The Past: Tagging And Tracking Through The...
TechSEO Boost 2021 - The Future Is The Past: Tagging And Tracking Through The...Catalyst
 
10 Trends Changing Programmatic
10 Trends Changing Programmatic10 Trends Changing Programmatic
10 Trends Changing ProgrammaticCatalyst
 
New Commerce Conference: Charting a Course to Success with Your Retail Media ...
New Commerce Conference: Charting a Course to Success with Your Retail Media ...New Commerce Conference: Charting a Course to Success with Your Retail Media ...
New Commerce Conference: Charting a Course to Success with Your Retail Media ...Catalyst
 
The New Commerce Conference: The Omni-channel Imperative
The New Commerce Conference: The Omni-channel ImperativeThe New Commerce Conference: The Omni-channel Imperative
The New Commerce Conference: The Omni-channel ImperativeCatalyst
 
New Commerce Commerce: All Things Instacart
New Commerce Commerce: All Things InstacartNew Commerce Commerce: All Things Instacart
New Commerce Commerce: All Things InstacartCatalyst
 
The Power of SEO: Protect Your Bottom Line & Future Proof Your Brand
The Power of SEO: Protect Your Bottom Line & Future Proof Your BrandThe Power of SEO: Protect Your Bottom Line & Future Proof Your Brand
The Power of SEO: Protect Your Bottom Line & Future Proof Your BrandCatalyst
 
The Era of Omni-Commerce: New Insights for Dominating the Digital Shelf and B...
The Era of Omni-Commerce: New Insights for Dominating the Digital Shelf and B...The Era of Omni-Commerce: New Insights for Dominating the Digital Shelf and B...
The Era of Omni-Commerce: New Insights for Dominating the Digital Shelf and B...Catalyst
 
Reignite Your Business with Performance Marketing: 4 Ways to Fuel Your Reopening
Reignite Your Business with Performance Marketing: 4 Ways to Fuel Your ReopeningReignite Your Business with Performance Marketing: 4 Ways to Fuel Your Reopening
Reignite Your Business with Performance Marketing: 4 Ways to Fuel Your ReopeningCatalyst
 
Reignite Your Business with Performance Marketing: 4 Ways to Dial-Up Brand In...
Reignite Your Business with Performance Marketing: 4 Ways to Dial-Up Brand In...Reignite Your Business with Performance Marketing: 4 Ways to Dial-Up Brand In...
Reignite Your Business with Performance Marketing: 4 Ways to Dial-Up Brand In...Catalyst
 
Evolve Your Social Commerce Strategy: Thinking Beyond Facebook
Evolve Your Social Commerce Strategy: Thinking Beyond FacebookEvolve Your Social Commerce Strategy: Thinking Beyond Facebook
Evolve Your Social Commerce Strategy: Thinking Beyond FacebookCatalyst
 
B2B SEO: Increase Traffic & Leads in 2020
B2B SEO: Increase Traffic & Leads in 2020B2B SEO: Increase Traffic & Leads in 2020
B2B SEO: Increase Traffic & Leads in 2020Catalyst
 
Keynote: Bias in Search and Recommender Systems
Keynote: Bias in Search and Recommender SystemsKeynote: Bias in Search and Recommender Systems
Keynote: Bias in Search and Recommender SystemsCatalyst
 
Generating Qualitative Content with GPT-2 in All Languages
Generating Qualitative Content with GPT-2 in All LanguagesGenerating Qualitative Content with GPT-2 in All Languages
Generating Qualitative Content with GPT-2 in All LanguagesCatalyst
 
Automate, Create Tools, & Test Ideas Quickly with Google Apps Script
Automate, Create Tools, & Test Ideas Quickly with Google Apps ScriptAutomate, Create Tools, & Test Ideas Quickly with Google Apps Script
Automate, Create Tools, & Test Ideas Quickly with Google Apps ScriptCatalyst
 
The User is The Query: The Rise of Predictive Proactive Search
The User is The Query: The Rise of Predictive Proactive SearchThe User is The Query: The Rise of Predictive Proactive Search
The User is The Query: The Rise of Predictive Proactive SearchCatalyst
 
The Ultimate Pagination for SEO
The Ultimate Pagination for SEOThe Ultimate Pagination for SEO
The Ultimate Pagination for SEOCatalyst
 
Ranking Factors Going Causal: Regressions, Machine Learning, and Neural Networks
Ranking Factors Going Causal: Regressions, Machine Learning, and Neural NetworksRanking Factors Going Causal: Regressions, Machine Learning, and Neural Networks
Ranking Factors Going Causal: Regressions, Machine Learning, and Neural NetworksCatalyst
 

Plus de Catalyst (20)

Closing the Gap: Adopting Omnichannel Strategies for Stronger Brand-Consumer ...
Closing the Gap: Adopting Omnichannel Strategies for Stronger Brand-Consumer ...Closing the Gap: Adopting Omnichannel Strategies for Stronger Brand-Consumer ...
Closing the Gap: Adopting Omnichannel Strategies for Stronger Brand-Consumer ...
 
TechSEO Boost 2021 - Cultivating a Product Mindset for Success
TechSEO Boost 2021 - Cultivating a Product Mindset for SuccessTechSEO Boost 2021 - Cultivating a Product Mindset for Success
TechSEO Boost 2021 - Cultivating a Product Mindset for Success
 
TechSEO Boost 2021 - SEO Experimentation
TechSEO Boost 2021 - SEO ExperimentationTechSEO Boost 2021 - SEO Experimentation
TechSEO Boost 2021 - SEO Experimentation
 
TechSEO Boost 2021 - The Future Is The Past: Tagging And Tracking Through The...
TechSEO Boost 2021 - The Future Is The Past: Tagging And Tracking Through The...TechSEO Boost 2021 - The Future Is The Past: Tagging And Tracking Through The...
TechSEO Boost 2021 - The Future Is The Past: Tagging And Tracking Through The...
 
10 Trends Changing Programmatic
10 Trends Changing Programmatic10 Trends Changing Programmatic
10 Trends Changing Programmatic
 
New Commerce Conference: Charting a Course to Success with Your Retail Media ...
New Commerce Conference: Charting a Course to Success with Your Retail Media ...New Commerce Conference: Charting a Course to Success with Your Retail Media ...
New Commerce Conference: Charting a Course to Success with Your Retail Media ...
 
The New Commerce Conference: The Omni-channel Imperative
The New Commerce Conference: The Omni-channel ImperativeThe New Commerce Conference: The Omni-channel Imperative
The New Commerce Conference: The Omni-channel Imperative
 
New Commerce Commerce: All Things Instacart
New Commerce Commerce: All Things InstacartNew Commerce Commerce: All Things Instacart
New Commerce Commerce: All Things Instacart
 
The Power of SEO: Protect Your Bottom Line & Future Proof Your Brand
The Power of SEO: Protect Your Bottom Line & Future Proof Your BrandThe Power of SEO: Protect Your Bottom Line & Future Proof Your Brand
The Power of SEO: Protect Your Bottom Line & Future Proof Your Brand
 
The Era of Omni-Commerce: New Insights for Dominating the Digital Shelf and B...
The Era of Omni-Commerce: New Insights for Dominating the Digital Shelf and B...The Era of Omni-Commerce: New Insights for Dominating the Digital Shelf and B...
The Era of Omni-Commerce: New Insights for Dominating the Digital Shelf and B...
 
Reignite Your Business with Performance Marketing: 4 Ways to Fuel Your Reopening
Reignite Your Business with Performance Marketing: 4 Ways to Fuel Your ReopeningReignite Your Business with Performance Marketing: 4 Ways to Fuel Your Reopening
Reignite Your Business with Performance Marketing: 4 Ways to Fuel Your Reopening
 
Reignite Your Business with Performance Marketing: 4 Ways to Dial-Up Brand In...
Reignite Your Business with Performance Marketing: 4 Ways to Dial-Up Brand In...Reignite Your Business with Performance Marketing: 4 Ways to Dial-Up Brand In...
Reignite Your Business with Performance Marketing: 4 Ways to Dial-Up Brand In...
 
Evolve Your Social Commerce Strategy: Thinking Beyond Facebook
Evolve Your Social Commerce Strategy: Thinking Beyond FacebookEvolve Your Social Commerce Strategy: Thinking Beyond Facebook
Evolve Your Social Commerce Strategy: Thinking Beyond Facebook
 
B2B SEO: Increase Traffic & Leads in 2020
B2B SEO: Increase Traffic & Leads in 2020B2B SEO: Increase Traffic & Leads in 2020
B2B SEO: Increase Traffic & Leads in 2020
 
Keynote: Bias in Search and Recommender Systems
Keynote: Bias in Search and Recommender SystemsKeynote: Bias in Search and Recommender Systems
Keynote: Bias in Search and Recommender Systems
 
Generating Qualitative Content with GPT-2 in All Languages
Generating Qualitative Content with GPT-2 in All LanguagesGenerating Qualitative Content with GPT-2 in All Languages
Generating Qualitative Content with GPT-2 in All Languages
 
Automate, Create Tools, & Test Ideas Quickly with Google Apps Script
Automate, Create Tools, & Test Ideas Quickly with Google Apps ScriptAutomate, Create Tools, & Test Ideas Quickly with Google Apps Script
Automate, Create Tools, & Test Ideas Quickly with Google Apps Script
 
The User is The Query: The Rise of Predictive Proactive Search
The User is The Query: The Rise of Predictive Proactive SearchThe User is The Query: The Rise of Predictive Proactive Search
The User is The Query: The Rise of Predictive Proactive Search
 
The Ultimate Pagination for SEO
The Ultimate Pagination for SEOThe Ultimate Pagination for SEO
The Ultimate Pagination for SEO
 
Ranking Factors Going Causal: Regressions, Machine Learning, and Neural Networks
Ranking Factors Going Causal: Regressions, Machine Learning, and Neural NetworksRanking Factors Going Causal: Regressions, Machine Learning, and Neural Networks
Ranking Factors Going Causal: Regressions, Machine Learning, and Neural Networks
 

Dernier

Infographics about SEO strategies and uses
Infographics about SEO strategies and usesInfographics about SEO strategies and uses
Infographics about SEO strategies and usesbhavanirupeshmoksha
 
Understanding the Affiliate Marketing Channel; the short guide
Understanding the Affiliate Marketing Channel; the short guideUnderstanding the Affiliate Marketing Channel; the short guide
Understanding the Affiliate Marketing Channel; the short guidePartnercademy
 
Codes and Conventions of Film Magazine Covers.pptx
Codes and Conventions of Film Magazine Covers.pptxCodes and Conventions of Film Magazine Covers.pptx
Codes and Conventions of Film Magazine Covers.pptxGeorgeCulica
 
DIGITAL MARKETING STRATEGY_INFOGRAPHIC IMAGE.pdf
DIGITAL MARKETING STRATEGY_INFOGRAPHIC IMAGE.pdfDIGITAL MARKETING STRATEGY_INFOGRAPHIC IMAGE.pdf
DIGITAL MARKETING STRATEGY_INFOGRAPHIC IMAGE.pdfmayanksharma0441
 
Research and Discovery Tools for Experimentation - 17 Apr 2024 - v 2.3 (1).pdf
Research and Discovery Tools for Experimentation - 17 Apr 2024 - v 2.3 (1).pdfResearch and Discovery Tools for Experimentation - 17 Apr 2024 - v 2.3 (1).pdf
Research and Discovery Tools for Experimentation - 17 Apr 2024 - v 2.3 (1).pdfVWO
 
Influencer Marketing Power point presentation
Influencer Marketing  Power point presentationInfluencer Marketing  Power point presentation
Influencer Marketing Power point presentationdgtivemarketingagenc
 
Miss Immigrant USA Activity Pageant Program.pdf
Miss Immigrant USA Activity Pageant Program.pdfMiss Immigrant USA Activity Pageant Program.pdf
Miss Immigrant USA Activity Pageant Program.pdfMagdalena Kulisz
 
What I learned from auditing over 1,000,000 websites - SERP Conf 2024 Patrick...
What I learned from auditing over 1,000,000 websites - SERP Conf 2024 Patrick...What I learned from auditing over 1,000,000 websites - SERP Conf 2024 Patrick...
What I learned from auditing over 1,000,000 websites - SERP Conf 2024 Patrick...Ahrefs
 
From Chance to Choice - Tactical Link Building for International SEO
From Chance to Choice - Tactical Link Building for International SEOFrom Chance to Choice - Tactical Link Building for International SEO
From Chance to Choice - Tactical Link Building for International SEOSzymon Słowik
 
Digital Marketing Courses In Pune- school Of Internet Marketing
Digital Marketing Courses In Pune- school Of Internet MarketingDigital Marketing Courses In Pune- school Of Internet Marketing
Digital Marketing Courses In Pune- school Of Internet MarketingShauryaBadaya
 
Prezentare Brandfluence 2023 - Social Media Trends
Prezentare Brandfluence 2023 - Social Media TrendsPrezentare Brandfluence 2023 - Social Media Trends
Prezentare Brandfluence 2023 - Social Media TrendsCristian Manafu
 
Master the Art of Digital Recruitment in Asia.pdf
Master the Art of Digital Recruitment in Asia.pdfMaster the Art of Digital Recruitment in Asia.pdf
Master the Art of Digital Recruitment in Asia.pdfHigher Education Marketing
 
5 Digital Marketing Tips | Devherds Software Solutions
5 Digital Marketing Tips | Devherds Software Solutions5 Digital Marketing Tips | Devherds Software Solutions
5 Digital Marketing Tips | Devherds Software SolutionsDevherds Software Solutions
 
Michael Kors marketing assignment swot analysis
Michael Kors marketing assignment swot analysisMichael Kors marketing assignment swot analysis
Michael Kors marketing assignment swot analysisjunaid794917
 
The 10 Most Influential CMO's Leading the Way of Success, 2024 (Final file) (...
The 10 Most Influential CMO's Leading the Way of Success, 2024 (Final file) (...The 10 Most Influential CMO's Leading the Way of Success, 2024 (Final file) (...
The 10 Most Influential CMO's Leading the Way of Success, 2024 (Final file) (...CIO Business World
 
Common Culture: Paul Willis Symbolic Creativity
Common Culture: Paul Willis Symbolic CreativityCommon Culture: Paul Willis Symbolic Creativity
Common Culture: Paul Willis Symbolic CreativityMonishka Adhikari
 
SEO and Digital PR - How to Connect Your Teams to Maximise Success
SEO and Digital PR - How to Connect Your Teams to Maximise SuccessSEO and Digital PR - How to Connect Your Teams to Maximise Success
SEO and Digital PR - How to Connect Your Teams to Maximise SuccessLiv Day
 
Most Impressive Construction Leaders in Tech, Making Waves in the Industry, 2...
Most Impressive Construction Leaders in Tech, Making Waves in the Industry, 2...Most Impressive Construction Leaders in Tech, Making Waves in the Industry, 2...
Most Impressive Construction Leaders in Tech, Making Waves in the Industry, 2...CIO Business World
 
Introduction to marketing Management Notes
Introduction to marketing Management NotesIntroduction to marketing Management Notes
Introduction to marketing Management NotesKiranTiwari42
 
What’s the difference between Affiliate Marketing and Brand Partnerships?
What’s the difference between Affiliate Marketing and Brand Partnerships?What’s the difference between Affiliate Marketing and Brand Partnerships?
What’s the difference between Affiliate Marketing and Brand Partnerships?Partnercademy
 

Dernier (20)

Infographics about SEO strategies and uses
Infographics about SEO strategies and usesInfographics about SEO strategies and uses
Infographics about SEO strategies and uses
 
Understanding the Affiliate Marketing Channel; the short guide
Understanding the Affiliate Marketing Channel; the short guideUnderstanding the Affiliate Marketing Channel; the short guide
Understanding the Affiliate Marketing Channel; the short guide
 
Codes and Conventions of Film Magazine Covers.pptx
Codes and Conventions of Film Magazine Covers.pptxCodes and Conventions of Film Magazine Covers.pptx
Codes and Conventions of Film Magazine Covers.pptx
 
DIGITAL MARKETING STRATEGY_INFOGRAPHIC IMAGE.pdf
DIGITAL MARKETING STRATEGY_INFOGRAPHIC IMAGE.pdfDIGITAL MARKETING STRATEGY_INFOGRAPHIC IMAGE.pdf
DIGITAL MARKETING STRATEGY_INFOGRAPHIC IMAGE.pdf
 
Research and Discovery Tools for Experimentation - 17 Apr 2024 - v 2.3 (1).pdf
Research and Discovery Tools for Experimentation - 17 Apr 2024 - v 2.3 (1).pdfResearch and Discovery Tools for Experimentation - 17 Apr 2024 - v 2.3 (1).pdf
Research and Discovery Tools for Experimentation - 17 Apr 2024 - v 2.3 (1).pdf
 
Influencer Marketing Power point presentation
Influencer Marketing  Power point presentationInfluencer Marketing  Power point presentation
Influencer Marketing Power point presentation
 
Miss Immigrant USA Activity Pageant Program.pdf
Miss Immigrant USA Activity Pageant Program.pdfMiss Immigrant USA Activity Pageant Program.pdf
Miss Immigrant USA Activity Pageant Program.pdf
 
What I learned from auditing over 1,000,000 websites - SERP Conf 2024 Patrick...
What I learned from auditing over 1,000,000 websites - SERP Conf 2024 Patrick...What I learned from auditing over 1,000,000 websites - SERP Conf 2024 Patrick...
What I learned from auditing over 1,000,000 websites - SERP Conf 2024 Patrick...
 
From Chance to Choice - Tactical Link Building for International SEO
From Chance to Choice - Tactical Link Building for International SEOFrom Chance to Choice - Tactical Link Building for International SEO
From Chance to Choice - Tactical Link Building for International SEO
 
Digital Marketing Courses In Pune- school Of Internet Marketing
Digital Marketing Courses In Pune- school Of Internet MarketingDigital Marketing Courses In Pune- school Of Internet Marketing
Digital Marketing Courses In Pune- school Of Internet Marketing
 
Prezentare Brandfluence 2023 - Social Media Trends
Prezentare Brandfluence 2023 - Social Media TrendsPrezentare Brandfluence 2023 - Social Media Trends
Prezentare Brandfluence 2023 - Social Media Trends
 
Master the Art of Digital Recruitment in Asia.pdf
Master the Art of Digital Recruitment in Asia.pdfMaster the Art of Digital Recruitment in Asia.pdf
Master the Art of Digital Recruitment in Asia.pdf
 
5 Digital Marketing Tips | Devherds Software Solutions
5 Digital Marketing Tips | Devherds Software Solutions5 Digital Marketing Tips | Devherds Software Solutions
5 Digital Marketing Tips | Devherds Software Solutions
 
Michael Kors marketing assignment swot analysis
Michael Kors marketing assignment swot analysisMichael Kors marketing assignment swot analysis
Michael Kors marketing assignment swot analysis
 
The 10 Most Influential CMO's Leading the Way of Success, 2024 (Final file) (...
The 10 Most Influential CMO's Leading the Way of Success, 2024 (Final file) (...The 10 Most Influential CMO's Leading the Way of Success, 2024 (Final file) (...
The 10 Most Influential CMO's Leading the Way of Success, 2024 (Final file) (...
 
Common Culture: Paul Willis Symbolic Creativity
Common Culture: Paul Willis Symbolic CreativityCommon Culture: Paul Willis Symbolic Creativity
Common Culture: Paul Willis Symbolic Creativity
 
SEO and Digital PR - How to Connect Your Teams to Maximise Success
SEO and Digital PR - How to Connect Your Teams to Maximise SuccessSEO and Digital PR - How to Connect Your Teams to Maximise Success
SEO and Digital PR - How to Connect Your Teams to Maximise Success
 
Most Impressive Construction Leaders in Tech, Making Waves in the Industry, 2...
Most Impressive Construction Leaders in Tech, Making Waves in the Industry, 2...Most Impressive Construction Leaders in Tech, Making Waves in the Industry, 2...
Most Impressive Construction Leaders in Tech, Making Waves in the Industry, 2...
 
Introduction to marketing Management Notes
Introduction to marketing Management NotesIntroduction to marketing Management Notes
Introduction to marketing Management Notes
 
What’s the difference between Affiliate Marketing and Brand Partnerships?
What’s the difference between Affiliate Marketing and Brand Partnerships?What’s the difference between Affiliate Marketing and Brand Partnerships?
What’s the difference between Affiliate Marketing and Brand Partnerships?
 

Machine Learning Shaping Google and Technical SEO

  • 1. JR Oakes | @jroakes #TechSEOBoost Fun with Machines. How Machine Learning is Shaping Google and Technical SEO
  • 2. JR Oakes | @jroakes #TechSEOBoost About Me • Studied Industrial Design at NCSU • Worked as an architectural glass artist for 10 years. • Was Lead Developer and then Director of Strategy for medium- sized agency with 100+ clients worldwide. • Work as Director, Technical SEO for Adapt.
  • 3. JR Oakes | @jroakes #TechSEOBoost I have a problem with tf-idf
  • 4. JR Oakes | @jroakes #TechSEOBoost About TF-IDF TF-IDF is very hand-wavy and sounds very fancy, but is not the magic elixir to DOMINATING ON GOOGLE.
  • 5. JR Oakes | @jroakes #TechSEOBoost About TF-IDF It is actually not even the best IR algorithm. BM25 takes into account document length in addition to other factors in various iterations.
  • 6. JR Oakes | @jroakes #TechSEOBoost About TF-IDF https://wikimedia-research.github.io/Discovery-Search-Test-BM25/ https://wikimedia-research.github.io/Discovery-Search-Test-InterleavedLTR/
  • 7. JR Oakes | @jroakes #TechSEOBoost About TF-IDF The Search Platform Team has been working on improving search on Wikimedia projects with machine learning. Machine learned-ranking (MLR) enables us to rank relevance of pages using a model trained on implicit and explicit judgements. In the first test of the learning-to-rank (LTR) project, we evaluated the performance of a click-based model on users searching English Wikipedia. We found that users were slightly more likely to engage with MLR- provided results than with BM25 results (assessed via the clickthrough rate and a preference statistic). We also found that users with machine learning- ranked results were statistically significantly more likely to click on the first search result first than users with BM25-ranked results, which indicates that we are onto something. The next step for us is to evaluate the model’s performance on Wikipedia in other languages.
  • 8. JR Oakes | @jroakes #TechSEOBoost About TF-IDF
  • 9. JR Oakes | @jroakes #TechSEOBoost About TF-IDF Wikimedia Research released their first model on Github last month. MjoLniR – our Python and Spark-based library for handling the backend data processing for Machine Learned Ranking at Wikimedia. https://github.com/wikimedia/search-MjoLniR/tree/master/mjolnir
  • 10. JR Oakes | @jroakes #TechSEOBoost About TF-IDF We are WAY beyond TF-IDF. TF-IDF seems to work because it causes you to look for related phrases, but it is not a very good relevance metric. It is a keyword frequency metric.
  • 11. JR Oakes | @jroakes #TechSEOBoost How is Google Using Machine Learning?
  • 12. JR Oakes | @jroakes #TechSEOBoost Was Larry Kim right?
  • 13. JR Oakes | @jroakes #TechSEOBoost CTR As A Ranking Factor
  • 14. JR Oakes | @jroakes #TechSEOBoost CTR As A Ranking Factor
  • 15. JR Oakes | @jroakes #TechSEOBoost CTR As A Ranking Factor Potentially: • Clicks - For our click model we use a generalization of the PositionBased Model (PBM) [9], at the core of which lies an examination hypothesis, stating that in order to be clicked a document has to be examined and attractive: • Attention – What if users get the information that they need directly from the SERP (Answer boxes), without a click, how do we know they were satisfied? • Satisfaction – “While looking at the reasons specified by the raters we found out that 42% of the raters who said that they would click through on a SERP, indicated that their goal was “to confirm information already present in the summary” So additional clicks don’t necessarily mean a poor initial result.
  • 16. JR Oakes | @jroakes #TechSEOBoost CTR As A Ranking Factor
  • 17. JR Oakes | @jroakes #TechSEOBoost CTR As A Ranking Factor Machine Learning in its simplest form takes: 1. Input features 2. An algorithm that processes the features (most often) in a linear, non- linear, or tree-based way to make a prediction. 3. And an evaluation metric that compares the prediction to your “ground truth” data. It is technically possible that CTR and / or Quality Rater data provides the ground truth.
  • 18. JR Oakes | @jroakes #TechSEOBoost CTR As A Ranking Factor The problem is: We don’t have the ground truth, we don’t know the features, and we sure as hell have no idea what is in here:
  • 19. JR Oakes | @jroakes #TechSEOBoost CTR As A Ranking Factor We know that it probably depends on: • Click-through-rate • Context models • Ground-truth quality (Quality Rater’s Guidelines) • And other standard factors.
  • 20. JR Oakes | @jroakes #TechSEOBoost Storytelling
  • 21. JR Oakes | @jroakes #TechSEOBoost Storytelling
  • 22. JR Oakes | @jroakes #TechSEOBoost Storytelling Using Generative Adversarial Networks to train machines how to see the storylines in news events. https://www.ijcai.org/proceedings/2017/0554.pdf
  • 23. JR Oakes | @jroakes #TechSEOBoost LSTMs
  • 24. JR Oakes | @jroakes #TechSEOBoost LSTMs We would also guess that LSTMs (with attention) play some role in Rankbrain based on its state-of-the-art ability to pick up referential information in texts well beyond traditional BOW models. http://colah.github.io/posts/2015-08-Understanding-LSTMs/
  • 25. JR Oakes | @jroakes #TechSEOBoost What should we focus on?
  • 26. JR Oakes | @jroakes #TechSEOBoost Query Disambiguation
  • 27. JR Oakes | @jroakes #TechSEOBoost Query Disambiguation Very little information in the query and a lot of information in the possible results.
  • 28. JR Oakes | @jroakes #TechSEOBoost Query Disambiguation Google tries to give us a nudge.
  • 29. JR Oakes | @jroakes #TechSEOBoost Query Disambiguation What a strong hint to consider when thinking about what needs to be included on a page discussing: Lipton Tea Also a very strong hint at potential navigation.
  • 30. JR Oakes | @jroakes #TechSEOBoost Query Disambiguation AT&T does an amazing job at this.
  • 31. JR Oakes | @jroakes #TechSEOBoost Semantic Relevance
  • 32. JR Oakes | @jroakes #TechSEOBoost Semantic Relevance Bill Slawski (as always) is spot on.
  • 33. JR Oakes | @jroakes #TechSEOBoost Semantic Relevance Going back to the patent from Google in 2014 (Integrated external related phrase information into a phrase- based indexing information retrieval system), we see that there is an marked gain in the significance of phrases in a page based on additional semantically related qualifying phrases.
  • 34. JR Oakes | @jroakes #TechSEOBoost Semantic Relevance There are many ways to handle this on a page level.
  • 35. JR Oakes | @jroakes #TechSEOBoost Semantic Relevance But, this really starts much sooner by trying to discover content / intent categories that your site is relevant for to even start the process of building out relevant content categories for your visitors. https://anaconda.org/jroakes/cluster- share/notebook
  • 36. JR Oakes | @jroakes #TechSEOBoost Semantic Relevance The prior notebook ingests your keywords, models them to vector space, and then runs k-means to group the keywords into relevance clusters.
  • 37. JR Oakes | @jroakes #TechSEOBoost Semantic Relevance
  • 38. JR Oakes | @jroakes #TechSEOBoost Semantic Relevance Note this goes well beyond term- frequency.
  • 39. JR Oakes | @jroakes #TechSEOBoost Semantic Relevance Using skip-gram models impart probability of cooccurrence across large corpuses which is much closer to what Google does than simple tf- idf.
  • 40. JR Oakes | @jroakes #TechSEOBoost We should also care about click satisfaction.
  • 41. JR Oakes | @jroakes #TechSEOBoost Click Satisfaction
  • 42. JR Oakes | @jroakes #TechSEOBoost Click Satisfaction Working hard to ensure that your pages get the clicks. H/T to @fighto for the excellent article here: https://searchengineland.com/alert-abnormal-organic-ctr-detected-automatic- detection-poorly-performing-meta-data-280290 https://anaconda.org/jroakes/ctr_anamolies_share/notebook
  • 43. JR Oakes | @jroakes #TechSEOBoost We should also care about content deduplication.
  • 44. JR Oakes | @jroakes #TechSEOBoost Content Deduplication https://anaconda.org/jroakes/duplicate_detection _with_shingling_share/notebook
  • 45. JR Oakes | @jroakes #TechSEOBoost Wrapping Up It is very difficult to gain intuition into how Google works based on solely external data. The reality is that context, machine learning, and click data allows for the building of models that humans cannot understand easily. We wanted to move the conversation away from simplistic keyword mechanisms and towards an understanding that there semantics and context are much more valuable to ranking.
  • 46. JR Oakes | @jroakes #TechSEOBoost