SlideShare une entreprise Scribd logo
1  sur  148
Télécharger pour lire hors ligne
Doing SEO for large
websites.
Working on large websites, or large number of websites. Let’s
talk about SEO at scale, with the enterprise.
31 m
1.8m & 220kg
17x larger
4,913x heavier
1,084 T
x2
x2 x2
x2 x3
x2
SLOWER DIFFICULT TO WORK WITH
Working in a large organisation
Working with
data
Technical Foundation
Minimising Risk
Scaling Content
Reporting
Working in a large organisation
Scaling Content
Reporting
Working with
data
Technical Foundation
Minimising Risk
Working in a large organisation
Scaling Content
Working with
data
Technical Foundation
Reporting
Minimising Risk
Templates
Getting (& processing) data
Finding technical issues
Preventing technical issues
Templates
I would like a 1000 problems please.
“Please fix all 18,304 pages”
LIES
LIES
5
6
Category
Home page
Product
Contact Us
Obviously different
Small product number Main category page
Out of stock product
Extremes
Facet category page Reviews Page 2
Same page different URL
Country
County
City
Area/District
Street
Getting (& processing) data
Impressions week by week for new content
Pre change Post change
Clicks pre and post change for site sections
Competing pages for a set of terms
SLOWER DIFFICULT TO WORK WITH
SAMPLING
SLOWER DIFFICULT TO WORK WITH
SAMPLING
LIMITS
1,000 rows at a time
SLOWER DIFFICULT TO WORK WITH
SAMPLING
LIMITS LAG
SLOWER DIFFICULT TO WORK WITH
SAMPLING
LIMITS LAG
SEGMENTATION
Search console properties for a
large brand.
Register all the things.
5 sub-folders provided
260% more keywords
Part 1: Data Studio
Part 2: Day by day data
Part 3: Python
Part 4: Data warehousing
Get
Get, Analyse
Get, Store, Analyse, Report
Part 1: Data Studio
Part 2: Day by day data
Part 3: Python
Part 4: Data warehousing
Data studio for extracting
data
● Add a data source
● Create a table for it.
● Download the table.
With both GA & GSC, you’ll get
everything in the table, no
paginating.
Part 1: Data Studio
Part 2: Day by day data
Part 3: Python
Part 4: Data warehousing
Day by day data
To get even more data we have
to get it day by day.
● bit.ly/search-console-dat
a-downloader
This bit is Search Console only.
Part 1: Data Studio
Part 2: Day by day data
Part 3: Python
Part 4: Data warehousing
Getting data from APIs
Pull down your analytics data.
● Daily_google_analytics_v3
● Getting search console
data from the API
Getting data from APIs
Pull down your analytics data.
● Daily_google_analytics_v3
● Getting search console
data from the API
Getting started with pandas:
● Pandas tutorial with
ranking data
Getting data from APIs
Pull down your analytics data.
● Daily_google_analytics_v3
● Getting search console
data from the API
Getting started with pandas:
● Pandas tutorial with
ranking data
As a workflow I’d highly
recommend Jupyter notebooks
for getting started.
● Why use jupyter
notebooks?
● SearchLove Video (paid)
SEO Pythonistas
A memorial and soon to be
collection of Hamlet’s excellent
work.
SEO Pythonistas - In loving
memory of Hamlet Batista
@DataChaz
Part 1: Data Studio
Part 2: Day by day data
Part 3: Python
Part 4: Data warehousing
Analyse
Store data
Get data
Report
Analyse
Store data
Get data
Report
Takes time & space.
Analyse
Store data
Get data
Report
Takes time & space.
A developer could do it.
Rolling your own
JC Chouinard has built a series
of excellent granular tutorials
which walk you through setting
up one on your own machine.
Link.
Off the shelf
Get in touch with me!
I run Piped Out which is
software for building SEO data
warehouses.
Finding technical issues
Part 1: Templates
Part 2: Logs
Part 3: Crawling Big
Part 1: Templates
Part 2: Logs
Part 3: Crawling Big
Not the same fields as a crawl.
No page title for example.
● Crawling & indexing problems
● Crawling & indexing problems
● Measuring freshness
Time until article crawled
● Crawling & indexing problems
● Measuring freshness
● Prioritisation
● Crawling & indexing problems
● Measuring freshness
● Prioritisation
● Monitoring website changes (e.g. migrations)
Jun ‘19
Apr ‘19 Aug‘19 Oct ‘19
200 301 302
Status codes in product pages
Jun ‘19
Apr ‘19 Aug‘19 Oct ‘19
200 301 302
Status codes in product pages
ELK
● Crawling & indexing problems
● Measuring freshness
● Prioritisation
● Monitoring website changes (e.g. migrations)
● Debugging
Hi x
I’m {x} from {y} and we’ve been asked to do some log analysis to understand better how Google is behaving on the website and I was hoping you could help with some questions about the log set-up (as well as with getting the logs!).
What time period do we want?
What we’d ideally like is 3-6 months of historical logs for the website. Our goal is to look at all the different pages search engines are crawling on our website, discover where they’re spending their time, the status code errors they’re
finding etc.
We can absolutely do analysis with a month or so (we've even done it with just a week or two), but it means we lose historical context and obviously we're more likely to lose things on a larger side.
There are also some things that are really helpful for us to know when getting logs.
Do the logs have any personal information in?
We’re just concerned about the various search crawler bots like Google and Bing, we don’t need any logs from users, so any logs with emails, or telephone numbers etc. can be removed.
Can we get logs from as close to the edge as possible?
It's pretty likely you've got a couple different layers of your network that might log. Ideally we want those from as close to the edge as possible. This prevents a couple issues:
● If you've got caching going on, like a CDN or Varnish then if we get logs from after them, we won't see any of the requests they answer.
● If you've got a load balancer distributing to several servers sometimes the external IP gets lost (perhaps X-Forwarded-For isn't working), which we need to verify Googlebot or we accidentally only get logs from a couple
servers.
Are there any sub parts of your site which log to a different place?
Have you got anything like an embedded Wordpress blog which logs to a different location? If so then we’ll need those logs as well. (Although of course if you're sending us CDN logs this won't matter.)
How do you log hostname and protocol?
It's very helpful for us to be able to see hostname & protocol. How do you distinguish those in the log files?
Do you log HTTP & HTTPS to separate files? Do you log hostname at all?
This is one of the problems that's often solved by getting logs closer to the edge, as while many servers won't give you those by default, load balancers and CDN's often will.
Where would we like the logs?
In an ideal world, they would be files in an S3 bucket and we can draw them down from there. If possible, we'd also ask that multiple files aren't zipped together for upload, because that makes processing harder. (No problem with
compressed logs just, just zipping multiple log files into a single archive).
Is there anything else we should know?
Best,
{x}
Part 1: Templates
Part 2: Logs
Part 3: Crawling Big
Sampling your crawl
● Limit your crawl
percentage per template.
i.e.
● 20% to product pages
● 30% to category pages
Low memory crawler
Runs locally on your machine
and allows you to crawl with a
very low memory footprint.
Doesn’t render JS or process
data however.
Run SF in the cloud
You can purchase a super high
memory computer in the cloud,
install SF on it and run it at
maximum speed.
Preventing technical issues
Search console properties for a
large brand.
Part 1: Manually crawling
Part 2: Automating assertions
Part 3: Unit testing
Change detection with SF
Change detection with SF
Part 1: Manually crawling change detection
Part 2: Automating assertions
Part 3: Unit testing
<meta name="robots" content="noindex">
<meta name="robots" content="noindex,nofollow">
<meta name="robots" content="noindex">
Is it different?
Is it the value I want?
Is it different?
<meta name="robots" content="noindex,nofollow">
<meta name="robots" content="noindex">
Element Equals
Title Big Brown Shoe - £12.99 - Example.com
Status Code 200
H1 Big Brown Shoe
Canonical <link rel="canonical" href="https:/
/example.com/product/big-brown-shoe" />
CSS Selector: #review-counter Any number
CSS Selector: #product-data {
"@context": "https:/
/schema.org/",
"@type": "Product",
"name": "Big Brown Shoe",
"description": "The biggest brownest show you can find.",
"sku": "0446310786",
"mpn": "925872",
}
Asserting with Google sheets
Asserting with Google sheets
Part 1: Manually crawling
Part 2: Automating assertions
Part 3: Unit testing
Unit tests
Create code Test code Deployment
Create code Test code Deployment
All our hard work.
Create code Test code Deployment
All our hard work.
Create code Test code Deployment
endtest.io
Conclusions
@dom_woodman
bit.ly/seo-for-large-websites
www.pipedout.com
@dom_woodman

Contenu connexe

Tendances

How to construct your own SEO a b split tests (for free) - BrightonSEO July 2021
How to construct your own SEO a b split tests (for free) - BrightonSEO July 2021How to construct your own SEO a b split tests (for free) - BrightonSEO July 2021
How to construct your own SEO a b split tests (for free) - BrightonSEO July 2021Chris Green
 
How to Succeed in B2B SEO
How to Succeed in B2B SEOHow to Succeed in B2B SEO
How to Succeed in B2B SEODominic Woodman
 
Headless SEO: Optimising Next Gen Sites | brightonSEO 2021
Headless SEO: Optimising Next Gen Sites | brightonSEO 2021Headless SEO: Optimising Next Gen Sites | brightonSEO 2021
Headless SEO: Optimising Next Gen Sites | brightonSEO 2021Alex Wright
 
Hey Googlebot, did you cache that ?
Hey Googlebot, did you cache that ?Hey Googlebot, did you cache that ?
Hey Googlebot, did you cache that ?Petra Kis-Herczegh
 
How to scale SEO work NOBODY wants to do (including your competitors) to rapi...
How to scale SEO work NOBODY wants to do (including your competitors) to rapi...How to scale SEO work NOBODY wants to do (including your competitors) to rapi...
How to scale SEO work NOBODY wants to do (including your competitors) to rapi...Hamlet Batista
 
Brighton SEO July 2021 How JavaScript is preventing you from passing Core W...
Brighton SEO July 2021   How JavaScript is preventing you from passing Core W...Brighton SEO July 2021   How JavaScript is preventing you from passing Core W...
Brighton SEO July 2021 How JavaScript is preventing you from passing Core W...Izabela Wisniewska
 
TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...
TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...
TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...Catalyst
 
Hreflang - why and how and why not for International SEO
Hreflang - why and how and why not for International SEOHreflang - why and how and why not for International SEO
Hreflang - why and how and why not for International SEOGerry White
 
rel canonical audit BrightonSEO September 2018
rel canonical audit BrightonSEO September 2018rel canonical audit BrightonSEO September 2018
rel canonical audit BrightonSEO September 2018Mark Thomas
 
The New Renaissance of JavaScript
The New Renaissance of JavaScriptThe New Renaissance of JavaScript
The New Renaissance of JavaScriptHamlet Batista
 
SearchLeeds 2018 - Craig Campbell - How to fix the most common technical SEO ...
SearchLeeds 2018 - Craig Campbell - How to fix the most common technical SEO ...SearchLeeds 2018 - Craig Campbell - How to fix the most common technical SEO ...
SearchLeeds 2018 - Craig Campbell - How to fix the most common technical SEO ...Branded3
 
Solving Complex JavaScript Issues and Leveraging Semantic HTML5
Solving Complex JavaScript Issues and Leveraging Semantic HTML5Solving Complex JavaScript Issues and Leveraging Semantic HTML5
Solving Complex JavaScript Issues and Leveraging Semantic HTML5Hamlet Batista
 
SMX East - SEO Tools Panel
SMX East - SEO Tools PanelSMX East - SEO Tools Panel
SMX East - SEO Tools PanelAbby Hamilton
 
Three site speed optimisation tips to make your website REALLY fast - Brighto...
Three site speed optimisation tips to make your website REALLY fast - Brighto...Three site speed optimisation tips to make your website REALLY fast - Brighto...
Three site speed optimisation tips to make your website REALLY fast - Brighto...Bastian Grimm
 
An SEO's Guide to Website Migrations | Faye Watt | BrightonSEO's Advanced Tec...
An SEO's Guide to Website Migrations | Faye Watt | BrightonSEO's Advanced Tec...An SEO's Guide to Website Migrations | Faye Watt | BrightonSEO's Advanced Tec...
An SEO's Guide to Website Migrations | Faye Watt | BrightonSEO's Advanced Tec...Faye Watt
 
Use Google Docs to monitor SEO by pulling in Google Analytics #BrightonSEO
Use Google Docs to monitor SEO by pulling in Google Analytics #BrightonSEOUse Google Docs to monitor SEO by pulling in Google Analytics #BrightonSEO
Use Google Docs to monitor SEO by pulling in Google Analytics #BrightonSEOGerry White
 
TFM - Using Google Tag Manager for ecom
TFM - Using Google Tag Manager for ecom TFM - Using Google Tag Manager for ecom
TFM - Using Google Tag Manager for ecom Gerry White
 
How Testing Stops Product Changes Harming Your Organic Performance - Brighton...
How Testing Stops Product Changes Harming Your Organic Performance - Brighton...How Testing Stops Product Changes Harming Your Organic Performance - Brighton...
How Testing Stops Product Changes Harming Your Organic Performance - Brighton...Emily Potter
 
How Marketers Can Work With Code
How Marketers Can Work With CodeHow Marketers Can Work With Code
How Marketers Can Work With CodeKaizen
 

Tendances (20)

How to construct your own SEO a b split tests (for free) - BrightonSEO July 2021
How to construct your own SEO a b split tests (for free) - BrightonSEO July 2021How to construct your own SEO a b split tests (for free) - BrightonSEO July 2021
How to construct your own SEO a b split tests (for free) - BrightonSEO July 2021
 
How to Succeed in B2B SEO
How to Succeed in B2B SEOHow to Succeed in B2B SEO
How to Succeed in B2B SEO
 
Headless SEO: Optimising Next Gen Sites | brightonSEO 2021
Headless SEO: Optimising Next Gen Sites | brightonSEO 2021Headless SEO: Optimising Next Gen Sites | brightonSEO 2021
Headless SEO: Optimising Next Gen Sites | brightonSEO 2021
 
Hey Googlebot, did you cache that ?
Hey Googlebot, did you cache that ?Hey Googlebot, did you cache that ?
Hey Googlebot, did you cache that ?
 
How to scale SEO work NOBODY wants to do (including your competitors) to rapi...
How to scale SEO work NOBODY wants to do (including your competitors) to rapi...How to scale SEO work NOBODY wants to do (including your competitors) to rapi...
How to scale SEO work NOBODY wants to do (including your competitors) to rapi...
 
Brighton SEO July 2021 How JavaScript is preventing you from passing Core W...
Brighton SEO July 2021   How JavaScript is preventing you from passing Core W...Brighton SEO July 2021   How JavaScript is preventing you from passing Core W...
Brighton SEO July 2021 How JavaScript is preventing you from passing Core W...
 
TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...
TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...
TechSEO Boost 2021 - Rendering Strategies: Measuring the Devil’s Details in C...
 
Hreflang - why and how and why not for International SEO
Hreflang - why and how and why not for International SEOHreflang - why and how and why not for International SEO
Hreflang - why and how and why not for International SEO
 
rel canonical audit BrightonSEO September 2018
rel canonical audit BrightonSEO September 2018rel canonical audit BrightonSEO September 2018
rel canonical audit BrightonSEO September 2018
 
The New Renaissance of JavaScript
The New Renaissance of JavaScriptThe New Renaissance of JavaScript
The New Renaissance of JavaScript
 
SearchLeeds 2018 - Craig Campbell - How to fix the most common technical SEO ...
SearchLeeds 2018 - Craig Campbell - How to fix the most common technical SEO ...SearchLeeds 2018 - Craig Campbell - How to fix the most common technical SEO ...
SearchLeeds 2018 - Craig Campbell - How to fix the most common technical SEO ...
 
Solving Complex JavaScript Issues and Leveraging Semantic HTML5
Solving Complex JavaScript Issues and Leveraging Semantic HTML5Solving Complex JavaScript Issues and Leveraging Semantic HTML5
Solving Complex JavaScript Issues and Leveraging Semantic HTML5
 
SMX East - SEO Tools Panel
SMX East - SEO Tools PanelSMX East - SEO Tools Panel
SMX East - SEO Tools Panel
 
Three site speed optimisation tips to make your website REALLY fast - Brighto...
Three site speed optimisation tips to make your website REALLY fast - Brighto...Three site speed optimisation tips to make your website REALLY fast - Brighto...
Three site speed optimisation tips to make your website REALLY fast - Brighto...
 
Screaming Frog PPT
Screaming Frog PPTScreaming Frog PPT
Screaming Frog PPT
 
An SEO's Guide to Website Migrations | Faye Watt | BrightonSEO's Advanced Tec...
An SEO's Guide to Website Migrations | Faye Watt | BrightonSEO's Advanced Tec...An SEO's Guide to Website Migrations | Faye Watt | BrightonSEO's Advanced Tec...
An SEO's Guide to Website Migrations | Faye Watt | BrightonSEO's Advanced Tec...
 
Use Google Docs to monitor SEO by pulling in Google Analytics #BrightonSEO
Use Google Docs to monitor SEO by pulling in Google Analytics #BrightonSEOUse Google Docs to monitor SEO by pulling in Google Analytics #BrightonSEO
Use Google Docs to monitor SEO by pulling in Google Analytics #BrightonSEO
 
TFM - Using Google Tag Manager for ecom
TFM - Using Google Tag Manager for ecom TFM - Using Google Tag Manager for ecom
TFM - Using Google Tag Manager for ecom
 
How Testing Stops Product Changes Harming Your Organic Performance - Brighton...
How Testing Stops Product Changes Harming Your Organic Performance - Brighton...How Testing Stops Product Changes Harming Your Organic Performance - Brighton...
How Testing Stops Product Changes Harming Your Organic Performance - Brighton...
 
How Marketers Can Work With Code
How Marketers Can Work With CodeHow Marketers Can Work With Code
How Marketers Can Work With Code
 

Similaire à SEO for Large/Enterprise Websites - Data & Tech Side

Log analysis and pro use cases for search marketers online version (1)
Log analysis and pro use cases for search marketers online version (1)Log analysis and pro use cases for search marketers online version (1)
Log analysis and pro use cases for search marketers online version (1)David Sottimano
 
SearchLove Boston 2017 | Dom Woodman | How to Get Insight From Your Logs
SearchLove Boston 2017 | Dom Woodman | How to Get Insight From Your LogsSearchLove Boston 2017 | Dom Woodman | How to Get Insight From Your Logs
SearchLove Boston 2017 | Dom Woodman | How to Get Insight From Your LogsDistilled
 
LatJUG. Google App Engine
LatJUG. Google App EngineLatJUG. Google App Engine
LatJUG. Google App Enginedenis Udod
 
SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
SearchLove London 2016 | Dom Woodman | How to Get Insight From Your LogsSearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
SearchLove London 2016 | Dom Woodman | How to Get Insight From Your LogsDistilled
 
A Guide to Log Analysis with Big Query
A Guide to Log Analysis with Big QueryA Guide to Log Analysis with Big Query
A Guide to Log Analysis with Big QueryDominic Woodman
 
Ruby on Rails Performance Tuning. Make it faster, make it better (WindyCityRa...
Ruby on Rails Performance Tuning. Make it faster, make it better (WindyCityRa...Ruby on Rails Performance Tuning. Make it faster, make it better (WindyCityRa...
Ruby on Rails Performance Tuning. Make it faster, make it better (WindyCityRa...John McCaffrey
 
Windy cityrails performance_tuning
Windy cityrails performance_tuningWindy cityrails performance_tuning
Windy cityrails performance_tuningJohn McCaffrey
 
Why Analytics important for any business - EBriks Infotech
 Why Analytics important for any business - EBriks Infotech Why Analytics important for any business - EBriks Infotech
Why Analytics important for any business - EBriks InfotechEBriks Infotech Pvt. Ltd.
 
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Berlin 2017
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Berlin 2017Monitoring Big Data Systems Done "The Simple Way" - Codemotion Berlin 2017
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Berlin 2017Demi Ben-Ari
 
Website & Internet + Performance testing
Website & Internet + Performance testingWebsite & Internet + Performance testing
Website & Internet + Performance testingRoman Ananev
 
Data Engineer's Lunch 90: Migrating SQL Data with Arcion
Data Engineer's Lunch 90: Migrating SQL Data with ArcionData Engineer's Lunch 90: Migrating SQL Data with Arcion
Data Engineer's Lunch 90: Migrating SQL Data with ArcionAnant Corporation
 
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...Demi Ben-Ari
 
Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...
Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...
Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...Codemotion
 
Migration Best Practices - SEOkomm 2018
Migration Best Practices - SEOkomm 2018Migration Best Practices - SEOkomm 2018
Migration Best Practices - SEOkomm 2018Bastian Grimm
 
Content migration for sitecore
Content migration for sitecoreContent migration for sitecore
Content migration for sitecoreSurendra Sharma
 
Why Analytics is Important for Any Business - EBriks Infotech
Why Analytics is Important for Any Business - EBriks InfotechWhy Analytics is Important for Any Business - EBriks Infotech
Why Analytics is Important for Any Business - EBriks InfotechEBriks Infotech Pvt. Ltd.
 
Datasets, APIs, and Web Scraping
Datasets, APIs, and Web ScrapingDatasets, APIs, and Web Scraping
Datasets, APIs, and Web ScrapingDamian T. Gordon
 
Optimization 2020 | Using Edge SEO For Technical Issues ft. Dan Taylor
Optimization 2020 | Using Edge SEO For Technical Issues ft. Dan TaylorOptimization 2020 | Using Edge SEO For Technical Issues ft. Dan Taylor
Optimization 2020 | Using Edge SEO For Technical Issues ft. Dan TaylorDan Taylor
 
Google BigQuery is the future of Analytics! (Google Developer Conference)
Google BigQuery is the future of Analytics! (Google Developer Conference)Google BigQuery is the future of Analytics! (Google Developer Conference)
Google BigQuery is the future of Analytics! (Google Developer Conference)Rasel Rana
 

Similaire à SEO for Large/Enterprise Websites - Data & Tech Side (20)

Log analysis and pro use cases for search marketers online version (1)
Log analysis and pro use cases for search marketers online version (1)Log analysis and pro use cases for search marketers online version (1)
Log analysis and pro use cases for search marketers online version (1)
 
SearchLove Boston 2017 | Dom Woodman | How to Get Insight From Your Logs
SearchLove Boston 2017 | Dom Woodman | How to Get Insight From Your LogsSearchLove Boston 2017 | Dom Woodman | How to Get Insight From Your Logs
SearchLove Boston 2017 | Dom Woodman | How to Get Insight From Your Logs
 
LatJUG. Google App Engine
LatJUG. Google App EngineLatJUG. Google App Engine
LatJUG. Google App Engine
 
SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
SearchLove London 2016 | Dom Woodman | How to Get Insight From Your LogsSearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
SearchLove London 2016 | Dom Woodman | How to Get Insight From Your Logs
 
A Guide to Log Analysis with Big Query
A Guide to Log Analysis with Big QueryA Guide to Log Analysis with Big Query
A Guide to Log Analysis with Big Query
 
Ruby on Rails Performance Tuning. Make it faster, make it better (WindyCityRa...
Ruby on Rails Performance Tuning. Make it faster, make it better (WindyCityRa...Ruby on Rails Performance Tuning. Make it faster, make it better (WindyCityRa...
Ruby on Rails Performance Tuning. Make it faster, make it better (WindyCityRa...
 
Windy cityrails performance_tuning
Windy cityrails performance_tuningWindy cityrails performance_tuning
Windy cityrails performance_tuning
 
Why Analytics important for any business - EBriks Infotech
 Why Analytics important for any business - EBriks Infotech Why Analytics important for any business - EBriks Infotech
Why Analytics important for any business - EBriks Infotech
 
SEARCH Y - Bastian Grimm - Migrations Best Practices
SEARCH Y - Bastian Grimm -  Migrations Best PracticesSEARCH Y - Bastian Grimm -  Migrations Best Practices
SEARCH Y - Bastian Grimm - Migrations Best Practices
 
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Berlin 2017
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Berlin 2017Monitoring Big Data Systems Done "The Simple Way" - Codemotion Berlin 2017
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Berlin 2017
 
Website & Internet + Performance testing
Website & Internet + Performance testingWebsite & Internet + Performance testing
Website & Internet + Performance testing
 
Data Engineer's Lunch 90: Migrating SQL Data with Arcion
Data Engineer's Lunch 90: Migrating SQL Data with ArcionData Engineer's Lunch 90: Migrating SQL Data with Arcion
Data Engineer's Lunch 90: Migrating SQL Data with Arcion
 
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...
Monitoring Big Data Systems Done "The Simple Way" - Codemotion Milan 2017 - D...
 
Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...
Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...
Demi Ben-Ari - Monitoring Big Data Systems Done "The Simple Way" - Codemotion...
 
Migration Best Practices - SEOkomm 2018
Migration Best Practices - SEOkomm 2018Migration Best Practices - SEOkomm 2018
Migration Best Practices - SEOkomm 2018
 
Content migration for sitecore
Content migration for sitecoreContent migration for sitecore
Content migration for sitecore
 
Why Analytics is Important for Any Business - EBriks Infotech
Why Analytics is Important for Any Business - EBriks InfotechWhy Analytics is Important for Any Business - EBriks Infotech
Why Analytics is Important for Any Business - EBriks Infotech
 
Datasets, APIs, and Web Scraping
Datasets, APIs, and Web ScrapingDatasets, APIs, and Web Scraping
Datasets, APIs, and Web Scraping
 
Optimization 2020 | Using Edge SEO For Technical Issues ft. Dan Taylor
Optimization 2020 | Using Edge SEO For Technical Issues ft. Dan TaylorOptimization 2020 | Using Edge SEO For Technical Issues ft. Dan Taylor
Optimization 2020 | Using Edge SEO For Technical Issues ft. Dan Taylor
 
Google BigQuery is the future of Analytics! (Google Developer Conference)
Google BigQuery is the future of Analytics! (Google Developer Conference)Google BigQuery is the future of Analytics! (Google Developer Conference)
Google BigQuery is the future of Analytics! (Google Developer Conference)
 

Plus de Dominic Woodman

19 Lessons I learned from a year of SEO split testing
19 Lessons I learned from a year of SEO split testing19 Lessons I learned from a year of SEO split testing
19 Lessons I learned from a year of SEO split testingDominic Woodman
 
Information Architecture for SEOs - Matching intent to pages & internal linki...
Information Architecture for SEOs - Matching intent to pages & internal linki...Information Architecture for SEOs - Matching intent to pages & internal linki...
Information Architecture for SEOs - Matching intent to pages & internal linki...Dominic Woodman
 
Debugging SEO - Language & Breaking Down
Debugging SEO - Language & Breaking DownDebugging SEO - Language & Breaking Down
Debugging SEO - Language & Breaking DownDominic Woodman
 
How a year of SEO split testing changed how I thought SEO worked
How a year of SEO split testing changed how I thought SEO workedHow a year of SEO split testing changed how I thought SEO worked
How a year of SEO split testing changed how I thought SEO workedDominic Woodman
 
Matching Keywords to Pages - Information Architecture
Matching Keywords to Pages - Information ArchitectureMatching Keywords to Pages - Information Architecture
Matching Keywords to Pages - Information ArchitectureDominic Woodman
 
Split Testing for SEO - 9 Months of Learning
Split Testing for SEO - 9 Months of LearningSplit Testing for SEO - 9 Months of Learning
Split Testing for SEO - 9 Months of LearningDominic Woodman
 
What is AMP and do I care?
What is AMP and do I care?What is AMP and do I care?
What is AMP and do I care?Dominic Woodman
 

Plus de Dominic Woodman (7)

19 Lessons I learned from a year of SEO split testing
19 Lessons I learned from a year of SEO split testing19 Lessons I learned from a year of SEO split testing
19 Lessons I learned from a year of SEO split testing
 
Information Architecture for SEOs - Matching intent to pages & internal linki...
Information Architecture for SEOs - Matching intent to pages & internal linki...Information Architecture for SEOs - Matching intent to pages & internal linki...
Information Architecture for SEOs - Matching intent to pages & internal linki...
 
Debugging SEO - Language & Breaking Down
Debugging SEO - Language & Breaking DownDebugging SEO - Language & Breaking Down
Debugging SEO - Language & Breaking Down
 
How a year of SEO split testing changed how I thought SEO worked
How a year of SEO split testing changed how I thought SEO workedHow a year of SEO split testing changed how I thought SEO worked
How a year of SEO split testing changed how I thought SEO worked
 
Matching Keywords to Pages - Information Architecture
Matching Keywords to Pages - Information ArchitectureMatching Keywords to Pages - Information Architecture
Matching Keywords to Pages - Information Architecture
 
Split Testing for SEO - 9 Months of Learning
Split Testing for SEO - 9 Months of LearningSplit Testing for SEO - 9 Months of Learning
Split Testing for SEO - 9 Months of Learning
 
What is AMP and do I care?
What is AMP and do I care?What is AMP and do I care?
What is AMP and do I care?
 

Dernier

The power of SEO-driven market intelligence
The power of SEO-driven market intelligenceThe power of SEO-driven market intelligence
The power of SEO-driven market intelligenceHinde Lamrani
 
Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝soniya singh
 
pptx.marketing strategy of tanishq. pptx
pptx.marketing strategy of tanishq. pptxpptx.marketing strategy of tanishq. pptx
pptx.marketing strategy of tanishq. pptxarsathsahil
 
Most Impressive Construction Leaders in Tech, Making Waves in the Industry, 2...
Most Impressive Construction Leaders in Tech, Making Waves in the Industry, 2...Most Impressive Construction Leaders in Tech, Making Waves in the Industry, 2...
Most Impressive Construction Leaders in Tech, Making Waves in the Industry, 2...CIO Business World
 
VIP Call Girls In Green Park 9654467111 Escorts Service
VIP Call Girls In Green Park 9654467111 Escorts ServiceVIP Call Girls In Green Park 9654467111 Escorts Service
VIP Call Girls In Green Park 9654467111 Escorts ServiceSapana Sha
 
Master the Art of Digital Recruitment in Asia.pdf
Master the Art of Digital Recruitment in Asia.pdfMaster the Art of Digital Recruitment in Asia.pdf
Master the Art of Digital Recruitment in Asia.pdfHigher Education Marketing
 
Talent Management for mba 3rd sem useful
Talent Management for mba 3rd sem usefulTalent Management for mba 3rd sem useful
Talent Management for mba 3rd sem usefulAtifaArbar
 
2024 SEO Trends for Business Success (WSA)
2024 SEO Trends for Business Success (WSA)2024 SEO Trends for Business Success (WSA)
2024 SEO Trends for Business Success (WSA)Jomer Gregorio
 
Snapshot of Consumer Behaviors of March 2024-EOLiSurvey (EN).pdf
Snapshot of Consumer Behaviors of March 2024-EOLiSurvey (EN).pdfSnapshot of Consumer Behaviors of March 2024-EOLiSurvey (EN).pdf
Snapshot of Consumer Behaviors of March 2024-EOLiSurvey (EN).pdfEastern Online-iSURVEY
 
What are the 4 characteristics of CTAs that convert?
What are the 4 characteristics of CTAs that convert?What are the 4 characteristics of CTAs that convert?
What are the 4 characteristics of CTAs that convert?Juan Pineda
 
The Pitfalls of Keyword Stuffing in SEO Copywriting
The Pitfalls of Keyword Stuffing in SEO CopywritingThe Pitfalls of Keyword Stuffing in SEO Copywriting
The Pitfalls of Keyword Stuffing in SEO CopywritingJuan Pineda
 
Exploring Web 3.0 Growth marketing: Navigating the Future of the Internet
Exploring Web 3.0 Growth marketing: Navigating the Future of the InternetExploring Web 3.0 Growth marketing: Navigating the Future of the Internet
Exploring Web 3.0 Growth marketing: Navigating the Future of the Internetnehapardhi711
 
Influencer Marketing Power point presentation
Influencer Marketing  Power point presentationInfluencer Marketing  Power point presentation
Influencer Marketing Power point presentationdgtivemarketingagenc
 
Fueling A_B experiments with behavioral insights (1).pdf
Fueling A_B experiments with behavioral insights (1).pdfFueling A_B experiments with behavioral insights (1).pdf
Fueling A_B experiments with behavioral insights (1).pdfVWO
 
Common Culture: Paul Willis Symbolic Creativity
Common Culture: Paul Willis Symbolic CreativityCommon Culture: Paul Willis Symbolic Creativity
Common Culture: Paul Willis Symbolic CreativityMonishka Adhikari
 
From Chance to Choice - Tactical Link Building for International SEO
From Chance to Choice - Tactical Link Building for International SEOFrom Chance to Choice - Tactical Link Building for International SEO
From Chance to Choice - Tactical Link Building for International SEOSzymon Słowik
 
定制(ULV毕业证书)拉文大学毕业证成绩单原版一比一
定制(ULV毕业证书)拉文大学毕业证成绩单原版一比一定制(ULV毕业证书)拉文大学毕业证成绩单原版一比一
定制(ULV毕业证书)拉文大学毕业证成绩单原版一比一s SS
 
The 10 Most Inspirational Leaders LEADING THE WAY TO SUCCESS, 2024
The 10 Most Inspirational Leaders LEADING THE WAY TO SUCCESS, 2024The 10 Most Inspirational Leaders LEADING THE WAY TO SUCCESS, 2024
The 10 Most Inspirational Leaders LEADING THE WAY TO SUCCESS, 2024CIO Business World
 
Word Count for Writers: Examples of Word Counts for Sample Genres
Word Count for Writers: Examples of Word Counts for Sample GenresWord Count for Writers: Examples of Word Counts for Sample Genres
Word Count for Writers: Examples of Word Counts for Sample GenresLisa M. Masiello
 
DGR_Digital Advertising Strategies for a Cookieless World_Presentation.pdf
DGR_Digital Advertising Strategies for a Cookieless World_Presentation.pdfDGR_Digital Advertising Strategies for a Cookieless World_Presentation.pdf
DGR_Digital Advertising Strategies for a Cookieless World_Presentation.pdfDemandbase
 

Dernier (20)

The power of SEO-driven market intelligence
The power of SEO-driven market intelligenceThe power of SEO-driven market intelligence
The power of SEO-driven market intelligence
 
Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝
Call Girls in Lajpat Nagar Delhi 💯Call Us 🔝8264348440🔝
 
pptx.marketing strategy of tanishq. pptx
pptx.marketing strategy of tanishq. pptxpptx.marketing strategy of tanishq. pptx
pptx.marketing strategy of tanishq. pptx
 
Most Impressive Construction Leaders in Tech, Making Waves in the Industry, 2...
Most Impressive Construction Leaders in Tech, Making Waves in the Industry, 2...Most Impressive Construction Leaders in Tech, Making Waves in the Industry, 2...
Most Impressive Construction Leaders in Tech, Making Waves in the Industry, 2...
 
VIP Call Girls In Green Park 9654467111 Escorts Service
VIP Call Girls In Green Park 9654467111 Escorts ServiceVIP Call Girls In Green Park 9654467111 Escorts Service
VIP Call Girls In Green Park 9654467111 Escorts Service
 
Master the Art of Digital Recruitment in Asia.pdf
Master the Art of Digital Recruitment in Asia.pdfMaster the Art of Digital Recruitment in Asia.pdf
Master the Art of Digital Recruitment in Asia.pdf
 
Talent Management for mba 3rd sem useful
Talent Management for mba 3rd sem usefulTalent Management for mba 3rd sem useful
Talent Management for mba 3rd sem useful
 
2024 SEO Trends for Business Success (WSA)
2024 SEO Trends for Business Success (WSA)2024 SEO Trends for Business Success (WSA)
2024 SEO Trends for Business Success (WSA)
 
Snapshot of Consumer Behaviors of March 2024-EOLiSurvey (EN).pdf
Snapshot of Consumer Behaviors of March 2024-EOLiSurvey (EN).pdfSnapshot of Consumer Behaviors of March 2024-EOLiSurvey (EN).pdf
Snapshot of Consumer Behaviors of March 2024-EOLiSurvey (EN).pdf
 
What are the 4 characteristics of CTAs that convert?
What are the 4 characteristics of CTAs that convert?What are the 4 characteristics of CTAs that convert?
What are the 4 characteristics of CTAs that convert?
 
The Pitfalls of Keyword Stuffing in SEO Copywriting
The Pitfalls of Keyword Stuffing in SEO CopywritingThe Pitfalls of Keyword Stuffing in SEO Copywriting
The Pitfalls of Keyword Stuffing in SEO Copywriting
 
Exploring Web 3.0 Growth marketing: Navigating the Future of the Internet
Exploring Web 3.0 Growth marketing: Navigating the Future of the InternetExploring Web 3.0 Growth marketing: Navigating the Future of the Internet
Exploring Web 3.0 Growth marketing: Navigating the Future of the Internet
 
Influencer Marketing Power point presentation
Influencer Marketing  Power point presentationInfluencer Marketing  Power point presentation
Influencer Marketing Power point presentation
 
Fueling A_B experiments with behavioral insights (1).pdf
Fueling A_B experiments with behavioral insights (1).pdfFueling A_B experiments with behavioral insights (1).pdf
Fueling A_B experiments with behavioral insights (1).pdf
 
Common Culture: Paul Willis Symbolic Creativity
Common Culture: Paul Willis Symbolic CreativityCommon Culture: Paul Willis Symbolic Creativity
Common Culture: Paul Willis Symbolic Creativity
 
From Chance to Choice - Tactical Link Building for International SEO
From Chance to Choice - Tactical Link Building for International SEOFrom Chance to Choice - Tactical Link Building for International SEO
From Chance to Choice - Tactical Link Building for International SEO
 
定制(ULV毕业证书)拉文大学毕业证成绩单原版一比一
定制(ULV毕业证书)拉文大学毕业证成绩单原版一比一定制(ULV毕业证书)拉文大学毕业证成绩单原版一比一
定制(ULV毕业证书)拉文大学毕业证成绩单原版一比一
 
The 10 Most Inspirational Leaders LEADING THE WAY TO SUCCESS, 2024
The 10 Most Inspirational Leaders LEADING THE WAY TO SUCCESS, 2024The 10 Most Inspirational Leaders LEADING THE WAY TO SUCCESS, 2024
The 10 Most Inspirational Leaders LEADING THE WAY TO SUCCESS, 2024
 
Word Count for Writers: Examples of Word Counts for Sample Genres
Word Count for Writers: Examples of Word Counts for Sample GenresWord Count for Writers: Examples of Word Counts for Sample Genres
Word Count for Writers: Examples of Word Counts for Sample Genres
 
DGR_Digital Advertising Strategies for a Cookieless World_Presentation.pdf
DGR_Digital Advertising Strategies for a Cookieless World_Presentation.pdfDGR_Digital Advertising Strategies for a Cookieless World_Presentation.pdf
DGR_Digital Advertising Strategies for a Cookieless World_Presentation.pdf
 

SEO for Large/Enterprise Websites - Data & Tech Side

  • 1. Doing SEO for large websites. Working on large websites, or large number of websites. Let’s talk about SEO at scale, with the enterprise.
  • 2.
  • 3.
  • 4.
  • 5.
  • 6.
  • 12.
  • 13. x2
  • 14. x2 x2
  • 16.
  • 17.
  • 18.
  • 19.
  • 20.
  • 21.
  • 22.
  • 23. SLOWER DIFFICULT TO WORK WITH
  • 24.
  • 25. Working in a large organisation Working with data Technical Foundation Minimising Risk Scaling Content Reporting
  • 26. Working in a large organisation Scaling Content Reporting Working with data Technical Foundation Minimising Risk
  • 27. Working in a large organisation Scaling Content Working with data Technical Foundation Reporting Minimising Risk
  • 32.
  • 34.
  • 35.
  • 36.
  • 37. I would like a 1000 problems please.
  • 38. “Please fix all 18,304 pages”
  • 39.
  • 40.
  • 41.
  • 42.
  • 43.
  • 44. LIES
  • 46.
  • 48. Small product number Main category page Out of stock product Extremes
  • 49. Facet category page Reviews Page 2 Same page different URL
  • 51.
  • 53. Impressions week by week for new content
  • 54. Pre change Post change Clicks pre and post change for site sections
  • 55. Competing pages for a set of terms
  • 56. SLOWER DIFFICULT TO WORK WITH SAMPLING
  • 57.
  • 58.
  • 59.
  • 60. SLOWER DIFFICULT TO WORK WITH SAMPLING LIMITS
  • 61. 1,000 rows at a time
  • 62.
  • 63. SLOWER DIFFICULT TO WORK WITH SAMPLING LIMITS LAG
  • 64.
  • 65. SLOWER DIFFICULT TO WORK WITH SAMPLING LIMITS LAG SEGMENTATION
  • 66. Search console properties for a large brand.
  • 67. Register all the things.
  • 69. Part 1: Data Studio Part 2: Day by day data Part 3: Python Part 4: Data warehousing Get Get, Analyse Get, Store, Analyse, Report
  • 70. Part 1: Data Studio Part 2: Day by day data Part 3: Python Part 4: Data warehousing
  • 71. Data studio for extracting data ● Add a data source ● Create a table for it. ● Download the table. With both GA & GSC, you’ll get everything in the table, no paginating.
  • 72. Part 1: Data Studio Part 2: Day by day data Part 3: Python Part 4: Data warehousing
  • 73. Day by day data To get even more data we have to get it day by day. ● bit.ly/search-console-dat a-downloader This bit is Search Console only.
  • 74. Part 1: Data Studio Part 2: Day by day data Part 3: Python Part 4: Data warehousing
  • 75. Getting data from APIs Pull down your analytics data. ● Daily_google_analytics_v3 ● Getting search console data from the API
  • 76. Getting data from APIs Pull down your analytics data. ● Daily_google_analytics_v3 ● Getting search console data from the API Getting started with pandas: ● Pandas tutorial with ranking data
  • 77. Getting data from APIs Pull down your analytics data. ● Daily_google_analytics_v3 ● Getting search console data from the API Getting started with pandas: ● Pandas tutorial with ranking data As a workflow I’d highly recommend Jupyter notebooks for getting started. ● Why use jupyter notebooks? ● SearchLove Video (paid)
  • 78. SEO Pythonistas A memorial and soon to be collection of Hamlet’s excellent work. SEO Pythonistas - In loving memory of Hamlet Batista @DataChaz
  • 79. Part 1: Data Studio Part 2: Day by day data Part 3: Python Part 4: Data warehousing
  • 83.
  • 84.
  • 86. Rolling your own JC Chouinard has built a series of excellent granular tutorials which walk you through setting up one on your own machine. Link.
  • 87. Off the shelf Get in touch with me! I run Piped Out which is software for building SEO data warehouses.
  • 89. Part 1: Templates Part 2: Logs Part 3: Crawling Big
  • 90.
  • 91. Part 1: Templates Part 2: Logs Part 3: Crawling Big
  • 92.
  • 93. Not the same fields as a crawl. No page title for example.
  • 94.
  • 95. ● Crawling & indexing problems
  • 96. ● Crawling & indexing problems ● Measuring freshness
  • 98. ● Crawling & indexing problems ● Measuring freshness ● Prioritisation
  • 99.
  • 100. ● Crawling & indexing problems ● Measuring freshness ● Prioritisation ● Monitoring website changes (e.g. migrations)
  • 101. Jun ‘19 Apr ‘19 Aug‘19 Oct ‘19 200 301 302 Status codes in product pages
  • 102. Jun ‘19 Apr ‘19 Aug‘19 Oct ‘19 200 301 302 Status codes in product pages ELK
  • 103. ● Crawling & indexing problems ● Measuring freshness ● Prioritisation ● Monitoring website changes (e.g. migrations) ● Debugging
  • 104. Hi x I’m {x} from {y} and we’ve been asked to do some log analysis to understand better how Google is behaving on the website and I was hoping you could help with some questions about the log set-up (as well as with getting the logs!). What time period do we want? What we’d ideally like is 3-6 months of historical logs for the website. Our goal is to look at all the different pages search engines are crawling on our website, discover where they’re spending their time, the status code errors they’re finding etc. We can absolutely do analysis with a month or so (we've even done it with just a week or two), but it means we lose historical context and obviously we're more likely to lose things on a larger side. There are also some things that are really helpful for us to know when getting logs. Do the logs have any personal information in? We’re just concerned about the various search crawler bots like Google and Bing, we don’t need any logs from users, so any logs with emails, or telephone numbers etc. can be removed. Can we get logs from as close to the edge as possible? It's pretty likely you've got a couple different layers of your network that might log. Ideally we want those from as close to the edge as possible. This prevents a couple issues: ● If you've got caching going on, like a CDN or Varnish then if we get logs from after them, we won't see any of the requests they answer. ● If you've got a load balancer distributing to several servers sometimes the external IP gets lost (perhaps X-Forwarded-For isn't working), which we need to verify Googlebot or we accidentally only get logs from a couple servers. Are there any sub parts of your site which log to a different place? Have you got anything like an embedded Wordpress blog which logs to a different location? If so then we’ll need those logs as well. (Although of course if you're sending us CDN logs this won't matter.) How do you log hostname and protocol? It's very helpful for us to be able to see hostname & protocol. How do you distinguish those in the log files? Do you log HTTP & HTTPS to separate files? Do you log hostname at all? This is one of the problems that's often solved by getting logs closer to the edge, as while many servers won't give you those by default, load balancers and CDN's often will. Where would we like the logs? In an ideal world, they would be files in an S3 bucket and we can draw them down from there. If possible, we'd also ask that multiple files aren't zipped together for upload, because that makes processing harder. (No problem with compressed logs just, just zipping multiple log files into a single archive). Is there anything else we should know? Best, {x}
  • 105.
  • 106. Part 1: Templates Part 2: Logs Part 3: Crawling Big
  • 107. Sampling your crawl ● Limit your crawl percentage per template. i.e. ● 20% to product pages ● 30% to category pages
  • 108. Low memory crawler Runs locally on your machine and allows you to crawl with a very low memory footprint. Doesn’t render JS or process data however.
  • 109. Run SF in the cloud You can purchase a super high memory computer in the cloud, install SF on it and run it at maximum speed.
  • 111. Search console properties for a large brand.
  • 112.
  • 113.
  • 114.
  • 115.
  • 116. Part 1: Manually crawling Part 2: Automating assertions Part 3: Unit testing
  • 119.
  • 120. Part 1: Manually crawling change detection Part 2: Automating assertions Part 3: Unit testing
  • 122. <meta name="robots" content="noindex,nofollow"> <meta name="robots" content="noindex">
  • 124. Is it the value I want? Is it different?
  • 125. <meta name="robots" content="noindex,nofollow"> <meta name="robots" content="noindex">
  • 126. Element Equals Title Big Brown Shoe - £12.99 - Example.com Status Code 200 H1 Big Brown Shoe Canonical <link rel="canonical" href="https:/ /example.com/product/big-brown-shoe" /> CSS Selector: #review-counter Any number CSS Selector: #product-data { "@context": "https:/ /schema.org/", "@type": "Product", "name": "Big Brown Shoe", "description": "The biggest brownest show you can find.", "sku": "0446310786", "mpn": "925872", }
  • 129. Part 1: Manually crawling Part 2: Automating assertions Part 3: Unit testing
  • 130.
  • 131.
  • 132.
  • 134. Create code Test code Deployment
  • 135. Create code Test code Deployment All our hard work.
  • 136. Create code Test code Deployment All our hard work.
  • 137. Create code Test code Deployment
  • 138.
  • 140.
  • 142.
  • 143.
  • 144.
  • 145.
  • 146.
  • 147.