SlideShare a Scribd company logo
1 of 22
Download to read offline
<   SEO     FOR   web developers       />
          Universidad CEU San Pablo
             Madrid, 26 febrero 2013
                  Ruben Martinez




             Paradigma | Javahispano
WHAT IS SEo ?

SEO is everything that helps a website generate more
           revenues from search engines.




       Technical                     Off
          seo                       page
Why is technical SEO
            important


1   Helps close the gap between web servers and
    search engines




2   Helps close the gap between search engines and
    websites
www search flow
                                                                                   OPTIMIZE

                   User          UNDERSTAND

                                                               Web developer




resources     World Wide Web   Search engines    Servers       Site architecture     Page      Author/s



Bottlenecks     Connections       Crawl           Speed           Structure         Content    Content
                                  Index         Availability                       relevance   purpose
                                   Rank




 SEO deals with the bottlenecks
           in the information flow
What can SEO do for a
               web developer?
Save time               Organize
and energy              functionalities



                        Intermediate
Detect                  the expectations
unknown                 of UX, design and
bugs early              web developers
How does an experienced SEO
      audit a web site
                           1 Crawl
                           2 Filter
                             $ head crawl.txt

       $ cut -f1,2 crawl.txt | sed -e 's/http://www.{domain}.
 {tld}//g' -e 's/t/,/g' |grep -v ".jpg|http:|.css|.js" >filtered.csv

                           $ head -5 filtered.csv


    3    Visualize the network and analyze

                           4    Gephi
variables for audit
        Backlinks

    Targeted keywords

    Content inventory

     Site architecture

       Site health

      Engagement
Measurement                  tools
Backlinks           Ahrefs, OpenSiteExplorer
PageRank            Google Toolbar
Competition         Adword’s Keyword Estimator Tool
Rankings            Google Webmasters Tools
Content inventory   Xenu, Screaming Frog
Duplicate content   Copyscape
Pages indexed       Operator “site:” on Google
Site architecture   Gephi
Server logs         Apache Log Viewer, Splunk
Crawler reports     Google Webmasters Tools
Engagement          Web analytics providers
Link graph
                          Example www.bigdataspain.org


     speakers.php



                                                        terms-and-conditions.pdf


                                                               /2012/program.php




                                                                                   program.pdf
program.php                                             en-index.php




              venue.php
                                                 hashtag-traking-live.php
What SEO should developers
        carry out?



             Content is king.
  Make sure that you have great content.
1 Findable content
  Reach out to publishers
  Upload your content
  Upload sitemaps to search engines

    Image    <?xml version="1.0" encoding="UTF-8"?>
              <urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9"
      XML      xmlns:image="http://www.google.com/schemas/sitemap-image/1.1">
              <url>
 sitemaps       <loc>http://example.com/sample.html</loc>
                <image:image>
                 <image:loc>http://example.com/image.jpg</image:loc>
                </image:image>
                <image:image>
                 <image:loc>http://example.com/photo.jpg</image:loc>
                </image:image>
              </url>
             </urlset>
2 Accessible content
  Host your content in an easy-to-reach reliable server
  Design a simple site architecture

  Link your internal pages sensibly

  Curate broken links

  URL structure

  Avoid frames and flash              $ w3m –dump “http://www.ft.com/” less
3 Clear content
  Determine the canonical page.

  Pagination and canonicalization

        <link rel="canonical" href="http://www.example.com/article
        story=abc&page=2"/>



        <link rel="prev" href="http://www.example.com/article?
        story=abc&page=1&sessionid=123" />



         <link rel="next" href="http://www.example.com/article?
         story=abc&page=3&sessionid=123" />
4 Controllable content
  Use robots.txt
  Block bots of spammers and scrapers
  Avoid cloaking
  Use the metatags noindex, noarchive

  Submit URLs you want to remove from Google’s index

  Monitor your site for hacked content

  Set the crawling rate of Googlebot
  Administer your PageRank budget
5 Valuable content
  Write a content management protocol to deal with
  obsolete content.



                 minimise 404 errors and provide a useful 404 page


  learn the differences between 301 and 302 redirects and use preferably 301 codes


            use the 410 HTTP status code in some cases for empty pages
6 Measurable content
  Get data from:


  Server logs                log <- getURL("sftp://user:password@host:
                             /path/to/apache/accesslog.log")

  Libraries of tags

  Google Analytics

  Split tests or tests A/B
google webmasters   Crawl Erros
Crawl Status
Index Status
Search Queries
Advanced SEO

Setting URL parameters on Google Webmasters Tools


         Latent Dirichleet Allocation (LDA)


        International and multilingual SEO


      Prediction of traffic – valuation of traffic
thank you
  Follow
@rubenmartinezs
@paradigmate
@javahispano

More Related Content

More from Paradigma Digital

More from Paradigma Digital (20)

Linkerd a fondo
Linkerd a fondoLinkerd a fondo
Linkerd a fondo
 
Horneando apis
Horneando apisHorneando apis
Horneando apis
 
Java 8 time to join the future
Java 8  time to join the futureJava 8  time to join the future
Java 8 time to join the future
 
Programación Reactiva con Spring WebFlux
Programación Reactiva con Spring WebFluxProgramación Reactiva con Spring WebFlux
Programación Reactiva con Spring WebFlux
 
Orquestando microservicios como lo hace Netflix
Orquestando microservicios como lo hace NetflixOrquestando microservicios como lo hace Netflix
Orquestando microservicios como lo hace Netflix
 
Meetup microservicios: API Management
Meetup microservicios: API ManagementMeetup microservicios: API Management
Meetup microservicios: API Management
 
Meetup de kubernetes, conceptos básicos.
Meetup  de kubernetes, conceptos básicos.Meetup  de kubernetes, conceptos básicos.
Meetup de kubernetes, conceptos básicos.
 
Docker, kubernetes, openshift y openstack, para mi abuela. techfest 2017.pptx
Docker, kubernetes, openshift y openstack, para mi abuela. techfest 2017.pptxDocker, kubernetes, openshift y openstack, para mi abuela. techfest 2017.pptx
Docker, kubernetes, openshift y openstack, para mi abuela. techfest 2017.pptx
 
Implementando microservicios
Implementando microserviciosImplementando microservicios
Implementando microservicios
 
Equipo de Marketing de Paradigma Digital
Equipo de Marketing de Paradigma DigitalEquipo de Marketing de Paradigma Digital
Equipo de Marketing de Paradigma Digital
 
¿Cómo se despliega y autoescala Couchbase en Cloud? ¡Aprende de manera práctica!
¿Cómo se despliega y autoescala Couchbase en Cloud? ¡Aprende de manera práctica!¿Cómo se despliega y autoescala Couchbase en Cloud? ¡Aprende de manera práctica!
¿Cómo se despliega y autoescala Couchbase en Cloud? ¡Aprende de manera práctica!
 
Overview atlas (1)
Overview atlas (1)Overview atlas (1)
Overview atlas (1)
 
Cómo usar google analytics
Cómo usar google analyticsCómo usar google analytics
Cómo usar google analytics
 
Transformación Digital
Transformación DigitalTransformación Digital
Transformación Digital
 
Manuel Hurtado. Couchbase paradigma4oct
Manuel Hurtado. Couchbase paradigma4octManuel Hurtado. Couchbase paradigma4oct
Manuel Hurtado. Couchbase paradigma4oct
 
Programación Reactiva con RxJava
Programación Reactiva con RxJavaProgramación Reactiva con RxJava
Programación Reactiva con RxJava
 
¿Cómo vencer a los dragones digitales?
¿Cómo vencer a los dragones digitales?¿Cómo vencer a los dragones digitales?
¿Cómo vencer a los dragones digitales?
 
Python y Flink
Python y FlinkPython y Flink
Python y Flink
 
Kafka y python
Kafka y pythonKafka y python
Kafka y python
 
Cultura Digital Paradigma
Cultura Digital ParadigmaCultura Digital Paradigma
Cultura Digital Paradigma
 

Recently uploaded

Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Recently uploaded (20)

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 

SEO FOR WEB DEVELOPERS by Ruben Martinez for Paradigma and Javahispano

  • 1. < SEO FOR web developers /> Universidad CEU San Pablo Madrid, 26 febrero 2013 Ruben Martinez Paradigma | Javahispano
  • 2. WHAT IS SEo ? SEO is everything that helps a website generate more revenues from search engines. Technical Off seo page
  • 3. Why is technical SEO important 1 Helps close the gap between web servers and search engines 2 Helps close the gap between search engines and websites
  • 4. www search flow OPTIMIZE User UNDERSTAND Web developer resources World Wide Web Search engines Servers Site architecture Page Author/s Bottlenecks Connections Crawl Speed Structure Content Content Index Availability relevance purpose Rank SEO deals with the bottlenecks in the information flow
  • 5. What can SEO do for a web developer? Save time Organize and energy functionalities Intermediate Detect the expectations unknown of UX, design and bugs early web developers
  • 6. How does an experienced SEO audit a web site 1 Crawl 2 Filter $ head crawl.txt $ cut -f1,2 crawl.txt | sed -e 's/http://www.{domain}. {tld}//g' -e 's/t/,/g' |grep -v ".jpg|http:|.css|.js" >filtered.csv $ head -5 filtered.csv 3 Visualize the network and analyze 4 Gephi
  • 7. variables for audit Backlinks Targeted keywords Content inventory Site architecture Site health Engagement
  • 8. Measurement tools Backlinks Ahrefs, OpenSiteExplorer PageRank Google Toolbar Competition Adword’s Keyword Estimator Tool Rankings Google Webmasters Tools Content inventory Xenu, Screaming Frog Duplicate content Copyscape Pages indexed Operator “site:” on Google Site architecture Gephi Server logs Apache Log Viewer, Splunk Crawler reports Google Webmasters Tools Engagement Web analytics providers
  • 9. Link graph Example www.bigdataspain.org speakers.php terms-and-conditions.pdf /2012/program.php program.pdf program.php en-index.php venue.php hashtag-traking-live.php
  • 10. What SEO should developers carry out? Content is king. Make sure that you have great content.
  • 11. 1 Findable content Reach out to publishers Upload your content Upload sitemaps to search engines Image <?xml version="1.0" encoding="UTF-8"?> <urlset xmlns="http://www.sitemaps.org/schemas/sitemap/0.9" XML xmlns:image="http://www.google.com/schemas/sitemap-image/1.1"> <url> sitemaps <loc>http://example.com/sample.html</loc> <image:image> <image:loc>http://example.com/image.jpg</image:loc> </image:image> <image:image> <image:loc>http://example.com/photo.jpg</image:loc> </image:image> </url> </urlset>
  • 12. 2 Accessible content Host your content in an easy-to-reach reliable server Design a simple site architecture Link your internal pages sensibly Curate broken links URL structure Avoid frames and flash $ w3m –dump “http://www.ft.com/” less
  • 13. 3 Clear content Determine the canonical page. Pagination and canonicalization <link rel="canonical" href="http://www.example.com/article story=abc&page=2"/> <link rel="prev" href="http://www.example.com/article? story=abc&page=1&sessionid=123" /> <link rel="next" href="http://www.example.com/article? story=abc&page=3&sessionid=123" />
  • 14. 4 Controllable content Use robots.txt Block bots of spammers and scrapers Avoid cloaking Use the metatags noindex, noarchive Submit URLs you want to remove from Google’s index Monitor your site for hacked content Set the crawling rate of Googlebot Administer your PageRank budget
  • 15. 5 Valuable content Write a content management protocol to deal with obsolete content. minimise 404 errors and provide a useful 404 page learn the differences between 301 and 302 redirects and use preferably 301 codes use the 410 HTTP status code in some cases for empty pages
  • 16. 6 Measurable content Get data from: Server logs log <- getURL("sftp://user:password@host: /path/to/apache/accesslog.log") Libraries of tags Google Analytics Split tests or tests A/B
  • 17. google webmasters Crawl Erros
  • 21. Advanced SEO Setting URL parameters on Google Webmasters Tools Latent Dirichleet Allocation (LDA) International and multilingual SEO Prediction of traffic – valuation of traffic
  • 22. thank you Follow @rubenmartinezs @paradigmate @javahispano