SlideShare une entreprise Scribd logo
1  sur  29
Télécharger pour lire hors ligne
Google PageRank
Prof. Beat Signer
<bsigner@vub.ac.be>

Department of Computer Science
Vrije Universiteit Brussel

http://www.beatsigner.com




                                 December 9, 2008
Overview
 History of PageRank
 PageRank algorithm
 Examples
 Implications for website development




December 9, 2008    Beat Signer, signer@inf.ethz.ch   2
History of PageRank
 Developed as part of an academic
     project at Stanford University
           research platform to aid under-            Larry Page Sergey Brin
            standing of large-scale web data
            and enable researches to easily
            experiment with new search technologies
           Larry Page and Sergey Brin worked on the project
            about a new kind of search engine (1995-1998) which
            finally led to a functional prototype called Google


December 9, 2008                  Beat Signer, signer@inf.ethz.ch           3
Web Search Until 1998
 Find all documents using a query term
           use information retrieval (IR) solutions
           ranking based on "on-page factors"
             problem: poor quality of search results (order)
 Page and Brin proposed to compute the
     absolute qualtity of a page (PageRank)
           based on the number and quality of pages
            linking to a page (votes)


December 9, 2008               Beat Signer, signer@inf.ethz.ch   4
PageRank                          P1                        P5        P7   P8
                                  R1                        R5        R7   R8


                             P2                  P3              P4        P6
                             R2                  R3              R4        R6


 A page has a high PageRank R if
           there are many pages linking to it
           or, if there are some pages with a high PageRank
            linking to it
 Total score = IR score x PageRank
December 9, 2008              Beat Signer, signer@inf.ethz.ch                   5
PageRank Algorithm
                            R( Pj )
R( Pi )           
                   Pj Bi     Lj
                                                                        P1    P2

                                                                        1.5
                                                                         1    1.5
                                                                               1

 where
           Bi is the set of pages
            that link to page Pi                                              P3

           Lj is the number of                                               0.75
                                                                               1
            outgoing links for page Pj




December 9, 2008                      Beat Signer, signer@inf.ethz.ch                6
Matrix Representation
 Let us define a hyperlink                            P1   P2
     matrix H
       1 L j if Pj  Bi
H ij  
        0    otherwise
                                           0 1 2 1 
and R  RPi 
                                                            P3
                                       H  1 0 0
                                                   
 R  HR                                   0 1 2 0 
                                                   
R is an eigenvector of H
with eigenvalue 1
December 9, 2008     Beat Signer, signer@inf.ethz.ch             7
Matrix Representation ...
 We can use the power method to find R
     t 1
R            HR   t


                    0 1 2 1 
For our example H  1 0 0
                            
                    0 1 2 0 
                            

this results in R  2 2 1 or                           0.4   0.4 0.2

December 9, 2008       Beat Signer, signer@inf.ethz.ch                     8
Dangling Pages
 Problem with pages                                    P1       P2
                                                             C
     that have no outbound
     links (P2)

  0 0                                                          C

H     and R  0 0
  1 0
  0 1 2                  0 1 2
C       and S  H  C       
   0 1 2                 1 1 2
December 9, 2008      Beat Signer, signer@inf.ethz.ch                 9
Strongly Connected Pages (Graph)
                                                                       1-d
 Add new transition                                              P1         P2
     probabilities between
     all pages
           with probability d we follow
                                                                  P4         P3
            the hyperlink structure S
           with probability 1-d we
            choose a random page
                                                                       P5
                                                                 1-d         1-d
G  1  d  1  dS
            1
                                    R  GR
            n
December 9, 2008               Beat Signer, signer@inf.ethz.ch                     10
G  1  d  1  dS
                                                                        1
Examples                                                                n


                   A2

                   0.37



      A1           A3

    0.26           0.37




December 9, 2008          Beat Signer, signer@inf.ethz.ch                 11
G  1  d  1  dS
                                                                                          1
Examples ...                                                                              n


                         A2                                                    B2

                    0.185                                                     0.185



      A1                 A3                                     B1             B3

    0.13            0.185                                  0.13               0.185


           P A  0.5                                               PB   0.5




December 9, 2008              Beat Signer, signer@inf.ethz.ch                               12
G  1  d  1  dS
                                                                                        1
Examples ...                                                                            n


                      A2                                                     B2

                     0.14                                                   0.20



      A1              A3                                      B1             B3

    0.10             0.14                                0.22               0.20


           P A  0.38                                            PB   0.62




December 9, 2008            Beat Signer, signer@inf.ethz.ch                               13
G  1  d  1  dS
                                                                                        1
Examples ...                                                                            n


                      A2                                                     B2

                     0.23                                                   0.095



      A1              A3                                      B1             B3

     0.3             0.18                                0.10               0.095


           P A  0.71                                            PB   0.29




December 9, 2008            Beat Signer, signer@inf.ethz.ch                               14
G  1  d  1  dS
                                                                                        1
Examples ...                                                                            n


                      A2                                                     B2

                     0.24                                                   0.07



      A1              A3                                      B1             B3

    0.35             0.18                                0.09               0.07


           P A  0.77                                            PB   0.23




December 9, 2008            Beat Signer, signer@inf.ethz.ch                               15
G  1  d  1  dS
                                                                                       1
Examples ...                                                                           n


                    A2                                                      B2

                   0.17                                                    0.06



      A1            A3                                       B1             B3

    0.33           0.175                                0.08               0.06



                    A4                                            PB   0.20

P A  0.80
                   0.125



December 9, 2008           Beat Signer, signer@inf.ethz.ch                               16
Implications for Website Development
 First make sure that your page gets indexed
           on-page factors
 Think about your site's internal link structure
           create many internal links for important pages
           be "careful" about where to put outgoing links
 Increase the number of pages
 Ensure that webpages are addressed consistently
           http://www.vub.ac.be  http://www.vub.ac.be/index.php
 Make sure that you get links from good websites
December 9, 2008               Beat Signer, signer@inf.ethz.ch   17
Consistent Addressing of Webpages




December 9, 2008   Beat Signer, signer@inf.ethz.ch   18
Search Engine Optimisations (SEO)
 Internet marketing has become a big business
           white hat and black hat optimisations
 Bad ranking or removal from index can cost a
     company a lot of money
           e.g. supplemental index ("Google hell")




December 9, 2008              Beat Signer, signer@inf.ethz.ch   19
Black Hat Optimisations (Don'ts)
 Link farms
 Spamdexing in guestbooks, Wikipedia etc.
           "solution": <a rel="nofollow" href="…">…</a>
 Doorway pages (cloaking)
           e.g. BMW Germany and Ricoh Germany banned in
            February 2006
 Selling/buying links
 ...
December 9, 2008            Beat Signer, signer@inf.ethz.ch   20
On-Page Factors (Speculative)
 It is assumed that there are over 200 on-page
     and off-page factors
 Positive factors
           keyword in title tag
           keyword in URL
        keyword in domain name
        quality of HTML code
        page freshness (occasional changes)
        …

December 9, 2008                   Beat Signer, signer@inf.ethz.ch
On-Page Factors (Speculative) …
 Negative factors
           links to "bad neighbourhood"
           over optimisation penalty (keyword stuffing)
           text with same colour as background (hidden content)
           automatic redirects via the refresh meta tag
           any copyright violations
           …




December 9, 2008              Beat Signer, signer@inf.ethz.ch
Off-Page Factors (Speculative)
 Positive factors
           high PageRank
           anchor text of inbound links
           links from authority sites (Hilltop algorithm)
           listed in DMOZ (ODP) and Yahoo directories
           site age (stability)
           domain expiration date
           …



December 9, 2008               Beat Signer, signer@inf.ethz.ch
Off-Page Factors (Speculative) …
 Negative factors
           link buying (fast increasing number of inbound links)
           link farms
           cloaking (different pages for spider and user)
           limited (temporal) availability of site
           links from bad neighbourhood?
           competitor attack (e.g. duplicate content)?
           …



December 9, 2008               Beat Signer, signer@inf.ethz.ch
Tools
 Google toolbar
           PageRank information not frequently updated
 Google webmaster tools
           meta description issues
           title tag issues
        non-indexable content issues
        number and URLs of indexed pages
        number and URLs of inbound/outbound links
        ...

December 9, 2008              Beat Signer, signer@inf.ethz.ch   25
Questions
 Is PageRank fair?
 What about Google's power and influence?




December 9, 2008      Beat Signer, signer@inf.ethz.ch   26
Conclusions
 PageRank algorithm
           absolute quality of a page based on incoming links
           random surfer model
           computed as eigenvector of Google matrix G
 Implications for website development and SEO
 PageRank is just one (important) factor



December 9, 2008              Beat Signer, signer@inf.ethz.ch    27
References
 The PageRank Citation Ranking: Bringing Order
     to the Web, L. Page, S. Brin, R. Motwani and
     T. Winograd, January 1998
 The Anatomy of a Large-Scale Hypertextual
     Web Search Engine, S. Brin and L. Page,
     Computer Networks and ISDN Systems, 30(1-7),
     April 1998


December 9, 2008       Beat Signer, signer@inf.ethz.ch
References …
 PageRank Uncovered, C. Ridings and
     M. Shishigin, September 2002
 PageRank Calculator,
     http://www.webworkshop.net/pagerank_
     calculator.php




December 9, 2008      Beat Signer, signer@inf.ethz.ch   29

Contenu connexe

Tendances

On-Site SEO Audit Example
On-Site SEO Audit ExampleOn-Site SEO Audit Example
On-Site SEO Audit ExampleJames Allen
 
page ranking algorithm
page ranking algorithmpage ranking algorithm
page ranking algorithmJaved Khan
 
Page-Rank Algorithm Final
Page-Rank Algorithm FinalPage-Rank Algorithm Final
Page-Rank Algorithm FinalWilliam Keene
 
Generative AI and SEO
Generative AI and SEOGenerative AI and SEO
Generative AI and SEOJason Packer
 
SEO Website Analysis - example report
SEO Website Analysis - example reportSEO Website Analysis - example report
SEO Website Analysis - example reportLynn Holley III
 
Website Analysis Report
Website Analysis ReportWebsite Analysis Report
Website Analysis ReportAuroIN
 
Page rank
Page rankPage rank
Page rankCarlos
 
Google page rank
Google page rankGoogle page rank
Google page rankYifan Li
 
HITS + Pagerank
HITS + PagerankHITS + Pagerank
HITS + Pagerankajkt
 
Monthly SEO Report By calipus.com
Monthly SEO Report By calipus.comMonthly SEO Report By calipus.com
Monthly SEO Report By calipus.comSaurabh Choudhary
 
SEO TRENDS 2023: 5 Things to Look Out For
SEO TRENDS 2023: 5 Things to Look Out ForSEO TRENDS 2023: 5 Things to Look Out For
SEO TRENDS 2023: 5 Things to Look Out ForJomer Gregorio
 
An introduction to Search Engine Optimization (SEO) and web analytics on fao.org
An introduction to Search Engine Optimization (SEO) and web analytics on fao.orgAn introduction to Search Engine Optimization (SEO) and web analytics on fao.org
An introduction to Search Engine Optimization (SEO) and web analytics on fao.orgFAO
 
SEO, PPC and AI in 2023 and Beyond
SEO, PPC and AI in 2023 and BeyondSEO, PPC and AI in 2023 and Beyond
SEO, PPC and AI in 2023 and BeyondLily Ray
 
What is the best steps for seo ? ppt
What is the best steps for seo ? pptWhat is the best steps for seo ? ppt
What is the best steps for seo ? ppte-Definers Technology
 
Basics of Search Engine Optimisation
Basics of Search Engine OptimisationBasics of Search Engine Optimisation
Basics of Search Engine OptimisationWordCamp Sydney
 

Tendances (20)

On-Site SEO Audit Example
On-Site SEO Audit ExampleOn-Site SEO Audit Example
On-Site SEO Audit Example
 
Page rank
Page rankPage rank
Page rank
 
page ranking algorithm
page ranking algorithmpage ranking algorithm
page ranking algorithm
 
Page-Rank Algorithm Final
Page-Rank Algorithm FinalPage-Rank Algorithm Final
Page-Rank Algorithm Final
 
Generative AI and SEO
Generative AI and SEOGenerative AI and SEO
Generative AI and SEO
 
SEO Website Analysis - example report
SEO Website Analysis - example reportSEO Website Analysis - example report
SEO Website Analysis - example report
 
Intelligent agents
Intelligent agentsIntelligent agents
Intelligent agents
 
Website Analysis Report
Website Analysis ReportWebsite Analysis Report
Website Analysis Report
 
Google analytics
Google analyticsGoogle analytics
Google analytics
 
Page rank
Page rankPage rank
Page rank
 
Google page rank
Google page rankGoogle page rank
Google page rank
 
HITS + Pagerank
HITS + PagerankHITS + Pagerank
HITS + Pagerank
 
Monthly SEO Report By calipus.com
Monthly SEO Report By calipus.comMonthly SEO Report By calipus.com
Monthly SEO Report By calipus.com
 
Seo
SeoSeo
Seo
 
Iterative deepening search
Iterative deepening searchIterative deepening search
Iterative deepening search
 
SEO TRENDS 2023: 5 Things to Look Out For
SEO TRENDS 2023: 5 Things to Look Out ForSEO TRENDS 2023: 5 Things to Look Out For
SEO TRENDS 2023: 5 Things to Look Out For
 
An introduction to Search Engine Optimization (SEO) and web analytics on fao.org
An introduction to Search Engine Optimization (SEO) and web analytics on fao.orgAn introduction to Search Engine Optimization (SEO) and web analytics on fao.org
An introduction to Search Engine Optimization (SEO) and web analytics on fao.org
 
SEO, PPC and AI in 2023 and Beyond
SEO, PPC and AI in 2023 and BeyondSEO, PPC and AI in 2023 and Beyond
SEO, PPC and AI in 2023 and Beyond
 
What is the best steps for seo ? ppt
What is the best steps for seo ? pptWhat is the best steps for seo ? ppt
What is the best steps for seo ? ppt
 
Basics of Search Engine Optimisation
Basics of Search Engine OptimisationBasics of Search Engine Optimisation
Basics of Search Engine Optimisation
 

En vedette

Pagerank Algorithm Explained
Pagerank Algorithm ExplainedPagerank Algorithm Explained
Pagerank Algorithm Explainedjdhaar
 
The Google Pagerank algorithm - How does it work?
The Google Pagerank algorithm - How does it work?The Google Pagerank algorithm - How does it work?
The Google Pagerank algorithm - How does it work?Kundan Bhaduri
 
PageRank Algorithm In data mining
PageRank Algorithm In data miningPageRank Algorithm In data mining
PageRank Algorithm In data miningMai Mustafa
 
Page rank talk at NTU-EE
Page rank talk at NTU-EEPage rank talk at NTU-EE
Page rank talk at NTU-EEPing Yeh
 
How Google Search Engine Algorithm Works ??
How Google Search Engine Algorithm Works ??How Google Search Engine Algorithm Works ??
How Google Search Engine Algorithm Works ??Viral Shah
 
Clinical Cases from Resource Limited Settings: David Roesel
Clinical Cases from Resource Limited Settings: David RoeselClinical Cases from Resource Limited Settings: David Roesel
Clinical Cases from Resource Limited Settings: David RoeselUWGlobalHealth
 
Implementing page rank algorithm using hadoop map reduce
Implementing page rank algorithm using hadoop map reduceImplementing page rank algorithm using hadoop map reduce
Implementing page rank algorithm using hadoop map reduceFarzan Hajian
 
Giới thiệu tổng quan về SEO
Giới thiệu tổng quan về SEOGiới thiệu tổng quan về SEO
Giới thiệu tổng quan về SEODuy, Vo Hoang
 
PageRank and Related Methods
PageRank and Related MethodsPageRank and Related Methods
PageRank and Related MethodsJohn Breslin
 
A hadoop implementation of pagerank
A hadoop implementation of pagerankA hadoop implementation of pagerank
A hadoop implementation of pagerankChengeng Ma
 
Understanding search engine algorithms
Understanding search engine algorithmsUnderstanding search engine algorithms
Understanding search engine algorithmsVijay Sankar
 
Hadoop implementation for algorithms apriori, pcy, son
Hadoop implementation for algorithms apriori, pcy, sonHadoop implementation for algorithms apriori, pcy, son
Hadoop implementation for algorithms apriori, pcy, sonChengeng Ma
 

En vedette (20)

Pagerank Algorithm Explained
Pagerank Algorithm ExplainedPagerank Algorithm Explained
Pagerank Algorithm Explained
 
The Google Pagerank algorithm - How does it work?
The Google Pagerank algorithm - How does it work?The Google Pagerank algorithm - How does it work?
The Google Pagerank algorithm - How does it work?
 
PageRank Algorithm In data mining
PageRank Algorithm In data miningPageRank Algorithm In data mining
PageRank Algorithm In data mining
 
Lec5 Pagerank
Lec5 PagerankLec5 Pagerank
Lec5 Pagerank
 
Page rank talk at NTU-EE
Page rank talk at NTU-EEPage rank talk at NTU-EE
Page rank talk at NTU-EE
 
Google algorithim’s
Google  algorithim’sGoogle  algorithim’s
Google algorithim’s
 
How Google Search Engine Algorithm Works ??
How Google Search Engine Algorithm Works ??How Google Search Engine Algorithm Works ??
How Google Search Engine Algorithm Works ??
 
Clinical Cases from Resource Limited Settings: David Roesel
Clinical Cases from Resource Limited Settings: David RoeselClinical Cases from Resource Limited Settings: David Roesel
Clinical Cases from Resource Limited Settings: David Roesel
 
Samana m
Samana mSamana m
Samana m
 
Implementing page rank algorithm using hadoop map reduce
Implementing page rank algorithm using hadoop map reduceImplementing page rank algorithm using hadoop map reduce
Implementing page rank algorithm using hadoop map reduce
 
Giới thiệu tổng quan về SEO
Giới thiệu tổng quan về SEOGiới thiệu tổng quan về SEO
Giới thiệu tổng quan về SEO
 
Google pagerank là gì
Google pagerank là gìGoogle pagerank là gì
Google pagerank là gì
 
Hadoop Futures
Hadoop FuturesHadoop Futures
Hadoop Futures
 
Graphs
GraphsGraphs
Graphs
 
PageRank and Related Methods
PageRank and Related MethodsPageRank and Related Methods
PageRank and Related Methods
 
A hadoop implementation of pagerank
A hadoop implementation of pagerankA hadoop implementation of pagerank
A hadoop implementation of pagerank
 
Link Analysis (RBY)
Link Analysis (RBY)Link Analysis (RBY)
Link Analysis (RBY)
 
Understanding search engine algorithms
Understanding search engine algorithmsUnderstanding search engine algorithms
Understanding search engine algorithms
 
Hadoop implementation for algorithms apriori, pcy, son
Hadoop implementation for algorithms apriori, pcy, sonHadoop implementation for algorithms apriori, pcy, son
Hadoop implementation for algorithms apriori, pcy, son
 
Pagerank and hits
Pagerank and hitsPagerank and hits
Pagerank and hits
 

Plus de Beat Signer

Introduction - Lecture 1 - Human-Computer Interaction (1023841ANR)
Introduction - Lecture 1 - Human-Computer Interaction (1023841ANR)Introduction - Lecture 1 - Human-Computer Interaction (1023841ANR)
Introduction - Lecture 1 - Human-Computer Interaction (1023841ANR)Beat Signer
 
Indoor Positioning Using the OpenHPS Framework
Indoor Positioning Using the OpenHPS FrameworkIndoor Positioning Using the OpenHPS Framework
Indoor Positioning Using the OpenHPS FrameworkBeat Signer
 
Personalised Learning Environments Based on Knowledge Graphs and the Zone of ...
Personalised Learning Environments Based on Knowledge Graphs and the Zone of ...Personalised Learning Environments Based on Knowledge Graphs and the Zone of ...
Personalised Learning Environments Based on Knowledge Graphs and the Zone of ...Beat Signer
 
Cross-Media Technologies and Applications - Future Directions for Personal In...
Cross-Media Technologies and Applications - Future Directions for Personal In...Cross-Media Technologies and Applications - Future Directions for Personal In...
Cross-Media Technologies and Applications - Future Directions for Personal In...Beat Signer
 
Bridging the Gap: Managing and Interacting with Information Across Media Boun...
Bridging the Gap: Managing and Interacting with Information Across Media Boun...Bridging the Gap: Managing and Interacting with Information Across Media Boun...
Bridging the Gap: Managing and Interacting with Information Across Media Boun...Beat Signer
 
Codeschool in a Box: A Low-Barrier Approach to Packaging Programming Curricula
Codeschool in a Box: A Low-Barrier Approach to Packaging Programming CurriculaCodeschool in a Box: A Low-Barrier Approach to Packaging Programming Curricula
Codeschool in a Box: A Low-Barrier Approach to Packaging Programming CurriculaBeat Signer
 
The RSL Hypermedia Metamodel and Its Application in Cross-Media Solutions
The RSL Hypermedia Metamodel and Its Application in Cross-Media Solutions The RSL Hypermedia Metamodel and Its Application in Cross-Media Solutions
The RSL Hypermedia Metamodel and Its Application in Cross-Media Solutions Beat Signer
 
Case Studies and Course Review - Lecture 12 - Information Visualisation (4019...
Case Studies and Course Review - Lecture 12 - Information Visualisation (4019...Case Studies and Course Review - Lecture 12 - Information Visualisation (4019...
Case Studies and Course Review - Lecture 12 - Information Visualisation (4019...Beat Signer
 
Dashboards - Lecture 11 - Information Visualisation (4019538FNR)
Dashboards - Lecture 11 - Information Visualisation (4019538FNR)Dashboards - Lecture 11 - Information Visualisation (4019538FNR)
Dashboards - Lecture 11 - Information Visualisation (4019538FNR)Beat Signer
 
Interaction - Lecture 10 - Information Visualisation (4019538FNR)
Interaction - Lecture 10 - Information Visualisation (4019538FNR)Interaction - Lecture 10 - Information Visualisation (4019538FNR)
Interaction - Lecture 10 - Information Visualisation (4019538FNR)Beat Signer
 
View Manipulation and Reduction - Lecture 9 - Information Visualisation (4019...
View Manipulation and Reduction - Lecture 9 - Information Visualisation (4019...View Manipulation and Reduction - Lecture 9 - Information Visualisation (4019...
View Manipulation and Reduction - Lecture 9 - Information Visualisation (4019...Beat Signer
 
Visualisation Techniques - Lecture 8 - Information Visualisation (4019538FNR)
Visualisation Techniques - Lecture 8 - Information Visualisation (4019538FNR)Visualisation Techniques - Lecture 8 - Information Visualisation (4019538FNR)
Visualisation Techniques - Lecture 8 - Information Visualisation (4019538FNR)Beat Signer
 
Design Guidelines and Principles - Lecture 7 - Information Visualisation (401...
Design Guidelines and Principles - Lecture 7 - Information Visualisation (401...Design Guidelines and Principles - Lecture 7 - Information Visualisation (401...
Design Guidelines and Principles - Lecture 7 - Information Visualisation (401...Beat Signer
 
Data Processing and Visualisation Frameworks - Lecture 6 - Information Visual...
Data Processing and Visualisation Frameworks - Lecture 6 - Information Visual...Data Processing and Visualisation Frameworks - Lecture 6 - Information Visual...
Data Processing and Visualisation Frameworks - Lecture 6 - Information Visual...Beat Signer
 
Data Presentation - Lecture 5 - Information Visualisation (4019538FNR)
Data Presentation - Lecture 5 - Information Visualisation (4019538FNR)Data Presentation - Lecture 5 - Information Visualisation (4019538FNR)
Data Presentation - Lecture 5 - Information Visualisation (4019538FNR)Beat Signer
 
Analysis and Validation - Lecture 4 - Information Visualisation (4019538FNR)
Analysis and Validation - Lecture 4 - Information Visualisation (4019538FNR)Analysis and Validation - Lecture 4 - Information Visualisation (4019538FNR)
Analysis and Validation - Lecture 4 - Information Visualisation (4019538FNR)Beat Signer
 
Data Representation - Lecture 3 - Information Visualisation (4019538FNR)
Data Representation - Lecture 3 - Information Visualisation (4019538FNR)Data Representation - Lecture 3 - Information Visualisation (4019538FNR)
Data Representation - Lecture 3 - Information Visualisation (4019538FNR)Beat Signer
 
Human Perception and Colour Theory - Lecture 2 - Information Visualisation (4...
Human Perception and Colour Theory - Lecture 2 - Information Visualisation (4...Human Perception and Colour Theory - Lecture 2 - Information Visualisation (4...
Human Perception and Colour Theory - Lecture 2 - Information Visualisation (4...Beat Signer
 
Introduction - Lecture 1 - Information Visualisation (4019538FNR)
Introduction - Lecture 1 - Information Visualisation (4019538FNR)Introduction - Lecture 1 - Information Visualisation (4019538FNR)
Introduction - Lecture 1 - Information Visualisation (4019538FNR)Beat Signer
 
Towards a Framework for Dynamic Data Physicalisation
Towards a Framework for Dynamic Data PhysicalisationTowards a Framework for Dynamic Data Physicalisation
Towards a Framework for Dynamic Data PhysicalisationBeat Signer
 

Plus de Beat Signer (20)

Introduction - Lecture 1 - Human-Computer Interaction (1023841ANR)
Introduction - Lecture 1 - Human-Computer Interaction (1023841ANR)Introduction - Lecture 1 - Human-Computer Interaction (1023841ANR)
Introduction - Lecture 1 - Human-Computer Interaction (1023841ANR)
 
Indoor Positioning Using the OpenHPS Framework
Indoor Positioning Using the OpenHPS FrameworkIndoor Positioning Using the OpenHPS Framework
Indoor Positioning Using the OpenHPS Framework
 
Personalised Learning Environments Based on Knowledge Graphs and the Zone of ...
Personalised Learning Environments Based on Knowledge Graphs and the Zone of ...Personalised Learning Environments Based on Knowledge Graphs and the Zone of ...
Personalised Learning Environments Based on Knowledge Graphs and the Zone of ...
 
Cross-Media Technologies and Applications - Future Directions for Personal In...
Cross-Media Technologies and Applications - Future Directions for Personal In...Cross-Media Technologies and Applications - Future Directions for Personal In...
Cross-Media Technologies and Applications - Future Directions for Personal In...
 
Bridging the Gap: Managing and Interacting with Information Across Media Boun...
Bridging the Gap: Managing and Interacting with Information Across Media Boun...Bridging the Gap: Managing and Interacting with Information Across Media Boun...
Bridging the Gap: Managing and Interacting with Information Across Media Boun...
 
Codeschool in a Box: A Low-Barrier Approach to Packaging Programming Curricula
Codeschool in a Box: A Low-Barrier Approach to Packaging Programming CurriculaCodeschool in a Box: A Low-Barrier Approach to Packaging Programming Curricula
Codeschool in a Box: A Low-Barrier Approach to Packaging Programming Curricula
 
The RSL Hypermedia Metamodel and Its Application in Cross-Media Solutions
The RSL Hypermedia Metamodel and Its Application in Cross-Media Solutions The RSL Hypermedia Metamodel and Its Application in Cross-Media Solutions
The RSL Hypermedia Metamodel and Its Application in Cross-Media Solutions
 
Case Studies and Course Review - Lecture 12 - Information Visualisation (4019...
Case Studies and Course Review - Lecture 12 - Information Visualisation (4019...Case Studies and Course Review - Lecture 12 - Information Visualisation (4019...
Case Studies and Course Review - Lecture 12 - Information Visualisation (4019...
 
Dashboards - Lecture 11 - Information Visualisation (4019538FNR)
Dashboards - Lecture 11 - Information Visualisation (4019538FNR)Dashboards - Lecture 11 - Information Visualisation (4019538FNR)
Dashboards - Lecture 11 - Information Visualisation (4019538FNR)
 
Interaction - Lecture 10 - Information Visualisation (4019538FNR)
Interaction - Lecture 10 - Information Visualisation (4019538FNR)Interaction - Lecture 10 - Information Visualisation (4019538FNR)
Interaction - Lecture 10 - Information Visualisation (4019538FNR)
 
View Manipulation and Reduction - Lecture 9 - Information Visualisation (4019...
View Manipulation and Reduction - Lecture 9 - Information Visualisation (4019...View Manipulation and Reduction - Lecture 9 - Information Visualisation (4019...
View Manipulation and Reduction - Lecture 9 - Information Visualisation (4019...
 
Visualisation Techniques - Lecture 8 - Information Visualisation (4019538FNR)
Visualisation Techniques - Lecture 8 - Information Visualisation (4019538FNR)Visualisation Techniques - Lecture 8 - Information Visualisation (4019538FNR)
Visualisation Techniques - Lecture 8 - Information Visualisation (4019538FNR)
 
Design Guidelines and Principles - Lecture 7 - Information Visualisation (401...
Design Guidelines and Principles - Lecture 7 - Information Visualisation (401...Design Guidelines and Principles - Lecture 7 - Information Visualisation (401...
Design Guidelines and Principles - Lecture 7 - Information Visualisation (401...
 
Data Processing and Visualisation Frameworks - Lecture 6 - Information Visual...
Data Processing and Visualisation Frameworks - Lecture 6 - Information Visual...Data Processing and Visualisation Frameworks - Lecture 6 - Information Visual...
Data Processing and Visualisation Frameworks - Lecture 6 - Information Visual...
 
Data Presentation - Lecture 5 - Information Visualisation (4019538FNR)
Data Presentation - Lecture 5 - Information Visualisation (4019538FNR)Data Presentation - Lecture 5 - Information Visualisation (4019538FNR)
Data Presentation - Lecture 5 - Information Visualisation (4019538FNR)
 
Analysis and Validation - Lecture 4 - Information Visualisation (4019538FNR)
Analysis and Validation - Lecture 4 - Information Visualisation (4019538FNR)Analysis and Validation - Lecture 4 - Information Visualisation (4019538FNR)
Analysis and Validation - Lecture 4 - Information Visualisation (4019538FNR)
 
Data Representation - Lecture 3 - Information Visualisation (4019538FNR)
Data Representation - Lecture 3 - Information Visualisation (4019538FNR)Data Representation - Lecture 3 - Information Visualisation (4019538FNR)
Data Representation - Lecture 3 - Information Visualisation (4019538FNR)
 
Human Perception and Colour Theory - Lecture 2 - Information Visualisation (4...
Human Perception and Colour Theory - Lecture 2 - Information Visualisation (4...Human Perception and Colour Theory - Lecture 2 - Information Visualisation (4...
Human Perception and Colour Theory - Lecture 2 - Information Visualisation (4...
 
Introduction - Lecture 1 - Information Visualisation (4019538FNR)
Introduction - Lecture 1 - Information Visualisation (4019538FNR)Introduction - Lecture 1 - Information Visualisation (4019538FNR)
Introduction - Lecture 1 - Information Visualisation (4019538FNR)
 
Towards a Framework for Dynamic Data Physicalisation
Towards a Framework for Dynamic Data PhysicalisationTowards a Framework for Dynamic Data Physicalisation
Towards a Framework for Dynamic Data Physicalisation
 

Dernier

How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxhariprasad279825
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii SoldatenkoFwdays
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 

Dernier (20)

DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Artificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptxArtificial intelligence in cctv survelliance.pptx
Artificial intelligence in cctv survelliance.pptx
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko"Debugging python applications inside k8s environment", Andrii Soldatenko
"Debugging python applications inside k8s environment", Andrii Soldatenko
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 

Google PageRank

  • 1. Google PageRank Prof. Beat Signer <bsigner@vub.ac.be> Department of Computer Science Vrije Universiteit Brussel http://www.beatsigner.com December 9, 2008
  • 2. Overview  History of PageRank  PageRank algorithm  Examples  Implications for website development December 9, 2008 Beat Signer, signer@inf.ethz.ch 2
  • 3. History of PageRank  Developed as part of an academic project at Stanford University  research platform to aid under- Larry Page Sergey Brin standing of large-scale web data and enable researches to easily experiment with new search technologies  Larry Page and Sergey Brin worked on the project about a new kind of search engine (1995-1998) which finally led to a functional prototype called Google December 9, 2008 Beat Signer, signer@inf.ethz.ch 3
  • 4. Web Search Until 1998  Find all documents using a query term  use information retrieval (IR) solutions  ranking based on "on-page factors"  problem: poor quality of search results (order)  Page and Brin proposed to compute the absolute qualtity of a page (PageRank)  based on the number and quality of pages linking to a page (votes) December 9, 2008 Beat Signer, signer@inf.ethz.ch 4
  • 5. PageRank P1 P5 P7 P8 R1 R5 R7 R8 P2 P3 P4 P6 R2 R3 R4 R6  A page has a high PageRank R if  there are many pages linking to it  or, if there are some pages with a high PageRank linking to it  Total score = IR score x PageRank December 9, 2008 Beat Signer, signer@inf.ethz.ch 5
  • 6. PageRank Algorithm R( Pj ) R( Pi )   Pj Bi Lj P1 P2 1.5 1 1.5 1  where  Bi is the set of pages that link to page Pi P3  Lj is the number of 0.75 1 outgoing links for page Pj December 9, 2008 Beat Signer, signer@inf.ethz.ch 6
  • 7. Matrix Representation  Let us define a hyperlink P1 P2 matrix H 1 L j if Pj  Bi H ij    0 otherwise 0 1 2 1  and R  RPi  P3 H  1 0 0    R  HR 0 1 2 0    R is an eigenvector of H with eigenvalue 1 December 9, 2008 Beat Signer, signer@inf.ethz.ch 7
  • 8. Matrix Representation ...  We can use the power method to find R t 1 R  HR t 0 1 2 1  For our example H  1 0 0   0 1 2 0    this results in R  2 2 1 or 0.4 0.4 0.2 December 9, 2008 Beat Signer, signer@inf.ethz.ch 8
  • 9. Dangling Pages  Problem with pages P1 P2 C that have no outbound links (P2) 0 0  C H  and R  0 0 1 0 0 1 2  0 1 2 C  and S  H  C     0 1 2 1 1 2 December 9, 2008 Beat Signer, signer@inf.ethz.ch 9
  • 10. Strongly Connected Pages (Graph) 1-d  Add new transition P1 P2 probabilities between all pages  with probability d we follow P4 P3 the hyperlink structure S  with probability 1-d we choose a random page P5 1-d 1-d G  1  d  1  dS 1 R  GR n December 9, 2008 Beat Signer, signer@inf.ethz.ch 10
  • 11. G  1  d  1  dS 1 Examples n A2 0.37 A1 A3 0.26 0.37 December 9, 2008 Beat Signer, signer@inf.ethz.ch 11
  • 12. G  1  d  1  dS 1 Examples ... n A2 B2 0.185 0.185 A1 A3 B1 B3 0.13 0.185 0.13 0.185 P A  0.5 PB   0.5 December 9, 2008 Beat Signer, signer@inf.ethz.ch 12
  • 13. G  1  d  1  dS 1 Examples ... n A2 B2 0.14 0.20 A1 A3 B1 B3 0.10 0.14 0.22 0.20 P A  0.38 PB   0.62 December 9, 2008 Beat Signer, signer@inf.ethz.ch 13
  • 14. G  1  d  1  dS 1 Examples ... n A2 B2 0.23 0.095 A1 A3 B1 B3 0.3 0.18 0.10 0.095 P A  0.71 PB   0.29 December 9, 2008 Beat Signer, signer@inf.ethz.ch 14
  • 15. G  1  d  1  dS 1 Examples ... n A2 B2 0.24 0.07 A1 A3 B1 B3 0.35 0.18 0.09 0.07 P A  0.77 PB   0.23 December 9, 2008 Beat Signer, signer@inf.ethz.ch 15
  • 16. G  1  d  1  dS 1 Examples ... n A2 B2 0.17 0.06 A1 A3 B1 B3 0.33 0.175 0.08 0.06 A4 PB   0.20 P A  0.80 0.125 December 9, 2008 Beat Signer, signer@inf.ethz.ch 16
  • 17. Implications for Website Development  First make sure that your page gets indexed  on-page factors  Think about your site's internal link structure  create many internal links for important pages  be "careful" about where to put outgoing links  Increase the number of pages  Ensure that webpages are addressed consistently  http://www.vub.ac.be  http://www.vub.ac.be/index.php  Make sure that you get links from good websites December 9, 2008 Beat Signer, signer@inf.ethz.ch 17
  • 18. Consistent Addressing of Webpages December 9, 2008 Beat Signer, signer@inf.ethz.ch 18
  • 19. Search Engine Optimisations (SEO)  Internet marketing has become a big business  white hat and black hat optimisations  Bad ranking or removal from index can cost a company a lot of money  e.g. supplemental index ("Google hell") December 9, 2008 Beat Signer, signer@inf.ethz.ch 19
  • 20. Black Hat Optimisations (Don'ts)  Link farms  Spamdexing in guestbooks, Wikipedia etc.  "solution": <a rel="nofollow" href="…">…</a>  Doorway pages (cloaking)  e.g. BMW Germany and Ricoh Germany banned in February 2006  Selling/buying links  ... December 9, 2008 Beat Signer, signer@inf.ethz.ch 20
  • 21. On-Page Factors (Speculative)  It is assumed that there are over 200 on-page and off-page factors  Positive factors  keyword in title tag  keyword in URL  keyword in domain name  quality of HTML code  page freshness (occasional changes)  … December 9, 2008 Beat Signer, signer@inf.ethz.ch
  • 22. On-Page Factors (Speculative) …  Negative factors  links to "bad neighbourhood"  over optimisation penalty (keyword stuffing)  text with same colour as background (hidden content)  automatic redirects via the refresh meta tag  any copyright violations  … December 9, 2008 Beat Signer, signer@inf.ethz.ch
  • 23. Off-Page Factors (Speculative)  Positive factors  high PageRank  anchor text of inbound links  links from authority sites (Hilltop algorithm)  listed in DMOZ (ODP) and Yahoo directories  site age (stability)  domain expiration date  … December 9, 2008 Beat Signer, signer@inf.ethz.ch
  • 24. Off-Page Factors (Speculative) …  Negative factors  link buying (fast increasing number of inbound links)  link farms  cloaking (different pages for spider and user)  limited (temporal) availability of site  links from bad neighbourhood?  competitor attack (e.g. duplicate content)?  … December 9, 2008 Beat Signer, signer@inf.ethz.ch
  • 25. Tools  Google toolbar  PageRank information not frequently updated  Google webmaster tools  meta description issues  title tag issues  non-indexable content issues  number and URLs of indexed pages  number and URLs of inbound/outbound links  ... December 9, 2008 Beat Signer, signer@inf.ethz.ch 25
  • 26. Questions  Is PageRank fair?  What about Google's power and influence? December 9, 2008 Beat Signer, signer@inf.ethz.ch 26
  • 27. Conclusions  PageRank algorithm  absolute quality of a page based on incoming links  random surfer model  computed as eigenvector of Google matrix G  Implications for website development and SEO  PageRank is just one (important) factor December 9, 2008 Beat Signer, signer@inf.ethz.ch 27
  • 28. References  The PageRank Citation Ranking: Bringing Order to the Web, L. Page, S. Brin, R. Motwani and T. Winograd, January 1998  The Anatomy of a Large-Scale Hypertextual Web Search Engine, S. Brin and L. Page, Computer Networks and ISDN Systems, 30(1-7), April 1998 December 9, 2008 Beat Signer, signer@inf.ethz.ch
  • 29. References …  PageRank Uncovered, C. Ridings and M. Shishigin, September 2002  PageRank Calculator, http://www.webworkshop.net/pagerank_ calculator.php December 9, 2008 Beat Signer, signer@inf.ethz.ch 29