SlideShare une entreprise Scribd logo
1  sur  41
Télécharger pour lire hors ligne
Markov Chains as methodology used by PageRank to
rank the Web Pages on Internet.
Sergio S. Guirreri - www.guirreri.host22.com
Google Technology User Group (GTUG) of Palermo.
5th March 2010
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 1 / 14
Overview
1 Concepts on Markov-Chains.
2 The idea of the PageRank algorithm.
3 The PageRank algorithm.
4 Solving the PageRank algorithm.
5 Conclusions.
6 Bibliography.
7 Internet web sites.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 2 / 14
Concepts on Markov-Chains.
Stochastic Process and Markov-Chains.
Let assume the following stochastic process
{Xn; n = 0, 1, 2, . . . }
with values in a set E, called the state space, while its elements are called
state of the process.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 3 / 14
Concepts on Markov-Chains.
Stochastic Process and Markov-Chains.
Let assume the following stochastic process
{Xn; n = 0, 1, 2, . . . }
with values in a set E, called the state space, while its elements are called
state of the process.
Let assume the set E is finite or countable.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 3 / 14
Concepts on Markov-Chains.
Stochastic Process and Markov-Chains.
Let assume the following stochastic process
{Xn; n = 0, 1, 2, . . . }
with values in a set E, called the state space, while its elements are called
state of the process.
Let assume the set E is finite or countable.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 3 / 14
Concepts on Markov-Chains.
Stochastic Process and Markov-Chains.
Let assume the following stochastic process
{Xn; n = 0, 1, 2, . . . }
with values in a set E, called the state space, while its elements are called
state of the process.
Let assume the set E is finite or countable.
Definition
A Markov Chain is a stochastic process Xn that hold the following feature:
Prob{Xn+1 = j|Xn = i, Xn−1 = in−1, . . . , X0 = i0} =
= Prob{Xn+1 = j|Xn = i} = pij(n)
where E is the state space set and j, i, in−1, . . . , i0 ∈ E, n ∈ N.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 3 / 14
Concepts on Markov-Chains.
Stochastic Process and Markov-Chains.
Let assume the following stochastic process
{Xn; n = 0, 1, 2, . . . }
with values in a set E, called the state space, while its elements are called
state of the process.
Let assume the set E is finite or countable.
Definition
A Markov Chain is a stochastic process Xn that hold the following feature:
Prob{Xn+1 = j|Xn = i, Xn−1 = in−1, . . . , X0 = i0} =
= Prob{Xn+1 = j|Xn = i} = pij(n)
where E is the state space set and j, i, in−1, . . . , i0 ∈ E, n ∈ N.
The transition probability matrix P of the process Xn is composed of pij,
∀i, j ∈ E.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 3 / 14
The idea of the PageRank algorithm.
PageRank’s idea.
The idea behind the PageRank algorithm is similar to the idea of the impact
factor index used to rank the Journals [Page et al.(1999)]
[Brin and Page(1998)] [Langville et al.(2008)].
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 4 / 14
The idea of the PageRank algorithm.
PageRank’s idea.
The idea behind the PageRank algorithm is similar to the idea of the impact
factor index used to rank the Journals [Page et al.(1999)]
[Brin and Page(1998)] [Langville et al.(2008)].
PageRank the impact factor of Internet.
The impact factor of a journal is defined as the average number of citations
per recently published papers in that journal.
By regarding each web page as a journal, this idea was then extended to
measure the importance of the web page in the PageRank Algorithm.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 4 / 14
The idea of the PageRank algorithm.
Elements of the PageRank.
To illustrate the PageRank algorithm I define the following variables
[Ching and Ng(2006)]:
let be N the total number of web pages in the web.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 5 / 14
The idea of the PageRank algorithm.
Elements of the PageRank.
To illustrate the PageRank algorithm I define the following variables
[Ching and Ng(2006)]:
let be N the total number of web pages in the web.
let be k the outgoing links of web page j.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 5 / 14
The idea of the PageRank algorithm.
Elements of the PageRank.
To illustrate the PageRank algorithm I define the following variables
[Ching and Ng(2006)]:
let be N the total number of web pages in the web.
let be k the outgoing links of web page j.
let be Q the so called hyperlink matrix with elements:
Qij =



1
k if web page i is an outgoing link of web page j;
0 otherwise;
Qi,i > 0 ∀i.
(1)
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 5 / 14
The idea of the PageRank algorithm.
Elements of the PageRank.
To illustrate the PageRank algorithm I define the following variables
[Ching and Ng(2006)]:
let be N the total number of web pages in the web.
let be k the outgoing links of web page j.
let be Q the so called hyperlink matrix with elements:
Qij =



1
k if web page i is an outgoing link of web page j;
0 otherwise;
Qi,i > 0 ∀i.
(1)
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 5 / 14
The idea of the PageRank algorithm.
Elements of the PageRank.
To illustrate the PageRank algorithm I define the following variables
[Ching and Ng(2006)]:
let be N the total number of web pages in the web.
let be k the outgoing links of web page j.
let be Q the so called hyperlink matrix with elements:
Qij =



1
k if web page i is an outgoing link of web page j;
0 otherwise;
Qi,i > 0 ∀i.
(1)
The hyperlink matrix Q can be regarded as a transition probability matrix of
a Markov chain.
One may regard a surfer on the net as a random walker and the web pages as
the states of the Markov chain.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 5 / 14
The PageRank algorithm.
The PageRank with irreducible Markov Chain.
Assuming that the Markov chain is irreduciblea
and aperiodicb
then the
steady-state probability distribution (p1, p2, . . . , pN )T
of the states (web
pages) exists.
aA Markov chain is irreducible if all states communicate with each other.
bA chain is periodic if there exists k > 1 such that the interval between two visits to some
state s is always a multiple of k. Therefore a chain is aperiodic if k=1.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 6 / 14
The PageRank algorithm.
The PageRank with irreducible Markov Chain.
Assuming that the Markov chain is irreduciblea
and aperiodicb
then the
steady-state probability distribution (p1, p2, . . . , pN )T
of the states (web
pages) exists.
aA Markov chain is irreducible if all states communicate with each other.
bA chain is periodic if there exists k > 1 such that the interval between two visits to some
state s is always a multiple of k. Therefore a chain is aperiodic if k=1.
The PageRank
Each pi is the proportion of time that the surfer visiting the web page i.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 6 / 14
The PageRank algorithm.
The PageRank with irreducible Markov Chain.
Assuming that the Markov chain is irreduciblea
and aperiodicb
then the
steady-state probability distribution (p1, p2, . . . , pN )T
of the states (web
pages) exists.
aA Markov chain is irreducible if all states communicate with each other.
bA chain is periodic if there exists k > 1 such that the interval between two visits to some
state s is always a multiple of k. Therefore a chain is aperiodic if k=1.
The PageRank
Each pi is the proportion of time that the surfer visiting the web page i.
The higher the value of pi is, the more important web page i will be.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 6 / 14
The PageRank algorithm.
The PageRank with irreducible Markov Chain.
Assuming that the Markov chain is irreduciblea
and aperiodicb
then the
steady-state probability distribution (p1, p2, . . . , pN )T
of the states (web
pages) exists.
aA Markov chain is irreducible if all states communicate with each other.
bA chain is periodic if there exists k > 1 such that the interval between two visits to some
state s is always a multiple of k. Therefore a chain is aperiodic if k=1.
The PageRank
Each pi is the proportion of time that the surfer visiting the web page i.
The higher the value of pi is, the more important web page i will be.
The PageRank of web page i is then defined as pi.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 6 / 14
The PageRank algorithm.
The PageRank with reducible Markov Chain
Since the matrix Q can be reducible to ensure that the steady-state
probability exists and is unique the following matrix P must be considered:
P = α




Q11 Q12 . . . Q1N
Q21 Q22 . . . Q2N
. . . . . . . . . . . .
QN1 QN2 . . . QNN



 +
(1 − α)
N




1 1 . . . 1
1 1 . . . 1
. . . . . . . . . . . .
1 1 . . . 1



 (2)
Where 0 < α < 1 and the most popular values of α are 0.85 and (1 − 1/N).
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 7 / 14
The PageRank algorithm.
The PageRank with reducible Markov Chain
Since the matrix Q can be reducible to ensure that the steady-state
probability exists and is unique the following matrix P must be considered:
P = α




Q11 Q12 . . . Q1N
Q21 Q22 . . . Q2N
. . . . . . . . . . . .
QN1 QN2 . . . QNN



 +
(1 − α)
N




1 1 . . . 1
1 1 . . . 1
. . . . . . . . . . . .
1 1 . . . 1



 (2)
Where 0 < α < 1 and the most popular values of α are 0.85 and (1 − 1/N).
Interpretation of PageRank
The idea of the PageRank (2) is that, for a network of N web pages, each web
page has an inherent importance of (1 − α)/N.
If a page Pi has an importance of pi, then it will contribute an importance of
α pi which is shared among the web pages that it points to.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 7 / 14
The PageRank algorithm.
The PageRank with reducible Markov Chain
Solving the following linear system of equations subject to the normalization
constraint one can obtain the importance of web page Pi :





p1
p2
...
pN





= α




Q11 Q12 . . . Q1N
Q21 Q22 . . . Q2N
. . . . . . . . . . . .
QN1 QN2 . . . QNN









p1
p2
...
pN





+
(1 − α)
N





1
1
...
1





(3)
Since
N
i=1
pi = 1
the (3) can be rewritten as
(p1, p2, . . . , pN )T
= P(p1, p2, . . . , pN )T
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 8 / 14
Solving the PageRank algorithm.
The power method.
The power method is an iterative method for solving the dominant eigenvalue
and its corresponding eigenvectors of a matrix.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 9 / 14
Solving the PageRank algorithm.
The power method.
The power method is an iterative method for solving the dominant eigenvalue
and its corresponding eigenvectors of a matrix.
Given an n × n matrix A, the hypothesis of power method are:
there is a single dominant eigenvalue. The eigenvalues can be sorted:
|λ1| > |λ2| ≥ |λ3| ≥ . . . |λn|
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 9 / 14
Solving the PageRank algorithm.
The power method.
The power method is an iterative method for solving the dominant eigenvalue
and its corresponding eigenvectors of a matrix.
Given an n × n matrix A, the hypothesis of power method are:
there is a single dominant eigenvalue. The eigenvalues can be sorted:
|λ1| > |λ2| ≥ |λ3| ≥ . . . |λn|
there is a linearly independent set of n eigenvectors:
{u(1)
, u(2)
, . . . , u(n)
}
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 9 / 14
Solving the PageRank algorithm.
The power method.
The power method is an iterative method for solving the dominant eigenvalue
and its corresponding eigenvectors of a matrix.
Given an n × n matrix A, the hypothesis of power method are:
there is a single dominant eigenvalue. The eigenvalues can be sorted:
|λ1| > |λ2| ≥ |λ3| ≥ . . . |λn|
there is a linearly independent set of n eigenvectors:
{u(1)
, u(2)
, . . . , u(n)
}
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 9 / 14
Solving the PageRank algorithm.
The power method.
The power method is an iterative method for solving the dominant eigenvalue
and its corresponding eigenvectors of a matrix.
Given an n × n matrix A, the hypothesis of power method are:
there is a single dominant eigenvalue. The eigenvalues can be sorted:
|λ1| > |λ2| ≥ |λ3| ≥ . . . |λn|
there is a linearly independent set of n eigenvectors:
{u(1)
, u(2)
, . . . , u(n)
}
so that
Au(i)
= λiu(i)
, i = 1, . . . , n.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 9 / 14
Solving the PageRank algorithm.
The power method.
The initial vector x0 can be wrote:
x(0)
= a1u(1)
+ a2u(2)
+ · · · + anu(n)
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
Solving the PageRank algorithm.
The power method.
The initial vector x0 can be wrote:
x(0)
= a1u(1)
+ a2u(2)
+ · · · + anu(n)
iterating the initial vector with the A matrix:
Ak
x(0)
= a1Ak
u(1)
+ a2Ak
u(2)
+ · · · + anAk
u(n)
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
Solving the PageRank algorithm.
The power method.
The initial vector x0 can be wrote:
x(0)
= a1u(1)
+ a2u(2)
+ · · · + anu(n)
iterating the initial vector with the A matrix:
Ak
x(0)
= a1Ak
u(1)
+ a2Ak
u(2)
+ · · · + anAk
u(n)
= a1λk
1u(1)
+ a2λk
2u(2)
+ · · · + anλk
nu(n)
.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
Solving the PageRank algorithm.
The power method.
The initial vector x0 can be wrote:
x(0)
= a1u(1)
+ a2u(2)
+ · · · + anu(n)
iterating the initial vector with the A matrix:
Ak
x(0)
= a1Ak
u(1)
+ a2Ak
u(2)
+ · · · + anAk
u(n)
= a1λk
1u(1)
+ a2λk
2u(2)
+ · · · + anλk
nu(n)
.
dividing by λk
1
Ak
x(0)
λk
1
= a1u(1)
+ a2
λ2
λ1
k
u(2)
+ · · · + an
λn
λ1
k
u(n)
,
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
Solving the PageRank algorithm.
The power method.
The initial vector x0 can be wrote:
x(0)
= a1u(1)
+ a2u(2)
+ · · · + anu(n)
iterating the initial vector with the A matrix:
Ak
x(0)
= a1Ak
u(1)
+ a2Ak
u(2)
+ · · · + anAk
u(n)
= a1λk
1u(1)
+ a2λk
2u(2)
+ · · · + anλk
nu(n)
.
dividing by λk
1
Ak
x(0)
λk
1
= a1u(1)
+ a2
λ2
λ1
k
u(2)
+ · · · + an
λn
λ1
k
u(n)
,
Since
|λi|
|λ1|
< 1 →
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
Solving the PageRank algorithm.
The power method.
The initial vector x0 can be wrote:
x(0)
= a1u(1)
+ a2u(2)
+ · · · + anu(n)
iterating the initial vector with the A matrix:
Ak
x(0)
= a1Ak
u(1)
+ a2Ak
u(2)
+ · · · + anAk
u(n)
= a1λk
1u(1)
+ a2λk
2u(2)
+ · · · + anλk
nu(n)
.
dividing by λk
1
Ak
x(0)
λk
1
= a1u(1)
+ a2
λ2
λ1
k
u(2)
+ · · · + an
λn
λ1
k
u(n)
,
Since
|λi|
|λ1|
< 1 → lim
k→∞
|λi|k
|λ1|k
= 0 →
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
Solving the PageRank algorithm.
The power method.
The initial vector x0 can be wrote:
x(0)
= a1u(1)
+ a2u(2)
+ · · · + anu(n)
iterating the initial vector with the A matrix:
Ak
x(0)
= a1Ak
u(1)
+ a2Ak
u(2)
+ · · · + anAk
u(n)
= a1λk
1u(1)
+ a2λk
2u(2)
+ · · · + anλk
nu(n)
.
dividing by λk
1
Ak
x(0)
λk
1
= a1u(1)
+ a2
λ2
λ1
k
u(2)
+ · · · + an
λn
λ1
k
u(n)
,
Since
|λi|
|λ1|
< 1 → lim
k→∞
|λi|k
|λ1|k
= 0 → Ak
≈ a1λk
1u(1)
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
Conclusions.
The power method and PageRank.
Results.
The matrix P of the PageRank algorithm is a stochastic matrix therefore
the largest eigenvalue is 1.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 11 / 14
Conclusions.
The power method and PageRank.
Results.
The matrix P of the PageRank algorithm is a stochastic matrix therefore
the largest eigenvalue is 1.
The convergence rate of the power method depends on the ratio of λ2
λ1
.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 11 / 14
Conclusions.
The power method and PageRank.
Results.
The matrix P of the PageRank algorithm is a stochastic matrix therefore
the largest eigenvalue is 1.
The convergence rate of the power method depends on the ratio of λ2
λ1
.
It has been showed by [Haveliwala and Kamvar(2003)] that for the second
largest eigenvalue of P, we have
|λ2| ≤ α 0 ≤ α ≤ 1.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 11 / 14
Conclusions.
The power method and PageRank.
Results.
The matrix P of the PageRank algorithm is a stochastic matrix therefore
the largest eigenvalue is 1.
The convergence rate of the power method depends on the ratio of λ2
λ1
.
It has been showed by [Haveliwala and Kamvar(2003)] that for the second
largest eigenvalue of P, we have
|λ2| ≤ α 0 ≤ α ≤ 1.
Since λ1 = 1 the converge rate depends on α.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 11 / 14
Conclusions.
The power method and PageRank.
Results.
The matrix P of the PageRank algorithm is a stochastic matrix therefore
the largest eigenvalue is 1.
The convergence rate of the power method depends on the ratio of λ2
λ1
.
It has been showed by [Haveliwala and Kamvar(2003)] that for the second
largest eigenvalue of P, we have
|λ2| ≤ α 0 ≤ α ≤ 1.
Since λ1 = 1 the converge rate depends on α.
The most popular value for α is 0.85. With this value it has been proved
that the power method on web data set of over 80 million pages converges
in about 50 iterations.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 11 / 14
Conclusions.
Really thanks to GTUG Palermo
and
see you to the next meeting!
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 12 / 14
Bibliography.
Bibliography.
Brin, S. and Page, L. (1998).
The anatomy of a large-scale hypertextual Web search engine.
Computer networks and ISDN systems, 30(1-7), 107–117.
Ching, W. and Ng, M. (2006).
Markov Chains: Models, Algoritms and Applications.
Springer Science + Business Media, Inc.
Haveliwala, T. and Kamvar, M. (2003).
The second eigenvalue of the google matrix.
Technical report, Stanford University.
Langville, A., Meyer, C., and Fern´Andez, P. (2008).
Google’s PageRank and beyond: the science of search engine rankings.
The Mathematical Intelligencer, 30(1), 68–69.
Page, L., Brin, S., Motwani, R., and Winograd, T. (1999).
The PageRank Citation Ranking: Bringing Order to the Web.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 13 / 14
Internet web sites.
Internet web sites.
Jon Atle Gulla (2007) - From Google Search to Semantic Exploration. -
Norwegian University of Science Technology -
www.slideshare.net/sveino/semantics-and-search?type=presentation
Steven Levy (2010) - Exclusive: How Google’s Algorithm Rules the Web - Wired
Magazine - www.wired.com/magazine/2010/02/ff_google_algorithm/
Ann Smarty (2009) - Let’s Try to Find All 200 Parameters in Google Algorithm -
Search Engine Journal -
www.searchenginejournal.com/200-parameters-in-google-algorithm/15457/.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 14 / 14

Contenu connexe

Tendances

Generative AI and SEO
Generative AI and SEOGenerative AI and SEO
Generative AI and SEOJason Packer
 
page ranking algorithm
page ranking algorithmpage ranking algorithm
page ranking algorithmJaved Khan
 
Page-Rank Algorithm Final
Page-Rank Algorithm FinalPage-Rank Algorithm Final
Page-Rank Algorithm FinalWilliam Keene
 
Basics of Search Engine Optimisation
Basics of Search Engine OptimisationBasics of Search Engine Optimisation
Basics of Search Engine OptimisationWordCamp Sydney
 
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Tutori...
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Tutori...Machine Learning Algorithms | Machine Learning Tutorial | Data Science Tutori...
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Tutori...Edureka!
 
Search Engine Powerpoint
Search Engine  Powerpoint Search Engine  Powerpoint
Search Engine Powerpoint Partha Himu
 
Google Ads Tutorial for Beginners
Google Ads Tutorial for BeginnersGoogle Ads Tutorial for Beginners
Google Ads Tutorial for BeginnersSimplilearn
 
What is Keyword Research & How to Do it ?
What is Keyword Research & How to Do it ? What is Keyword Research & How to Do it ?
What is Keyword Research & How to Do it ? Jam Hassan
 
SEO Restart 2023: Zdeněk Nešpor - Titánské weby
SEO Restart 2023: Zdeněk Nešpor - Titánské webySEO Restart 2023: Zdeněk Nešpor - Titánské weby
SEO Restart 2023: Zdeněk Nešpor - Titánské webyTaste
 
SEO Restart 2023: Radek Kupr - Jak využít GA4 pro SEO
SEO Restart 2023: Radek Kupr - Jak využít GA4 pro SEOSEO Restart 2023: Radek Kupr - Jak využít GA4 pro SEO
SEO Restart 2023: Radek Kupr - Jak využít GA4 pro SEOTaste
 
Link analysis .. Data Mining
Link analysis .. Data MiningLink analysis .. Data Mining
Link analysis .. Data MiningMustafa Salam
 
SEO Restart 2023: Lukáš Kostka - AI a R studio – optimalizace meta tagů na ,,...
SEO Restart 2023: Lukáš Kostka - AI a R studio – optimalizace meta tagů na ,,...SEO Restart 2023: Lukáš Kostka - AI a R studio – optimalizace meta tagů na ,,...
SEO Restart 2023: Lukáš Kostka - AI a R studio – optimalizace meta tagů na ,,...Taste
 
Keyword research - Digital Marketing - SEO
Keyword research - Digital Marketing - SEOKeyword research - Digital Marketing - SEO
Keyword research - Digital Marketing - SEONeeraj Reddy
 
SEO Restart 2023: Zdeněk Dvořák aka Linki - Slabiny Ahrefs. Jaké jsou? A co s...
SEO Restart 2023: Zdeněk Dvořák aka Linki - Slabiny Ahrefs. Jaké jsou? A co s...SEO Restart 2023: Zdeněk Dvořák aka Linki - Slabiny Ahrefs. Jaké jsou? A co s...
SEO Restart 2023: Zdeněk Dvořák aka Linki - Slabiny Ahrefs. Jaké jsou? A co s...Taste
 
Data Restart 2022: David Janoušek - Jak na výkonnostní kampaně v období cooki...
Data Restart 2022: David Janoušek - Jak na výkonnostní kampaně v období cooki...Data Restart 2022: David Janoušek - Jak na výkonnostní kampaně v období cooki...
Data Restart 2022: David Janoušek - Jak na výkonnostní kampaně v období cooki...Taste
 

Tendances (20)

Page rank algortihm
Page rank algortihmPage rank algortihm
Page rank algortihm
 
Generative AI and SEO
Generative AI and SEOGenerative AI and SEO
Generative AI and SEO
 
page ranking algorithm
page ranking algorithmpage ranking algorithm
page ranking algorithm
 
Page-Rank Algorithm Final
Page-Rank Algorithm FinalPage-Rank Algorithm Final
Page-Rank Algorithm Final
 
Basics of Search Engine Optimisation
Basics of Search Engine OptimisationBasics of Search Engine Optimisation
Basics of Search Engine Optimisation
 
SEO
SEO SEO
SEO
 
Link Analysis
Link AnalysisLink Analysis
Link Analysis
 
Link Analysis
Link AnalysisLink Analysis
Link Analysis
 
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Tutori...
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Tutori...Machine Learning Algorithms | Machine Learning Tutorial | Data Science Tutori...
Machine Learning Algorithms | Machine Learning Tutorial | Data Science Tutori...
 
Search Engine Powerpoint
Search Engine  Powerpoint Search Engine  Powerpoint
Search Engine Powerpoint
 
Google Ads Tutorial for Beginners
Google Ads Tutorial for BeginnersGoogle Ads Tutorial for Beginners
Google Ads Tutorial for Beginners
 
What is Keyword Research & How to Do it ?
What is Keyword Research & How to Do it ? What is Keyword Research & How to Do it ?
What is Keyword Research & How to Do it ?
 
SEO Restart 2023: Zdeněk Nešpor - Titánské weby
SEO Restart 2023: Zdeněk Nešpor - Titánské webySEO Restart 2023: Zdeněk Nešpor - Titánské weby
SEO Restart 2023: Zdeněk Nešpor - Titánské weby
 
SEO Restart 2023: Radek Kupr - Jak využít GA4 pro SEO
SEO Restart 2023: Radek Kupr - Jak využít GA4 pro SEOSEO Restart 2023: Radek Kupr - Jak využít GA4 pro SEO
SEO Restart 2023: Radek Kupr - Jak využít GA4 pro SEO
 
Web spam
Web spamWeb spam
Web spam
 
Link analysis .. Data Mining
Link analysis .. Data MiningLink analysis .. Data Mining
Link analysis .. Data Mining
 
SEO Restart 2023: Lukáš Kostka - AI a R studio – optimalizace meta tagů na ,,...
SEO Restart 2023: Lukáš Kostka - AI a R studio – optimalizace meta tagů na ,,...SEO Restart 2023: Lukáš Kostka - AI a R studio – optimalizace meta tagů na ,,...
SEO Restart 2023: Lukáš Kostka - AI a R studio – optimalizace meta tagů na ,,...
 
Keyword research - Digital Marketing - SEO
Keyword research - Digital Marketing - SEOKeyword research - Digital Marketing - SEO
Keyword research - Digital Marketing - SEO
 
SEO Restart 2023: Zdeněk Dvořák aka Linki - Slabiny Ahrefs. Jaké jsou? A co s...
SEO Restart 2023: Zdeněk Dvořák aka Linki - Slabiny Ahrefs. Jaké jsou? A co s...SEO Restart 2023: Zdeněk Dvořák aka Linki - Slabiny Ahrefs. Jaké jsou? A co s...
SEO Restart 2023: Zdeněk Dvořák aka Linki - Slabiny Ahrefs. Jaké jsou? A co s...
 
Data Restart 2022: David Janoušek - Jak na výkonnostní kampaně v období cooki...
Data Restart 2022: David Janoušek - Jak na výkonnostní kampaně v období cooki...Data Restart 2022: David Janoušek - Jak na výkonnostní kampaně v období cooki...
Data Restart 2022: David Janoušek - Jak na výkonnostní kampaně v období cooki...
 

En vedette

The Google Pagerank algorithm - How does it work?
The Google Pagerank algorithm - How does it work?The Google Pagerank algorithm - How does it work?
The Google Pagerank algorithm - How does it work?Kundan Bhaduri
 
A Comparative Study of HITS vs PageRank Algorithms for Twitter Users Analysis
A Comparative Study of HITS vs PageRank Algorithms for Twitter Users AnalysisA Comparative Study of HITS vs PageRank Algorithms for Twitter Users Analysis
A Comparative Study of HITS vs PageRank Algorithms for Twitter Users AnalysisPoo Kuan Hoong
 
Population 3.5 - Spearman’S Rank
Population 3.5 - Spearman’S RankPopulation 3.5 - Spearman’S Rank
Population 3.5 - Spearman’S RankEcumene
 
LINEAR ALGEBRA BEHIND GOOGLE SEARCH
LINEAR ALGEBRA BEHIND GOOGLE SEARCHLINEAR ALGEBRA BEHIND GOOGLE SEARCH
LINEAR ALGEBRA BEHIND GOOGLE SEARCHDivyansh Verma
 
PageRank Algorithm In data mining
PageRank Algorithm In data miningPageRank Algorithm In data mining
PageRank Algorithm In data miningMai Mustafa
 
GCSE Geography: How And Why To Use Spearman’s Rank
GCSE Geography: How And Why To Use Spearman’s RankGCSE Geography: How And Why To Use Spearman’s Rank
GCSE Geography: How And Why To Use Spearman’s RankMark Cowan
 
Spearman Rank Correlation Presentation
Spearman Rank Correlation PresentationSpearman Rank Correlation Presentation
Spearman Rank Correlation Presentationcae_021
 
Halloween.Presentacion
Halloween.PresentacionHalloween.Presentacion
Halloween.PresentacionCarolinaMiau
 
Giornata Mondiale del Migrante e del Rifugiato
Giornata Mondiale del Migrante e del RifugiatoGiornata Mondiale del Migrante e del Rifugiato
Giornata Mondiale del Migrante e del RifugiatoMaike Loes
 
Secrets & Lies
Secrets & LiesSecrets & Lies
Secrets & LiesSam Baksh
 
Study skillsppt
Study skillspptStudy skillsppt
Study skillspptrnesbit
 
All-Pro Fasteners
All-Pro FastenersAll-Pro Fasteners
All-Pro FastenersTodd Grzych
 
Action adventure movie shots
Action adventure movie shotsAction adventure movie shots
Action adventure movie shotsstoliros
 
Presentation alevel work max
Presentation alevel work maxPresentation alevel work max
Presentation alevel work maxstoliros
 
A Story of Tiny Frog
A Story of Tiny FrogA Story of Tiny Frog
A Story of Tiny Frogbhatsumanta
 

En vedette (20)

The Google Pagerank algorithm - How does it work?
The Google Pagerank algorithm - How does it work?The Google Pagerank algorithm - How does it work?
The Google Pagerank algorithm - How does it work?
 
A Comparative Study of HITS vs PageRank Algorithms for Twitter Users Analysis
A Comparative Study of HITS vs PageRank Algorithms for Twitter Users AnalysisA Comparative Study of HITS vs PageRank Algorithms for Twitter Users Analysis
A Comparative Study of HITS vs PageRank Algorithms for Twitter Users Analysis
 
Population 3.5 - Spearman’S Rank
Population 3.5 - Spearman’S RankPopulation 3.5 - Spearman’S Rank
Population 3.5 - Spearman’S Rank
 
LINEAR ALGEBRA BEHIND GOOGLE SEARCH
LINEAR ALGEBRA BEHIND GOOGLE SEARCHLINEAR ALGEBRA BEHIND GOOGLE SEARCH
LINEAR ALGEBRA BEHIND GOOGLE SEARCH
 
PageRank Algorithm In data mining
PageRank Algorithm In data miningPageRank Algorithm In data mining
PageRank Algorithm In data mining
 
GCSE Geography: How And Why To Use Spearman’s Rank
GCSE Geography: How And Why To Use Spearman’s RankGCSE Geography: How And Why To Use Spearman’s Rank
GCSE Geography: How And Why To Use Spearman’s Rank
 
Spearman Rank Correlation Presentation
Spearman Rank Correlation PresentationSpearman Rank Correlation Presentation
Spearman Rank Correlation Presentation
 
Halloween.Presentacion
Halloween.PresentacionHalloween.Presentacion
Halloween.Presentacion
 
Giornata Mondiale del Migrante e del Rifugiato
Giornata Mondiale del Migrante e del RifugiatoGiornata Mondiale del Migrante e del Rifugiato
Giornata Mondiale del Migrante e del Rifugiato
 
Secrets & Lies
Secrets & LiesSecrets & Lies
Secrets & Lies
 
Study skillsppt
Study skillspptStudy skillsppt
Study skillsppt
 
All-Pro Fasteners
All-Pro FastenersAll-Pro Fasteners
All-Pro Fasteners
 
Action adventure movie shots
Action adventure movie shotsAction adventure movie shots
Action adventure movie shots
 
birthday
birthdaybirthday
birthday
 
Presentation alevel work max
Presentation alevel work maxPresentation alevel work max
Presentation alevel work max
 
A Story of Tiny Frog
A Story of Tiny FrogA Story of Tiny Frog
A Story of Tiny Frog
 
Tema 2
Tema 2Tema 2
Tema 2
 
Farewell (Mae, batch 2011)
Farewell (Mae, batch 2011)Farewell (Mae, batch 2011)
Farewell (Mae, batch 2011)
 
A3 sergiodelpino
A3 sergiodelpinoA3 sergiodelpino
A3 sergiodelpino
 
Pati alegre (fil eminimizer)
Pati alegre (fil eminimizer)Pati alegre (fil eminimizer)
Pati alegre (fil eminimizer)
 

Similaire à Markov Chains and PageRank Algorithm

PageRank_algorithm_Nfaoui_El_Habib
PageRank_algorithm_Nfaoui_El_HabibPageRank_algorithm_Nfaoui_El_Habib
PageRank_algorithm_Nfaoui_El_HabibEl Habib NFAOUI
 
Pagerank
Pagerank Pagerank
Pagerank C C
 
Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...
Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...
Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...IOSR Journals
 
A Generalization of the PageRank Algorithm : NOTES
A Generalization of the PageRank Algorithm : NOTESA Generalization of the PageRank Algorithm : NOTES
A Generalization of the PageRank Algorithm : NOTESSubhajit Sahu
 
page rank explication et exemple formule
page rank explication et exemple  formulepage rank explication et exemple  formule
page rank explication et exemple formuleRamiHarrathi1
 
Done reread deeperinsidepagerank
Done reread deeperinsidepagerankDone reread deeperinsidepagerank
Done reread deeperinsidepagerankJames Arnold
 
Deeper Inside PageRank (NOTES)
Deeper Inside PageRank (NOTES)Deeper Inside PageRank (NOTES)
Deeper Inside PageRank (NOTES)Subhajit Sahu
 
Random web surfer pagerank algorithm
Random web surfer pagerank algorithmRandom web surfer pagerank algorithm
Random web surfer pagerank algorithmalexandrelevada
 
Data.Mining.C.8(Ii).Web Mining 570802461
Data.Mining.C.8(Ii).Web Mining 570802461Data.Mining.C.8(Ii).Web Mining 570802461
Data.Mining.C.8(Ii).Web Mining 570802461Margaret Wang
 
Hui xie 591r_presentation
Hui xie 591r_presentationHui xie 591r_presentation
Hui xie 591r_presentationramiro
 
Markov chains and page rankGraphs.pdf
Markov chains and page rankGraphs.pdfMarkov chains and page rankGraphs.pdf
Markov chains and page rankGraphs.pdfrayyverma
 
PageRank in Multithreading
PageRank in MultithreadingPageRank in Multithreading
PageRank in MultithreadingShujian Zhang
 
Done reread sketchinglandscapesofpagefarmsnpcomplete(3)
Done reread sketchinglandscapesofpagefarmsnpcomplete(3)Done reread sketchinglandscapesofpagefarmsnpcomplete(3)
Done reread sketchinglandscapesofpagefarmsnpcomplete(3)James Arnold
 
Done reread sketchinglandscapesofpagefarmsnpcomplete
Done reread sketchinglandscapesofpagefarmsnpcompleteDone reread sketchinglandscapesofpagefarmsnpcomplete
Done reread sketchinglandscapesofpagefarmsnpcompleteJames Arnold
 
Done reread sketchinglandscapesofpagefarmsnpcomplete(2)
Done reread sketchinglandscapesofpagefarmsnpcomplete(2)Done reread sketchinglandscapesofpagefarmsnpcomplete(2)
Done reread sketchinglandscapesofpagefarmsnpcomplete(2)James Arnold
 
Prediction Model Using Web Usage Mining Techniques
Prediction Model Using Web Usage Mining TechniquesPrediction Model Using Web Usage Mining Techniques
Prediction Model Using Web Usage Mining TechniquesEditor IJCATR
 
Done reread the effect of new links on google pagerank
Done reread the effect of new links on google pagerankDone reread the effect of new links on google pagerank
Done reread the effect of new links on google pagerankJames Arnold
 

Similaire à Markov Chains and PageRank Algorithm (20)

PageRank_algorithm_Nfaoui_El_Habib
PageRank_algorithm_Nfaoui_El_HabibPageRank_algorithm_Nfaoui_El_Habib
PageRank_algorithm_Nfaoui_El_Habib
 
Pagerank
Pagerank Pagerank
Pagerank
 
Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...
Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...
Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...
 
A Generalization of the PageRank Algorithm : NOTES
A Generalization of the PageRank Algorithm : NOTESA Generalization of the PageRank Algorithm : NOTES
A Generalization of the PageRank Algorithm : NOTES
 
page rank explication et exemple formule
page rank explication et exemple  formulepage rank explication et exemple  formule
page rank explication et exemple formule
 
Done reread deeperinsidepagerank
Done reread deeperinsidepagerankDone reread deeperinsidepagerank
Done reread deeperinsidepagerank
 
Deeper Inside PageRank (NOTES)
Deeper Inside PageRank (NOTES)Deeper Inside PageRank (NOTES)
Deeper Inside PageRank (NOTES)
 
Random web surfer pagerank algorithm
Random web surfer pagerank algorithmRandom web surfer pagerank algorithm
Random web surfer pagerank algorithm
 
Data.Mining.C.8(Ii).Web Mining 570802461
Data.Mining.C.8(Ii).Web Mining 570802461Data.Mining.C.8(Ii).Web Mining 570802461
Data.Mining.C.8(Ii).Web Mining 570802461
 
I04015559
I04015559I04015559
I04015559
 
Page Rank Link Farm Detection
Page Rank Link Farm DetectionPage Rank Link Farm Detection
Page Rank Link Farm Detection
 
Hui xie 591r_presentation
Hui xie 591r_presentationHui xie 591r_presentation
Hui xie 591r_presentation
 
Markov chains and page rankGraphs.pdf
Markov chains and page rankGraphs.pdfMarkov chains and page rankGraphs.pdf
Markov chains and page rankGraphs.pdf
 
PageRank in Multithreading
PageRank in MultithreadingPageRank in Multithreading
PageRank in Multithreading
 
Done reread sketchinglandscapesofpagefarmsnpcomplete(3)
Done reread sketchinglandscapesofpagefarmsnpcomplete(3)Done reread sketchinglandscapesofpagefarmsnpcomplete(3)
Done reread sketchinglandscapesofpagefarmsnpcomplete(3)
 
Done reread sketchinglandscapesofpagefarmsnpcomplete
Done reread sketchinglandscapesofpagefarmsnpcompleteDone reread sketchinglandscapesofpagefarmsnpcomplete
Done reread sketchinglandscapesofpagefarmsnpcomplete
 
Done reread sketchinglandscapesofpagefarmsnpcomplete(2)
Done reread sketchinglandscapesofpagefarmsnpcomplete(2)Done reread sketchinglandscapesofpagefarmsnpcomplete(2)
Done reread sketchinglandscapesofpagefarmsnpcomplete(2)
 
Prediction Model Using Web Usage Mining Techniques
Prediction Model Using Web Usage Mining TechniquesPrediction Model Using Web Usage Mining Techniques
Prediction Model Using Web Usage Mining Techniques
 
zanardi
zanardizanardi
zanardi
 
Done reread the effect of new links on google pagerank
Done reread the effect of new links on google pagerankDone reread the effect of new links on google pagerank
Done reread the effect of new links on google pagerank
 

Dernier

Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLScyllaDB
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsMark Billinghurst
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embeddingZilliz
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piececharlottematthew16
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenHervé Boutemy
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Manik S Magar
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfRankYa
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clashcharlottematthew16
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 

Dernier (20)

Developer Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQLDeveloper Data Modeling Mistakes: From Postgres to NoSQL
Developer Data Modeling Mistakes: From Postgres to NoSQL
 
Human Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR SystemsHuman Factors of XR: Using Human Factors to Design XR Systems
Human Factors of XR: Using Human Factors to Design XR Systems
 
Training state-of-the-art general text embedding
Training state-of-the-art general text embeddingTraining state-of-the-art general text embedding
Training state-of-the-art general text embedding
 
Story boards and shot lists for my a level piece
Story boards and shot lists for my a level pieceStory boards and shot lists for my a level piece
Story boards and shot lists for my a level piece
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
DevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache MavenDevoxxFR 2024 Reproducible Builds with Apache Maven
DevoxxFR 2024 Reproducible Builds with Apache Maven
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
Bun (KitWorks Team Study 노별마루 발표 2024.4.22)
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Vertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering TipsVertex AI Gemini Prompt Engineering Tips
Vertex AI Gemini Prompt Engineering Tips
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!Anypoint Exchange: It’s Not Just a Repo!
Anypoint Exchange: It’s Not Just a Repo!
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Search Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdfSearch Engine Optimization SEO PDF for 2024.pdf
Search Engine Optimization SEO PDF for 2024.pdf
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Powerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time ClashPowerpoint exploring the locations used in television show Time Clash
Powerpoint exploring the locations used in television show Time Clash
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 

Markov Chains and PageRank Algorithm

  • 1. Markov Chains as methodology used by PageRank to rank the Web Pages on Internet. Sergio S. Guirreri - www.guirreri.host22.com Google Technology User Group (GTUG) of Palermo. 5th March 2010 Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 1 / 14
  • 2. Overview 1 Concepts on Markov-Chains. 2 The idea of the PageRank algorithm. 3 The PageRank algorithm. 4 Solving the PageRank algorithm. 5 Conclusions. 6 Bibliography. 7 Internet web sites. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 2 / 14
  • 3. Concepts on Markov-Chains. Stochastic Process and Markov-Chains. Let assume the following stochastic process {Xn; n = 0, 1, 2, . . . } with values in a set E, called the state space, while its elements are called state of the process. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 3 / 14
  • 4. Concepts on Markov-Chains. Stochastic Process and Markov-Chains. Let assume the following stochastic process {Xn; n = 0, 1, 2, . . . } with values in a set E, called the state space, while its elements are called state of the process. Let assume the set E is finite or countable. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 3 / 14
  • 5. Concepts on Markov-Chains. Stochastic Process and Markov-Chains. Let assume the following stochastic process {Xn; n = 0, 1, 2, . . . } with values in a set E, called the state space, while its elements are called state of the process. Let assume the set E is finite or countable. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 3 / 14
  • 6. Concepts on Markov-Chains. Stochastic Process and Markov-Chains. Let assume the following stochastic process {Xn; n = 0, 1, 2, . . . } with values in a set E, called the state space, while its elements are called state of the process. Let assume the set E is finite or countable. Definition A Markov Chain is a stochastic process Xn that hold the following feature: Prob{Xn+1 = j|Xn = i, Xn−1 = in−1, . . . , X0 = i0} = = Prob{Xn+1 = j|Xn = i} = pij(n) where E is the state space set and j, i, in−1, . . . , i0 ∈ E, n ∈ N. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 3 / 14
  • 7. Concepts on Markov-Chains. Stochastic Process and Markov-Chains. Let assume the following stochastic process {Xn; n = 0, 1, 2, . . . } with values in a set E, called the state space, while its elements are called state of the process. Let assume the set E is finite or countable. Definition A Markov Chain is a stochastic process Xn that hold the following feature: Prob{Xn+1 = j|Xn = i, Xn−1 = in−1, . . . , X0 = i0} = = Prob{Xn+1 = j|Xn = i} = pij(n) where E is the state space set and j, i, in−1, . . . , i0 ∈ E, n ∈ N. The transition probability matrix P of the process Xn is composed of pij, ∀i, j ∈ E. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 3 / 14
  • 8. The idea of the PageRank algorithm. PageRank’s idea. The idea behind the PageRank algorithm is similar to the idea of the impact factor index used to rank the Journals [Page et al.(1999)] [Brin and Page(1998)] [Langville et al.(2008)]. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 4 / 14
  • 9. The idea of the PageRank algorithm. PageRank’s idea. The idea behind the PageRank algorithm is similar to the idea of the impact factor index used to rank the Journals [Page et al.(1999)] [Brin and Page(1998)] [Langville et al.(2008)]. PageRank the impact factor of Internet. The impact factor of a journal is defined as the average number of citations per recently published papers in that journal. By regarding each web page as a journal, this idea was then extended to measure the importance of the web page in the PageRank Algorithm. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 4 / 14
  • 10. The idea of the PageRank algorithm. Elements of the PageRank. To illustrate the PageRank algorithm I define the following variables [Ching and Ng(2006)]: let be N the total number of web pages in the web. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 5 / 14
  • 11. The idea of the PageRank algorithm. Elements of the PageRank. To illustrate the PageRank algorithm I define the following variables [Ching and Ng(2006)]: let be N the total number of web pages in the web. let be k the outgoing links of web page j. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 5 / 14
  • 12. The idea of the PageRank algorithm. Elements of the PageRank. To illustrate the PageRank algorithm I define the following variables [Ching and Ng(2006)]: let be N the total number of web pages in the web. let be k the outgoing links of web page j. let be Q the so called hyperlink matrix with elements: Qij =    1 k if web page i is an outgoing link of web page j; 0 otherwise; Qi,i > 0 ∀i. (1) Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 5 / 14
  • 13. The idea of the PageRank algorithm. Elements of the PageRank. To illustrate the PageRank algorithm I define the following variables [Ching and Ng(2006)]: let be N the total number of web pages in the web. let be k the outgoing links of web page j. let be Q the so called hyperlink matrix with elements: Qij =    1 k if web page i is an outgoing link of web page j; 0 otherwise; Qi,i > 0 ∀i. (1) Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 5 / 14
  • 14. The idea of the PageRank algorithm. Elements of the PageRank. To illustrate the PageRank algorithm I define the following variables [Ching and Ng(2006)]: let be N the total number of web pages in the web. let be k the outgoing links of web page j. let be Q the so called hyperlink matrix with elements: Qij =    1 k if web page i is an outgoing link of web page j; 0 otherwise; Qi,i > 0 ∀i. (1) The hyperlink matrix Q can be regarded as a transition probability matrix of a Markov chain. One may regard a surfer on the net as a random walker and the web pages as the states of the Markov chain. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 5 / 14
  • 15. The PageRank algorithm. The PageRank with irreducible Markov Chain. Assuming that the Markov chain is irreduciblea and aperiodicb then the steady-state probability distribution (p1, p2, . . . , pN )T of the states (web pages) exists. aA Markov chain is irreducible if all states communicate with each other. bA chain is periodic if there exists k > 1 such that the interval between two visits to some state s is always a multiple of k. Therefore a chain is aperiodic if k=1. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 6 / 14
  • 16. The PageRank algorithm. The PageRank with irreducible Markov Chain. Assuming that the Markov chain is irreduciblea and aperiodicb then the steady-state probability distribution (p1, p2, . . . , pN )T of the states (web pages) exists. aA Markov chain is irreducible if all states communicate with each other. bA chain is periodic if there exists k > 1 such that the interval between two visits to some state s is always a multiple of k. Therefore a chain is aperiodic if k=1. The PageRank Each pi is the proportion of time that the surfer visiting the web page i. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 6 / 14
  • 17. The PageRank algorithm. The PageRank with irreducible Markov Chain. Assuming that the Markov chain is irreduciblea and aperiodicb then the steady-state probability distribution (p1, p2, . . . , pN )T of the states (web pages) exists. aA Markov chain is irreducible if all states communicate with each other. bA chain is periodic if there exists k > 1 such that the interval between two visits to some state s is always a multiple of k. Therefore a chain is aperiodic if k=1. The PageRank Each pi is the proportion of time that the surfer visiting the web page i. The higher the value of pi is, the more important web page i will be. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 6 / 14
  • 18. The PageRank algorithm. The PageRank with irreducible Markov Chain. Assuming that the Markov chain is irreduciblea and aperiodicb then the steady-state probability distribution (p1, p2, . . . , pN )T of the states (web pages) exists. aA Markov chain is irreducible if all states communicate with each other. bA chain is periodic if there exists k > 1 such that the interval between two visits to some state s is always a multiple of k. Therefore a chain is aperiodic if k=1. The PageRank Each pi is the proportion of time that the surfer visiting the web page i. The higher the value of pi is, the more important web page i will be. The PageRank of web page i is then defined as pi. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 6 / 14
  • 19. The PageRank algorithm. The PageRank with reducible Markov Chain Since the matrix Q can be reducible to ensure that the steady-state probability exists and is unique the following matrix P must be considered: P = α     Q11 Q12 . . . Q1N Q21 Q22 . . . Q2N . . . . . . . . . . . . QN1 QN2 . . . QNN     + (1 − α) N     1 1 . . . 1 1 1 . . . 1 . . . . . . . . . . . . 1 1 . . . 1     (2) Where 0 < α < 1 and the most popular values of α are 0.85 and (1 − 1/N). Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 7 / 14
  • 20. The PageRank algorithm. The PageRank with reducible Markov Chain Since the matrix Q can be reducible to ensure that the steady-state probability exists and is unique the following matrix P must be considered: P = α     Q11 Q12 . . . Q1N Q21 Q22 . . . Q2N . . . . . . . . . . . . QN1 QN2 . . . QNN     + (1 − α) N     1 1 . . . 1 1 1 . . . 1 . . . . . . . . . . . . 1 1 . . . 1     (2) Where 0 < α < 1 and the most popular values of α are 0.85 and (1 − 1/N). Interpretation of PageRank The idea of the PageRank (2) is that, for a network of N web pages, each web page has an inherent importance of (1 − α)/N. If a page Pi has an importance of pi, then it will contribute an importance of α pi which is shared among the web pages that it points to. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 7 / 14
  • 21. The PageRank algorithm. The PageRank with reducible Markov Chain Solving the following linear system of equations subject to the normalization constraint one can obtain the importance of web page Pi :      p1 p2 ... pN      = α     Q11 Q12 . . . Q1N Q21 Q22 . . . Q2N . . . . . . . . . . . . QN1 QN2 . . . QNN          p1 p2 ... pN      + (1 − α) N      1 1 ... 1      (3) Since N i=1 pi = 1 the (3) can be rewritten as (p1, p2, . . . , pN )T = P(p1, p2, . . . , pN )T Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 8 / 14
  • 22. Solving the PageRank algorithm. The power method. The power method is an iterative method for solving the dominant eigenvalue and its corresponding eigenvectors of a matrix. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 9 / 14
  • 23. Solving the PageRank algorithm. The power method. The power method is an iterative method for solving the dominant eigenvalue and its corresponding eigenvectors of a matrix. Given an n × n matrix A, the hypothesis of power method are: there is a single dominant eigenvalue. The eigenvalues can be sorted: |λ1| > |λ2| ≥ |λ3| ≥ . . . |λn| Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 9 / 14
  • 24. Solving the PageRank algorithm. The power method. The power method is an iterative method for solving the dominant eigenvalue and its corresponding eigenvectors of a matrix. Given an n × n matrix A, the hypothesis of power method are: there is a single dominant eigenvalue. The eigenvalues can be sorted: |λ1| > |λ2| ≥ |λ3| ≥ . . . |λn| there is a linearly independent set of n eigenvectors: {u(1) , u(2) , . . . , u(n) } Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 9 / 14
  • 25. Solving the PageRank algorithm. The power method. The power method is an iterative method for solving the dominant eigenvalue and its corresponding eigenvectors of a matrix. Given an n × n matrix A, the hypothesis of power method are: there is a single dominant eigenvalue. The eigenvalues can be sorted: |λ1| > |λ2| ≥ |λ3| ≥ . . . |λn| there is a linearly independent set of n eigenvectors: {u(1) , u(2) , . . . , u(n) } Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 9 / 14
  • 26. Solving the PageRank algorithm. The power method. The power method is an iterative method for solving the dominant eigenvalue and its corresponding eigenvectors of a matrix. Given an n × n matrix A, the hypothesis of power method are: there is a single dominant eigenvalue. The eigenvalues can be sorted: |λ1| > |λ2| ≥ |λ3| ≥ . . . |λn| there is a linearly independent set of n eigenvectors: {u(1) , u(2) , . . . , u(n) } so that Au(i) = λiu(i) , i = 1, . . . , n. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 9 / 14
  • 27. Solving the PageRank algorithm. The power method. The initial vector x0 can be wrote: x(0) = a1u(1) + a2u(2) + · · · + anu(n) Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
  • 28. Solving the PageRank algorithm. The power method. The initial vector x0 can be wrote: x(0) = a1u(1) + a2u(2) + · · · + anu(n) iterating the initial vector with the A matrix: Ak x(0) = a1Ak u(1) + a2Ak u(2) + · · · + anAk u(n) Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
  • 29. Solving the PageRank algorithm. The power method. The initial vector x0 can be wrote: x(0) = a1u(1) + a2u(2) + · · · + anu(n) iterating the initial vector with the A matrix: Ak x(0) = a1Ak u(1) + a2Ak u(2) + · · · + anAk u(n) = a1λk 1u(1) + a2λk 2u(2) + · · · + anλk nu(n) . Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
  • 30. Solving the PageRank algorithm. The power method. The initial vector x0 can be wrote: x(0) = a1u(1) + a2u(2) + · · · + anu(n) iterating the initial vector with the A matrix: Ak x(0) = a1Ak u(1) + a2Ak u(2) + · · · + anAk u(n) = a1λk 1u(1) + a2λk 2u(2) + · · · + anλk nu(n) . dividing by λk 1 Ak x(0) λk 1 = a1u(1) + a2 λ2 λ1 k u(2) + · · · + an λn λ1 k u(n) , Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
  • 31. Solving the PageRank algorithm. The power method. The initial vector x0 can be wrote: x(0) = a1u(1) + a2u(2) + · · · + anu(n) iterating the initial vector with the A matrix: Ak x(0) = a1Ak u(1) + a2Ak u(2) + · · · + anAk u(n) = a1λk 1u(1) + a2λk 2u(2) + · · · + anλk nu(n) . dividing by λk 1 Ak x(0) λk 1 = a1u(1) + a2 λ2 λ1 k u(2) + · · · + an λn λ1 k u(n) , Since |λi| |λ1| < 1 → Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
  • 32. Solving the PageRank algorithm. The power method. The initial vector x0 can be wrote: x(0) = a1u(1) + a2u(2) + · · · + anu(n) iterating the initial vector with the A matrix: Ak x(0) = a1Ak u(1) + a2Ak u(2) + · · · + anAk u(n) = a1λk 1u(1) + a2λk 2u(2) + · · · + anλk nu(n) . dividing by λk 1 Ak x(0) λk 1 = a1u(1) + a2 λ2 λ1 k u(2) + · · · + an λn λ1 k u(n) , Since |λi| |λ1| < 1 → lim k→∞ |λi|k |λ1|k = 0 → Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
  • 33. Solving the PageRank algorithm. The power method. The initial vector x0 can be wrote: x(0) = a1u(1) + a2u(2) + · · · + anu(n) iterating the initial vector with the A matrix: Ak x(0) = a1Ak u(1) + a2Ak u(2) + · · · + anAk u(n) = a1λk 1u(1) + a2λk 2u(2) + · · · + anλk nu(n) . dividing by λk 1 Ak x(0) λk 1 = a1u(1) + a2 λ2 λ1 k u(2) + · · · + an λn λ1 k u(n) , Since |λi| |λ1| < 1 → lim k→∞ |λi|k |λ1|k = 0 → Ak ≈ a1λk 1u(1) Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
  • 34. Conclusions. The power method and PageRank. Results. The matrix P of the PageRank algorithm is a stochastic matrix therefore the largest eigenvalue is 1. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 11 / 14
  • 35. Conclusions. The power method and PageRank. Results. The matrix P of the PageRank algorithm is a stochastic matrix therefore the largest eigenvalue is 1. The convergence rate of the power method depends on the ratio of λ2 λ1 . Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 11 / 14
  • 36. Conclusions. The power method and PageRank. Results. The matrix P of the PageRank algorithm is a stochastic matrix therefore the largest eigenvalue is 1. The convergence rate of the power method depends on the ratio of λ2 λ1 . It has been showed by [Haveliwala and Kamvar(2003)] that for the second largest eigenvalue of P, we have |λ2| ≤ α 0 ≤ α ≤ 1. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 11 / 14
  • 37. Conclusions. The power method and PageRank. Results. The matrix P of the PageRank algorithm is a stochastic matrix therefore the largest eigenvalue is 1. The convergence rate of the power method depends on the ratio of λ2 λ1 . It has been showed by [Haveliwala and Kamvar(2003)] that for the second largest eigenvalue of P, we have |λ2| ≤ α 0 ≤ α ≤ 1. Since λ1 = 1 the converge rate depends on α. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 11 / 14
  • 38. Conclusions. The power method and PageRank. Results. The matrix P of the PageRank algorithm is a stochastic matrix therefore the largest eigenvalue is 1. The convergence rate of the power method depends on the ratio of λ2 λ1 . It has been showed by [Haveliwala and Kamvar(2003)] that for the second largest eigenvalue of P, we have |λ2| ≤ α 0 ≤ α ≤ 1. Since λ1 = 1 the converge rate depends on α. The most popular value for α is 0.85. With this value it has been proved that the power method on web data set of over 80 million pages converges in about 50 iterations. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 11 / 14
  • 39. Conclusions. Really thanks to GTUG Palermo and see you to the next meeting! Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 12 / 14
  • 40. Bibliography. Bibliography. Brin, S. and Page, L. (1998). The anatomy of a large-scale hypertextual Web search engine. Computer networks and ISDN systems, 30(1-7), 107–117. Ching, W. and Ng, M. (2006). Markov Chains: Models, Algoritms and Applications. Springer Science + Business Media, Inc. Haveliwala, T. and Kamvar, M. (2003). The second eigenvalue of the google matrix. Technical report, Stanford University. Langville, A., Meyer, C., and Fern´Andez, P. (2008). Google’s PageRank and beyond: the science of search engine rankings. The Mathematical Intelligencer, 30(1), 68–69. Page, L., Brin, S., Motwani, R., and Winograd, T. (1999). The PageRank Citation Ranking: Bringing Order to the Web. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 13 / 14
  • 41. Internet web sites. Internet web sites. Jon Atle Gulla (2007) - From Google Search to Semantic Exploration. - Norwegian University of Science Technology - www.slideshare.net/sveino/semantics-and-search?type=presentation Steven Levy (2010) - Exclusive: How Google’s Algorithm Rules the Web - Wired Magazine - www.wired.com/magazine/2010/02/ff_google_algorithm/ Ann Smarty (2009) - Let’s Try to Find All 200 Parameters in Google Algorithm - Search Engine Journal - www.searchenginejournal.com/200-parameters-in-google-algorithm/15457/. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 14 / 14