SlideShare une entreprise Scribd logo
1  sur  41
Télécharger pour lire hors ligne
Markov Chains as methodology used by PageRank to
rank the Web Pages on Internet.
Sergio S. Guirreri - www.guirreri.host22.com
Google Technology User Group (GTUG) of Palermo.
5th March 2010
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 1 / 14
Overview
1 Concepts on Markov-Chains.
2 The idea of the PageRank algorithm.
3 The PageRank algorithm.
4 Solving the PageRank algorithm.
5 Conclusions.
6 Bibliography.
7 Internet web sites.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 2 / 14
Concepts on Markov-Chains.
Stochastic Process and Markov-Chains.
Let assume the following stochastic process
{Xn; n = 0, 1, 2, . . . }
with values in a set E, called the state space, while its elements are called
state of the process.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 3 / 14
Concepts on Markov-Chains.
Stochastic Process and Markov-Chains.
Let assume the following stochastic process
{Xn; n = 0, 1, 2, . . . }
with values in a set E, called the state space, while its elements are called
state of the process.
Let assume the set E is finite or countable.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 3 / 14
Concepts on Markov-Chains.
Stochastic Process and Markov-Chains.
Let assume the following stochastic process
{Xn; n = 0, 1, 2, . . . }
with values in a set E, called the state space, while its elements are called
state of the process.
Let assume the set E is finite or countable.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 3 / 14
Concepts on Markov-Chains.
Stochastic Process and Markov-Chains.
Let assume the following stochastic process
{Xn; n = 0, 1, 2, . . . }
with values in a set E, called the state space, while its elements are called
state of the process.
Let assume the set E is finite or countable.
Definition
A Markov Chain is a stochastic process Xn that hold the following feature:
Prob{Xn+1 = j|Xn = i, Xn−1 = in−1, . . . , X0 = i0} =
= Prob{Xn+1 = j|Xn = i} = pij(n)
where E is the state space set and j, i, in−1, . . . , i0 ∈ E, n ∈ N.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 3 / 14
Concepts on Markov-Chains.
Stochastic Process and Markov-Chains.
Let assume the following stochastic process
{Xn; n = 0, 1, 2, . . . }
with values in a set E, called the state space, while its elements are called
state of the process.
Let assume the set E is finite or countable.
Definition
A Markov Chain is a stochastic process Xn that hold the following feature:
Prob{Xn+1 = j|Xn = i, Xn−1 = in−1, . . . , X0 = i0} =
= Prob{Xn+1 = j|Xn = i} = pij(n)
where E is the state space set and j, i, in−1, . . . , i0 ∈ E, n ∈ N.
The transition probability matrix P of the process Xn is composed of pij,
∀i, j ∈ E.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 3 / 14
The idea of the PageRank algorithm.
PageRank’s idea.
The idea behind the PageRank algorithm is similar to the idea of the impact
factor index used to rank the Journals [Page et al.(1999)]
[Brin and Page(1998)] [Langville et al.(2008)].
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 4 / 14
The idea of the PageRank algorithm.
PageRank’s idea.
The idea behind the PageRank algorithm is similar to the idea of the impact
factor index used to rank the Journals [Page et al.(1999)]
[Brin and Page(1998)] [Langville et al.(2008)].
PageRank the impact factor of Internet.
The impact factor of a journal is defined as the average number of citations
per recently published papers in that journal.
By regarding each web page as a journal, this idea was then extended to
measure the importance of the web page in the PageRank Algorithm.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 4 / 14
The idea of the PageRank algorithm.
Elements of the PageRank.
To illustrate the PageRank algorithm I define the following variables
[Ching and Ng(2006)]:
let be N the total number of web pages in the web.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 5 / 14
The idea of the PageRank algorithm.
Elements of the PageRank.
To illustrate the PageRank algorithm I define the following variables
[Ching and Ng(2006)]:
let be N the total number of web pages in the web.
let be k the outgoing links of web page j.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 5 / 14
The idea of the PageRank algorithm.
Elements of the PageRank.
To illustrate the PageRank algorithm I define the following variables
[Ching and Ng(2006)]:
let be N the total number of web pages in the web.
let be k the outgoing links of web page j.
let be Q the so called hyperlink matrix with elements:
Qij =



1
k if web page i is an outgoing link of web page j;
0 otherwise;
Qi,i > 0 ∀i.
(1)
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 5 / 14
The idea of the PageRank algorithm.
Elements of the PageRank.
To illustrate the PageRank algorithm I define the following variables
[Ching and Ng(2006)]:
let be N the total number of web pages in the web.
let be k the outgoing links of web page j.
let be Q the so called hyperlink matrix with elements:
Qij =



1
k if web page i is an outgoing link of web page j;
0 otherwise;
Qi,i > 0 ∀i.
(1)
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 5 / 14
The idea of the PageRank algorithm.
Elements of the PageRank.
To illustrate the PageRank algorithm I define the following variables
[Ching and Ng(2006)]:
let be N the total number of web pages in the web.
let be k the outgoing links of web page j.
let be Q the so called hyperlink matrix with elements:
Qij =



1
k if web page i is an outgoing link of web page j;
0 otherwise;
Qi,i > 0 ∀i.
(1)
The hyperlink matrix Q can be regarded as a transition probability matrix of
a Markov chain.
One may regard a surfer on the net as a random walker and the web pages as
the states of the Markov chain.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 5 / 14
The PageRank algorithm.
The PageRank with irreducible Markov Chain.
Assuming that the Markov chain is irreduciblea
and aperiodicb
then the
steady-state probability distribution (p1, p2, . . . , pN )T
of the states (web
pages) exists.
aA Markov chain is irreducible if all states communicate with each other.
bA chain is periodic if there exists k > 1 such that the interval between two visits to some
state s is always a multiple of k. Therefore a chain is aperiodic if k=1.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 6 / 14
The PageRank algorithm.
The PageRank with irreducible Markov Chain.
Assuming that the Markov chain is irreduciblea
and aperiodicb
then the
steady-state probability distribution (p1, p2, . . . , pN )T
of the states (web
pages) exists.
aA Markov chain is irreducible if all states communicate with each other.
bA chain is periodic if there exists k > 1 such that the interval between two visits to some
state s is always a multiple of k. Therefore a chain is aperiodic if k=1.
The PageRank
Each pi is the proportion of time that the surfer visiting the web page i.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 6 / 14
The PageRank algorithm.
The PageRank with irreducible Markov Chain.
Assuming that the Markov chain is irreduciblea
and aperiodicb
then the
steady-state probability distribution (p1, p2, . . . , pN )T
of the states (web
pages) exists.
aA Markov chain is irreducible if all states communicate with each other.
bA chain is periodic if there exists k > 1 such that the interval between two visits to some
state s is always a multiple of k. Therefore a chain is aperiodic if k=1.
The PageRank
Each pi is the proportion of time that the surfer visiting the web page i.
The higher the value of pi is, the more important web page i will be.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 6 / 14
The PageRank algorithm.
The PageRank with irreducible Markov Chain.
Assuming that the Markov chain is irreduciblea
and aperiodicb
then the
steady-state probability distribution (p1, p2, . . . , pN )T
of the states (web
pages) exists.
aA Markov chain is irreducible if all states communicate with each other.
bA chain is periodic if there exists k > 1 such that the interval between two visits to some
state s is always a multiple of k. Therefore a chain is aperiodic if k=1.
The PageRank
Each pi is the proportion of time that the surfer visiting the web page i.
The higher the value of pi is, the more important web page i will be.
The PageRank of web page i is then defined as pi.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 6 / 14
The PageRank algorithm.
The PageRank with reducible Markov Chain
Since the matrix Q can be reducible to ensure that the steady-state
probability exists and is unique the following matrix P must be considered:
P = α




Q11 Q12 . . . Q1N
Q21 Q22 . . . Q2N
. . . . . . . . . . . .
QN1 QN2 . . . QNN



 +
(1 − α)
N




1 1 . . . 1
1 1 . . . 1
. . . . . . . . . . . .
1 1 . . . 1



 (2)
Where 0 < α < 1 and the most popular values of α are 0.85 and (1 − 1/N).
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 7 / 14
The PageRank algorithm.
The PageRank with reducible Markov Chain
Since the matrix Q can be reducible to ensure that the steady-state
probability exists and is unique the following matrix P must be considered:
P = α




Q11 Q12 . . . Q1N
Q21 Q22 . . . Q2N
. . . . . . . . . . . .
QN1 QN2 . . . QNN



 +
(1 − α)
N




1 1 . . . 1
1 1 . . . 1
. . . . . . . . . . . .
1 1 . . . 1



 (2)
Where 0 < α < 1 and the most popular values of α are 0.85 and (1 − 1/N).
Interpretation of PageRank
The idea of the PageRank (2) is that, for a network of N web pages, each web
page has an inherent importance of (1 − α)/N.
If a page Pi has an importance of pi, then it will contribute an importance of
α pi which is shared among the web pages that it points to.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 7 / 14
The PageRank algorithm.
The PageRank with reducible Markov Chain
Solving the following linear system of equations subject to the normalization
constraint one can obtain the importance of web page Pi :





p1
p2
...
pN





= α




Q11 Q12 . . . Q1N
Q21 Q22 . . . Q2N
. . . . . . . . . . . .
QN1 QN2 . . . QNN









p1
p2
...
pN





+
(1 − α)
N





1
1
...
1





(3)
Since
N
i=1
pi = 1
the (3) can be rewritten as
(p1, p2, . . . , pN )T
= P(p1, p2, . . . , pN )T
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 8 / 14
Solving the PageRank algorithm.
The power method.
The power method is an iterative method for solving the dominant eigenvalue
and its corresponding eigenvectors of a matrix.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 9 / 14
Solving the PageRank algorithm.
The power method.
The power method is an iterative method for solving the dominant eigenvalue
and its corresponding eigenvectors of a matrix.
Given an n × n matrix A, the hypothesis of power method are:
there is a single dominant eigenvalue. The eigenvalues can be sorted:
|λ1| > |λ2| ≥ |λ3| ≥ . . . |λn|
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 9 / 14
Solving the PageRank algorithm.
The power method.
The power method is an iterative method for solving the dominant eigenvalue
and its corresponding eigenvectors of a matrix.
Given an n × n matrix A, the hypothesis of power method are:
there is a single dominant eigenvalue. The eigenvalues can be sorted:
|λ1| > |λ2| ≥ |λ3| ≥ . . . |λn|
there is a linearly independent set of n eigenvectors:
{u(1)
, u(2)
, . . . , u(n)
}
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 9 / 14
Solving the PageRank algorithm.
The power method.
The power method is an iterative method for solving the dominant eigenvalue
and its corresponding eigenvectors of a matrix.
Given an n × n matrix A, the hypothesis of power method are:
there is a single dominant eigenvalue. The eigenvalues can be sorted:
|λ1| > |λ2| ≥ |λ3| ≥ . . . |λn|
there is a linearly independent set of n eigenvectors:
{u(1)
, u(2)
, . . . , u(n)
}
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 9 / 14
Solving the PageRank algorithm.
The power method.
The power method is an iterative method for solving the dominant eigenvalue
and its corresponding eigenvectors of a matrix.
Given an n × n matrix A, the hypothesis of power method are:
there is a single dominant eigenvalue. The eigenvalues can be sorted:
|λ1| > |λ2| ≥ |λ3| ≥ . . . |λn|
there is a linearly independent set of n eigenvectors:
{u(1)
, u(2)
, . . . , u(n)
}
so that
Au(i)
= λiu(i)
, i = 1, . . . , n.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 9 / 14
Solving the PageRank algorithm.
The power method.
The initial vector x0 can be wrote:
x(0)
= a1u(1)
+ a2u(2)
+ · · · + anu(n)
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
Solving the PageRank algorithm.
The power method.
The initial vector x0 can be wrote:
x(0)
= a1u(1)
+ a2u(2)
+ · · · + anu(n)
iterating the initial vector with the A matrix:
Ak
x(0)
= a1Ak
u(1)
+ a2Ak
u(2)
+ · · · + anAk
u(n)
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
Solving the PageRank algorithm.
The power method.
The initial vector x0 can be wrote:
x(0)
= a1u(1)
+ a2u(2)
+ · · · + anu(n)
iterating the initial vector with the A matrix:
Ak
x(0)
= a1Ak
u(1)
+ a2Ak
u(2)
+ · · · + anAk
u(n)
= a1λk
1u(1)
+ a2λk
2u(2)
+ · · · + anλk
nu(n)
.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
Solving the PageRank algorithm.
The power method.
The initial vector x0 can be wrote:
x(0)
= a1u(1)
+ a2u(2)
+ · · · + anu(n)
iterating the initial vector with the A matrix:
Ak
x(0)
= a1Ak
u(1)
+ a2Ak
u(2)
+ · · · + anAk
u(n)
= a1λk
1u(1)
+ a2λk
2u(2)
+ · · · + anλk
nu(n)
.
dividing by λk
1
Ak
x(0)
λk
1
= a1u(1)
+ a2
λ2
λ1
k
u(2)
+ · · · + an
λn
λ1
k
u(n)
,
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
Solving the PageRank algorithm.
The power method.
The initial vector x0 can be wrote:
x(0)
= a1u(1)
+ a2u(2)
+ · · · + anu(n)
iterating the initial vector with the A matrix:
Ak
x(0)
= a1Ak
u(1)
+ a2Ak
u(2)
+ · · · + anAk
u(n)
= a1λk
1u(1)
+ a2λk
2u(2)
+ · · · + anλk
nu(n)
.
dividing by λk
1
Ak
x(0)
λk
1
= a1u(1)
+ a2
λ2
λ1
k
u(2)
+ · · · + an
λn
λ1
k
u(n)
,
Since
|λi|
|λ1|
< 1 →
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
Solving the PageRank algorithm.
The power method.
The initial vector x0 can be wrote:
x(0)
= a1u(1)
+ a2u(2)
+ · · · + anu(n)
iterating the initial vector with the A matrix:
Ak
x(0)
= a1Ak
u(1)
+ a2Ak
u(2)
+ · · · + anAk
u(n)
= a1λk
1u(1)
+ a2λk
2u(2)
+ · · · + anλk
nu(n)
.
dividing by λk
1
Ak
x(0)
λk
1
= a1u(1)
+ a2
λ2
λ1
k
u(2)
+ · · · + an
λn
λ1
k
u(n)
,
Since
|λi|
|λ1|
< 1 → lim
k→∞
|λi|k
|λ1|k
= 0 →
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
Solving the PageRank algorithm.
The power method.
The initial vector x0 can be wrote:
x(0)
= a1u(1)
+ a2u(2)
+ · · · + anu(n)
iterating the initial vector with the A matrix:
Ak
x(0)
= a1Ak
u(1)
+ a2Ak
u(2)
+ · · · + anAk
u(n)
= a1λk
1u(1)
+ a2λk
2u(2)
+ · · · + anλk
nu(n)
.
dividing by λk
1
Ak
x(0)
λk
1
= a1u(1)
+ a2
λ2
λ1
k
u(2)
+ · · · + an
λn
λ1
k
u(n)
,
Since
|λi|
|λ1|
< 1 → lim
k→∞
|λi|k
|λ1|k
= 0 → Ak
≈ a1λk
1u(1)
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
Conclusions.
The power method and PageRank.
Results.
The matrix P of the PageRank algorithm is a stochastic matrix therefore
the largest eigenvalue is 1.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 11 / 14
Conclusions.
The power method and PageRank.
Results.
The matrix P of the PageRank algorithm is a stochastic matrix therefore
the largest eigenvalue is 1.
The convergence rate of the power method depends on the ratio of λ2
λ1
.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 11 / 14
Conclusions.
The power method and PageRank.
Results.
The matrix P of the PageRank algorithm is a stochastic matrix therefore
the largest eigenvalue is 1.
The convergence rate of the power method depends on the ratio of λ2
λ1
.
It has been showed by [Haveliwala and Kamvar(2003)] that for the second
largest eigenvalue of P, we have
|λ2| ≤ α 0 ≤ α ≤ 1.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 11 / 14
Conclusions.
The power method and PageRank.
Results.
The matrix P of the PageRank algorithm is a stochastic matrix therefore
the largest eigenvalue is 1.
The convergence rate of the power method depends on the ratio of λ2
λ1
.
It has been showed by [Haveliwala and Kamvar(2003)] that for the second
largest eigenvalue of P, we have
|λ2| ≤ α 0 ≤ α ≤ 1.
Since λ1 = 1 the converge rate depends on α.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 11 / 14
Conclusions.
The power method and PageRank.
Results.
The matrix P of the PageRank algorithm is a stochastic matrix therefore
the largest eigenvalue is 1.
The convergence rate of the power method depends on the ratio of λ2
λ1
.
It has been showed by [Haveliwala and Kamvar(2003)] that for the second
largest eigenvalue of P, we have
|λ2| ≤ α 0 ≤ α ≤ 1.
Since λ1 = 1 the converge rate depends on α.
The most popular value for α is 0.85. With this value it has been proved
that the power method on web data set of over 80 million pages converges
in about 50 iterations.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 11 / 14
Conclusions.
Really thanks to GTUG Palermo
and
see you to the next meeting!
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 12 / 14
Bibliography.
Bibliography.
Brin, S. and Page, L. (1998).
The anatomy of a large-scale hypertextual Web search engine.
Computer networks and ISDN systems, 30(1-7), 107–117.
Ching, W. and Ng, M. (2006).
Markov Chains: Models, Algoritms and Applications.
Springer Science + Business Media, Inc.
Haveliwala, T. and Kamvar, M. (2003).
The second eigenvalue of the google matrix.
Technical report, Stanford University.
Langville, A., Meyer, C., and Fern´Andez, P. (2008).
Google’s PageRank and beyond: the science of search engine rankings.
The Mathematical Intelligencer, 30(1), 68–69.
Page, L., Brin, S., Motwani, R., and Winograd, T. (1999).
The PageRank Citation Ranking: Bringing Order to the Web.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 13 / 14
Internet web sites.
Internet web sites.
Jon Atle Gulla (2007) - From Google Search to Semantic Exploration. -
Norwegian University of Science Technology -
www.slideshare.net/sveino/semantics-and-search?type=presentation
Steven Levy (2010) - Exclusive: How Google’s Algorithm Rules the Web - Wired
Magazine - www.wired.com/magazine/2010/02/ff_google_algorithm/
Ann Smarty (2009) - Let’s Try to Find All 200 Parameters in Google Algorithm -
Search Engine Journal -
www.searchenginejournal.com/200-parameters-in-google-algorithm/15457/.
Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 14 / 14

Contenu connexe

Tendances

Pagerank Algorithm Explained
Pagerank Algorithm ExplainedPagerank Algorithm Explained
Pagerank Algorithm Explainedjdhaar
 
page ranking algorithm
page ranking algorithmpage ranking algorithm
page ranking algorithmJaved Khan
 
Linear algebra behind Google search
Linear algebra behind Google searchLinear algebra behind Google search
Linear algebra behind Google searchPlusOrMinusZero
 
Page-Rank Algorithm Final
Page-Rank Algorithm FinalPage-Rank Algorithm Final
Page-Rank Algorithm FinalWilliam Keene
 
Page rank
Page rankPage rank
Page rankCarlos
 
Webpage Classification
Webpage ClassificationWebpage Classification
Webpage ClassificationPacharaStudio
 
PageRank_algorithm_Nfaoui_El_Habib
PageRank_algorithm_Nfaoui_El_HabibPageRank_algorithm_Nfaoui_El_Habib
PageRank_algorithm_Nfaoui_El_HabibEl Habib NFAOUI
 
Implementing page rank algorithm using hadoop map reduce
Implementing page rank algorithm using hadoop map reduceImplementing page rank algorithm using hadoop map reduce
Implementing page rank algorithm using hadoop map reduceFarzan Hajian
 
LINEAR ALGEBRA BEHIND GOOGLE SEARCH
LINEAR ALGEBRA BEHIND GOOGLE SEARCHLINEAR ALGEBRA BEHIND GOOGLE SEARCH
LINEAR ALGEBRA BEHIND GOOGLE SEARCHDivyansh Verma
 
Logistic Regression | Logistic Regression In Python | Machine Learning Algori...
Logistic Regression | Logistic Regression In Python | Machine Learning Algori...Logistic Regression | Logistic Regression In Python | Machine Learning Algori...
Logistic Regression | Logistic Regression In Python | Machine Learning Algori...Simplilearn
 
Search Engine Optimization (SEO) Seminar Report
Search Engine Optimization (SEO) Seminar ReportSearch Engine Optimization (SEO) Seminar Report
Search Engine Optimization (SEO) Seminar ReportNandu B Rajan
 
Stock market prediction using Twitter sentiment analysis
Stock market prediction using Twitter sentiment analysisStock market prediction using Twitter sentiment analysis
Stock market prediction using Twitter sentiment analysisjournal ijrtem
 
Linear Regression Algorithm | Linear Regression in Python | Machine Learning ...
Linear Regression Algorithm | Linear Regression in Python | Machine Learning ...Linear Regression Algorithm | Linear Regression in Python | Machine Learning ...
Linear Regression Algorithm | Linear Regression in Python | Machine Learning ...Edureka!
 
Keyword Research | Keyword Research For SEO 2019 | SEO Tutorial For Beginners...
Keyword Research | Keyword Research For SEO 2019 | SEO Tutorial For Beginners...Keyword Research | Keyword Research For SEO 2019 | SEO Tutorial For Beginners...
Keyword Research | Keyword Research For SEO 2019 | SEO Tutorial For Beginners...Simplilearn
 

Tendances (20)

Google PageRank
Google PageRankGoogle PageRank
Google PageRank
 
Pagerank Algorithm Explained
Pagerank Algorithm ExplainedPagerank Algorithm Explained
Pagerank Algorithm Explained
 
page ranking algorithm
page ranking algorithmpage ranking algorithm
page ranking algorithm
 
Linear algebra behind Google search
Linear algebra behind Google searchLinear algebra behind Google search
Linear algebra behind Google search
 
Page-Rank Algorithm Final
Page-Rank Algorithm FinalPage-Rank Algorithm Final
Page-Rank Algorithm Final
 
Link Analysis
Link AnalysisLink Analysis
Link Analysis
 
Page rank
Page rankPage rank
Page rank
 
Webpage Classification
Webpage ClassificationWebpage Classification
Webpage Classification
 
Web Crawler
Web CrawlerWeb Crawler
Web Crawler
 
PageRank_algorithm_Nfaoui_El_Habib
PageRank_algorithm_Nfaoui_El_HabibPageRank_algorithm_Nfaoui_El_Habib
PageRank_algorithm_Nfaoui_El_Habib
 
K Nearest Neighbor Algorithm
K Nearest Neighbor AlgorithmK Nearest Neighbor Algorithm
K Nearest Neighbor Algorithm
 
Implementing page rank algorithm using hadoop map reduce
Implementing page rank algorithm using hadoop map reduceImplementing page rank algorithm using hadoop map reduce
Implementing page rank algorithm using hadoop map reduce
 
LINEAR ALGEBRA BEHIND GOOGLE SEARCH
LINEAR ALGEBRA BEHIND GOOGLE SEARCHLINEAR ALGEBRA BEHIND GOOGLE SEARCH
LINEAR ALGEBRA BEHIND GOOGLE SEARCH
 
Recommender Systems
Recommender SystemsRecommender Systems
Recommender Systems
 
Logistic Regression | Logistic Regression In Python | Machine Learning Algori...
Logistic Regression | Logistic Regression In Python | Machine Learning Algori...Logistic Regression | Logistic Regression In Python | Machine Learning Algori...
Logistic Regression | Logistic Regression In Python | Machine Learning Algori...
 
Search Engine Optimization (SEO) Seminar Report
Search Engine Optimization (SEO) Seminar ReportSearch Engine Optimization (SEO) Seminar Report
Search Engine Optimization (SEO) Seminar Report
 
Seo syllabus pdf
Seo syllabus pdfSeo syllabus pdf
Seo syllabus pdf
 
Stock market prediction using Twitter sentiment analysis
Stock market prediction using Twitter sentiment analysisStock market prediction using Twitter sentiment analysis
Stock market prediction using Twitter sentiment analysis
 
Linear Regression Algorithm | Linear Regression in Python | Machine Learning ...
Linear Regression Algorithm | Linear Regression in Python | Machine Learning ...Linear Regression Algorithm | Linear Regression in Python | Machine Learning ...
Linear Regression Algorithm | Linear Regression in Python | Machine Learning ...
 
Keyword Research | Keyword Research For SEO 2019 | SEO Tutorial For Beginners...
Keyword Research | Keyword Research For SEO 2019 | SEO Tutorial For Beginners...Keyword Research | Keyword Research For SEO 2019 | SEO Tutorial For Beginners...
Keyword Research | Keyword Research For SEO 2019 | SEO Tutorial For Beginners...
 

En vedette

The Google Pagerank algorithm - How does it work?
The Google Pagerank algorithm - How does it work?The Google Pagerank algorithm - How does it work?
The Google Pagerank algorithm - How does it work?Kundan Bhaduri
 
A Comparative Study of HITS vs PageRank Algorithms for Twitter Users Analysis
A Comparative Study of HITS vs PageRank Algorithms for Twitter Users AnalysisA Comparative Study of HITS vs PageRank Algorithms for Twitter Users Analysis
A Comparative Study of HITS vs PageRank Algorithms for Twitter Users AnalysisPoo Kuan Hoong
 
Population 3.5 - Spearman’S Rank
Population 3.5 - Spearman’S RankPopulation 3.5 - Spearman’S Rank
Population 3.5 - Spearman’S RankEcumene
 
GCSE Geography: How And Why To Use Spearman’s Rank
GCSE Geography: How And Why To Use Spearman’s RankGCSE Geography: How And Why To Use Spearman’s Rank
GCSE Geography: How And Why To Use Spearman’s RankMark Cowan
 
Spearman Rank Correlation Presentation
Spearman Rank Correlation PresentationSpearman Rank Correlation Presentation
Spearman Rank Correlation Presentationcae_021
 
Halloween.Presentacion
Halloween.PresentacionHalloween.Presentacion
Halloween.PresentacionCarolinaMiau
 
Giornata Mondiale del Migrante e del Rifugiato
Giornata Mondiale del Migrante e del RifugiatoGiornata Mondiale del Migrante e del Rifugiato
Giornata Mondiale del Migrante e del RifugiatoMaike Loes
 
Secrets & Lies
Secrets & LiesSecrets & Lies
Secrets & LiesSam Baksh
 
Study skillsppt
Study skillspptStudy skillsppt
Study skillspptrnesbit
 
All-Pro Fasteners
All-Pro FastenersAll-Pro Fasteners
All-Pro FastenersTodd Grzych
 
Action adventure movie shots
Action adventure movie shotsAction adventure movie shots
Action adventure movie shotsstoliros
 
Presentation alevel work max
Presentation alevel work maxPresentation alevel work max
Presentation alevel work maxstoliros
 
A Story of Tiny Frog
A Story of Tiny FrogA Story of Tiny Frog
A Story of Tiny Frogbhatsumanta
 
印刷に耐えられる品質のQRCodeをPerlで生成したよ! - Mishima.pm #1
印刷に耐えられる品質のQRCodeをPerlで生成したよ! - Mishima.pm #1印刷に耐えられる品質のQRCodeをPerlで生成したよ! - Mishima.pm #1
印刷に耐えられる品質のQRCodeをPerlで生成したよ! - Mishima.pm #1azuma satoshi
 
инвестиционный паспорт города орла
инвестиционный паспорт города орлаинвестиционный паспорт города орла
инвестиционный паспорт города орлаlitavrika
 

En vedette (20)

The Google Pagerank algorithm - How does it work?
The Google Pagerank algorithm - How does it work?The Google Pagerank algorithm - How does it work?
The Google Pagerank algorithm - How does it work?
 
A Comparative Study of HITS vs PageRank Algorithms for Twitter Users Analysis
A Comparative Study of HITS vs PageRank Algorithms for Twitter Users AnalysisA Comparative Study of HITS vs PageRank Algorithms for Twitter Users Analysis
A Comparative Study of HITS vs PageRank Algorithms for Twitter Users Analysis
 
Population 3.5 - Spearman’S Rank
Population 3.5 - Spearman’S RankPopulation 3.5 - Spearman’S Rank
Population 3.5 - Spearman’S Rank
 
GCSE Geography: How And Why To Use Spearman’s Rank
GCSE Geography: How And Why To Use Spearman’s RankGCSE Geography: How And Why To Use Spearman’s Rank
GCSE Geography: How And Why To Use Spearman’s Rank
 
Spearman Rank Correlation Presentation
Spearman Rank Correlation PresentationSpearman Rank Correlation Presentation
Spearman Rank Correlation Presentation
 
Halloween.Presentacion
Halloween.PresentacionHalloween.Presentacion
Halloween.Presentacion
 
Giornata Mondiale del Migrante e del Rifugiato
Giornata Mondiale del Migrante e del RifugiatoGiornata Mondiale del Migrante e del Rifugiato
Giornata Mondiale del Migrante e del Rifugiato
 
Secrets & Lies
Secrets & LiesSecrets & Lies
Secrets & Lies
 
Study skillsppt
Study skillspptStudy skillsppt
Study skillsppt
 
All-Pro Fasteners
All-Pro FastenersAll-Pro Fasteners
All-Pro Fasteners
 
Action adventure movie shots
Action adventure movie shotsAction adventure movie shots
Action adventure movie shots
 
birthday
birthdaybirthday
birthday
 
Presentation alevel work max
Presentation alevel work maxPresentation alevel work max
Presentation alevel work max
 
A Story of Tiny Frog
A Story of Tiny FrogA Story of Tiny Frog
A Story of Tiny Frog
 
Tema 2
Tema 2Tema 2
Tema 2
 
Farewell (Mae, batch 2011)
Farewell (Mae, batch 2011)Farewell (Mae, batch 2011)
Farewell (Mae, batch 2011)
 
A3 sergiodelpino
A3 sergiodelpinoA3 sergiodelpino
A3 sergiodelpino
 
Pati alegre (fil eminimizer)
Pati alegre (fil eminimizer)Pati alegre (fil eminimizer)
Pati alegre (fil eminimizer)
 
印刷に耐えられる品質のQRCodeをPerlで生成したよ! - Mishima.pm #1
印刷に耐えられる品質のQRCodeをPerlで生成したよ! - Mishima.pm #1印刷に耐えられる品質のQRCodeをPerlで生成したよ! - Mishima.pm #1
印刷に耐えられる品質のQRCodeをPerlで生成したよ! - Mishima.pm #1
 
инвестиционный паспорт города орла
инвестиционный паспорт города орлаинвестиционный паспорт города орла
инвестиционный паспорт города орла
 

Similaire à PageRank and Markov Chain

Pagerank
Pagerank Pagerank
Pagerank C C
 
Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...
Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...
Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...IOSR Journals
 
A Generalization of the PageRank Algorithm : NOTES
A Generalization of the PageRank Algorithm : NOTESA Generalization of the PageRank Algorithm : NOTES
A Generalization of the PageRank Algorithm : NOTESSubhajit Sahu
 
page rank explication et exemple formule
page rank explication et exemple  formulepage rank explication et exemple  formule
page rank explication et exemple formuleRamiHarrathi1
 
Done reread deeperinsidepagerank
Done reread deeperinsidepagerankDone reread deeperinsidepagerank
Done reread deeperinsidepagerankJames Arnold
 
Deeper Inside PageRank (NOTES)
Deeper Inside PageRank (NOTES)Deeper Inside PageRank (NOTES)
Deeper Inside PageRank (NOTES)Subhajit Sahu
 
Random web surfer pagerank algorithm
Random web surfer pagerank algorithmRandom web surfer pagerank algorithm
Random web surfer pagerank algorithmalexandrelevada
 
Data.Mining.C.8(Ii).Web Mining 570802461
Data.Mining.C.8(Ii).Web Mining 570802461Data.Mining.C.8(Ii).Web Mining 570802461
Data.Mining.C.8(Ii).Web Mining 570802461Margaret Wang
 
Hui xie 591r_presentation
Hui xie 591r_presentationHui xie 591r_presentation
Hui xie 591r_presentationramiro
 
Markov chains and page rankGraphs.pdf
Markov chains and page rankGraphs.pdfMarkov chains and page rankGraphs.pdf
Markov chains and page rankGraphs.pdfrayyverma
 
PageRank in Multithreading
PageRank in MultithreadingPageRank in Multithreading
PageRank in MultithreadingShujian Zhang
 
Done reread sketchinglandscapesofpagefarmsnpcomplete(3)
Done reread sketchinglandscapesofpagefarmsnpcomplete(3)Done reread sketchinglandscapesofpagefarmsnpcomplete(3)
Done reread sketchinglandscapesofpagefarmsnpcomplete(3)James Arnold
 
Done reread sketchinglandscapesofpagefarmsnpcomplete
Done reread sketchinglandscapesofpagefarmsnpcompleteDone reread sketchinglandscapesofpagefarmsnpcomplete
Done reread sketchinglandscapesofpagefarmsnpcompleteJames Arnold
 
Done reread sketchinglandscapesofpagefarmsnpcomplete(2)
Done reread sketchinglandscapesofpagefarmsnpcomplete(2)Done reread sketchinglandscapesofpagefarmsnpcomplete(2)
Done reread sketchinglandscapesofpagefarmsnpcomplete(2)James Arnold
 
Prediction Model Using Web Usage Mining Techniques
Prediction Model Using Web Usage Mining TechniquesPrediction Model Using Web Usage Mining Techniques
Prediction Model Using Web Usage Mining TechniquesEditor IJCATR
 
Done reread the effect of new links on google pagerank
Done reread the effect of new links on google pagerankDone reread the effect of new links on google pagerank
Done reread the effect of new links on google pagerankJames Arnold
 
Markov chain and its Application
Markov chain and its Application Markov chain and its Application
Markov chain and its Application Tilakpoudel2
 

Similaire à PageRank and Markov Chain (20)

Pagerank
Pagerank Pagerank
Pagerank
 
Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...
Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...
Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...
 
A Generalization of the PageRank Algorithm : NOTES
A Generalization of the PageRank Algorithm : NOTESA Generalization of the PageRank Algorithm : NOTES
A Generalization of the PageRank Algorithm : NOTES
 
page rank explication et exemple formule
page rank explication et exemple  formulepage rank explication et exemple  formule
page rank explication et exemple formule
 
Done reread deeperinsidepagerank
Done reread deeperinsidepagerankDone reread deeperinsidepagerank
Done reread deeperinsidepagerank
 
Deeper Inside PageRank (NOTES)
Deeper Inside PageRank (NOTES)Deeper Inside PageRank (NOTES)
Deeper Inside PageRank (NOTES)
 
Random web surfer pagerank algorithm
Random web surfer pagerank algorithmRandom web surfer pagerank algorithm
Random web surfer pagerank algorithm
 
Data.Mining.C.8(Ii).Web Mining 570802461
Data.Mining.C.8(Ii).Web Mining 570802461Data.Mining.C.8(Ii).Web Mining 570802461
Data.Mining.C.8(Ii).Web Mining 570802461
 
I04015559
I04015559I04015559
I04015559
 
Page Rank Link Farm Detection
Page Rank Link Farm DetectionPage Rank Link Farm Detection
Page Rank Link Farm Detection
 
Hui xie 591r_presentation
Hui xie 591r_presentationHui xie 591r_presentation
Hui xie 591r_presentation
 
Markov chains and page rankGraphs.pdf
Markov chains and page rankGraphs.pdfMarkov chains and page rankGraphs.pdf
Markov chains and page rankGraphs.pdf
 
PageRank in Multithreading
PageRank in MultithreadingPageRank in Multithreading
PageRank in Multithreading
 
Done reread sketchinglandscapesofpagefarmsnpcomplete(3)
Done reread sketchinglandscapesofpagefarmsnpcomplete(3)Done reread sketchinglandscapesofpagefarmsnpcomplete(3)
Done reread sketchinglandscapesofpagefarmsnpcomplete(3)
 
Done reread sketchinglandscapesofpagefarmsnpcomplete
Done reread sketchinglandscapesofpagefarmsnpcompleteDone reread sketchinglandscapesofpagefarmsnpcomplete
Done reread sketchinglandscapesofpagefarmsnpcomplete
 
Done reread sketchinglandscapesofpagefarmsnpcomplete(2)
Done reread sketchinglandscapesofpagefarmsnpcomplete(2)Done reread sketchinglandscapesofpagefarmsnpcomplete(2)
Done reread sketchinglandscapesofpagefarmsnpcomplete(2)
 
Prediction Model Using Web Usage Mining Techniques
Prediction Model Using Web Usage Mining TechniquesPrediction Model Using Web Usage Mining Techniques
Prediction Model Using Web Usage Mining Techniques
 
zanardi
zanardizanardi
zanardi
 
Done reread the effect of new links on google pagerank
Done reread the effect of new links on google pagerankDone reread the effect of new links on google pagerank
Done reread the effect of new links on google pagerank
 
Markov chain and its Application
Markov chain and its Application Markov chain and its Application
Markov chain and its Application
 

Dernier

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoffsammart93
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobeapidays
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...Martijn de Jong
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)wesley chun
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonAnna Loughnan Colquhoun
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processorsdebabhi2
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsMaria Levchenko
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptxHampshireHUG
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Scriptwesley chun
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?Antenna Manufacturer Coco
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 

Dernier (20)

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt RobisonData Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 

PageRank and Markov Chain

  • 1. Markov Chains as methodology used by PageRank to rank the Web Pages on Internet. Sergio S. Guirreri - www.guirreri.host22.com Google Technology User Group (GTUG) of Palermo. 5th March 2010 Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 1 / 14
  • 2. Overview 1 Concepts on Markov-Chains. 2 The idea of the PageRank algorithm. 3 The PageRank algorithm. 4 Solving the PageRank algorithm. 5 Conclusions. 6 Bibliography. 7 Internet web sites. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 2 / 14
  • 3. Concepts on Markov-Chains. Stochastic Process and Markov-Chains. Let assume the following stochastic process {Xn; n = 0, 1, 2, . . . } with values in a set E, called the state space, while its elements are called state of the process. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 3 / 14
  • 4. Concepts on Markov-Chains. Stochastic Process and Markov-Chains. Let assume the following stochastic process {Xn; n = 0, 1, 2, . . . } with values in a set E, called the state space, while its elements are called state of the process. Let assume the set E is finite or countable. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 3 / 14
  • 5. Concepts on Markov-Chains. Stochastic Process and Markov-Chains. Let assume the following stochastic process {Xn; n = 0, 1, 2, . . . } with values in a set E, called the state space, while its elements are called state of the process. Let assume the set E is finite or countable. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 3 / 14
  • 6. Concepts on Markov-Chains. Stochastic Process and Markov-Chains. Let assume the following stochastic process {Xn; n = 0, 1, 2, . . . } with values in a set E, called the state space, while its elements are called state of the process. Let assume the set E is finite or countable. Definition A Markov Chain is a stochastic process Xn that hold the following feature: Prob{Xn+1 = j|Xn = i, Xn−1 = in−1, . . . , X0 = i0} = = Prob{Xn+1 = j|Xn = i} = pij(n) where E is the state space set and j, i, in−1, . . . , i0 ∈ E, n ∈ N. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 3 / 14
  • 7. Concepts on Markov-Chains. Stochastic Process and Markov-Chains. Let assume the following stochastic process {Xn; n = 0, 1, 2, . . . } with values in a set E, called the state space, while its elements are called state of the process. Let assume the set E is finite or countable. Definition A Markov Chain is a stochastic process Xn that hold the following feature: Prob{Xn+1 = j|Xn = i, Xn−1 = in−1, . . . , X0 = i0} = = Prob{Xn+1 = j|Xn = i} = pij(n) where E is the state space set and j, i, in−1, . . . , i0 ∈ E, n ∈ N. The transition probability matrix P of the process Xn is composed of pij, ∀i, j ∈ E. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 3 / 14
  • 8. The idea of the PageRank algorithm. PageRank’s idea. The idea behind the PageRank algorithm is similar to the idea of the impact factor index used to rank the Journals [Page et al.(1999)] [Brin and Page(1998)] [Langville et al.(2008)]. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 4 / 14
  • 9. The idea of the PageRank algorithm. PageRank’s idea. The idea behind the PageRank algorithm is similar to the idea of the impact factor index used to rank the Journals [Page et al.(1999)] [Brin and Page(1998)] [Langville et al.(2008)]. PageRank the impact factor of Internet. The impact factor of a journal is defined as the average number of citations per recently published papers in that journal. By regarding each web page as a journal, this idea was then extended to measure the importance of the web page in the PageRank Algorithm. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 4 / 14
  • 10. The idea of the PageRank algorithm. Elements of the PageRank. To illustrate the PageRank algorithm I define the following variables [Ching and Ng(2006)]: let be N the total number of web pages in the web. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 5 / 14
  • 11. The idea of the PageRank algorithm. Elements of the PageRank. To illustrate the PageRank algorithm I define the following variables [Ching and Ng(2006)]: let be N the total number of web pages in the web. let be k the outgoing links of web page j. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 5 / 14
  • 12. The idea of the PageRank algorithm. Elements of the PageRank. To illustrate the PageRank algorithm I define the following variables [Ching and Ng(2006)]: let be N the total number of web pages in the web. let be k the outgoing links of web page j. let be Q the so called hyperlink matrix with elements: Qij =    1 k if web page i is an outgoing link of web page j; 0 otherwise; Qi,i > 0 ∀i. (1) Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 5 / 14
  • 13. The idea of the PageRank algorithm. Elements of the PageRank. To illustrate the PageRank algorithm I define the following variables [Ching and Ng(2006)]: let be N the total number of web pages in the web. let be k the outgoing links of web page j. let be Q the so called hyperlink matrix with elements: Qij =    1 k if web page i is an outgoing link of web page j; 0 otherwise; Qi,i > 0 ∀i. (1) Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 5 / 14
  • 14. The idea of the PageRank algorithm. Elements of the PageRank. To illustrate the PageRank algorithm I define the following variables [Ching and Ng(2006)]: let be N the total number of web pages in the web. let be k the outgoing links of web page j. let be Q the so called hyperlink matrix with elements: Qij =    1 k if web page i is an outgoing link of web page j; 0 otherwise; Qi,i > 0 ∀i. (1) The hyperlink matrix Q can be regarded as a transition probability matrix of a Markov chain. One may regard a surfer on the net as a random walker and the web pages as the states of the Markov chain. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 5 / 14
  • 15. The PageRank algorithm. The PageRank with irreducible Markov Chain. Assuming that the Markov chain is irreduciblea and aperiodicb then the steady-state probability distribution (p1, p2, . . . , pN )T of the states (web pages) exists. aA Markov chain is irreducible if all states communicate with each other. bA chain is periodic if there exists k > 1 such that the interval between two visits to some state s is always a multiple of k. Therefore a chain is aperiodic if k=1. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 6 / 14
  • 16. The PageRank algorithm. The PageRank with irreducible Markov Chain. Assuming that the Markov chain is irreduciblea and aperiodicb then the steady-state probability distribution (p1, p2, . . . , pN )T of the states (web pages) exists. aA Markov chain is irreducible if all states communicate with each other. bA chain is periodic if there exists k > 1 such that the interval between two visits to some state s is always a multiple of k. Therefore a chain is aperiodic if k=1. The PageRank Each pi is the proportion of time that the surfer visiting the web page i. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 6 / 14
  • 17. The PageRank algorithm. The PageRank with irreducible Markov Chain. Assuming that the Markov chain is irreduciblea and aperiodicb then the steady-state probability distribution (p1, p2, . . . , pN )T of the states (web pages) exists. aA Markov chain is irreducible if all states communicate with each other. bA chain is periodic if there exists k > 1 such that the interval between two visits to some state s is always a multiple of k. Therefore a chain is aperiodic if k=1. The PageRank Each pi is the proportion of time that the surfer visiting the web page i. The higher the value of pi is, the more important web page i will be. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 6 / 14
  • 18. The PageRank algorithm. The PageRank with irreducible Markov Chain. Assuming that the Markov chain is irreduciblea and aperiodicb then the steady-state probability distribution (p1, p2, . . . , pN )T of the states (web pages) exists. aA Markov chain is irreducible if all states communicate with each other. bA chain is periodic if there exists k > 1 such that the interval between two visits to some state s is always a multiple of k. Therefore a chain is aperiodic if k=1. The PageRank Each pi is the proportion of time that the surfer visiting the web page i. The higher the value of pi is, the more important web page i will be. The PageRank of web page i is then defined as pi. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 6 / 14
  • 19. The PageRank algorithm. The PageRank with reducible Markov Chain Since the matrix Q can be reducible to ensure that the steady-state probability exists and is unique the following matrix P must be considered: P = α     Q11 Q12 . . . Q1N Q21 Q22 . . . Q2N . . . . . . . . . . . . QN1 QN2 . . . QNN     + (1 − α) N     1 1 . . . 1 1 1 . . . 1 . . . . . . . . . . . . 1 1 . . . 1     (2) Where 0 < α < 1 and the most popular values of α are 0.85 and (1 − 1/N). Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 7 / 14
  • 20. The PageRank algorithm. The PageRank with reducible Markov Chain Since the matrix Q can be reducible to ensure that the steady-state probability exists and is unique the following matrix P must be considered: P = α     Q11 Q12 . . . Q1N Q21 Q22 . . . Q2N . . . . . . . . . . . . QN1 QN2 . . . QNN     + (1 − α) N     1 1 . . . 1 1 1 . . . 1 . . . . . . . . . . . . 1 1 . . . 1     (2) Where 0 < α < 1 and the most popular values of α are 0.85 and (1 − 1/N). Interpretation of PageRank The idea of the PageRank (2) is that, for a network of N web pages, each web page has an inherent importance of (1 − α)/N. If a page Pi has an importance of pi, then it will contribute an importance of α pi which is shared among the web pages that it points to. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 7 / 14
  • 21. The PageRank algorithm. The PageRank with reducible Markov Chain Solving the following linear system of equations subject to the normalization constraint one can obtain the importance of web page Pi :      p1 p2 ... pN      = α     Q11 Q12 . . . Q1N Q21 Q22 . . . Q2N . . . . . . . . . . . . QN1 QN2 . . . QNN          p1 p2 ... pN      + (1 − α) N      1 1 ... 1      (3) Since N i=1 pi = 1 the (3) can be rewritten as (p1, p2, . . . , pN )T = P(p1, p2, . . . , pN )T Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 8 / 14
  • 22. Solving the PageRank algorithm. The power method. The power method is an iterative method for solving the dominant eigenvalue and its corresponding eigenvectors of a matrix. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 9 / 14
  • 23. Solving the PageRank algorithm. The power method. The power method is an iterative method for solving the dominant eigenvalue and its corresponding eigenvectors of a matrix. Given an n × n matrix A, the hypothesis of power method are: there is a single dominant eigenvalue. The eigenvalues can be sorted: |λ1| > |λ2| ≥ |λ3| ≥ . . . |λn| Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 9 / 14
  • 24. Solving the PageRank algorithm. The power method. The power method is an iterative method for solving the dominant eigenvalue and its corresponding eigenvectors of a matrix. Given an n × n matrix A, the hypothesis of power method are: there is a single dominant eigenvalue. The eigenvalues can be sorted: |λ1| > |λ2| ≥ |λ3| ≥ . . . |λn| there is a linearly independent set of n eigenvectors: {u(1) , u(2) , . . . , u(n) } Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 9 / 14
  • 25. Solving the PageRank algorithm. The power method. The power method is an iterative method for solving the dominant eigenvalue and its corresponding eigenvectors of a matrix. Given an n × n matrix A, the hypothesis of power method are: there is a single dominant eigenvalue. The eigenvalues can be sorted: |λ1| > |λ2| ≥ |λ3| ≥ . . . |λn| there is a linearly independent set of n eigenvectors: {u(1) , u(2) , . . . , u(n) } Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 9 / 14
  • 26. Solving the PageRank algorithm. The power method. The power method is an iterative method for solving the dominant eigenvalue and its corresponding eigenvectors of a matrix. Given an n × n matrix A, the hypothesis of power method are: there is a single dominant eigenvalue. The eigenvalues can be sorted: |λ1| > |λ2| ≥ |λ3| ≥ . . . |λn| there is a linearly independent set of n eigenvectors: {u(1) , u(2) , . . . , u(n) } so that Au(i) = λiu(i) , i = 1, . . . , n. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 9 / 14
  • 27. Solving the PageRank algorithm. The power method. The initial vector x0 can be wrote: x(0) = a1u(1) + a2u(2) + · · · + anu(n) Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
  • 28. Solving the PageRank algorithm. The power method. The initial vector x0 can be wrote: x(0) = a1u(1) + a2u(2) + · · · + anu(n) iterating the initial vector with the A matrix: Ak x(0) = a1Ak u(1) + a2Ak u(2) + · · · + anAk u(n) Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
  • 29. Solving the PageRank algorithm. The power method. The initial vector x0 can be wrote: x(0) = a1u(1) + a2u(2) + · · · + anu(n) iterating the initial vector with the A matrix: Ak x(0) = a1Ak u(1) + a2Ak u(2) + · · · + anAk u(n) = a1λk 1u(1) + a2λk 2u(2) + · · · + anλk nu(n) . Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
  • 30. Solving the PageRank algorithm. The power method. The initial vector x0 can be wrote: x(0) = a1u(1) + a2u(2) + · · · + anu(n) iterating the initial vector with the A matrix: Ak x(0) = a1Ak u(1) + a2Ak u(2) + · · · + anAk u(n) = a1λk 1u(1) + a2λk 2u(2) + · · · + anλk nu(n) . dividing by λk 1 Ak x(0) λk 1 = a1u(1) + a2 λ2 λ1 k u(2) + · · · + an λn λ1 k u(n) , Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
  • 31. Solving the PageRank algorithm. The power method. The initial vector x0 can be wrote: x(0) = a1u(1) + a2u(2) + · · · + anu(n) iterating the initial vector with the A matrix: Ak x(0) = a1Ak u(1) + a2Ak u(2) + · · · + anAk u(n) = a1λk 1u(1) + a2λk 2u(2) + · · · + anλk nu(n) . dividing by λk 1 Ak x(0) λk 1 = a1u(1) + a2 λ2 λ1 k u(2) + · · · + an λn λ1 k u(n) , Since |λi| |λ1| < 1 → Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
  • 32. Solving the PageRank algorithm. The power method. The initial vector x0 can be wrote: x(0) = a1u(1) + a2u(2) + · · · + anu(n) iterating the initial vector with the A matrix: Ak x(0) = a1Ak u(1) + a2Ak u(2) + · · · + anAk u(n) = a1λk 1u(1) + a2λk 2u(2) + · · · + anλk nu(n) . dividing by λk 1 Ak x(0) λk 1 = a1u(1) + a2 λ2 λ1 k u(2) + · · · + an λn λ1 k u(n) , Since |λi| |λ1| < 1 → lim k→∞ |λi|k |λ1|k = 0 → Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
  • 33. Solving the PageRank algorithm. The power method. The initial vector x0 can be wrote: x(0) = a1u(1) + a2u(2) + · · · + anu(n) iterating the initial vector with the A matrix: Ak x(0) = a1Ak u(1) + a2Ak u(2) + · · · + anAk u(n) = a1λk 1u(1) + a2λk 2u(2) + · · · + anλk nu(n) . dividing by λk 1 Ak x(0) λk 1 = a1u(1) + a2 λ2 λ1 k u(2) + · · · + an λn λ1 k u(n) , Since |λi| |λ1| < 1 → lim k→∞ |λi|k |λ1|k = 0 → Ak ≈ a1λk 1u(1) Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 10 / 14
  • 34. Conclusions. The power method and PageRank. Results. The matrix P of the PageRank algorithm is a stochastic matrix therefore the largest eigenvalue is 1. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 11 / 14
  • 35. Conclusions. The power method and PageRank. Results. The matrix P of the PageRank algorithm is a stochastic matrix therefore the largest eigenvalue is 1. The convergence rate of the power method depends on the ratio of λ2 λ1 . Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 11 / 14
  • 36. Conclusions. The power method and PageRank. Results. The matrix P of the PageRank algorithm is a stochastic matrix therefore the largest eigenvalue is 1. The convergence rate of the power method depends on the ratio of λ2 λ1 . It has been showed by [Haveliwala and Kamvar(2003)] that for the second largest eigenvalue of P, we have |λ2| ≤ α 0 ≤ α ≤ 1. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 11 / 14
  • 37. Conclusions. The power method and PageRank. Results. The matrix P of the PageRank algorithm is a stochastic matrix therefore the largest eigenvalue is 1. The convergence rate of the power method depends on the ratio of λ2 λ1 . It has been showed by [Haveliwala and Kamvar(2003)] that for the second largest eigenvalue of P, we have |λ2| ≤ α 0 ≤ α ≤ 1. Since λ1 = 1 the converge rate depends on α. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 11 / 14
  • 38. Conclusions. The power method and PageRank. Results. The matrix P of the PageRank algorithm is a stochastic matrix therefore the largest eigenvalue is 1. The convergence rate of the power method depends on the ratio of λ2 λ1 . It has been showed by [Haveliwala and Kamvar(2003)] that for the second largest eigenvalue of P, we have |λ2| ≤ α 0 ≤ α ≤ 1. Since λ1 = 1 the converge rate depends on α. The most popular value for α is 0.85. With this value it has been proved that the power method on web data set of over 80 million pages converges in about 50 iterations. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 11 / 14
  • 39. Conclusions. Really thanks to GTUG Palermo and see you to the next meeting! Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 12 / 14
  • 40. Bibliography. Bibliography. Brin, S. and Page, L. (1998). The anatomy of a large-scale hypertextual Web search engine. Computer networks and ISDN systems, 30(1-7), 107–117. Ching, W. and Ng, M. (2006). Markov Chains: Models, Algoritms and Applications. Springer Science + Business Media, Inc. Haveliwala, T. and Kamvar, M. (2003). The second eigenvalue of the google matrix. Technical report, Stanford University. Langville, A., Meyer, C., and Fern´Andez, P. (2008). Google’s PageRank and beyond: the science of search engine rankings. The Mathematical Intelligencer, 30(1), 68–69. Page, L., Brin, S., Motwani, R., and Winograd, T. (1999). The PageRank Citation Ranking: Bringing Order to the Web. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 13 / 14
  • 41. Internet web sites. Internet web sites. Jon Atle Gulla (2007) - From Google Search to Semantic Exploration. - Norwegian University of Science Technology - www.slideshare.net/sveino/semantics-and-search?type=presentation Steven Levy (2010) - Exclusive: How Google’s Algorithm Rules the Web - Wired Magazine - www.wired.com/magazine/2010/02/ff_google_algorithm/ Ann Smarty (2009) - Let’s Try to Find All 200 Parameters in Google Algorithm - Search Engine Journal - www.searchenginejournal.com/200-parameters-in-google-algorithm/15457/. Sergio S. Guirreri - www.guirreri.host22.com (Google Technology User Group (GTUG) of Palermo.)Markov Chains as methodology used by PageRank to rank the Web Pages on Inte5th March 2010 14 / 14