SlideShare une entreprise Scribd logo
1  sur  12
Working Of “Search
Engine”
Nikhil
D-1
14BTCSERS033
Maths Assignment
What is Search Engine ?
“A web search engine is a software system that
is designed to search for information on the
World Wide Web.”
Purpose of Search Engines
Helping people find what they’re looking for:
• Starts with an “information need”
• Convert to a query
• Gets results
Types of Search Engines
• Search by Keywords
(e.g.AltaVista,Google)
• Search by categories
(e.g. Yahoo)
The Parts of a Search Engine
Spider (or “crawler”)
Index
Search software (an algorithm)
The “spider” or “crawler”
The spider visits a web page, reads it, and
then follows links to other pages within the
site. This is what it means when someone
refers to a site being "spidered" or
"crawled". This is also known as
“harvesting”. The spider returns to the site
on a regular basis, such as every month or
two, to look for changes.
The Indexer
Everything the spider finds goes
into the second part of a search
engine, the index. The index,
sometimes called the catalog, is like
a giant book containing a copy of
every web page that the spider
finds. If a web page changes, then
this book is updated new
information.
Search engine software
It is the third part of a search
engine. This is the program that
sifts through the millions of pages
recorded in the index to find
matches to a search and rank them
in order of what it believes is most
relevant.
Variations of the tf–idf weighting
scheme are often used by search
engines as a central tool in scoring and
ranking a document's relevance given a
user query.
Term Frequency–Inverse Document
Frequency, is a numerical statistic that is
intended to reflect how important a
word is to a document in a collection.
TF-IDF Ranking Algorithm
wij = weight of Term Tj in Document Di
tfij = frequency of Term Tj in Document Dj
N = number of Documents in collection
n = number of Documents where term Tj occurs at least once
• The equation:
PR(A) = (1-d) + d(PR(t1)/C(t1) + … + PR(tn)/C(tn))
• Used by WebQuery and Google
• Google simulates users using the search engine to
rank documents.
• Google uses citation graph (518 million links)
• Google computes 26 million in a few hours.
PageRank
PageRank works by counting
the number and quality of
links to a page to determine a
rough estimate of how
important the website is. The
underlying assumption is that
more important websites are
likely to receive more links
from other websites
The End
Thank you for listening patiently.

Contenu connexe

Tendances (20)

search engines
search enginessearch engines
search engines
 
Google Search Presentation
Google Search PresentationGoogle Search Presentation
Google Search Presentation
 
Internet Research Presentation
Internet Research PresentationInternet Research Presentation
Internet Research Presentation
 
How search engine work ppt
How search engine work pptHow search engine work ppt
How search engine work ppt
 
Search engine
Search engineSearch engine
Search engine
 
Search engine
Search engineSearch engine
Search engine
 
Search Engine Powerpoint
Search Engine PowerpointSearch Engine Powerpoint
Search Engine Powerpoint
 
Search Engine
Search EngineSearch Engine
Search Engine
 
Meta search engine
Meta search engineMeta search engine
Meta search engine
 
Google Search Engine
Google Search EngineGoogle Search Engine
Google Search Engine
 
basic Seo ppt
basic Seo pptbasic Seo ppt
basic Seo ppt
 
Effective Internet Searching
Effective Internet SearchingEffective Internet Searching
Effective Internet Searching
 
Search engine
Search engineSearch engine
Search engine
 
Creating WebPages using HTML
Creating WebPages using HTMLCreating WebPages using HTML
Creating WebPages using HTML
 
Search Engine
Search EngineSearch Engine
Search Engine
 
On page seo
On page seoOn page seo
On page seo
 
Meta tags
Meta tagsMeta tags
Meta tags
 
Introduction to SEO Presentation
Introduction to SEO PresentationIntroduction to SEO Presentation
Introduction to SEO Presentation
 
White hat seo vs black hat seo
White hat seo vs black hat seoWhite hat seo vs black hat seo
White hat seo vs black hat seo
 
Introduction to Search Engines
Introduction to Search EnginesIntroduction to Search Engines
Introduction to Search Engines
 

En vedette

Working Of Search Engine
Working Of Search EngineWorking Of Search Engine
Working Of Search EngineNIKHIL NAIR
 
How Google Search Algorithm Works ??
How Google Search Algorithm Works ??How Google Search Algorithm Works ??
How Google Search Algorithm Works ??viralshahb
 
Search Engines Presentation
Search Engines PresentationSearch Engines Presentation
Search Engines PresentationJSCHO9
 
How search engines work
How search engines workHow search engines work
How search engines workChinna Botla
 
Search Engine Strategies
Search  Engine  StrategiesSearch  Engine  Strategies
Search Engine Strategiesjsotir
 
Working of a Web Crawler
Working of a Web CrawlerWorking of a Web Crawler
Working of a Web CrawlerSanchit Saini
 
How Do Search Engines Work
How Do Search Engines WorkHow Do Search Engines Work
How Do Search Engines WorkPromozSEO
 
Social Media Slides Hubspot Pdf
Social Media Slides Hubspot PdfSocial Media Slides Hubspot Pdf
Social Media Slides Hubspot PdfSilviaConti
 
SearchLove London 2015 | Talia Wolf | Emotional Targeting
SearchLove London 2015 | Talia Wolf | Emotional TargetingSearchLove London 2015 | Talia Wolf | Emotional Targeting
SearchLove London 2015 | Talia Wolf | Emotional TargetingDistilled
 
Search love 2015 - Google's Predictable Content Preference - Aaron Friedman
Search love 2015 - Google's Predictable Content Preference - Aaron FriedmanSearch love 2015 - Google's Predictable Content Preference - Aaron Friedman
Search love 2015 - Google's Predictable Content Preference - Aaron FriedmanAaron Friedman
 

En vedette (19)

Working Of Search Engine
Working Of Search EngineWorking Of Search Engine
Working Of Search Engine
 
Search engines
Search enginesSearch engines
Search engines
 
How Google Search Algorithm Works ??
How Google Search Algorithm Works ??How Google Search Algorithm Works ??
How Google Search Algorithm Works ??
 
Search Engines Presentation
Search Engines PresentationSearch Engines Presentation
Search Engines Presentation
 
How Google Works
How Google WorksHow Google Works
How Google Works
 
Information organization
Information organization Information organization
Information organization
 
How search engines work
How search engines workHow search engines work
How search engines work
 
Search Engine Strategies
Search  Engine  StrategiesSearch  Engine  Strategies
Search Engine Strategies
 
Smart crawler a two stage crawler
Smart crawler a two stage crawlerSmart crawler a two stage crawler
Smart crawler a two stage crawler
 
Working of a Web Crawler
Working of a Web CrawlerWorking of a Web Crawler
Working of a Web Crawler
 
Smart Crawler
Smart CrawlerSmart Crawler
Smart Crawler
 
Types of Search Engines
Types of Search EnginesTypes of Search Engines
Types of Search Engines
 
How Do Search Engines Work
How Do Search Engines WorkHow Do Search Engines Work
How Do Search Engines Work
 
Web crawler
Web crawlerWeb crawler
Web crawler
 
Gaap ppt
Gaap pptGaap ppt
Gaap ppt
 
Social Media Slides Hubspot Pdf
Social Media Slides Hubspot PdfSocial Media Slides Hubspot Pdf
Social Media Slides Hubspot Pdf
 
How do search engines work? A visual model.
How do search engines work?  A visual model. How do search engines work?  A visual model.
How do search engines work? A visual model.
 
SearchLove London 2015 | Talia Wolf | Emotional Targeting
SearchLove London 2015 | Talia Wolf | Emotional TargetingSearchLove London 2015 | Talia Wolf | Emotional Targeting
SearchLove London 2015 | Talia Wolf | Emotional Targeting
 
Search love 2015 - Google's Predictable Content Preference - Aaron Friedman
Search love 2015 - Google's Predictable Content Preference - Aaron FriedmanSearch love 2015 - Google's Predictable Content Preference - Aaron Friedman
Search love 2015 - Google's Predictable Content Preference - Aaron Friedman
 

Similaire à Working of search engine

How a search engine works slide
How a search engine works slideHow a search engine works slide
How a search engine works slideSovan Misra
 
Seminar report(rohitsahu cs 17 vth sem)
Seminar report(rohitsahu cs 17 vth sem)Seminar report(rohitsahu cs 17 vth sem)
Seminar report(rohitsahu cs 17 vth sem)ROHIT SAHU
 
Try It The Google Way .
Try It The Google Way .Try It The Google Way .
Try It The Google Way .abhinavbom
 
How a search engine works report
How a search engine works reportHow a search engine works report
How a search engine works reportSovan Misra
 
Search engines by Gulshan K Maheshwari(QAU)
Search engines by Gulshan  K Maheshwari(QAU)Search engines by Gulshan  K Maheshwari(QAU)
Search engines by Gulshan K Maheshwari(QAU)GulshanKumar368
 
Google indexing
Google indexingGoogle indexing
Google indexingtahoor71
 
Web technology: Web search
Web technology: Web searchWeb technology: Web search
Web technology: Web searchVictor de Boer
 
Comparisons of ranking algorithms
Comparisons of ranking algorithmsComparisons of ranking algorithms
Comparisons of ranking algorithmsPravin Patil
 
Search engine
Search engineSearch engine
Search engineswaraj27
 
Components of a search engine
Components of a search engineComponents of a search engine
Components of a search enginePrimya Tamil
 
CS6007 information retrieval - 5 units notes
CS6007   information retrieval - 5 units notesCS6007   information retrieval - 5 units notes
CS6007 information retrieval - 5 units notesAnandh Arumugakan
 
Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...
Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...
Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...Lucidworks
 
Reflected Intelligence: Lucene/Solr as a self-learning data system
Reflected Intelligence: Lucene/Solr as a self-learning data systemReflected Intelligence: Lucene/Solr as a self-learning data system
Reflected Intelligence: Lucene/Solr as a self-learning data systemTrey Grainger
 
Search Engine working, Crawlers working, Search Engine mechanism
Search Engine working, Crawlers working, Search Engine mechanismSearch Engine working, Crawlers working, Search Engine mechanism
Search Engine working, Crawlers working, Search Engine mechanismUmang MIshra
 
Chapter 1: Introduction to Information Storage and Retrieval
Chapter 1: Introduction to Information Storage and RetrievalChapter 1: Introduction to Information Storage and Retrieval
Chapter 1: Introduction to Information Storage and Retrievalcaptainmactavish1996
 

Similaire à Working of search engine (20)

how google works
how google workshow google works
how google works
 
How a search engine works slide
How a search engine works slideHow a search engine works slide
How a search engine works slide
 
Seminar report(rohitsahu cs 17 vth sem)
Seminar report(rohitsahu cs 17 vth sem)Seminar report(rohitsahu cs 17 vth sem)
Seminar report(rohitsahu cs 17 vth sem)
 
Try It The Google Way .
Try It The Google Way .Try It The Google Way .
Try It The Google Way .
 
How a search engine works report
How a search engine works reportHow a search engine works report
How a search engine works report
 
Search engines by Gulshan K Maheshwari(QAU)
Search engines by Gulshan  K Maheshwari(QAU)Search engines by Gulshan  K Maheshwari(QAU)
Search engines by Gulshan K Maheshwari(QAU)
 
How Google Works
How Google WorksHow Google Works
How Google Works
 
Google indexing
Google indexingGoogle indexing
Google indexing
 
Web technology: Web search
Web technology: Web searchWeb technology: Web search
Web technology: Web search
 
How web searching engines work
How web searching engines workHow web searching engines work
How web searching engines work
 
Comparisons of ranking algorithms
Comparisons of ranking algorithmsComparisons of ranking algorithms
Comparisons of ranking algorithms
 
Web Search Engine
Web Search EngineWeb Search Engine
Web Search Engine
 
Search engine
Search engineSearch engine
Search engine
 
Components of a search engine
Components of a search engineComponents of a search engine
Components of a search engine
 
CS6007 information retrieval - 5 units notes
CS6007   information retrieval - 5 units notesCS6007   information retrieval - 5 units notes
CS6007 information retrieval - 5 units notes
 
Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...
Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...
Reflected Intelligence - Lucene/Solr as a self-learning data system: Presente...
 
Reflected Intelligence: Lucene/Solr as a self-learning data system
Reflected Intelligence: Lucene/Solr as a self-learning data systemReflected Intelligence: Lucene/Solr as a self-learning data system
Reflected Intelligence: Lucene/Solr as a self-learning data system
 
Search Engine working, Crawlers working, Search Engine mechanism
Search Engine working, Crawlers working, Search Engine mechanismSearch Engine working, Crawlers working, Search Engine mechanism
Search Engine working, Crawlers working, Search Engine mechanism
 
Chapter 1: Introduction to Information Storage and Retrieval
Chapter 1: Introduction to Information Storage and RetrievalChapter 1: Introduction to Information Storage and Retrieval
Chapter 1: Introduction to Information Storage and Retrieval
 
intro.ppt
intro.pptintro.ppt
intro.ppt
 

Plus de Nikhil Deswal

Linear Algebra's Applications
Linear Algebra's ApplicationsLinear Algebra's Applications
Linear Algebra's ApplicationsNikhil Deswal
 
Complex Number's Applications
Complex Number's ApplicationsComplex Number's Applications
Complex Number's ApplicationsNikhil Deswal
 
Cardinality and participation constraints
Cardinality and participation constraintsCardinality and participation constraints
Cardinality and participation constraintsNikhil Deswal
 

Plus de Nikhil Deswal (6)

Blood donation
Blood donationBlood donation
Blood donation
 
Microbiology
Microbiology Microbiology
Microbiology
 
Linear Algebra's Applications
Linear Algebra's ApplicationsLinear Algebra's Applications
Linear Algebra's Applications
 
Complex Number's Applications
Complex Number's ApplicationsComplex Number's Applications
Complex Number's Applications
 
Fun Science
Fun ScienceFun Science
Fun Science
 
Cardinality and participation constraints
Cardinality and participation constraintsCardinality and participation constraints
Cardinality and participation constraints
 

Dernier

Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...roncy bisnoi
 
22-prompt engineering noted slide shown.pdf
22-prompt engineering noted slide shown.pdf22-prompt engineering noted slide shown.pdf
22-prompt engineering noted slide shown.pdf203318pmpc
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756dollysharma2066
 
A Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna MunicipalityA Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna MunicipalityMorshed Ahmed Rahath
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfKamal Acharya
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startQuintin Balsdon
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfJiananWang21
 
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptBlock diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptNANDHAKUMARA10
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdfKamal Acharya
 
notes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptnotes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptMsecMca
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdfKamal Acharya
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Bookingdharasingh5698
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VDineshKumar4165
 
Unit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdfUnit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdfRagavanV2
 
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...tanu pandey
 
Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086anil_gaur
 

Dernier (20)

Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
Call Girls Pimpri Chinchwad Call Me 7737669865 Budget Friendly No Advance Boo...
 
22-prompt engineering noted slide shown.pdf
22-prompt engineering noted slide shown.pdf22-prompt engineering noted slide shown.pdf
22-prompt engineering noted slide shown.pdf
 
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
FULL ENJOY Call Girls In Mahipalpur Delhi Contact Us 8377877756
 
A Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna MunicipalityA Study of Urban Area Plan for Pabna Municipality
A Study of Urban Area Plan for Pabna Municipality
 
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdfONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
ONLINE FOOD ORDER SYSTEM PROJECT REPORT.pdf
 
Call Girls in Netaji Nagar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Netaji Nagar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort ServiceCall Girls in Netaji Nagar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
Call Girls in Netaji Nagar, Delhi 💯 Call Us 🔝9953056974 🔝 Escort Service
 
Design For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the startDesign For Accessibility: Getting it right from the start
Design For Accessibility: Getting it right from the start
 
data_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdfdata_management_and _data_science_cheat_sheet.pdf
data_management_and _data_science_cheat_sheet.pdf
 
Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024Water Industry Process Automation & Control Monthly - April 2024
Water Industry Process Automation & Control Monthly - April 2024
 
Block diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.pptBlock diagram reduction techniques in control systems.ppt
Block diagram reduction techniques in control systems.ppt
 
Hostel management system project report..pdf
Hostel management system project report..pdfHostel management system project report..pdf
Hostel management system project report..pdf
 
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
(INDIRA) Call Girl Meerut Call Now 8617697112 Meerut Escorts 24x7
 
notes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.pptnotes on Evolution Of Analytic Scalability.ppt
notes on Evolution Of Analytic Scalability.ppt
 
University management System project report..pdf
University management System project report..pdfUniversity management System project report..pdf
University management System project report..pdf
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 BookingVIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
VIP Call Girls Ankleshwar 7001035870 Whatsapp Number, 24/07 Booking
 
Thermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - VThermal Engineering-R & A / C - unit - V
Thermal Engineering-R & A / C - unit - V
 
Unit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdfUnit 2- Effective stress & Permeability.pdf
Unit 2- Effective stress & Permeability.pdf
 
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...Bhosari ( Call Girls ) Pune  6297143586  Hot Model With Sexy Bhabi Ready For ...
Bhosari ( Call Girls ) Pune 6297143586 Hot Model With Sexy Bhabi Ready For ...
 
Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086Minimum and Maximum Modes of microprocessor 8086
Minimum and Maximum Modes of microprocessor 8086
 

Working of search engine

  • 2. What is Search Engine ? “A web search engine is a software system that is designed to search for information on the World Wide Web.”
  • 3. Purpose of Search Engines Helping people find what they’re looking for: • Starts with an “information need” • Convert to a query • Gets results
  • 4. Types of Search Engines • Search by Keywords (e.g.AltaVista,Google) • Search by categories (e.g. Yahoo)
  • 5. The Parts of a Search Engine Spider (or “crawler”) Index Search software (an algorithm)
  • 6. The “spider” or “crawler” The spider visits a web page, reads it, and then follows links to other pages within the site. This is what it means when someone refers to a site being "spidered" or "crawled". This is also known as “harvesting”. The spider returns to the site on a regular basis, such as every month or two, to look for changes.
  • 7. The Indexer Everything the spider finds goes into the second part of a search engine, the index. The index, sometimes called the catalog, is like a giant book containing a copy of every web page that the spider finds. If a web page changes, then this book is updated new information.
  • 8. Search engine software It is the third part of a search engine. This is the program that sifts through the millions of pages recorded in the index to find matches to a search and rank them in order of what it believes is most relevant.
  • 9. Variations of the tf–idf weighting scheme are often used by search engines as a central tool in scoring and ranking a document's relevance given a user query. Term Frequency–Inverse Document Frequency, is a numerical statistic that is intended to reflect how important a word is to a document in a collection. TF-IDF Ranking Algorithm wij = weight of Term Tj in Document Di tfij = frequency of Term Tj in Document Dj N = number of Documents in collection n = number of Documents where term Tj occurs at least once
  • 10. • The equation: PR(A) = (1-d) + d(PR(t1)/C(t1) + … + PR(tn)/C(tn)) • Used by WebQuery and Google • Google simulates users using the search engine to rank documents. • Google uses citation graph (518 million links) • Google computes 26 million in a few hours. PageRank
  • 11. PageRank works by counting the number and quality of links to a page to determine a rough estimate of how important the website is. The underlying assumption is that more important websites are likely to receive more links from other websites
  • 12. The End Thank you for listening patiently.