SlideShare a Scribd company logo
Presentation by:
ADMEC MULTIMEDIA INSTITUTE
www.admecindia.co.in
Indexing and Working Process
of Search Engines
The first basic truths, that’s you need to understand in SEO that search
engines are not a human.
 While this might be obvious for everybody, the differences between
how humans and search engines view web pages aren't. Search engines
are text-driven, voice driven and image driven.
 Although now a day’s technology advances rapidly grow, search engines
are far from intelligent creatures that can feel the beauty of a cool design
or enjoy the sounds and movement in movies.
 Instead, search engines crawl the web pages, looking at particular site
content (mainly text) to get an idea about a site.
Firstly, search engines crawl the website to see what is on the website. This
task is performed by software, called a crawler or a spider.
Spiders go to website and follow links from one page to another and index
all things, whatever they find on their way. More than 20 billion pages on
the web available, so it is impossible for a spider to visit all site daily just to
see if a new pages is added or any existing page is modified on the web. So
it may be possible that crawlers may not end up visiting your site for a
month or two.
Crawling-
Crawling is a process by which search
engines discover publicly available
web pages. Google uses software
name “web crawlers” for crawling.
The crawl process begins with a list of
web address from past crawls and
sitemaps provided by website owners.
What you can do is to check what a crawler sees from your site. As above
mentioned, crawlers are not humans and they do not see images, Flash
movies, JavaScript, frames, password-protected pages and directories, so if
you have added these on your site, you'd better run the Spider
Simulator below to see if these goodies are viewable by the spider. If they
are not viewable, they will not be spidered, not indexed, not processed,
etc. - in a word they will be non-existent for search engines.
Spider-
Spider is a program (set of instructions) that
automatically fetches Web pages. Spiders
are used to feed pages to search engines.
It's crawls over the Web, so it’s called
spider. Another term for these programs is
known as WebCrawler.
Example:
Name of Google Spider is “Googlebot”.
Name of Bing Spider is “Bingbot”.
Name of Alta Vista Spider is “Scooter”.
a) When page is crawled by crawler the next step is to index its all the
content.
b) The index page stored in a giant database, from where it can be
access or retrieved later as per requirement.
c) Essentially, the process of indexing is identifying the words that best
describe the page and provides the page to particular keywords
which search on the web.
d) So typical work is very difficult for a human to process such
amounts of information but generally search engines manage just
fine with this task within a few time.
e) Sometimes search engine not get the meaning of a page right but if
we help them by optimizing it, it will be easier for to search engine
to classify your pages correctly and for you – to get higher rankings
and better results.
When anybody Query anything in search engine, the search
engine processes it – i.e. it compares the search keywords or string in the
search request with the indexed pages in the stored database.
Since it is likely that more than one page (practically it is millions of pages)
contains the search string or keyword, the search engine starts calculating
the relevancy of each of the pages in its index as the keywords or string
searched and provides best result after calculating the relevancy.
1. The Web server sends the query to the index servers. The content
inside the index server is similar to the index in the back of a book-it
tells which pages contain the words that match the query.
2. The query travels to the doc servers , which actually retrieve the stored
documents. Snippets are generated to describe each search result.
3. The search result are returned to the user in a fraction of a second.
Contact Us:
ADMEC MULTIMEDIA INSTITUTE
C-7/114, IInd Floor, Sector- 7, Rohini, Delhi- 85
Landmark: Near Rohini East Metro Station
Helpline 1: +91 9811 818 122
Helpline 2: +91 9911 782 350
ADMEC MULTIMEDIA INSTITUTE
For More information you can visit :
http://www.admecindia.co.in
Or email : info@admecindia.co.in

More Related Content

Viewers also liked

Ted Dunning - Whither Hadoop
Ted Dunning - Whither HadoopTed Dunning - Whither Hadoop
Ted Dunning - Whither Hadoop
Ed Kohlwey
 
Intelligent crawling and indexing using lucene
Intelligent crawling and indexing using luceneIntelligent crawling and indexing using lucene
Intelligent crawling and indexing using lucene
Swapnil & Patil
 
Web publishing
Web publishingWeb publishing
Web publishing
Kanav Sood
 

Viewers also liked (17)

Ted Dunning - Whither Hadoop
Ted Dunning - Whither HadoopTed Dunning - Whither Hadoop
Ted Dunning - Whither Hadoop
 
Blogging With Word Press -Social Media Bootcamp
Blogging With Word Press -Social Media BootcampBlogging With Word Press -Social Media Bootcamp
Blogging With Word Press -Social Media Bootcamp
 
Search Engine Optimization - Social Media Bootcamp
Search Engine Optimization - Social Media BootcampSearch Engine Optimization - Social Media Bootcamp
Search Engine Optimization - Social Media Bootcamp
 
Web Browser
Web BrowserWeb Browser
Web Browser
 
Challenges Distributed Information Retrieval [RBY] (ICDE 2007 Turkey)
Challenges Distributed Information Retrieval [RBY] (ICDE 2007 Turkey)Challenges Distributed Information Retrieval [RBY] (ICDE 2007 Turkey)
Challenges Distributed Information Retrieval [RBY] (ICDE 2007 Turkey)
 
Web crawler
Web crawlerWeb crawler
Web crawler
 
Crawling and Indexing
Crawling and IndexingCrawling and Indexing
Crawling and Indexing
 
Web crawler
Web crawlerWeb crawler
Web crawler
 
Intelligent crawling and indexing using lucene
Intelligent crawling and indexing using luceneIntelligent crawling and indexing using lucene
Intelligent crawling and indexing using lucene
 
Recommendation for dummy
Recommendation for dummyRecommendation for dummy
Recommendation for dummy
 
MicroComputer Application 1
MicroComputer Application 1MicroComputer Application 1
MicroComputer Application 1
 
Crawling, indexing, ranking: Make the search engine crawlers and algorithms y...
Crawling, indexing, ranking: Make the search engine crawlers and algorithms y...Crawling, indexing, ranking: Make the search engine crawlers and algorithms y...
Crawling, indexing, ranking: Make the search engine crawlers and algorithms y...
 
Lucene Introduction
Lucene IntroductionLucene Introduction
Lucene Introduction
 
Web publishing
Web publishingWeb publishing
Web publishing
 
Search engine
Search engineSearch engine
Search engine
 
Introduction to Search Engines
Introduction to Search EnginesIntroduction to Search Engines
Introduction to Search Engines
 
Mis vacaciones
Mis vacacionesMis vacaciones
Mis vacaciones
 

More from Ravi Bhadauria

More from Ravi Bhadauria (20)

3 Important Terms of Post Production
3 Important Terms of Post Production3 Important Terms of Post Production
3 Important Terms of Post Production
 
Basics of Video Editing | Types of Video Editing | Video Production Process
Basics of Video Editing | Types of Video Editing | Video Production ProcessBasics of Video Editing | Types of Video Editing | Video Production Process
Basics of Video Editing | Types of Video Editing | Video Production Process
 
Basics of Media | Types of Media | Units in Media | Software in Media | Color...
Basics of Media | Types of Media | Units in Media | Software in Media | Color...Basics of Media | Types of Media | Units in Media | Software in Media | Color...
Basics of Media | Types of Media | Units in Media | Software in Media | Color...
 
History of Visual Communication | Guide to Visual Communication by ADMEC Mult...
History of Visual Communication | Guide to Visual Communication by ADMEC Mult...History of Visual Communication | Guide to Visual Communication by ADMEC Mult...
History of Visual Communication | Guide to Visual Communication by ADMEC Mult...
 
Elements and Principles of Design (Updated)
Elements and Principles of Design (Updated)Elements and Principles of Design (Updated)
Elements and Principles of Design (Updated)
 
Top Graphic Designing Hacks to Make You a Designing Pro Today
Top Graphic Designing Hacks to Make You a Designing Pro Today Top Graphic Designing Hacks to Make You a Designing Pro Today
Top Graphic Designing Hacks to Make You a Designing Pro Today
 
12 Famous Typographers to Inspire You
12 Famous Typographers to Inspire You12 Famous Typographers to Inspire You
12 Famous Typographers to Inspire You
 
Sargam UI Design
Sargam UI DesignSargam UI Design
Sargam UI Design
 
Use of Shapes in Graphic Design | Psychology of Shapes by ADMEC (Updated)
Use of Shapes in Graphic Design | Psychology of Shapes by ADMEC (Updated)Use of Shapes in Graphic Design | Psychology of Shapes by ADMEC (Updated)
Use of Shapes in Graphic Design | Psychology of Shapes by ADMEC (Updated)
 
UX Design Essential Theories
UX Design Essential TheoriesUX Design Essential Theories
UX Design Essential Theories
 
Top 10 Ad Gurus
Top 10 Ad GurusTop 10 Ad Gurus
Top 10 Ad Gurus
 
Workshop on resume, portfolio, interview
Workshop on resume, portfolio, interviewWorkshop on resume, portfolio, interview
Workshop on resume, portfolio, interview
 
Top 10 Architecture Design Colleges in India
Top 10 Architecture Design Colleges in IndiaTop 10 Architecture Design Colleges in India
Top 10 Architecture Design Colleges in India
 
User interface and user experience ui ux design basics
User interface  and user experience ui ux design basicsUser interface  and user experience ui ux design basics
User interface and user experience ui ux design basics
 
How to create Frost Neon Effect in Photoshop?
How to create Frost Neon Effect in Photoshop?How to create Frost Neon Effect in Photoshop?
How to create Frost Neon Effect in Photoshop?
 
Top 10 design colleges and institutes of india
Top 10 design colleges and institutes of indiaTop 10 design colleges and institutes of india
Top 10 design colleges and institutes of india
 
Best Hollywood poster designers
Best Hollywood poster designersBest Hollywood poster designers
Best Hollywood poster designers
 
Design Principles for All the Designers
Design Principles for All the DesignersDesign Principles for All the Designers
Design Principles for All the Designers
 
Content Writing Tips for SEO
Content Writing Tips for SEOContent Writing Tips for SEO
Content Writing Tips for SEO
 
6 Great Steps to Know to Create Successful Web GUI
6 Great Steps to Know to Create Successful Web GUI6 Great Steps to Know to Create Successful Web GUI
6 Great Steps to Know to Create Successful Web GUI
 

Recently uploaded

一比一原版UTS毕业证悉尼科技大学毕业证成绩单如何办理
一比一原版UTS毕业证悉尼科技大学毕业证成绩单如何办理一比一原版UTS毕业证悉尼科技大学毕业证成绩单如何办理
一比一原版UTS毕业证悉尼科技大学毕业证成绩单如何办理
aagad
 
Article writing on excessive use of internet.pptx
Article writing on excessive use of internet.pptxArticle writing on excessive use of internet.pptx
Article writing on excessive use of internet.pptx
abhinandnam9997
 
audience research (emma) 1.pptxkkkkkkkkkkkkkkkkk
audience research (emma) 1.pptxkkkkkkkkkkkkkkkkkaudience research (emma) 1.pptxkkkkkkkkkkkkkkkkk
audience research (emma) 1.pptxkkkkkkkkkkkkkkkkk
lolsDocherty
 
Production 2024 sunderland culture final - Copy.pptx
Production 2024 sunderland culture final - Copy.pptxProduction 2024 sunderland culture final - Copy.pptx
Production 2024 sunderland culture final - Copy.pptx
ChloeMeadows1
 

Recently uploaded (14)

Case study on merger of Vodafone and Idea (VI).pptx
Case study on merger of Vodafone and Idea (VI).pptxCase study on merger of Vodafone and Idea (VI).pptx
Case study on merger of Vodafone and Idea (VI).pptx
 
Cyber Security Services Unveiled: Strategies to Secure Your Digital Presence
Cyber Security Services Unveiled: Strategies to Secure Your Digital PresenceCyber Security Services Unveiled: Strategies to Secure Your Digital Presence
Cyber Security Services Unveiled: Strategies to Secure Your Digital Presence
 
一比一原版UTS毕业证悉尼科技大学毕业证成绩单如何办理
一比一原版UTS毕业证悉尼科技大学毕业证成绩单如何办理一比一原版UTS毕业证悉尼科技大学毕业证成绩单如何办理
一比一原版UTS毕业证悉尼科技大学毕业证成绩单如何办理
 
Statistical Analysis of DNS Latencies.pdf
Statistical Analysis of DNS Latencies.pdfStatistical Analysis of DNS Latencies.pdf
Statistical Analysis of DNS Latencies.pdf
 
Article writing on excessive use of internet.pptx
Article writing on excessive use of internet.pptxArticle writing on excessive use of internet.pptx
Article writing on excessive use of internet.pptx
 
ER(Entity Relationship) Diagram for online shopping - TAE
ER(Entity Relationship) Diagram for online shopping - TAEER(Entity Relationship) Diagram for online shopping - TAE
ER(Entity Relationship) Diagram for online shopping - TAE
 
Pvtaan Social media marketing proposal.pdf
Pvtaan Social media marketing proposal.pdfPvtaan Social media marketing proposal.pdf
Pvtaan Social media marketing proposal.pdf
 
audience research (emma) 1.pptxkkkkkkkkkkkkkkkkk
audience research (emma) 1.pptxkkkkkkkkkkkkkkkkkaudience research (emma) 1.pptxkkkkkkkkkkkkkkkkk
audience research (emma) 1.pptxkkkkkkkkkkkkkkkkk
 
Bug Bounty Blueprint : A Beginner's Guide
Bug Bounty Blueprint : A Beginner's GuideBug Bounty Blueprint : A Beginner's Guide
Bug Bounty Blueprint : A Beginner's Guide
 
How Do I Begin the Linksys Velop Setup Process?
How Do I Begin the Linksys Velop Setup Process?How Do I Begin the Linksys Velop Setup Process?
How Do I Begin the Linksys Velop Setup Process?
 
Production 2024 sunderland culture final - Copy.pptx
Production 2024 sunderland culture final - Copy.pptxProduction 2024 sunderland culture final - Copy.pptx
Production 2024 sunderland culture final - Copy.pptx
 
Premier Mobile App Development Agency in USA.pdf
Premier Mobile App Development Agency in USA.pdfPremier Mobile App Development Agency in USA.pdf
Premier Mobile App Development Agency in USA.pdf
 
Multi-cluster Kubernetes Networking- Patterns, Projects and Guidelines
Multi-cluster Kubernetes Networking- Patterns, Projects and GuidelinesMulti-cluster Kubernetes Networking- Patterns, Projects and Guidelines
Multi-cluster Kubernetes Networking- Patterns, Projects and Guidelines
 
The Use of AI in Indonesia Election 2024: A Case Study
The Use of AI in Indonesia Election 2024: A Case StudyThe Use of AI in Indonesia Election 2024: A Case Study
The Use of AI in Indonesia Election 2024: A Case Study
 

Indexing and working process of search engine

  • 1. Presentation by: ADMEC MULTIMEDIA INSTITUTE www.admecindia.co.in Indexing and Working Process of Search Engines
  • 2. The first basic truths, that’s you need to understand in SEO that search engines are not a human.  While this might be obvious for everybody, the differences between how humans and search engines view web pages aren't. Search engines are text-driven, voice driven and image driven.  Although now a day’s technology advances rapidly grow, search engines are far from intelligent creatures that can feel the beauty of a cool design or enjoy the sounds and movement in movies.  Instead, search engines crawl the web pages, looking at particular site content (mainly text) to get an idea about a site.
  • 3.
  • 4. Firstly, search engines crawl the website to see what is on the website. This task is performed by software, called a crawler or a spider. Spiders go to website and follow links from one page to another and index all things, whatever they find on their way. More than 20 billion pages on the web available, so it is impossible for a spider to visit all site daily just to see if a new pages is added or any existing page is modified on the web. So it may be possible that crawlers may not end up visiting your site for a month or two. Crawling- Crawling is a process by which search engines discover publicly available web pages. Google uses software name “web crawlers” for crawling. The crawl process begins with a list of web address from past crawls and sitemaps provided by website owners.
  • 5. What you can do is to check what a crawler sees from your site. As above mentioned, crawlers are not humans and they do not see images, Flash movies, JavaScript, frames, password-protected pages and directories, so if you have added these on your site, you'd better run the Spider Simulator below to see if these goodies are viewable by the spider. If they are not viewable, they will not be spidered, not indexed, not processed, etc. - in a word they will be non-existent for search engines. Spider- Spider is a program (set of instructions) that automatically fetches Web pages. Spiders are used to feed pages to search engines. It's crawls over the Web, so it’s called spider. Another term for these programs is known as WebCrawler. Example: Name of Google Spider is “Googlebot”. Name of Bing Spider is “Bingbot”. Name of Alta Vista Spider is “Scooter”.
  • 6. a) When page is crawled by crawler the next step is to index its all the content. b) The index page stored in a giant database, from where it can be access or retrieved later as per requirement. c) Essentially, the process of indexing is identifying the words that best describe the page and provides the page to particular keywords which search on the web. d) So typical work is very difficult for a human to process such amounts of information but generally search engines manage just fine with this task within a few time. e) Sometimes search engine not get the meaning of a page right but if we help them by optimizing it, it will be easier for to search engine to classify your pages correctly and for you – to get higher rankings and better results.
  • 7.
  • 8. When anybody Query anything in search engine, the search engine processes it – i.e. it compares the search keywords or string in the search request with the indexed pages in the stored database. Since it is likely that more than one page (practically it is millions of pages) contains the search string or keyword, the search engine starts calculating the relevancy of each of the pages in its index as the keywords or string searched and provides best result after calculating the relevancy. 1. The Web server sends the query to the index servers. The content inside the index server is similar to the index in the back of a book-it tells which pages contain the words that match the query. 2. The query travels to the doc servers , which actually retrieve the stored documents. Snippets are generated to describe each search result. 3. The search result are returned to the user in a fraction of a second.
  • 9. Contact Us: ADMEC MULTIMEDIA INSTITUTE C-7/114, IInd Floor, Sector- 7, Rohini, Delhi- 85 Landmark: Near Rohini East Metro Station Helpline 1: +91 9811 818 122 Helpline 2: +91 9911 782 350 ADMEC MULTIMEDIA INSTITUTE For More information you can visit : http://www.admecindia.co.in Or email : info@admecindia.co.in