OpenChain AI Study Group - Europe and Asia Recap - 2024-04-11 - Full Recording
The heart of search engine Inverted index
1. The heart of search engine - Inverted Index
It is the foundation of the search engine.
When you want to build search engine such as yahoo, google at the core of the search engine it lays an
Inverted Index.
1 Building an index by crawling the web.
2 Building an inverted index.
3 Lookup the Inverted Index for relevant webpages
Steps involved in building Inverted Index
www.npntraining.com/courses/big-data-and-hadoop.php
2. Step 01
Build an index by crawling the web
e.g. Selenium occurs on which all sites. In order to build an Inverted Index we have to crawl the webpages
from the web and store them along with their contents.
www.abc.com Training provided on Selenium Big Data Hadoop
www.xyz.com Trainings provided on Apache Spark Scala J2EE
www.def.com Training provided on Java J2EE Python Selenium
This is an index of webpages and their contents
www.npntraining.com/courses/big-data-and-hadoop.php
3. Step 02
Build an inverted index
www.abc.com Training provided on Selenium BigData Hadoop
www.xyz.com Trainings provided on Apache Spark Scala J2EE
www.def.com Training provided on Java J2EE Python Selenium
This is an index of webpages and their contents
Training www.abc.com, www.xyz.com, www.def.com
BigData www.abc.com
Spark www.xyz.com
J2EE www.xyz.com, www.def.com
Build an index of words to webpages they appear in
www.npntraining.com/courses/big-data-and-hadoop.php
4. Step 03
Given a search term, look up the inverted index for the relevant webpages
Training www.abc.com, www.xyz.com, www.def.com
Big Data www.abc.com
Spark www.xyz.com
J2EE www.xyz.com, www.def.com
Build an index of words to webpages they appear in
www.npntraining.com/courses/big-data-and-hadoop.php