SlideShare une entreprise Scribd logo
1  sur  28
Télécharger pour lire hors ligne
Internet Intelligence




Robert Crayford


 •   Copyright © Halliwells LLP 2008 All rights reserved.
The Internet



  “Have you heard of this new thing called
  the internet? It's giving people new
  expectations. It's allowing them to
  become their own expert. Knowledge lies
  anxious at their fingertips”
Roy H. Williams
Internet Intelligence


• Open Source
• Social networking sites.
• Internet footprint
• Questions
Open Source searching


• Open source searching refers to any site that,
  does not need a password or log in to enter.
• The more common open source searches relate
  to search engines.
Deep Web Searching


• The term Deep Web refers to information found
  on Web sites that is hidden or generally
  inaccessible through traditional search methods
Deep Web searching




• Searching social networking sites and newsgroups/forums
  is an example of deep web searching.
• The information would not be found from searching search
  engines.
• It is important to remember that there is a lot of data that
  can only be found through deep web searching
• To search the deep web you need to locate online
  databases and forums and search them individually
Search Engines


• When you search the web using a search engine, you are
  always searching a somewhat stale copy of the real web
  page. When you click on links provided in a search engine's
  search results, you retrieve from the server the current
  version of the page.
• Search engine databases are selected and built by
  computer robot programs called spiders. These "crawl" the
  web, finding pages for potential inclusion by following the
  links in the pages they already have in their database (i.e.,
  already "know about").
Search engines
Search engines


• If a web page is never linked to in any other page, search engine
  spiders cannot find it. The only way a brand new page - one that
  no other page has ever linked to - can get into a search engine is
  for its URL to be sent by some human to the search engine
  companies as a request that the new page be included. All search
  engine companies offer ways to do this.
• Many web pages are excluded from most search engines by
  policy. The contents of most of the searchable databases
  mounted on the web, such as library catalogs and article
  databases, are excluded because search engine spiders cannot
  access them. All this material is referred to as the Invisible web-
  what you don't see in search engine results.
One Enough??


• Less than half the searchable Web is fully searchable in Google.
• The percent of total results unique to one search engine was
  established to be 88.3 percent.

• The percent of total results shared by any two search engines
  was established to be 8.9 percent.

• The percent of total results shared by three search engines was
  established to be 2.2 percent.

• The percent of total results shared by the top four search engines
  was established to be 0.6 percent.
One Enough??


• The majority of first page results are unique:
• On average, 69.6 percent of Google first page search results
  were unique to Google.

• On average, 79.4 percent of Yahoo! first page search results
  were unique to Yahoo!

• On average, 80.1 percent of Live first page search results were
  unique to Live.

• On average, 75.0 percent Ask first page search results were
  unique to Ask.
Social Networking Sites
The Top 9 Social Networking Sites by internet visits

Rank    Name                 Domain                      Market Share %


1       Facebook             www.facebook.com            37.7

2       Bebo                 www.bebo.com                28

3       Myspace              www.myspace.com             18.97

4       Faceparty            www.faceparty.com           2.01

5       Windows Live Space   Spaces.live.com             1.99

6       BBC h2g2             www.bbc.co.uk/dna           1.25

7       Stumble Upon         www.stumbleupon.com         1.19

8       Club Penguin         www.clubpenguin.com         1.05

9       Friends Reunited     www.friendsreunited.co.uk   0.88
Investigator footprint
I.P Addresses


• All computers across the internet are assigned a
  unique identifier called an IP address. They are
  used like street addresses so other computers
  can find them. An IP address could look
  something like this: 87.242.211.23.
• Websites can log any IP addresses that look at
  their site.
• IP addresses can then be traced back to the
  server.
IP Address
I.P


• They could then Google or yahoo “Halliwells” and
  “Manchester” to find our address.
• IP Address finder:


• http://www.ip-adress.com/
Search Results


• Webmasters can even trace, what search term
  you used to find their website.
• For example, if you searched for fraudulent
  people in Liverpool and then clicked on one of
  the search results, the owner of the site found in
  the search could see that you were searching for
  fraudulent people in Liverpool.
Search Results


• To avoid this, most search results provide the
  URL of the results. You can copy and paste this
  in to a new web browser.
Cloaking


• There are many web based proxys that claim to
  hide your IP address.
• These sites are untested- and this must be
  considered while using them.
• The websites records information of who blocked
  who, to look at what.
• http://www.the-cloak.com/anonymous-surfing-
  home.html
Tracing Emails


• You can trace a IP address of the server the email was sent
  from.
• Web mail tracing would reveal the IP address of the web
  mail server. e.g. Hotmail.
• The IP address is hidden in the internet header of the
  email.
• You can either search through the headers to find the IP
  address or you can paste the header on to an online
  engine and it will find it for you.
• http://www.ip2location.com/emailtracer.aspx
Tracing Emails
Tracing Emails
BBC news 6/12/1998
Halliwells Website 27.11.2004
Any Questions


• Robert Crayford
• Robert.crayford@halliwells.com
• 0161 618 4312

Contenu connexe

Tendances

Google Update Zoo : Panda – Penguin
Google Update Zoo : Panda – PenguinGoogle Update Zoo : Panda – Penguin
Google Update Zoo : Panda – PenguinBill Hartzer
 
Alternative search engines; Library 2.014 presentation
Alternative search engines; Library 2.014 presentationAlternative search engines; Library 2.014 presentation
Alternative search engines; Library 2.014 presentationPhil Bradley
 
Technical SEO - An Introduction to Core Aspects of Technical SEO Best-Practise
Technical SEO - An Introduction to Core Aspects of Technical SEO Best-PractiseTechnical SEO - An Introduction to Core Aspects of Technical SEO Best-Practise
Technical SEO - An Introduction to Core Aspects of Technical SEO Best-PractiseErudite
 
Se Rg@Vf^ F Fo
Se  Rg@Vf^ F FoSe  Rg@Vf^ F Fo
Se Rg@Vf^ F Fohezyz
 
SEO Workshop #EcomTIM in Romania
SEO Workshop #EcomTIM in RomaniaSEO Workshop #EcomTIM in Romania
SEO Workshop #EcomTIM in RomaniaLisa Myers
 
Advanced Internet searching Autumn 2012
Advanced Internet searching Autumn 2012Advanced Internet searching Autumn 2012
Advanced Internet searching Autumn 2012Phil Bradley
 
Hummingbird unleashed. Understanding the new Google Search Algorithm
Hummingbird unleashed. Understanding the new Google Search AlgorithmHummingbird unleashed. Understanding the new Google Search Algorithm
Hummingbird unleashed. Understanding the new Google Search AlgorithmGianluca Fiorelli
 
Advanced internet search
Advanced internet searchAdvanced internet search
Advanced internet searchMegan Heuer
 
Link building Services from TheSeoPortal SEO Company
Link building Services from TheSeoPortal SEO CompanyLink building Services from TheSeoPortal SEO Company
Link building Services from TheSeoPortal SEO CompanyTheseoportal
 
The 5 Ws Of Cyberspace
The 5 Ws Of CyberspaceThe 5 Ws Of Cyberspace
The 5 Ws Of Cyberspacetbladow
 
[Workshop] Best-Practice Tech Sourcing, Susanna Frazier - Recruiters’ Hub New...
[Workshop] Best-Practice Tech Sourcing, Susanna Frazier - Recruiters’ Hub New...[Workshop] Best-Practice Tech Sourcing, Susanna Frazier - Recruiters’ Hub New...
[Workshop] Best-Practice Tech Sourcing, Susanna Frazier - Recruiters’ Hub New...Susanna Frazier
 
TechFuse 2013 - Break down the walls SharePoint 2013
TechFuse 2013 - Break down the walls SharePoint 2013TechFuse 2013 - Break down the walls SharePoint 2013
TechFuse 2013 - Break down the walls SharePoint 2013Avtex
 
Understanding SEO
Understanding SEOUnderstanding SEO
Understanding SEOTim Huegdon
 
Monster list of 3,000 high quality back link source
Monster list of 3,000 high quality back link source Monster list of 3,000 high quality back link source
Monster list of 3,000 high quality back link source blogsvht
 
SEO: Optimizing Sites for People (and search engines)
SEO: Optimizing Sites for People (and search engines)SEO: Optimizing Sites for People (and search engines)
SEO: Optimizing Sites for People (and search engines)kdmcBerkeley at UC Berkeley
 
Tips for Author Websites
Tips for Author Websites Tips for Author Websites
Tips for Author Websites JSpruell
 
Evaluating web content authenticity
Evaluating web content authenticityEvaluating web content authenticity
Evaluating web content authenticityKelly Walsh
 
1pager: How to Find Contact Information - Phone Numbers & Email Addresses (So...
1pager: How to Find Contact Information - Phone Numbers & Email Addresses (So...1pager: How to Find Contact Information - Phone Numbers & Email Addresses (So...
1pager: How to Find Contact Information - Phone Numbers & Email Addresses (So...Susanna Frazier
 
Google Penguin, Google Panda, and Google Algorithms 2013
Google Penguin, Google Panda, and Google Algorithms 2013Google Penguin, Google Panda, and Google Algorithms 2013
Google Penguin, Google Panda, and Google Algorithms 2013Bill Hartzer
 

Tendances (20)

Google Update Zoo : Panda – Penguin
Google Update Zoo : Panda – PenguinGoogle Update Zoo : Panda – Penguin
Google Update Zoo : Panda – Penguin
 
Alternative search engines; Library 2.014 presentation
Alternative search engines; Library 2.014 presentationAlternative search engines; Library 2.014 presentation
Alternative search engines; Library 2.014 presentation
 
Technical SEO - An Introduction to Core Aspects of Technical SEO Best-Practise
Technical SEO - An Introduction to Core Aspects of Technical SEO Best-PractiseTechnical SEO - An Introduction to Core Aspects of Technical SEO Best-Practise
Technical SEO - An Introduction to Core Aspects of Technical SEO Best-Practise
 
Se Rg@Vf^ F Fo
Se  Rg@Vf^ F FoSe  Rg@Vf^ F Fo
Se Rg@Vf^ F Fo
 
SEO Workshop #EcomTIM in Romania
SEO Workshop #EcomTIM in RomaniaSEO Workshop #EcomTIM in Romania
SEO Workshop #EcomTIM in Romania
 
Advanced Internet searching Autumn 2012
Advanced Internet searching Autumn 2012Advanced Internet searching Autumn 2012
Advanced Internet searching Autumn 2012
 
Hummingbird unleashed. Understanding the new Google Search Algorithm
Hummingbird unleashed. Understanding the new Google Search AlgorithmHummingbird unleashed. Understanding the new Google Search Algorithm
Hummingbird unleashed. Understanding the new Google Search Algorithm
 
Advanced internet search
Advanced internet searchAdvanced internet search
Advanced internet search
 
Link building Services from TheSeoPortal SEO Company
Link building Services from TheSeoPortal SEO CompanyLink building Services from TheSeoPortal SEO Company
Link building Services from TheSeoPortal SEO Company
 
People Search
People SearchPeople Search
People Search
 
The 5 Ws Of Cyberspace
The 5 Ws Of CyberspaceThe 5 Ws Of Cyberspace
The 5 Ws Of Cyberspace
 
[Workshop] Best-Practice Tech Sourcing, Susanna Frazier - Recruiters’ Hub New...
[Workshop] Best-Practice Tech Sourcing, Susanna Frazier - Recruiters’ Hub New...[Workshop] Best-Practice Tech Sourcing, Susanna Frazier - Recruiters’ Hub New...
[Workshop] Best-Practice Tech Sourcing, Susanna Frazier - Recruiters’ Hub New...
 
TechFuse 2013 - Break down the walls SharePoint 2013
TechFuse 2013 - Break down the walls SharePoint 2013TechFuse 2013 - Break down the walls SharePoint 2013
TechFuse 2013 - Break down the walls SharePoint 2013
 
Understanding SEO
Understanding SEOUnderstanding SEO
Understanding SEO
 
Monster list of 3,000 high quality back link source
Monster list of 3,000 high quality back link source Monster list of 3,000 high quality back link source
Monster list of 3,000 high quality back link source
 
SEO: Optimizing Sites for People (and search engines)
SEO: Optimizing Sites for People (and search engines)SEO: Optimizing Sites for People (and search engines)
SEO: Optimizing Sites for People (and search engines)
 
Tips for Author Websites
Tips for Author Websites Tips for Author Websites
Tips for Author Websites
 
Evaluating web content authenticity
Evaluating web content authenticityEvaluating web content authenticity
Evaluating web content authenticity
 
1pager: How to Find Contact Information - Phone Numbers & Email Addresses (So...
1pager: How to Find Contact Information - Phone Numbers & Email Addresses (So...1pager: How to Find Contact Information - Phone Numbers & Email Addresses (So...
1pager: How to Find Contact Information - Phone Numbers & Email Addresses (So...
 
Google Penguin, Google Panda, and Google Algorithms 2013
Google Penguin, Google Panda, and Google Algorithms 2013Google Penguin, Google Panda, and Google Algorithms 2013
Google Penguin, Google Panda, and Google Algorithms 2013
 

En vedette

ASART WORKSHOPS
ASART WORKSHOPSASART WORKSHOPS
ASART WORKSHOPSpapillon7
 
PEACE CULTURE
PEACE CULTUREPEACE CULTURE
PEACE CULTUREpapillon7
 
Reflexion2
Reflexion2Reflexion2
Reflexion2adr_11
 
PAHO/WHO. PAHO Director BLOG 5th anniversary.
PAHO/WHO. PAHO Director BLOG 5th anniversary.PAHO/WHO. PAHO Director BLOG 5th anniversary.
PAHO/WHO. PAHO Director BLOG 5th anniversary.KATIA DIAZ
 

En vedette (9)

ASART WORKSHOPS
ASART WORKSHOPSASART WORKSHOPS
ASART WORKSHOPS
 
The 1980’s
The 1980’sThe 1980’s
The 1980’s
 
Visual resume
Visual resumeVisual resume
Visual resume
 
0024920a
0024920a0024920a
0024920a
 
COSTA RICA
COSTA RICACOSTA RICA
COSTA RICA
 
PEACE CULTURE
PEACE CULTUREPEACE CULTURE
PEACE CULTURE
 
Reflexion2
Reflexion2Reflexion2
Reflexion2
 
Hrny May 2004
Hrny May 2004Hrny May 2004
Hrny May 2004
 
PAHO/WHO. PAHO Director BLOG 5th anniversary.
PAHO/WHO. PAHO Director BLOG 5th anniversary.PAHO/WHO. PAHO Director BLOG 5th anniversary.
PAHO/WHO. PAHO Director BLOG 5th anniversary.
 

Similaire à Internet Intelligence: Tracking Your Online Footprint

Information Discovery and Search Strategies for Evidence-Based Research
Information Discovery and Search Strategies for Evidence-Based ResearchInformation Discovery and Search Strategies for Evidence-Based Research
Information Discovery and Search Strategies for Evidence-Based ResearchDavid Nzoputa Ofili
 
Search engines powerpoint
Search engines powerpointSearch engines powerpoint
Search engines powerpointvbaker2210
 
Online Collections Crawlability for Libraries, Archives, and Museums
Online Collections Crawlability for Libraries, Archives, and MuseumsOnline Collections Crawlability for Libraries, Archives, and Museums
Online Collections Crawlability for Libraries, Archives, and Museumsmherbison
 
INTERNET SEARCH B and W.ppt
INTERNET SEARCH  B and W.pptINTERNET SEARCH  B and W.ppt
INTERNET SEARCH B and W.pptMalik922000
 
Search Engine Optimization (SEO)
Search Engine Optimization (SEO)Search Engine Optimization (SEO)
Search Engine Optimization (SEO)Christopher Mbinda
 
Search Engine Optimize for WordPress in 3 Easy Steps
Search Engine Optimize for WordPress in 3 Easy StepsSearch Engine Optimize for WordPress in 3 Easy Steps
Search Engine Optimize for WordPress in 3 Easy StepsAnna Belle Leiserson
 
Search engines and its types
Search engines and its typesSearch engines and its types
Search engines and its typesNagarjuna Kalluru
 
Search Engine Optimization
Search Engine OptimizationSearch Engine Optimization
Search Engine OptimizationDHARMENDRA SINHA
 
mysearchengines-150208065440-conversion-gate02.pdf
mysearchengines-150208065440-conversion-gate02.pdfmysearchengines-150208065440-conversion-gate02.pdf
mysearchengines-150208065440-conversion-gate02.pdfFranzLawrenzDeTorres1
 
Lost in the Net: Navigating Search Engines
Lost in the Net:  Navigating Search EnginesLost in the Net:  Navigating Search Engines
Lost in the Net: Navigating Search EnginesJohan Koren
 
Internet basics powerpoint
Internet basics powerpointInternet basics powerpoint
Internet basics powerpointSamapti Sen
 
Internet basics powerpoint
Internet basics powerpointInternet basics powerpoint
Internet basics powerpointSamapti Sen
 
SEO Search Engine Optimization Internet Marketing
SEO Search Engine Optimization Internet MarketingSEO Search Engine Optimization Internet Marketing
SEO Search Engine Optimization Internet MarketingManny Sarmiento
 
Search Engine Marketing (Oldschool) - an introduction.
Search Engine Marketing (Oldschool) - an introduction.Search Engine Marketing (Oldschool) - an introduction.
Search Engine Marketing (Oldschool) - an introduction.Tim Vermeire
 

Similaire à Internet Intelligence: Tracking Your Online Footprint (20)

Internet basics
Internet basicsInternet basics
Internet basics
 
Information Discovery and Search Strategies for Evidence-Based Research
Information Discovery and Search Strategies for Evidence-Based ResearchInformation Discovery and Search Strategies for Evidence-Based Research
Information Discovery and Search Strategies for Evidence-Based Research
 
Searching the Internet
Searching the InternetSearching the Internet
Searching the Internet
 
Search engines powerpoint
Search engines powerpointSearch engines powerpoint
Search engines powerpoint
 
Online Collections Crawlability for Libraries, Archives, and Museums
Online Collections Crawlability for Libraries, Archives, and MuseumsOnline Collections Crawlability for Libraries, Archives, and Museums
Online Collections Crawlability for Libraries, Archives, and Museums
 
INTERNET SEARCH B and W.ppt
INTERNET SEARCH  B and W.pptINTERNET SEARCH  B and W.ppt
INTERNET SEARCH B and W.ppt
 
Digital marketing course
Digital marketing course Digital marketing course
Digital marketing course
 
Search Engine Optimization (SEO)
Search Engine Optimization (SEO)Search Engine Optimization (SEO)
Search Engine Optimization (SEO)
 
Search Engine Optimize for WordPress in 3 Easy Steps
Search Engine Optimize for WordPress in 3 Easy StepsSearch Engine Optimize for WordPress in 3 Easy Steps
Search Engine Optimize for WordPress in 3 Easy Steps
 
Search Engine
Search EngineSearch Engine
Search Engine
 
Search engines and its types
Search engines and its typesSearch engines and its types
Search engines and its types
 
Search Engine Optimization
Search Engine OptimizationSearch Engine Optimization
Search Engine Optimization
 
mysearchengines-150208065440-conversion-gate02.pdf
mysearchengines-150208065440-conversion-gate02.pdfmysearchengines-150208065440-conversion-gate02.pdf
mysearchengines-150208065440-conversion-gate02.pdf
 
Lost in the Net: Navigating Search Engines
Lost in the Net:  Navigating Search EnginesLost in the Net:  Navigating Search Engines
Lost in the Net: Navigating Search Engines
 
ppt
pptppt
ppt
 
Internet basics powerpoint
Internet basics powerpointInternet basics powerpoint
Internet basics powerpoint
 
Internet basics powerpoint
Internet basics powerpointInternet basics powerpoint
Internet basics powerpoint
 
Research 101
Research 101Research 101
Research 101
 
SEO Search Engine Optimization Internet Marketing
SEO Search Engine Optimization Internet MarketingSEO Search Engine Optimization Internet Marketing
SEO Search Engine Optimization Internet Marketing
 
Search Engine Marketing (Oldschool) - an introduction.
Search Engine Marketing (Oldschool) - an introduction.Search Engine Marketing (Oldschool) - an introduction.
Search Engine Marketing (Oldschool) - an introduction.
 

Internet Intelligence: Tracking Your Online Footprint

  • 1. Internet Intelligence Robert Crayford • Copyright © Halliwells LLP 2008 All rights reserved.
  • 2. The Internet “Have you heard of this new thing called the internet? It's giving people new expectations. It's allowing them to become their own expert. Knowledge lies anxious at their fingertips” Roy H. Williams
  • 3. Internet Intelligence • Open Source • Social networking sites. • Internet footprint • Questions
  • 4. Open Source searching • Open source searching refers to any site that, does not need a password or log in to enter. • The more common open source searches relate to search engines.
  • 5. Deep Web Searching • The term Deep Web refers to information found on Web sites that is hidden or generally inaccessible through traditional search methods
  • 6. Deep Web searching • Searching social networking sites and newsgroups/forums is an example of deep web searching. • The information would not be found from searching search engines. • It is important to remember that there is a lot of data that can only be found through deep web searching • To search the deep web you need to locate online databases and forums and search them individually
  • 7. Search Engines • When you search the web using a search engine, you are always searching a somewhat stale copy of the real web page. When you click on links provided in a search engine's search results, you retrieve from the server the current version of the page. • Search engine databases are selected and built by computer robot programs called spiders. These "crawl" the web, finding pages for potential inclusion by following the links in the pages they already have in their database (i.e., already "know about").
  • 9. Search engines • If a web page is never linked to in any other page, search engine spiders cannot find it. The only way a brand new page - one that no other page has ever linked to - can get into a search engine is for its URL to be sent by some human to the search engine companies as a request that the new page be included. All search engine companies offer ways to do this. • Many web pages are excluded from most search engines by policy. The contents of most of the searchable databases mounted on the web, such as library catalogs and article databases, are excluded because search engine spiders cannot access them. All this material is referred to as the Invisible web- what you don't see in search engine results.
  • 10.
  • 11.
  • 12. One Enough?? • Less than half the searchable Web is fully searchable in Google. • The percent of total results unique to one search engine was established to be 88.3 percent. • The percent of total results shared by any two search engines was established to be 8.9 percent. • The percent of total results shared by three search engines was established to be 2.2 percent. • The percent of total results shared by the top four search engines was established to be 0.6 percent.
  • 13. One Enough?? • The majority of first page results are unique: • On average, 69.6 percent of Google first page search results were unique to Google. • On average, 79.4 percent of Yahoo! first page search results were unique to Yahoo! • On average, 80.1 percent of Live first page search results were unique to Live. • On average, 75.0 percent Ask first page search results were unique to Ask.
  • 15. The Top 9 Social Networking Sites by internet visits Rank Name Domain Market Share % 1 Facebook www.facebook.com 37.7 2 Bebo www.bebo.com 28 3 Myspace www.myspace.com 18.97 4 Faceparty www.faceparty.com 2.01 5 Windows Live Space Spaces.live.com 1.99 6 BBC h2g2 www.bbc.co.uk/dna 1.25 7 Stumble Upon www.stumbleupon.com 1.19 8 Club Penguin www.clubpenguin.com 1.05 9 Friends Reunited www.friendsreunited.co.uk 0.88
  • 17. I.P Addresses • All computers across the internet are assigned a unique identifier called an IP address. They are used like street addresses so other computers can find them. An IP address could look something like this: 87.242.211.23. • Websites can log any IP addresses that look at their site. • IP addresses can then be traced back to the server.
  • 19. I.P • They could then Google or yahoo “Halliwells” and “Manchester” to find our address. • IP Address finder: • http://www.ip-adress.com/
  • 20. Search Results • Webmasters can even trace, what search term you used to find their website. • For example, if you searched for fraudulent people in Liverpool and then clicked on one of the search results, the owner of the site found in the search could see that you were searching for fraudulent people in Liverpool.
  • 21. Search Results • To avoid this, most search results provide the URL of the results. You can copy and paste this in to a new web browser.
  • 22. Cloaking • There are many web based proxys that claim to hide your IP address. • These sites are untested- and this must be considered while using them. • The websites records information of who blocked who, to look at what. • http://www.the-cloak.com/anonymous-surfing- home.html
  • 23. Tracing Emails • You can trace a IP address of the server the email was sent from. • Web mail tracing would reveal the IP address of the web mail server. e.g. Hotmail. • The IP address is hidden in the internet header of the email. • You can either search through the headers to find the IP address or you can paste the header on to an online engine and it will find it for you. • http://www.ip2location.com/emailtracer.aspx
  • 28. Any Questions • Robert Crayford • Robert.crayford@halliwells.com • 0161 618 4312