SlideShare une entreprise Scribd logo
1  sur  18
Dr. Mike Lowndes,
Interactive Media Manager,
Natural History Museum, London
– Houses 350-permanent scientific staff, plus postgraduate
students; one of the largest UK research institutes in the
natural sciences.
(Right-click or click-hold (Mac) and press k or select Speaker Notes)
IWMW 2005: Who’s web is it anyway?
Lies, Damn lies and Web Statistics
Contents
• Why bother?
• Issues with web logs
• Issues with analytic tools
• Browser tracking
• Comparison between approaches
• Known issues with browser tracking
• Nedstat input and findings from Newcastle
University
Why bother?
• Web log analysis is currently the main method used to
quantify web site usage for reporting.
• Results are used by the government as performance
indicators for institutional websites.
• Not accurate or meaningful most of the time
– no good for absolute measurement of usage.
Can be used for:
• Trend analysis
• Content preferences
• ROI estimation
• Checking and fixing your site
• Understanding users behaviour
• Testing assumed pathways
Issues with server logs
• Dynamic IP
– Many users using the same IP number over time.
– Same user assigned many IP numbers over time.
• Proxies
– Several or many users behind 1 IP number
• Caches (can be ‘in’ Proxies)
– Commonly requested files cached closer to the users.
– Can form the top 20-50 hosts accessing sites.
• Robots and spiders
– Few visits but lots of hits.
– Analytic packages cannot keep up to date with all of them for exclusion.
• Syndication
– RSS feeds generate huge logs, but are not ‘read’ by humans initially.
– Click-through configuration.
• Reporting by analysis tools
– Often weekly or monthly reports: realtime is very labour/server intensive
– Reports often complex and techy.
Issues with log analysis tools
• Webtrends vs Summary.net
• 1. Natural History Museum
– Summary SP (summary.net) Version 4.2.1, unregistered demo, default configuration
• 2. UKOLN (Bath)
– WebTrends (www.webtrends.com) Version 5, default configuration
• Both tools were applied to the same log file
• Default configurations – not removing robots
– Note: WebTrends documentation not clear on this point
Measurement discrepancies
Summary SP Webtrends 7
Connections (hits) - +0.67% hits
Page views (page hits) - +5.00%
Visits (user sessions) - +0.07%
Failed hits - +0.30%
Average visit duration - -30.0% (+250%)
Browsers
IE 75% 86%
Netscape compatible 2% 4%
Referrers
Top Level Domains US US
UK UK
AUS CAN
NETHER NETHER
CAN AUS
JAP JAP
Comparison between tools
• Not a single measurement was identical.
• Most measurements were within 5%
• Visit duration measurement widely different, and
can depend on configuration. Possible bug in
WebTrends version 5.
• Page view measurements were quite different.
Results broadly similar but direct comparisons,
especially of Page Views, are not really justified.
Browser tracking
• Do they have fewer inaccuracies and distortions?
• Is it easier on the web team?
• Is it affordable?
• Does it give us more information / better
information?
Browser tracking
• Requires code to be added to pages
• Uses an image, sourced from the tracking website.
Also uses javascript and cookies for gathering
extended and repeat-visit information
• Usually hosted services
• Provide near real-time tracking
• Few of the issues distorting logs affect these
measurements (according to the blurb)
• Main players: Nedstat, Nielson/Netratings,
WebSideStory
Comparison between tools
• Summary SP VS Nielson/Netratings
• Run on one section of a site over a month.
• ‘Visiting’ section of the Natural History Museum site
– small but popular and easily tagged.
Results 1 – visits and visitors
Visits / User sessions 27,663 40,402 -32% 35,395
Visits per day (ave) 922 1,347 1,180
Visits per visitor per month (ave) 1.1 1.7 1.5
Unique visitors (browsers) 25,127 23,585 23,084
Pages per visit (ave) 3.31 3 2.1
Visit duration (ave) 02:09 07:13 04:08
Page impressions 91,506 117,447 71,895
Results 2 – pages viewed
value Browser track Log analysis
Top 10
index.html, Visiting home. 31,117 28591
where are we? page 17,897 26566
planning your visit page 6,835 16773
events calendar page 9,221 9369
howtogethere -local map page 4,700 5005
access guide introduction page 1,978 4653
travel details page 3,550 3668
facilities page 2,767 3497
activities page 3,293 3375
multilingual info. 828 1901
top ten totals 82,186 103,398
Results 3 – country
Browser tr. GeoIP (Sum.)
Countries uk 75% uk 62%
us 5% us 8%
spain spain
italy netherlands
netherlands germany
france italy
germany france
belgium canada
poland poland
• Depends on the quality of the geographical IP database, not
the mode of tracking?
Conclusions regarding traditional Log
analysis
Assuming browser tracking is more accurate…
• We have fewer visit sessions than we thought, but
more visitors
– Fewer visits (sessions), possibly due to robot exclusion
– More visitors (unique users), possibly due to the masking
effect of proxies/caches and browser caches
• Visit duration is much shorter than thought
– possibly due to robots/spiders and cache updating.
• Country information is roughly accurate so long as a
geographical lookup is used.
• Activity of popular pages, which are often cached,
will be underestimated
Browser tracking advantages
• Almost real-time analysis, incremental data.
• Better repeat user tracking and individual pathway
analysis.
• Configurable, graphical reports for non-techies
– Techie still needs to configure those reports however, as
an understanding of web analytics is required
• Cut our monthly staff time down from 1.5 days to 1
hour
• Appear to be more accurate in describing the
activity of real people, but we would like to see
some independent research.
Issues with browser tracking
• Setup is not trivial: You need to add code to every page.
– Multiple server / ownership issues.
• Does not always work (or get full user details) if Javascript is turned
off or cookies disallowed.
• Does not work with text-only browsers.
• Unknown compatibility with PDAs, mobiles etc.
Questions:
• Would we get different results with different hosted services?
– ABCE: industry standards for measurement
• Cookies often deleted unless user is confident in the source?
– This would affect the measurement of repeat visitors and behaviour
Political issues:
• Issues with external hosting of institutional data
• Security of personal data issues with external hosting
Next steps
• Many private sector and public sector sites have
already moved to browser tracking.
• About 6 National Museums are currently discussing
hosted browser tracking.
• 5 Universities currently involved in a trial of
NedStat.
Thank you

Contenu connexe

Similaire à IWMW 2005: Lies, Damn Lies, and Web Statistics (1)

LITA Forum 2012 Web Analytics Preconference
LITA Forum 2012 Web Analytics PreconferenceLITA Forum 2012 Web Analytics Preconference
LITA Forum 2012 Web Analytics PreconferenceNina McHale
 
Using EZ Proxy and Google Analytics to Evaluate Electronic Serials Usage
Using EZ Proxy and Google Analytics to Evaluate Electronic Serials UsageUsing EZ Proxy and Google Analytics to Evaluate Electronic Serials Usage
Using EZ Proxy and Google Analytics to Evaluate Electronic Serials UsageH. Jamane Yeager
 
Establishing best practices to improve usefulness and usability of web interf...
Establishing best practices to improve usefulness and usability of web interf...Establishing best practices to improve usefulness and usability of web interf...
Establishing best practices to improve usefulness and usability of web interf...DRIscience
 
neurisa_11_09_rosenthal
neurisa_11_09_rosenthalneurisa_11_09_rosenthal
neurisa_11_09_rosenthaltutorialsruby
 
neurisa_11_09_rosenthal
neurisa_11_09_rosenthalneurisa_11_09_rosenthal
neurisa_11_09_rosenthaltutorialsruby
 
ConFoo: Moving web performance testing to the left
ConFoo: Moving web performance testing to the leftConFoo: Moving web performance testing to the left
ConFoo: Moving web performance testing to the leftTom Chavez
 
Role of-analytics-in-db as-life
Role of-analytics-in-db as-lifeRole of-analytics-in-db as-life
Role of-analytics-in-db as-lifeNavneet Upneja
 
Google Analytics Basics for NCSU Libraries' Staff
Google Analytics Basics for NCSU Libraries' StaffGoogle Analytics Basics for NCSU Libraries' Staff
Google Analytics Basics for NCSU Libraries' StaffCharlie Morris
 
Unified Framework for Real Time, Near Real Time and Offline Analysis of Video...
Unified Framework for Real Time, Near Real Time and Offline Analysis of Video...Unified Framework for Real Time, Near Real Time and Offline Analysis of Video...
Unified Framework for Real Time, Near Real Time and Offline Analysis of Video...Spark Summit
 
5 things you didn't know about your website
5 things you didn't know about your website5 things you didn't know about your website
5 things you didn't know about your websiteNeil Allison
 
Web-Scale Discovery: Post Implementation
Web-Scale Discovery: Post ImplementationWeb-Scale Discovery: Post Implementation
Web-Scale Discovery: Post ImplementationRachel Vacek
 
Adaptable Information Workshop slides
Adaptable Information Workshop slidesAdaptable Information Workshop slides
Adaptable Information Workshop slidesLouis Rosenfeld
 
COBWEB - infrastructure and platform for Environmental Crowd Sensing and Big ...
COBWEB - infrastructure and platform for Environmental Crowd Sensing and Big ...COBWEB - infrastructure and platform for Environmental Crowd Sensing and Big ...
COBWEB - infrastructure and platform for Environmental Crowd Sensing and Big ...EDINA, University of Edinburgh
 
Iqpc eln joanna mulgrew
Iqpc eln joanna mulgrewIqpc eln joanna mulgrew
Iqpc eln joanna mulgrewJo Mulgrew
 
W4A 2010 - Web Not For All: A Large Scale Study of Web Accessibility
W4A 2010 - Web Not For All: A Large Scale Study of Web AccessibilityW4A 2010 - Web Not For All: A Large Scale Study of Web Accessibility
W4A 2010 - Web Not For All: A Large Scale Study of Web AccessibilityRui Lopes
 
ODDC at ICTD2013 - Research methods discussion - Web Analytics
 ODDC at ICTD2013 - Research methods discussion - Web Analytics ODDC at ICTD2013 - Research methods discussion - Web Analytics
ODDC at ICTD2013 - Research methods discussion - Web AnalyticsOpen Data Research Network
 
Why Bad Data May Be Your Best Opportunity
Why Bad Data May Be Your Best OpportunityWhy Bad Data May Be Your Best Opportunity
Why Bad Data May Be Your Best OpportunityZach Gardner
 

Similaire à IWMW 2005: Lies, Damn Lies, and Web Statistics (1) (20)

LITA Forum 2012 Web Analytics Preconference
LITA Forum 2012 Web Analytics PreconferenceLITA Forum 2012 Web Analytics Preconference
LITA Forum 2012 Web Analytics Preconference
 
Wa mw 2013
Wa mw 2013Wa mw 2013
Wa mw 2013
 
Using EZ Proxy and Google Analytics to Evaluate Electronic Serials Usage
Using EZ Proxy and Google Analytics to Evaluate Electronic Serials UsageUsing EZ Proxy and Google Analytics to Evaluate Electronic Serials Usage
Using EZ Proxy and Google Analytics to Evaluate Electronic Serials Usage
 
Measuring impact
Measuring impactMeasuring impact
Measuring impact
 
Establishing best practices to improve usefulness and usability of web interf...
Establishing best practices to improve usefulness and usability of web interf...Establishing best practices to improve usefulness and usability of web interf...
Establishing best practices to improve usefulness and usability of web interf...
 
neurisa_11_09_rosenthal
neurisa_11_09_rosenthalneurisa_11_09_rosenthal
neurisa_11_09_rosenthal
 
neurisa_11_09_rosenthal
neurisa_11_09_rosenthalneurisa_11_09_rosenthal
neurisa_11_09_rosenthal
 
ConFoo: Moving web performance testing to the left
ConFoo: Moving web performance testing to the leftConFoo: Moving web performance testing to the left
ConFoo: Moving web performance testing to the left
 
Role of-analytics-in-db as-life
Role of-analytics-in-db as-lifeRole of-analytics-in-db as-life
Role of-analytics-in-db as-life
 
Google Analytics Basics for NCSU Libraries' Staff
Google Analytics Basics for NCSU Libraries' StaffGoogle Analytics Basics for NCSU Libraries' Staff
Google Analytics Basics for NCSU Libraries' Staff
 
Unified Framework for Real Time, Near Real Time and Offline Analysis of Video...
Unified Framework for Real Time, Near Real Time and Offline Analysis of Video...Unified Framework for Real Time, Near Real Time and Offline Analysis of Video...
Unified Framework for Real Time, Near Real Time and Offline Analysis of Video...
 
5 things you didn't know about your website
5 things you didn't know about your website5 things you didn't know about your website
5 things you didn't know about your website
 
Web-Scale Discovery: Post Implementation
Web-Scale Discovery: Post ImplementationWeb-Scale Discovery: Post Implementation
Web-Scale Discovery: Post Implementation
 
Adaptable Information Workshop slides
Adaptable Information Workshop slidesAdaptable Information Workshop slides
Adaptable Information Workshop slides
 
COBWEB - infrastructure and platform for Environmental Crowd Sensing and Big ...
COBWEB - infrastructure and platform for Environmental Crowd Sensing and Big ...COBWEB - infrastructure and platform for Environmental Crowd Sensing and Big ...
COBWEB - infrastructure and platform for Environmental Crowd Sensing and Big ...
 
Iqpc eln joanna mulgrew
Iqpc eln joanna mulgrewIqpc eln joanna mulgrew
Iqpc eln joanna mulgrew
 
W4A 2010 - Web Not For All: A Large Scale Study of Web Accessibility
W4A 2010 - Web Not For All: A Large Scale Study of Web AccessibilityW4A 2010 - Web Not For All: A Large Scale Study of Web Accessibility
W4A 2010 - Web Not For All: A Large Scale Study of Web Accessibility
 
ODDC at ICTD2013 - Research methods discussion - Web Analytics
 ODDC at ICTD2013 - Research methods discussion - Web Analytics ODDC at ICTD2013 - Research methods discussion - Web Analytics
ODDC at ICTD2013 - Research methods discussion - Web Analytics
 
OC_Offline_Africa
OC_Offline_AfricaOC_Offline_Africa
OC_Offline_Africa
 
Why Bad Data May Be Your Best Opportunity
Why Bad Data May Be Your Best OpportunityWhy Bad Data May Be Your Best Opportunity
Why Bad Data May Be Your Best Opportunity
 

Plus de IWMW

Look who's talking now
Look who's talking nowLook who's talking now
Look who's talking nowIWMW
 
Introduction to IWMW 2000 (Liz Lyon)
Introduction to IWMW 2000 (Liz Lyon)Introduction to IWMW 2000 (Liz Lyon)
Introduction to IWMW 2000 (Liz Lyon)IWMW
 
Web Tools report
Web Tools reportWeb Tools report
Web Tools reportIWMW
 
Personal Contingency Plan - Beat The Panic
Personal Contingency Plan - Beat The PanicPersonal Contingency Plan - Beat The Panic
Personal Contingency Plan - Beat The PanicIWMW
 
Whose site is it anyway?
Whose site is it anyway?Whose site is it anyway?
Whose site is it anyway?IWMW
 
Open Source - the case against
Open Source - the case againstOpen Source - the case against
Open Source - the case againstIWMW
 
IWMW 2002: Avoiding Portal Wars - an MIS view
IWMW 2002: Avoiding Portal Wars - an MIS viewIWMW 2002: Avoiding Portal Wars - an MIS view
IWMW 2002: Avoiding Portal Wars - an MIS viewIWMW
 
What does open source mean for the institutional web manager?
What does open source mean for the institutional web manager?What does open source mean for the institutional web manager?
What does open source mean for the institutional web manager?IWMW
 
Library 2.0
Library 2.0Library 2.0
Library 2.0IWMW
 
Social participation in student recruitment
Social participation in student recruitmentSocial participation in student recruitment
Social participation in student recruitmentIWMW
 
Supporting Institutions in Changing Times: Manifesto
Supporting Institutions in Changing Times: ManifestoSupporting Institutions in Changing Times: Manifesto
Supporting Institutions in Changing Times: ManifestoIWMW
 
IWMW 2019 photo scavenger hunt highlights
IWMW 2019 photo scavenger hunt highlightsIWMW 2019 photo scavenger hunt highlights
IWMW 2019 photo scavenger hunt highlightsIWMW
 
How to Turn a Web Strategy into Web Services
How to Turn a Web Strategy into Web ServicesHow to Turn a Web Strategy into Web Services
How to Turn a Web Strategy into Web ServicesIWMW
 
Static Site Generators - Developing Websites in Low-resource Condition
Static Site Generators - Developing Websites in Low-resource ConditionStatic Site Generators - Developing Websites in Low-resource Condition
Static Site Generators - Developing Websites in Low-resource ConditionIWMW
 
Looking to the Future
Looking to the FutureLooking to the Future
Looking to the FutureIWMW
 
Looking to the Future
Looking to the FutureLooking to the Future
Looking to the FutureIWMW
 
Developing Communities of Practice
Developing Communities of PracticeDeveloping Communities of Practice
Developing Communities of PracticeIWMW
 
How to train your content- so it doesn't slow you down...
How to train your content- so it doesn't slow you down... How to train your content- so it doesn't slow you down...
How to train your content- so it doesn't slow you down... IWMW
 
Grassroots & Guerrillas: The Beginnings of a UX Revolution
Grassroots & Guerrillas: The Beginnings of a UX RevolutionGrassroots & Guerrillas: The Beginnings of a UX Revolution
Grassroots & Guerrillas: The Beginnings of a UX RevolutionIWMW
 
Connecting Your Content: How to Save Time and Improve Content Quality through...
Connecting Your Content: How to Save Time and Improve Content Quality through...Connecting Your Content: How to Save Time and Improve Content Quality through...
Connecting Your Content: How to Save Time and Improve Content Quality through...IWMW
 

Plus de IWMW (20)

Look who's talking now
Look who's talking nowLook who's talking now
Look who's talking now
 
Introduction to IWMW 2000 (Liz Lyon)
Introduction to IWMW 2000 (Liz Lyon)Introduction to IWMW 2000 (Liz Lyon)
Introduction to IWMW 2000 (Liz Lyon)
 
Web Tools report
Web Tools reportWeb Tools report
Web Tools report
 
Personal Contingency Plan - Beat The Panic
Personal Contingency Plan - Beat The PanicPersonal Contingency Plan - Beat The Panic
Personal Contingency Plan - Beat The Panic
 
Whose site is it anyway?
Whose site is it anyway?Whose site is it anyway?
Whose site is it anyway?
 
Open Source - the case against
Open Source - the case againstOpen Source - the case against
Open Source - the case against
 
IWMW 2002: Avoiding Portal Wars - an MIS view
IWMW 2002: Avoiding Portal Wars - an MIS viewIWMW 2002: Avoiding Portal Wars - an MIS view
IWMW 2002: Avoiding Portal Wars - an MIS view
 
What does open source mean for the institutional web manager?
What does open source mean for the institutional web manager?What does open source mean for the institutional web manager?
What does open source mean for the institutional web manager?
 
Library 2.0
Library 2.0Library 2.0
Library 2.0
 
Social participation in student recruitment
Social participation in student recruitmentSocial participation in student recruitment
Social participation in student recruitment
 
Supporting Institutions in Changing Times: Manifesto
Supporting Institutions in Changing Times: ManifestoSupporting Institutions in Changing Times: Manifesto
Supporting Institutions in Changing Times: Manifesto
 
IWMW 2019 photo scavenger hunt highlights
IWMW 2019 photo scavenger hunt highlightsIWMW 2019 photo scavenger hunt highlights
IWMW 2019 photo scavenger hunt highlights
 
How to Turn a Web Strategy into Web Services
How to Turn a Web Strategy into Web ServicesHow to Turn a Web Strategy into Web Services
How to Turn a Web Strategy into Web Services
 
Static Site Generators - Developing Websites in Low-resource Condition
Static Site Generators - Developing Websites in Low-resource ConditionStatic Site Generators - Developing Websites in Low-resource Condition
Static Site Generators - Developing Websites in Low-resource Condition
 
Looking to the Future
Looking to the FutureLooking to the Future
Looking to the Future
 
Looking to the Future
Looking to the FutureLooking to the Future
Looking to the Future
 
Developing Communities of Practice
Developing Communities of PracticeDeveloping Communities of Practice
Developing Communities of Practice
 
How to train your content- so it doesn't slow you down...
How to train your content- so it doesn't slow you down... How to train your content- so it doesn't slow you down...
How to train your content- so it doesn't slow you down...
 
Grassroots & Guerrillas: The Beginnings of a UX Revolution
Grassroots & Guerrillas: The Beginnings of a UX RevolutionGrassroots & Guerrillas: The Beginnings of a UX Revolution
Grassroots & Guerrillas: The Beginnings of a UX Revolution
 
Connecting Your Content: How to Save Time and Improve Content Quality through...
Connecting Your Content: How to Save Time and Improve Content Quality through...Connecting Your Content: How to Save Time and Improve Content Quality through...
Connecting Your Content: How to Save Time and Improve Content Quality through...
 

Dernier

Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeThiyagu K
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingTeacherCyreneCayanan
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxVishalSingh1417
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpinRaunakKeshri1
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introductionMaksud Ahmed
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Celine George
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...christianmathematics
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfAdmir Softic
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...Sapna Thakur
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)eniolaolutunde
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfAyushMahapatra5
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationnomboosow
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024Janet Corral
 

Dernier (20)

Measures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and ModeMeasures of Central Tendency: Mean, Median and Mode
Measures of Central Tendency: Mean, Median and Mode
 
fourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writingfourth grading exam for kindergarten in writing
fourth grading exam for kindergarten in writing
 
Unit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptxUnit-IV- Pharma. Marketing Channels.pptx
Unit-IV- Pharma. Marketing Channels.pptx
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
Student login on Anyboli platform.helpin
Student login on Anyboli platform.helpinStudent login on Anyboli platform.helpin
Student login on Anyboli platform.helpin
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
microwave assisted reaction. General introduction
microwave assisted reaction. General introductionmicrowave assisted reaction. General introduction
microwave assisted reaction. General introduction
 
Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17Advanced Views - Calendar View in Odoo 17
Advanced Views - Calendar View in Odoo 17
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
BAG TECHNIQUE Bag technique-a tool making use of public health bag through wh...
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)Software Engineering Methodologies (overview)
Software Engineering Methodologies (overview)
 
Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1Código Creativo y Arte de Software | Unidad 1
Código Creativo y Arte de Software | Unidad 1
 
Class 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdfClass 11th Physics NEET formula sheet pdf
Class 11th Physics NEET formula sheet pdf
 
Interactive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communicationInteractive Powerpoint_How to Master effective communication
Interactive Powerpoint_How to Master effective communication
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
General AI for Medical Educators April 2024
General AI for Medical Educators April 2024General AI for Medical Educators April 2024
General AI for Medical Educators April 2024
 

IWMW 2005: Lies, Damn Lies, and Web Statistics (1)

  • 1. Dr. Mike Lowndes, Interactive Media Manager, Natural History Museum, London – Houses 350-permanent scientific staff, plus postgraduate students; one of the largest UK research institutes in the natural sciences. (Right-click or click-hold (Mac) and press k or select Speaker Notes) IWMW 2005: Who’s web is it anyway? Lies, Damn lies and Web Statistics
  • 2. Contents • Why bother? • Issues with web logs • Issues with analytic tools • Browser tracking • Comparison between approaches • Known issues with browser tracking • Nedstat input and findings from Newcastle University
  • 3. Why bother? • Web log analysis is currently the main method used to quantify web site usage for reporting. • Results are used by the government as performance indicators for institutional websites. • Not accurate or meaningful most of the time – no good for absolute measurement of usage. Can be used for: • Trend analysis • Content preferences • ROI estimation • Checking and fixing your site • Understanding users behaviour • Testing assumed pathways
  • 4. Issues with server logs • Dynamic IP – Many users using the same IP number over time. – Same user assigned many IP numbers over time. • Proxies – Several or many users behind 1 IP number • Caches (can be ‘in’ Proxies) – Commonly requested files cached closer to the users. – Can form the top 20-50 hosts accessing sites. • Robots and spiders – Few visits but lots of hits. – Analytic packages cannot keep up to date with all of them for exclusion. • Syndication – RSS feeds generate huge logs, but are not ‘read’ by humans initially. – Click-through configuration. • Reporting by analysis tools – Often weekly or monthly reports: realtime is very labour/server intensive – Reports often complex and techy.
  • 5. Issues with log analysis tools • Webtrends vs Summary.net • 1. Natural History Museum – Summary SP (summary.net) Version 4.2.1, unregistered demo, default configuration • 2. UKOLN (Bath) – WebTrends (www.webtrends.com) Version 5, default configuration • Both tools were applied to the same log file • Default configurations – not removing robots – Note: WebTrends documentation not clear on this point
  • 6. Measurement discrepancies Summary SP Webtrends 7 Connections (hits) - +0.67% hits Page views (page hits) - +5.00% Visits (user sessions) - +0.07% Failed hits - +0.30% Average visit duration - -30.0% (+250%) Browsers IE 75% 86% Netscape compatible 2% 4% Referrers Top Level Domains US US UK UK AUS CAN NETHER NETHER CAN AUS JAP JAP
  • 7. Comparison between tools • Not a single measurement was identical. • Most measurements were within 5% • Visit duration measurement widely different, and can depend on configuration. Possible bug in WebTrends version 5. • Page view measurements were quite different. Results broadly similar but direct comparisons, especially of Page Views, are not really justified.
  • 8. Browser tracking • Do they have fewer inaccuracies and distortions? • Is it easier on the web team? • Is it affordable? • Does it give us more information / better information?
  • 9. Browser tracking • Requires code to be added to pages • Uses an image, sourced from the tracking website. Also uses javascript and cookies for gathering extended and repeat-visit information • Usually hosted services • Provide near real-time tracking • Few of the issues distorting logs affect these measurements (according to the blurb) • Main players: Nedstat, Nielson/Netratings, WebSideStory
  • 10. Comparison between tools • Summary SP VS Nielson/Netratings • Run on one section of a site over a month. • ‘Visiting’ section of the Natural History Museum site – small but popular and easily tagged.
  • 11. Results 1 – visits and visitors Visits / User sessions 27,663 40,402 -32% 35,395 Visits per day (ave) 922 1,347 1,180 Visits per visitor per month (ave) 1.1 1.7 1.5 Unique visitors (browsers) 25,127 23,585 23,084 Pages per visit (ave) 3.31 3 2.1 Visit duration (ave) 02:09 07:13 04:08 Page impressions 91,506 117,447 71,895
  • 12. Results 2 – pages viewed value Browser track Log analysis Top 10 index.html, Visiting home. 31,117 28591 where are we? page 17,897 26566 planning your visit page 6,835 16773 events calendar page 9,221 9369 howtogethere -local map page 4,700 5005 access guide introduction page 1,978 4653 travel details page 3,550 3668 facilities page 2,767 3497 activities page 3,293 3375 multilingual info. 828 1901 top ten totals 82,186 103,398
  • 13. Results 3 – country Browser tr. GeoIP (Sum.) Countries uk 75% uk 62% us 5% us 8% spain spain italy netherlands netherlands germany france italy germany france belgium canada poland poland • Depends on the quality of the geographical IP database, not the mode of tracking?
  • 14. Conclusions regarding traditional Log analysis Assuming browser tracking is more accurate… • We have fewer visit sessions than we thought, but more visitors – Fewer visits (sessions), possibly due to robot exclusion – More visitors (unique users), possibly due to the masking effect of proxies/caches and browser caches • Visit duration is much shorter than thought – possibly due to robots/spiders and cache updating. • Country information is roughly accurate so long as a geographical lookup is used. • Activity of popular pages, which are often cached, will be underestimated
  • 15. Browser tracking advantages • Almost real-time analysis, incremental data. • Better repeat user tracking and individual pathway analysis. • Configurable, graphical reports for non-techies – Techie still needs to configure those reports however, as an understanding of web analytics is required • Cut our monthly staff time down from 1.5 days to 1 hour • Appear to be more accurate in describing the activity of real people, but we would like to see some independent research.
  • 16. Issues with browser tracking • Setup is not trivial: You need to add code to every page. – Multiple server / ownership issues. • Does not always work (or get full user details) if Javascript is turned off or cookies disallowed. • Does not work with text-only browsers. • Unknown compatibility with PDAs, mobiles etc. Questions: • Would we get different results with different hosted services? – ABCE: industry standards for measurement • Cookies often deleted unless user is confident in the source? – This would affect the measurement of repeat visitors and behaviour Political issues: • Issues with external hosting of institutional data • Security of personal data issues with external hosting
  • 17. Next steps • Many private sector and public sector sites have already moved to browser tracking. • About 6 National Museums are currently discussing hosted browser tracking. • 5 Universities currently involved in a trial of NedStat.

Notes de l'éditeur

  1. Museum community, but large academic website, over 1 million research records online (not including DNA).