SlideShare une entreprise Scribd logo
1  sur  14
Web analytics is dead!
Long live event analytics
How data scientists and big data tech are killing one
industry and creating another
What role Snowplow plays
Web analytics is a big industry
• Spend in the just the US on web analytics software (Adobe
Sitecatalyst, Webtrends, Google Analytics Premium etc.) estimated at $500m
and growing 17 – 20% p.a. in 2011*
• Likely that at that amount is spent again on consulting services related to the
use of web analytics data
• Whole industry of web consultants e.g.:
• Semphonic in the USA (bought by Ernst and Young)
• Logan Tod in the UK (bought by PwC)
• Big 4 accounting firms only buy businesses they can sell into (tens of) thousands of
companies

• Whole ecosystem around web analytics
• “Digital analytics professionals” – it is a career path (retailers, media agencies)
• Events, books, organisations geared towards web analysts

*Source: Quora
http://www.quora.com/Web-Analytics-what-is-the-size-of-the-web-analytics-market
Web analytics is an old industry, predating the recent wave in
big data technology
Web analytics
1990

Web is born

1993

Big data

Log file based web analytics

1996
1997

Javascript tagging

…
2004

publishes MapReduce paper

2006

Hadoop project split out of Nutch

2008

Facebook develops Hive

2010

publishes Dremel paper

2011

open sources Storm
Two problems with web analytics, that stem from the fact web
analytics came of age in the 1990s
The web was static, hyperlinked documents

Tech to handle massive data sets was
prohibitively expensive

• The entities and events that web analytics
programmes understand is limited

• Web analytics programmes aggregate raw data
to reduce data volumes

• Page views, link clicks, transactions, goals,
sessions, visitors

• This requires specifying in advance how data
can be analysed, so that the data can be ‘precut’

Hard to model the rich interactions in
today’s interactive webapps

Web analytics reporting is very inflexible
In particular, web analytics insistence on aggregating data is an
anathema to data scientists
Data scientist approach
Give me the data and I’ll figure out
how to answer the question

Web analytics approach
You can’t get your answer from one of
our pre-canned reports? Have a go
with our “advanced report-builder”

What if I want to: build a model? Understand underlying causality? Use the data in
my web application? Dynamically optimize spend / content?
We built Snowplow to address the two weaknesses in the web
analytics approach
Describe web events in much richer
grammar and vocabulary

Liberate your data
• Where you store your data has a big
impact on what types of analyses you can
quickly run on it
Snowplow is an event data collection and warehousing platform
Snowplow data pipeline
Website / webapp

Amazon
S3

Mobile apps

Other applications
(e.g. on games
consoles, connected
TVs, desktops, connected
devices)

Collect

Transform
and
enrich

Amazon
Redshift /
PostgreSQL

Other
(Neo4J, Big
Query…)
Snowplow delivers your
complete, granular event data in
your own data warehouse(s), so
you can plugin any tool to analyse
it
Snowplow is composed of a set of loosely coupled
subsystems, architected to be robust and scalable

1. Trackers

A

Generate event
data

Examples:
• Javascript
tracker
• Ruby / Lua /
No-JS /
Arduino
tracker

2. Collectors

B

Receive data
from trackers
and put it in a
queue
Examples:
• Cloudfront
collector
• Clojure
collector for
Amazon EB

3. Enrich

C

Clean and
enrich raw data

Built on
Scalding /
Cascading /
Hadoop and
powered by
Amazon EMR

4. Storage

D

5. Analytics

Store data
ready for
analysis
Examples:
• Amazon
Redshift
• PostgreSQL
• Amazon S3

A

D

Standardised data protocols
Snowplow is open source and cloud-based
• Open source but easy to deploy via integration with Amazon Web Services
(cloud infrastructure)

• Our technology is free!
• Collecting massive quantities of digital event data should be easy and cheap…
• … so that we can focus time and effort on using the data productively

• We charge for Professional Services on top of our platform
• More value in how you use the data, than in collecting / storing it
• Lots of scope to build applications on top of our platform going forwards
Our users…
…use our tech to solve some of their most intractable problems
• What is the impact of different ad campaigns and creative on the way users
behave, subsequently? What is the return on that ad spend?

• How do visitors use social channels (Facebook / Twitter) to interact around video
content? How can we predict which content will “go viral”?

• How do updates to our product change the “stickiness” of our service? ARPU?
Does that vary by customer segment?
We believe that event data is one of the most exciting data
sources to work with, today
We are only at the beginning of figuring out how to use this
data…
• How do we represent different types of event sequence?
• What makes journeys similar and what makes them different? How can we
cluster them?
• How can we “spot” those events that are predictive of future events? Of
consumer value? Of consumer interest?
• How can we unpick the effects of marketing / digital products and user’s
predisposition to the way sequences of events unfold?
• How best should we model different users at different points on different types
of journeys?
We hope people like you will use our tech to do amazing things
with the data!

Questions?

More information
• Snowplow repo: https://github.com/snowplow/snowplow
• Twitter: @SnowPlowData
• Website: http://snowplowanaltyics.com
• My LinkedIn:
• My Twitter:

Contenu connexe

Plus de yalisassoon

Snowplow: evolve your analytics stack with your business
Snowplow: evolve your analytics stack with your businessSnowplow: evolve your analytics stack with your business
Snowplow: evolve your analytics stack with your businessyalisassoon
 
Snowplow at Sigfig
Snowplow at SigfigSnowplow at Sigfig
Snowplow at Sigfigyalisassoon
 
2016 09 measurecamp - event data modeling
2016 09 measurecamp - event data modeling2016 09 measurecamp - event data modeling
2016 09 measurecamp - event data modelingyalisassoon
 
Snowplow: putting digital analysts at the heart of digital analytics - the fo...
Snowplow: putting digital analysts at the heart of digital analytics - the fo...Snowplow: putting digital analysts at the heart of digital analytics - the fo...
Snowplow: putting digital analysts at the heart of digital analytics - the fo...yalisassoon
 
Snowplow the evolving data pipeline
Snowplow   the evolving data pipelineSnowplow   the evolving data pipeline
Snowplow the evolving data pipelineyalisassoon
 
Capturing online customer data to create better insights and targeted actions...
Capturing online customer data to create better insights and targeted actions...Capturing online customer data to create better insights and targeted actions...
Capturing online customer data to create better insights and targeted actions...yalisassoon
 
Yali presentation for snowplow amsterdam meetup number 2
Yali presentation for snowplow amsterdam meetup number 2Yali presentation for snowplow amsterdam meetup number 2
Yali presentation for snowplow amsterdam meetup number 2yalisassoon
 
Snowplow at DA Hub emerging technology showcase
Snowplow at DA Hub emerging technology showcaseSnowplow at DA Hub emerging technology showcase
Snowplow at DA Hub emerging technology showcaseyalisassoon
 
Using Snowplow for A/B testing and user journey analysis at CustomMade
Using Snowplow for A/B testing and user journey analysis at CustomMadeUsing Snowplow for A/B testing and user journey analysis at CustomMade
Using Snowplow for A/B testing and user journey analysis at CustomMadeyalisassoon
 
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016yalisassoon
 
Modeling event data
Modeling event dataModeling event data
Modeling event datayalisassoon
 
The analytics journey at Viewbix - how they came to use Snowplow and the setu...
The analytics journey at Viewbix - how they came to use Snowplow and the setu...The analytics journey at Viewbix - how they came to use Snowplow and the setu...
The analytics journey at Viewbix - how they came to use Snowplow and the setu...yalisassoon
 
Snowplow Analytics and Looker at Oyster.com
Snowplow Analytics and Looker at Oyster.comSnowplow Analytics and Looker at Oyster.com
Snowplow Analytics and Looker at Oyster.comyalisassoon
 
Snowplow: where we came from and where we are going - March 2016
Snowplow: where we came from and where we are going - March 2016Snowplow: where we came from and where we are going - March 2016
Snowplow: where we came from and where we are going - March 2016yalisassoon
 
Snowplow is at the core of everything we do
Snowplow is at the core of everything we doSnowplow is at the core of everything we do
Snowplow is at the core of everything we doyalisassoon
 
Implementing improved and consistent arbitrary event tracking company-wide us...
Implementing improved and consistent arbitrary event tracking company-wide us...Implementing improved and consistent arbitrary event tracking company-wide us...
Implementing improved and consistent arbitrary event tracking company-wide us...yalisassoon
 
Chefsfeed presentation to Snowplow Meetup San Francisco, Oct 2015
Chefsfeed presentation to Snowplow Meetup San Francisco, Oct 2015Chefsfeed presentation to Snowplow Meetup San Francisco, Oct 2015
Chefsfeed presentation to Snowplow Meetup San Francisco, Oct 2015yalisassoon
 
Understanding event data
Understanding event dataUnderstanding event data
Understanding event datayalisassoon
 
Big data meetup budapest adding data schemas to snowplow
Big data meetup budapest   adding data schemas to snowplowBig data meetup budapest   adding data schemas to snowplow
Big data meetup budapest adding data schemas to snowplowyalisassoon
 
Customer lifetime value
Customer lifetime valueCustomer lifetime value
Customer lifetime valueyalisassoon
 

Plus de yalisassoon (20)

Snowplow: evolve your analytics stack with your business
Snowplow: evolve your analytics stack with your businessSnowplow: evolve your analytics stack with your business
Snowplow: evolve your analytics stack with your business
 
Snowplow at Sigfig
Snowplow at SigfigSnowplow at Sigfig
Snowplow at Sigfig
 
2016 09 measurecamp - event data modeling
2016 09 measurecamp - event data modeling2016 09 measurecamp - event data modeling
2016 09 measurecamp - event data modeling
 
Snowplow: putting digital analysts at the heart of digital analytics - the fo...
Snowplow: putting digital analysts at the heart of digital analytics - the fo...Snowplow: putting digital analysts at the heart of digital analytics - the fo...
Snowplow: putting digital analysts at the heart of digital analytics - the fo...
 
Snowplow the evolving data pipeline
Snowplow   the evolving data pipelineSnowplow   the evolving data pipeline
Snowplow the evolving data pipeline
 
Capturing online customer data to create better insights and targeted actions...
Capturing online customer data to create better insights and targeted actions...Capturing online customer data to create better insights and targeted actions...
Capturing online customer data to create better insights and targeted actions...
 
Yali presentation for snowplow amsterdam meetup number 2
Yali presentation for snowplow amsterdam meetup number 2Yali presentation for snowplow amsterdam meetup number 2
Yali presentation for snowplow amsterdam meetup number 2
 
Snowplow at DA Hub emerging technology showcase
Snowplow at DA Hub emerging technology showcaseSnowplow at DA Hub emerging technology showcase
Snowplow at DA Hub emerging technology showcase
 
Using Snowplow for A/B testing and user journey analysis at CustomMade
Using Snowplow for A/B testing and user journey analysis at CustomMadeUsing Snowplow for A/B testing and user journey analysis at CustomMade
Using Snowplow for A/B testing and user journey analysis at CustomMade
 
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016
 
Modeling event data
Modeling event dataModeling event data
Modeling event data
 
The analytics journey at Viewbix - how they came to use Snowplow and the setu...
The analytics journey at Viewbix - how they came to use Snowplow and the setu...The analytics journey at Viewbix - how they came to use Snowplow and the setu...
The analytics journey at Viewbix - how they came to use Snowplow and the setu...
 
Snowplow Analytics and Looker at Oyster.com
Snowplow Analytics and Looker at Oyster.comSnowplow Analytics and Looker at Oyster.com
Snowplow Analytics and Looker at Oyster.com
 
Snowplow: where we came from and where we are going - March 2016
Snowplow: where we came from and where we are going - March 2016Snowplow: where we came from and where we are going - March 2016
Snowplow: where we came from and where we are going - March 2016
 
Snowplow is at the core of everything we do
Snowplow is at the core of everything we doSnowplow is at the core of everything we do
Snowplow is at the core of everything we do
 
Implementing improved and consistent arbitrary event tracking company-wide us...
Implementing improved and consistent arbitrary event tracking company-wide us...Implementing improved and consistent arbitrary event tracking company-wide us...
Implementing improved and consistent arbitrary event tracking company-wide us...
 
Chefsfeed presentation to Snowplow Meetup San Francisco, Oct 2015
Chefsfeed presentation to Snowplow Meetup San Francisco, Oct 2015Chefsfeed presentation to Snowplow Meetup San Francisco, Oct 2015
Chefsfeed presentation to Snowplow Meetup San Francisco, Oct 2015
 
Understanding event data
Understanding event dataUnderstanding event data
Understanding event data
 
Big data meetup budapest adding data schemas to snowplow
Big data meetup budapest   adding data schemas to snowplowBig data meetup budapest   adding data schemas to snowplow
Big data meetup budapest adding data schemas to snowplow
 
Customer lifetime value
Customer lifetime valueCustomer lifetime value
Customer lifetime value
 

Dernier

Sales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for SuccessSales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for SuccessAggregage
 
Catalogue ONG NƯỚC uPVC - HDPE DE NHAT.pdf
Catalogue ONG NƯỚC uPVC - HDPE DE NHAT.pdfCatalogue ONG NƯỚC uPVC - HDPE DE NHAT.pdf
Catalogue ONG NƯỚC uPVC - HDPE DE NHAT.pdfOrient Homes
 
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...anilsa9823
 
RE Capital's Visionary Leadership under Newman Leech
RE Capital's Visionary Leadership under Newman LeechRE Capital's Visionary Leadership under Newman Leech
RE Capital's Visionary Leadership under Newman LeechNewman George Leech
 
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRLMONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRLSeo
 
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...Lviv Startup Club
 
Progress Report - Oracle Database Analyst Summit
Progress  Report - Oracle Database Analyst SummitProgress  Report - Oracle Database Analyst Summit
Progress Report - Oracle Database Analyst SummitHolger Mueller
 
Vip Dewas Call Girls #9907093804 Contact Number Escorts Service Dewas
Vip Dewas Call Girls #9907093804 Contact Number Escorts Service DewasVip Dewas Call Girls #9907093804 Contact Number Escorts Service Dewas
Vip Dewas Call Girls #9907093804 Contact Number Escorts Service Dewasmakika9823
 
VIP Call Girls In Saharaganj ( Lucknow ) 🔝 8923113531 🔝 Cash Payment (COD) 👒
VIP Call Girls In Saharaganj ( Lucknow  ) 🔝 8923113531 🔝  Cash Payment (COD) 👒VIP Call Girls In Saharaganj ( Lucknow  ) 🔝 8923113531 🔝  Cash Payment (COD) 👒
VIP Call Girls In Saharaganj ( Lucknow ) 🔝 8923113531 🔝 Cash Payment (COD) 👒anilsa9823
 
Insurers' journeys to build a mastery in the IoT usage
Insurers' journeys to build a mastery in the IoT usageInsurers' journeys to build a mastery in the IoT usage
Insurers' journeys to build a mastery in the IoT usageMatteo Carbone
 
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...lizamodels9
 
Call Girls in Gomti Nagar - 7388211116 - With room Service
Call Girls in Gomti Nagar - 7388211116  - With room ServiceCall Girls in Gomti Nagar - 7388211116  - With room Service
Call Girls in Gomti Nagar - 7388211116 - With room Servicediscovermytutordmt
 
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Dipal Arora
 
Monte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMMonte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMRavindra Nath Shukla
 
Socio-economic-Impact-of-business-consumers-suppliers-and.pptx
Socio-economic-Impact-of-business-consumers-suppliers-and.pptxSocio-economic-Impact-of-business-consumers-suppliers-and.pptx
Socio-economic-Impact-of-business-consumers-suppliers-and.pptxtrishalcan8
 
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdfRenandantas16
 
Cash Payment 9602870969 Escort Service in Udaipur Call Girls
Cash Payment 9602870969 Escort Service in Udaipur Call GirlsCash Payment 9602870969 Escort Service in Udaipur Call Girls
Cash Payment 9602870969 Escort Service in Udaipur Call GirlsApsara Of India
 

Dernier (20)

Sales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for SuccessSales & Marketing Alignment: How to Synergize for Success
Sales & Marketing Alignment: How to Synergize for Success
 
Catalogue ONG NƯỚC uPVC - HDPE DE NHAT.pdf
Catalogue ONG NƯỚC uPVC - HDPE DE NHAT.pdfCatalogue ONG NƯỚC uPVC - HDPE DE NHAT.pdf
Catalogue ONG NƯỚC uPVC - HDPE DE NHAT.pdf
 
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
Lucknow 💋 Escorts in Lucknow - 450+ Call Girl Cash Payment 8923113531 Neha Th...
 
RE Capital's Visionary Leadership under Newman Leech
RE Capital's Visionary Leadership under Newman LeechRE Capital's Visionary Leadership under Newman Leech
RE Capital's Visionary Leadership under Newman Leech
 
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRLMONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
MONA 98765-12871 CALL GIRLS IN LUDHIANA LUDHIANA CALL GIRL
 
Best Practices for Implementing an External Recruiting Partnership
Best Practices for Implementing an External Recruiting PartnershipBest Practices for Implementing an External Recruiting Partnership
Best Practices for Implementing an External Recruiting Partnership
 
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Pune Just Call 9907093804 Top Class Call Girl Service Available
 
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
Yaroslav Rozhankivskyy: Три складові і три передумови максимальної продуктивн...
 
Forklift Operations: Safety through Cartoons
Forklift Operations: Safety through CartoonsForklift Operations: Safety through Cartoons
Forklift Operations: Safety through Cartoons
 
Progress Report - Oracle Database Analyst Summit
Progress  Report - Oracle Database Analyst SummitProgress  Report - Oracle Database Analyst Summit
Progress Report - Oracle Database Analyst Summit
 
Vip Dewas Call Girls #9907093804 Contact Number Escorts Service Dewas
Vip Dewas Call Girls #9907093804 Contact Number Escorts Service DewasVip Dewas Call Girls #9907093804 Contact Number Escorts Service Dewas
Vip Dewas Call Girls #9907093804 Contact Number Escorts Service Dewas
 
VIP Call Girls In Saharaganj ( Lucknow ) 🔝 8923113531 🔝 Cash Payment (COD) 👒
VIP Call Girls In Saharaganj ( Lucknow  ) 🔝 8923113531 🔝  Cash Payment (COD) 👒VIP Call Girls In Saharaganj ( Lucknow  ) 🔝 8923113531 🔝  Cash Payment (COD) 👒
VIP Call Girls In Saharaganj ( Lucknow ) 🔝 8923113531 🔝 Cash Payment (COD) 👒
 
Insurers' journeys to build a mastery in the IoT usage
Insurers' journeys to build a mastery in the IoT usageInsurers' journeys to build a mastery in the IoT usage
Insurers' journeys to build a mastery in the IoT usage
 
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
Call Girls In DLf Gurgaon ➥99902@11544 ( Best price)100% Genuine Escort In 24...
 
Call Girls in Gomti Nagar - 7388211116 - With room Service
Call Girls in Gomti Nagar - 7388211116  - With room ServiceCall Girls in Gomti Nagar - 7388211116  - With room Service
Call Girls in Gomti Nagar - 7388211116 - With room Service
 
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
Call Girls Navi Mumbai Just Call 9907093804 Top Class Call Girl Service Avail...
 
Monte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSMMonte Carlo simulation : Simulation using MCSM
Monte Carlo simulation : Simulation using MCSM
 
Socio-economic-Impact-of-business-consumers-suppliers-and.pptx
Socio-economic-Impact-of-business-consumers-suppliers-and.pptxSocio-economic-Impact-of-business-consumers-suppliers-and.pptx
Socio-economic-Impact-of-business-consumers-suppliers-and.pptx
 
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf0183760ssssssssssssssssssssssssssss00101011 (27).pdf
0183760ssssssssssssssssssssssssssss00101011 (27).pdf
 
Cash Payment 9602870969 Escort Service in Udaipur Call Girls
Cash Payment 9602870969 Escort Service in Udaipur Call GirlsCash Payment 9602870969 Escort Service in Udaipur Call Girls
Cash Payment 9602870969 Escort Service in Udaipur Call Girls
 

How snowplow and data scientists are transforming the web analytics industry (and creating a new event analytics industry)

  • 1. Web analytics is dead! Long live event analytics How data scientists and big data tech are killing one industry and creating another What role Snowplow plays
  • 2. Web analytics is a big industry • Spend in the just the US on web analytics software (Adobe Sitecatalyst, Webtrends, Google Analytics Premium etc.) estimated at $500m and growing 17 – 20% p.a. in 2011* • Likely that at that amount is spent again on consulting services related to the use of web analytics data • Whole industry of web consultants e.g.: • Semphonic in the USA (bought by Ernst and Young) • Logan Tod in the UK (bought by PwC) • Big 4 accounting firms only buy businesses they can sell into (tens of) thousands of companies • Whole ecosystem around web analytics • “Digital analytics professionals” – it is a career path (retailers, media agencies) • Events, books, organisations geared towards web analysts *Source: Quora http://www.quora.com/Web-Analytics-what-is-the-size-of-the-web-analytics-market
  • 3. Web analytics is an old industry, predating the recent wave in big data technology Web analytics 1990 Web is born 1993 Big data Log file based web analytics 1996 1997 Javascript tagging … 2004 publishes MapReduce paper 2006 Hadoop project split out of Nutch 2008 Facebook develops Hive 2010 publishes Dremel paper 2011 open sources Storm
  • 4. Two problems with web analytics, that stem from the fact web analytics came of age in the 1990s The web was static, hyperlinked documents Tech to handle massive data sets was prohibitively expensive • The entities and events that web analytics programmes understand is limited • Web analytics programmes aggregate raw data to reduce data volumes • Page views, link clicks, transactions, goals, sessions, visitors • This requires specifying in advance how data can be analysed, so that the data can be ‘precut’ Hard to model the rich interactions in today’s interactive webapps Web analytics reporting is very inflexible
  • 5. In particular, web analytics insistence on aggregating data is an anathema to data scientists Data scientist approach Give me the data and I’ll figure out how to answer the question Web analytics approach You can’t get your answer from one of our pre-canned reports? Have a go with our “advanced report-builder” What if I want to: build a model? Understand underlying causality? Use the data in my web application? Dynamically optimize spend / content?
  • 6. We built Snowplow to address the two weaknesses in the web analytics approach Describe web events in much richer grammar and vocabulary Liberate your data • Where you store your data has a big impact on what types of analyses you can quickly run on it
  • 7. Snowplow is an event data collection and warehousing platform Snowplow data pipeline Website / webapp Amazon S3 Mobile apps Other applications (e.g. on games consoles, connected TVs, desktops, connected devices) Collect Transform and enrich Amazon Redshift / PostgreSQL Other (Neo4J, Big Query…) Snowplow delivers your complete, granular event data in your own data warehouse(s), so you can plugin any tool to analyse it
  • 8. Snowplow is composed of a set of loosely coupled subsystems, architected to be robust and scalable 1. Trackers A Generate event data Examples: • Javascript tracker • Ruby / Lua / No-JS / Arduino tracker 2. Collectors B Receive data from trackers and put it in a queue Examples: • Cloudfront collector • Clojure collector for Amazon EB 3. Enrich C Clean and enrich raw data Built on Scalding / Cascading / Hadoop and powered by Amazon EMR 4. Storage D 5. Analytics Store data ready for analysis Examples: • Amazon Redshift • PostgreSQL • Amazon S3 A D Standardised data protocols
  • 9. Snowplow is open source and cloud-based • Open source but easy to deploy via integration with Amazon Web Services (cloud infrastructure) • Our technology is free! • Collecting massive quantities of digital event data should be easy and cheap… • … so that we can focus time and effort on using the data productively • We charge for Professional Services on top of our platform • More value in how you use the data, than in collecting / storing it • Lots of scope to build applications on top of our platform going forwards
  • 11. …use our tech to solve some of their most intractable problems • What is the impact of different ad campaigns and creative on the way users behave, subsequently? What is the return on that ad spend? • How do visitors use social channels (Facebook / Twitter) to interact around video content? How can we predict which content will “go viral”? • How do updates to our product change the “stickiness” of our service? ARPU? Does that vary by customer segment?
  • 12. We believe that event data is one of the most exciting data sources to work with, today
  • 13. We are only at the beginning of figuring out how to use this data… • How do we represent different types of event sequence? • What makes journeys similar and what makes them different? How can we cluster them? • How can we “spot” those events that are predictive of future events? Of consumer value? Of consumer interest? • How can we unpick the effects of marketing / digital products and user’s predisposition to the way sequences of events unfold? • How best should we model different users at different points on different types of journeys?
  • 14. We hope people like you will use our tech to do amazing things with the data! Questions? More information • Snowplow repo: https://github.com/snowplow/snowplow • Twitter: @SnowPlowData • Website: http://snowplowanaltyics.com • My LinkedIn: • My Twitter: