Snowplow: where we came from and where we are going - March 2016

Where we came from and where we’re going
March 2016
Snowplow was born in 2012
Web data: rich but GA /
SiteCatalyst are limited
“Big data” tech
• Marketing, not product
analytics
• Silo’d: can’t join with
other customer data
Snowplow
• Open source
frameworks
• Cloud services
• Open source click
stream data warehouse
• Event level: any query
• Built on top of
Cloudfront / EMR /
Hadoop
The plan: spend 6 months
building a pipeline…
…then get back to using the data
So what went wrong?
Increased project scope
• Click stream data warehouse -> Event analytics
platform
• Collect events from anywhere, not just the web
• Make event data actionable in real-time
• Support more in-pipeline processing steps (enrichment
and modeling)
• Support more storage targets (where your data is has big
implications for what you can do with that data)
Track events from anywhere
• Events
• Entities
Make event data actionable in
real-time
• Personalization
• Marketing automation
• Content analytics
Today, Snowplow is an event
data pipeline
What makes Snowplow special?
• Data pipeline evolves with your business
• Channel coverage
• Flexibility: where your data is delivered
• Flexibility: how your data is processed
(enrichment and modeling)
• Data quality
• Speed
• Transparency
Used by 100s (1000s?) of companies…
…to answer their most important business questions
But there’s still much more to
build!
• Improve automation around schema evolution
• Make modeling event data easier, more robust,
more performant
• Support more storage targets
• Make it easier to act on event data
Data modeling in Spark
Druid, BigQuery, graph databases
Analytics SDKs, Sauna
Iglu: machine-readable schema registry
Questions?
• Can take questions now or after the other talks
1 sur 12

Recommandé

How we use Hive at SnowPlow, and how the role of HIve is changing par
How we use Hive at SnowPlow, and how the role of HIve is changingHow we use Hive at SnowPlow, and how the role of HIve is changing
How we use Hive at SnowPlow, and how the role of HIve is changingyalisassoon
3.4K vues12 diapositives
Snowplow the evolving data pipeline par
Snowplow   the evolving data pipelineSnowplow   the evolving data pipeline
Snowplow the evolving data pipelineyalisassoon
950 vues20 diapositives
Understanding event data par
Understanding event dataUnderstanding event data
Understanding event datayalisassoon
5.1K vues24 diapositives
2016 09 measurecamp - event data modeling par
2016 09 measurecamp - event data modeling2016 09 measurecamp - event data modeling
2016 09 measurecamp - event data modelingyalisassoon
798 vues15 diapositives
Why use big data tools to do web analytics? And how to do it using Snowplow a... par
Why use big data tools to do web analytics? And how to do it using Snowplow a...Why use big data tools to do web analytics? And how to do it using Snowplow a...
Why use big data tools to do web analytics? And how to do it using Snowplow a...yalisassoon
10.1K vues16 diapositives
Snowplow - Evolve your analytics stack with your business par
Snowplow - Evolve your analytics stack with your businessSnowplow - Evolve your analytics stack with your business
Snowplow - Evolve your analytics stack with your businessGiuseppe Gaviani
884 vues20 diapositives

Contenu connexe

Tendances

Modelling event data in look ml par
Modelling event data in look mlModelling event data in look ml
Modelling event data in look mlyalisassoon
3.8K vues11 diapositives
Snowplow is at the core of everything we do par
Snowplow is at the core of everything we doSnowplow is at the core of everything we do
Snowplow is at the core of everything we doyalisassoon
3.8K vues20 diapositives
Yali presentation for snowplow amsterdam meetup number 2 par
Yali presentation for snowplow amsterdam meetup number 2Yali presentation for snowplow amsterdam meetup number 2
Yali presentation for snowplow amsterdam meetup number 2yalisassoon
2.2K vues10 diapositives
A taste of Snowplow Analytics data par
A taste of Snowplow Analytics dataA taste of Snowplow Analytics data
A taste of Snowplow Analytics dataRobert Kingston
8.2K vues22 diapositives
Viewbix tracking journey par
Viewbix tracking journeyViewbix tracking journey
Viewbix tracking journeyidan_by
4K vues10 diapositives
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016 par
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016yalisassoon
1.6K vues15 diapositives

Tendances(20)

Modelling event data in look ml par yalisassoon
Modelling event data in look mlModelling event data in look ml
Modelling event data in look ml
yalisassoon3.8K vues
Snowplow is at the core of everything we do par yalisassoon
Snowplow is at the core of everything we doSnowplow is at the core of everything we do
Snowplow is at the core of everything we do
yalisassoon3.8K vues
Yali presentation for snowplow amsterdam meetup number 2 par yalisassoon
Yali presentation for snowplow amsterdam meetup number 2Yali presentation for snowplow amsterdam meetup number 2
Yali presentation for snowplow amsterdam meetup number 2
yalisassoon2.2K vues
A taste of Snowplow Analytics data par Robert Kingston
A taste of Snowplow Analytics dataA taste of Snowplow Analytics data
A taste of Snowplow Analytics data
Robert Kingston8.2K vues
Viewbix tracking journey par idan_by
Viewbix tracking journeyViewbix tracking journey
Viewbix tracking journey
idan_by4K vues
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016 par yalisassoon
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016
Analytics at Carbonite: presentation to Snowplow Meetup Boston April 2016
yalisassoon1.6K vues
How to evolve your analytics stack with your business using Snowplow par Giuseppe Gaviani
How to evolve your analytics stack with your business using SnowplowHow to evolve your analytics stack with your business using Snowplow
How to evolve your analytics stack with your business using Snowplow
Giuseppe Gaviani950 vues
Simply Business and Snowplow - Multichannel Attribution Analysis par Stewart Duncan
Simply Business and Snowplow - Multichannel Attribution AnalysisSimply Business and Snowplow - Multichannel Attribution Analysis
Simply Business and Snowplow - Multichannel Attribution Analysis
Stewart Duncan3.9K vues
Using Snowplow for A/B testing and user journey analysis at CustomMade par yalisassoon
Using Snowplow for A/B testing and user journey analysis at CustomMadeUsing Snowplow for A/B testing and user journey analysis at CustomMade
Using Snowplow for A/B testing and user journey analysis at CustomMade
yalisassoon2.1K vues
Big data meetup budapest adding data schemas to snowplow par yalisassoon
Big data meetup budapest   adding data schemas to snowplowBig data meetup budapest   adding data schemas to snowplow
Big data meetup budapest adding data schemas to snowplow
yalisassoon4.7K vues
Snowplow: open source game analytics powered by AWS par Giuseppe Gaviani
Snowplow: open source game analytics powered by AWSSnowplow: open source game analytics powered by AWS
Snowplow: open source game analytics powered by AWS
Giuseppe Gaviani1.9K vues
Snowplow Analytics and Looker at Oyster.com par yalisassoon
Snowplow Analytics and Looker at Oyster.comSnowplow Analytics and Looker at Oyster.com
Snowplow Analytics and Looker at Oyster.com
yalisassoon1.9K vues
Simply Business - Near Real Time Event Processing par idan_by
Simply Business - Near Real Time Event ProcessingSimply Business - Near Real Time Event Processing
Simply Business - Near Real Time Event Processing
idan_by2.3K vues
Implementing improved and consistent arbitrary event tracking company-wide us... par yalisassoon
Implementing improved and consistent arbitrary event tracking company-wide us...Implementing improved and consistent arbitrary event tracking company-wide us...
Implementing improved and consistent arbitrary event tracking company-wide us...
yalisassoon1.3K vues
Snowplow: evolve your analytics stack with your business par yalisassoon
Snowplow: evolve your analytics stack with your businessSnowplow: evolve your analytics stack with your business
Snowplow: evolve your analytics stack with your business
yalisassoon1.2K vues
Data driven video advertising campaigns - JustWatch & Snowplow par Giuseppe Gaviani
Data driven video advertising campaigns - JustWatch & SnowplowData driven video advertising campaigns - JustWatch & Snowplow
Data driven video advertising campaigns - JustWatch & Snowplow
Giuseppe Gaviani2.1K vues
Snowplow, Metail and Cascalog par Robert Boland
Snowplow, Metail and CascalogSnowplow, Metail and Cascalog
Snowplow, Metail and Cascalog
Robert Boland1.9K vues
Snowplow: putting digital analysts at the heart of digital analytics - the fo... par yalisassoon
Snowplow: putting digital analysts at the heart of digital analytics - the fo...Snowplow: putting digital analysts at the heart of digital analytics - the fo...
Snowplow: putting digital analysts at the heart of digital analytics - the fo...
yalisassoon2.2K vues
TripleLift: Preparing for a New Programmatic Ad-Tech World par VoltDB
TripleLift: Preparing for a New Programmatic Ad-Tech WorldTripleLift: Preparing for a New Programmatic Ad-Tech World
TripleLift: Preparing for a New Programmatic Ad-Tech World
VoltDB637 vues

En vedette

Modeling event data par
Modeling event dataModeling event data
Modeling event datayalisassoon
1.2K vues16 diapositives
Chefsfeed presentation to Snowplow Meetup San Francisco, Oct 2015 par
Chefsfeed presentation to Snowplow Meetup San Francisco, Oct 2015Chefsfeed presentation to Snowplow Meetup San Francisco, Oct 2015
Chefsfeed presentation to Snowplow Meetup San Francisco, Oct 2015yalisassoon
5.6K vues22 diapositives
Snowplow Analytics: from NoSQL to SQL and back again par
Snowplow Analytics: from NoSQL to SQL and back againSnowplow Analytics: from NoSQL to SQL and back again
Snowplow Analytics: from NoSQL to SQL and back againAlexander Dean
4.9K vues29 diapositives
Snowplow at Sigfig par
Snowplow at SigfigSnowplow at Sigfig
Snowplow at Sigfigyalisassoon
3K vues9 diapositives
A KPI framework for startups par
A KPI framework for startupsA KPI framework for startups
A KPI framework for startupsyalisassoon
79.2K vues6 diapositives
Big Data Tech Stack par
Big Data Tech StackBig Data Tech Stack
Big Data Tech StackAbdullah Çetin ÇAVDAR
15.3K vues114 diapositives

En vedette(7)

Modeling event data par yalisassoon
Modeling event dataModeling event data
Modeling event data
yalisassoon1.2K vues
Chefsfeed presentation to Snowplow Meetup San Francisco, Oct 2015 par yalisassoon
Chefsfeed presentation to Snowplow Meetup San Francisco, Oct 2015Chefsfeed presentation to Snowplow Meetup San Francisco, Oct 2015
Chefsfeed presentation to Snowplow Meetup San Francisco, Oct 2015
yalisassoon5.6K vues
Snowplow Analytics: from NoSQL to SQL and back again par Alexander Dean
Snowplow Analytics: from NoSQL to SQL and back againSnowplow Analytics: from NoSQL to SQL and back again
Snowplow Analytics: from NoSQL to SQL and back again
Alexander Dean4.9K vues
A KPI framework for startups par yalisassoon
A KPI framework for startupsA KPI framework for startups
A KPI framework for startups
yalisassoon79.2K vues
Big Data Technology Stack : Nutshell par Khalid Imran
Big Data Technology Stack : NutshellBig Data Technology Stack : Nutshell
Big Data Technology Stack : Nutshell
Khalid Imran17.2K vues

Similaire à Snowplow: where we came from and where we are going - March 2016

Capturing online customer data to create better insights and targeted actions... par
Capturing online customer data to create better insights and targeted actions...Capturing online customer data to create better insights and targeted actions...
Capturing online customer data to create better insights and targeted actions...yalisassoon
862 vues20 diapositives
Data insights for breakfast, stockholm par
Data insights for breakfast, stockholmData insights for breakfast, stockholm
Data insights for breakfast, stockholmSolita Oy
919 vues86 diapositives
Customer Event Hub - the modern Customer 360° view par
Customer Event Hub - the modern Customer 360° viewCustomer Event Hub - the modern Customer 360° view
Customer Event Hub - the modern Customer 360° viewGuido Schmutz
2.4K vues28 diapositives
Data Insights for Breakfast, Malmö par
Data Insights for Breakfast, MalmöData Insights for Breakfast, Malmö
Data Insights for Breakfast, MalmöSolita Oy
756 vues86 diapositives
Data-Driven Development Era and Its Technologies par
Data-Driven Development Era and Its TechnologiesData-Driven Development Era and Its Technologies
Data-Driven Development Era and Its TechnologiesSATOSHI TAGOMORI
8.6K vues25 diapositives
Modern Data Architectures for Business Insights at Scale par
Modern Data Architectures for Business Insights at ScaleModern Data Architectures for Business Insights at Scale
Modern Data Architectures for Business Insights at ScaleAmazon Web Services
708 vues118 diapositives

Similaire à Snowplow: where we came from and where we are going - March 2016(20)

Capturing online customer data to create better insights and targeted actions... par yalisassoon
Capturing online customer data to create better insights and targeted actions...Capturing online customer data to create better insights and targeted actions...
Capturing online customer data to create better insights and targeted actions...
yalisassoon862 vues
Data insights for breakfast, stockholm par Solita Oy
Data insights for breakfast, stockholmData insights for breakfast, stockholm
Data insights for breakfast, stockholm
Solita Oy919 vues
Customer Event Hub - the modern Customer 360° view par Guido Schmutz
Customer Event Hub - the modern Customer 360° viewCustomer Event Hub - the modern Customer 360° view
Customer Event Hub - the modern Customer 360° view
Guido Schmutz2.4K vues
Data Insights for Breakfast, Malmö par Solita Oy
Data Insights for Breakfast, MalmöData Insights for Breakfast, Malmö
Data Insights for Breakfast, Malmö
Solita Oy756 vues
Data-Driven Development Era and Its Technologies par SATOSHI TAGOMORI
Data-Driven Development Era and Its TechnologiesData-Driven Development Era and Its Technologies
Data-Driven Development Era and Its Technologies
SATOSHI TAGOMORI8.6K vues
Modern Data Architectures for Business Insights at Scale par Amazon Web Services
Modern Data Architectures for Business Insights at ScaleModern Data Architectures for Business Insights at Scale
Modern Data Architectures for Business Insights at Scale
Trivadis TechEvent 2016 Customer Event Hub - the modern Customer 360° view by... par Trivadis
Trivadis TechEvent 2016 Customer Event Hub - the modern Customer 360° view by...Trivadis TechEvent 2016 Customer Event Hub - the modern Customer 360° view by...
Trivadis TechEvent 2016 Customer Event Hub - the modern Customer 360° view by...
Trivadis518 vues
Top Business Intelligence Trends for 2016 by Panorama Software par Panorama Software
Top Business Intelligence Trends for 2016 by Panorama SoftwareTop Business Intelligence Trends for 2016 by Panorama Software
Top Business Intelligence Trends for 2016 by Panorama Software
Panorama Software3.6K vues
Introducing Neo4j par Neo4j
Introducing Neo4jIntroducing Neo4j
Introducing Neo4j
Neo4j1K vues
Big Data LDN 2017: Data Integration & Big Data Management par Matt Stubbs
Big Data LDN 2017: Data Integration & Big Data ManagementBig Data LDN 2017: Data Integration & Big Data Management
Big Data LDN 2017: Data Integration & Big Data Management
Matt Stubbs127 vues
Big Data Beers - Introducing Snowplow par Alexander Dean
Big Data Beers - Introducing SnowplowBig Data Beers - Introducing Snowplow
Big Data Beers - Introducing Snowplow
Alexander Dean1.7K vues
5 Essential Practices of the Data Driven Organization par Vivastream
5 Essential Practices of the Data Driven Organization5 Essential Practices of the Data Driven Organization
5 Essential Practices of the Data Driven Organization
Vivastream1.3K vues
Winning in Today's Data-Centric Economy (Part 1) par Alexander Loth
Winning in Today's Data-Centric Economy (Part 1)Winning in Today's Data-Centric Economy (Part 1)
Winning in Today's Data-Centric Economy (Part 1)
Alexander Loth362 vues
Introducing the SnapLogic Integration Cloud par Darren Cunningham
Introducing the SnapLogic Integration CloudIntroducing the SnapLogic Integration Cloud
Introducing the SnapLogic Integration Cloud
Darren Cunningham2.2K vues
Google Cloud Machine Learning par India Quotient
 Google Cloud Machine Learning  Google Cloud Machine Learning
Google Cloud Machine Learning
India Quotient7.6K vues
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera par MongoDB
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, ClouderaMongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB IoT City Tour STUTTGART: Hadoop and future data management. By, Cloudera
MongoDB1.9K vues

Dernier

Product Research sample.pdf par
Product Research sample.pdfProduct Research sample.pdf
Product Research sample.pdfAllenSingson
29 vues29 diapositives
[DSC Europe 23] Ales Gros - Quantum and Today s security with Quantum.pdf par
[DSC Europe 23] Ales Gros - Quantum and Today s security with Quantum.pdf[DSC Europe 23] Ales Gros - Quantum and Today s security with Quantum.pdf
[DSC Europe 23] Ales Gros - Quantum and Today s security with Quantum.pdfDataScienceConferenc1
5 vues54 diapositives
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M... par
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...DataScienceConferenc1
7 vues11 diapositives
CRM stick or twist.pptx par
CRM stick or twist.pptxCRM stick or twist.pptx
CRM stick or twist.pptxinfo828217
11 vues16 diapositives
[DSC Europe 23] Ivana Sesic - Use of AI in Public Health.pptx par
[DSC Europe 23] Ivana Sesic - Use of AI in Public Health.pptx[DSC Europe 23] Ivana Sesic - Use of AI in Public Health.pptx
[DSC Europe 23] Ivana Sesic - Use of AI in Public Health.pptxDataScienceConferenc1
5 vues15 diapositives
Custom Tag Manager Templates par
Custom Tag Manager TemplatesCustom Tag Manager Templates
Custom Tag Manager TemplatesMarkus Baersch
28 vues17 diapositives

Dernier(20)

[DSC Europe 23] Ales Gros - Quantum and Today s security with Quantum.pdf par DataScienceConferenc1
[DSC Europe 23] Ales Gros - Quantum and Today s security with Quantum.pdf[DSC Europe 23] Ales Gros - Quantum and Today s security with Quantum.pdf
[DSC Europe 23] Ales Gros - Quantum and Today s security with Quantum.pdf
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M... par DataScienceConferenc1
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...
[DSC Europe 23] Milos Grubjesic Empowering Business with Pepsico s Advanced M...
CRM stick or twist.pptx par info828217
CRM stick or twist.pptxCRM stick or twist.pptx
CRM stick or twist.pptx
info82821711 vues
[DSC Europe 23] Rania Wazir - Opening up the box: the complexity of human int... par DataScienceConferenc1
[DSC Europe 23] Rania Wazir - Opening up the box: the complexity of human int...[DSC Europe 23] Rania Wazir - Opening up the box: the complexity of human int...
[DSC Europe 23] Rania Wazir - Opening up the box: the complexity of human int...
[DSC Europe 23][Cryptica] Martin_Summer_Digital_central_bank_money_Ideas_init... par DataScienceConferenc1
[DSC Europe 23][Cryptica] Martin_Summer_Digital_central_bank_money_Ideas_init...[DSC Europe 23][Cryptica] Martin_Summer_Digital_central_bank_money_Ideas_init...
[DSC Europe 23][Cryptica] Martin_Summer_Digital_central_bank_money_Ideas_init...
Advanced_Recommendation_Systems_Presentation.pptx par neeharikasingh29
Advanced_Recommendation_Systems_Presentation.pptxAdvanced_Recommendation_Systems_Presentation.pptx
Advanced_Recommendation_Systems_Presentation.pptx
OECD-Persol Holdings Workshop on Advancing Employee Well-being in Business an... par StatsCommunications
OECD-Persol Holdings Workshop on Advancing Employee Well-being in Business an...OECD-Persol Holdings Workshop on Advancing Employee Well-being in Business an...
OECD-Persol Holdings Workshop on Advancing Employee Well-being in Business an...
[DSC Europe 23] Danijela Horak - The Innovator’s Dilemma: to Build or Not to ... par DataScienceConferenc1
[DSC Europe 23] Danijela Horak - The Innovator’s Dilemma: to Build or Not to ...[DSC Europe 23] Danijela Horak - The Innovator’s Dilemma: to Build or Not to ...
[DSC Europe 23] Danijela Horak - The Innovator’s Dilemma: to Build or Not to ...
CRM stick or twist workshop par info828217
CRM stick or twist workshopCRM stick or twist workshop
CRM stick or twist workshop
info82821712 vues
[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx par DataScienceConferenc1
[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx
[DSC Europe 23] Zsolt Feleki - Machine Translation should we trust it.pptx
[DSC Europe 23] Stefan Mrsic_Goran Savic - Evolving Technology Excellence.pptx par DataScienceConferenc1
[DSC Europe 23] Stefan Mrsic_Goran Savic - Evolving Technology Excellence.pptx[DSC Europe 23] Stefan Mrsic_Goran Savic - Evolving Technology Excellence.pptx
[DSC Europe 23] Stefan Mrsic_Goran Savic - Evolving Technology Excellence.pptx

Snowplow: where we came from and where we are going - March 2016

  • 1. Where we came from and where we’re going March 2016
  • 2. Snowplow was born in 2012 Web data: rich but GA / SiteCatalyst are limited “Big data” tech • Marketing, not product analytics • Silo’d: can’t join with other customer data Snowplow • Open source frameworks • Cloud services • Open source click stream data warehouse • Event level: any query • Built on top of Cloudfront / EMR / Hadoop
  • 3. The plan: spend 6 months building a pipeline… …then get back to using the data
  • 4. So what went wrong?
  • 5. Increased project scope • Click stream data warehouse -> Event analytics platform • Collect events from anywhere, not just the web • Make event data actionable in real-time • Support more in-pipeline processing steps (enrichment and modeling) • Support more storage targets (where your data is has big implications for what you can do with that data)
  • 6. Track events from anywhere • Events • Entities
  • 7. Make event data actionable in real-time • Personalization • Marketing automation • Content analytics
  • 8. Today, Snowplow is an event data pipeline
  • 9. What makes Snowplow special? • Data pipeline evolves with your business • Channel coverage • Flexibility: where your data is delivered • Flexibility: how your data is processed (enrichment and modeling) • Data quality • Speed • Transparency
  • 10. Used by 100s (1000s?) of companies… …to answer their most important business questions
  • 11. But there’s still much more to build! • Improve automation around schema evolution • Make modeling event data easier, more robust, more performant • Support more storage targets • Make it easier to act on event data Data modeling in Spark Druid, BigQuery, graph databases Analytics SDKs, Sauna Iglu: machine-readable schema registry
  • 12. Questions? • Can take questions now or after the other talks