SlideShare une entreprise Scribd logo
1  sur  12
{
The Single Step
Beginning your big data journey
Today’s Stops
Spil Games: A leader in online gaming
• 180 million monthly and 12 million daily players
• More than one billion gameplays monthly
• >50 websites, local in 15 languages
• Active in every country of the world (even Vatican City!)
• Platform, Publisher, Developer
What is big data?
X Matters
Define Metrics
Define
Requirements
Develop Data
Source
Design Data
Mart
Design Report
Sign Off
Report
Reporting
Available
Slow
IT-Centric
Inflexible
Big Data BI: Agile approach, data first
Capture
Explore Define
Apply +
Track
Open
Adaptive
Evolving Structure
Do we need real time analytics?
Traditional ETLReal Time
• Once a day
• Once a week
• Delayed
• Faster than human
perception
• <200 milliseconds
“In Time”
In Time: Information is available fast enough to influence decisions
• Following a product release (hours)
• While a customer is in the shop/on the site (minutes)
• While the query runs (seconds)
The Velocity Continuum
In Time: Fast enough, Cheap enough, Easy enough
Parts and needs of a big data stack
Unstructured
data intake
Unstructured
data storage
Structured
data storage
Human
interface
layer
Predictive
analytics
tools
Select A,B,sum(C)
From X
Group by 1,2
• High Query Performance
• Denormalized
• Scalable; high concurrency
• Cheap
• Flexible Schema
• Easy Management
• Scalable
• Schemaless or adaptive schema
• Resilient
• Highly Flexible
• Simple to use
• In-tool metadata
• Not memory constrained
• Flexible inputs/outputs
• Easy iteration
Spil: Harmony of open source/commercial
Unstructured
data intake
Unstructured
data storage
Structured
data storage
Human
interface
layer
Predictive
analytics
tools
• >100x faster than based systems
• Handles tables >10B rows easily
• Excellent concurrency on load/query
• Data marts not required
• Cross-platform merging
• Anyone can develop
• Open source
• Easy development
• Integrates with rest of tools
• Industry standard
• Open source
• Ecosystem
• Existing infrastructure
• Integration with production systems
Demographic Prediction
Analytical use cases
Multivariate Testing/Site Optimization
Explore, Learn, Predict, Measure
Getting your big data off the ground
Start Fresh
Have a Problem
Be Agile
Pragmatism >
Perfection
Be Flexible
Be Fast
Make Mistakes
Find Value
A tool, not a goal
Good Luck on your
Journey!
Rob Winters
Director, Reporting/Analytics
Spil Games
www.robertdwinters.com

Contenu connexe

Tendances

Basho pres
Basho presBasho pres
Basho pres
Frank Wu
 

Tendances (20)

VoltDB and HPE Vertica Present: Building an IoT Architecture for Fast + Big Data
VoltDB and HPE Vertica Present: Building an IoT Architecture for Fast + Big DataVoltDB and HPE Vertica Present: Building an IoT Architecture for Fast + Big Data
VoltDB and HPE Vertica Present: Building an IoT Architecture for Fast + Big Data
 
The Big Data Ecosystem for Financial Services
The Big Data Ecosystem for Financial ServicesThe Big Data Ecosystem for Financial Services
The Big Data Ecosystem for Financial Services
 
Getting It Right Exactly Once: Principles for Streaming Architectures
Getting It Right Exactly Once: Principles for Streaming ArchitecturesGetting It Right Exactly Once: Principles for Streaming Architectures
Getting It Right Exactly Once: Principles for Streaming Architectures
 
Building an IoT Kafka Pipeline in Under 5 Minutes
Building an IoT Kafka Pipeline in Under 5 MinutesBuilding an IoT Kafka Pipeline in Under 5 Minutes
Building an IoT Kafka Pipeline in Under 5 Minutes
 
Building a Distributed Collaborative Data Pipeline with Apache Spark
Building a Distributed Collaborative Data Pipeline with Apache SparkBuilding a Distributed Collaborative Data Pipeline with Apache Spark
Building a Distributed Collaborative Data Pipeline with Apache Spark
 
Five ways database modernization simplifies your data life
Five ways database modernization simplifies your data lifeFive ways database modernization simplifies your data life
Five ways database modernization simplifies your data life
 
MemSQL - The Real-time Analytics Platform
MemSQL - The Real-time Analytics PlatformMemSQL - The Real-time Analytics Platform
MemSQL - The Real-time Analytics Platform
 
How to Build Fast Data Applications: Evaluating the Top Contenders
How to Build Fast Data Applications: Evaluating the Top ContendersHow to Build Fast Data Applications: Evaluating the Top Contenders
How to Build Fast Data Applications: Evaluating the Top Contenders
 
Denodo DataFest 2017: Outpace Your Competition with Real-Time Responses
Denodo DataFest 2017: Outpace Your Competition with Real-Time ResponsesDenodo DataFest 2017: Outpace Your Competition with Real-Time Responses
Denodo DataFest 2017: Outpace Your Competition with Real-Time Responses
 
Building the Foundation for a Latency-Free Life
Building the Foundation for a Latency-Free LifeBuilding the Foundation for a Latency-Free Life
Building the Foundation for a Latency-Free Life
 
CTO View: Driving the On-Demand Economy with Predictive Analytics
CTO View: Driving the On-Demand Economy with Predictive AnalyticsCTO View: Driving the On-Demand Economy with Predictive Analytics
CTO View: Driving the On-Demand Economy with Predictive Analytics
 
Introduction: Architecting for Scale
Introduction: Architecting for ScaleIntroduction: Architecting for Scale
Introduction: Architecting for Scale
 
Basho pres
Basho presBasho pres
Basho pres
 
ironSource Atom BigData Berlin
ironSource Atom BigData BerlinironSource Atom BigData Berlin
ironSource Atom BigData Berlin
 
Fast Data for Competitive Advantage: 4 Steps to Expand your Window of Opportu...
Fast Data for Competitive Advantage: 4 Steps to Expand your Window of Opportu...Fast Data for Competitive Advantage: 4 Steps to Expand your Window of Opportu...
Fast Data for Competitive Advantage: 4 Steps to Expand your Window of Opportu...
 
How Yellowbrick Data Integrates to Existing Environments Webcast
How Yellowbrick Data Integrates to Existing Environments WebcastHow Yellowbrick Data Integrates to Existing Environments Webcast
How Yellowbrick Data Integrates to Existing Environments Webcast
 
Yellowbrick Webcast with DBTA for Real-Time Analytics
Yellowbrick Webcast with DBTA for Real-Time AnalyticsYellowbrick Webcast with DBTA for Real-Time Analytics
Yellowbrick Webcast with DBTA for Real-Time Analytics
 
Lessons Learned: The Impact of Fast Data for Personalization
Lessons Learned: The Impact of Fast Data for PersonalizationLessons Learned: The Impact of Fast Data for Personalization
Lessons Learned: The Impact of Fast Data for Personalization
 
Kyvos Insights
Kyvos Insights Kyvos Insights
Kyvos Insights
 
Denodo DataFest 2017: Edge Computing: Collecting vs. Connecting to Streaming ...
Denodo DataFest 2017: Edge Computing: Collecting vs. Connecting to Streaming ...Denodo DataFest 2017: Edge Computing: Collecting vs. Connecting to Streaming ...
Denodo DataFest 2017: Edge Computing: Collecting vs. Connecting to Streaming ...
 

En vedette (7)

Big Data Expo 2015 - Infotopics Zien, Begrijpen, Doen!
Big Data Expo 2015 - Infotopics Zien, Begrijpen, Doen!Big Data Expo 2015 - Infotopics Zien, Begrijpen, Doen!
Big Data Expo 2015 - Infotopics Zien, Begrijpen, Doen!
 
Top bi travelbird
Top bi travelbirdTop bi travelbird
Top bi travelbird
 
Building a Personalized Offer Using Machine Learning
Building a Personalized Offer Using Machine LearningBuilding a Personalized Offer Using Machine Learning
Building a Personalized Offer Using Machine Learning
 
Tableau @ Spil Games
Tableau @ Spil GamesTableau @ Spil Games
Tableau @ Spil Games
 
Data Vault Automation at the Bijenkorf
Data Vault Automation at the BijenkorfData Vault Automation at the Bijenkorf
Data Vault Automation at the Bijenkorf
 
Design Principles for a Modern Data Warehouse
Design Principles for a Modern Data WarehouseDesign Principles for a Modern Data Warehouse
Design Principles for a Modern Data Warehouse
 
ProductTank AMS - Building Products to Win - Edial Dekker
ProductTank AMS - Building Products to Win - Edial DekkerProductTank AMS - Building Products to Win - Edial Dekker
ProductTank AMS - Building Products to Win - Edial Dekker
 

Similaire à Getting Started with Big Data Analytics

Presentation at Google Day on Big Data
Presentation at Google Day on Big DataPresentation at Google Day on Big Data
Presentation at Google Day on Big Data
Rezaur Rahman
 
Big-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-KoenigBig-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-Koenig
Manish Chopra
 

Similaire à Getting Started with Big Data Analytics (20)

Demystify Big Data, Data Science & Signal Extraction Deep Dive
Demystify Big Data, Data Science & Signal Extraction Deep DiveDemystify Big Data, Data Science & Signal Extraction Deep Dive
Demystify Big Data, Data Science & Signal Extraction Deep Dive
 
Moving Targets: Harnessing Real-time Value from Data in Motion
Moving Targets: Harnessing Real-time Value from Data in Motion Moving Targets: Harnessing Real-time Value from Data in Motion
Moving Targets: Harnessing Real-time Value from Data in Motion
 
In Loco - AWS Startup Day São Paulo
In Loco - AWS Startup Day São PauloIn Loco - AWS Startup Day São Paulo
In Loco - AWS Startup Day São Paulo
 
Big data
Big dataBig data
Big data
 
Big data
Big dataBig data
Big data
 
bigdatappt.pptx
bigdatappt.pptxbigdatappt.pptx
bigdatappt.pptx
 
Learnings from 7 Years of Integrating Mission-Critical IBM Z® and IBM i with ...
Learnings from 7 Years of Integrating Mission-Critical IBM Z® and IBM i with ...Learnings from 7 Years of Integrating Mission-Critical IBM Z® and IBM i with ...
Learnings from 7 Years of Integrating Mission-Critical IBM Z® and IBM i with ...
 
Turning data from insights into value
Turning data from insights into valueTurning data from insights into value
Turning data from insights into value
 
Presentation at Google Day on Big Data
Presentation at Google Day on Big DataPresentation at Google Day on Big Data
Presentation at Google Day on Big Data
 
Transform 2014: Kofax Altosoft™ Insight - Deep Dive
 Transform 2014: Kofax Altosoft™ Insight - Deep Dive Transform 2014: Kofax Altosoft™ Insight - Deep Dive
Transform 2014: Kofax Altosoft™ Insight - Deep Dive
 
Sr. Jon Ander, Internet de las Cosas y Big Data: ¿hacia dónde va la Industria?
Sr. Jon Ander, Internet de las Cosas y Big Data: ¿hacia dónde va la Industria? Sr. Jon Ander, Internet de las Cosas y Big Data: ¿hacia dónde va la Industria?
Sr. Jon Ander, Internet de las Cosas y Big Data: ¿hacia dónde va la Industria?
 
Data Science and Enterprise Engineering with Michael Finger and Chris Robison
Data Science and Enterprise Engineering with Michael Finger and Chris RobisonData Science and Enterprise Engineering with Michael Finger and Chris Robison
Data Science and Enterprise Engineering with Michael Finger and Chris Robison
 
Splunk Digital Intelligence
Splunk Digital IntelligenceSplunk Digital Intelligence
Splunk Digital Intelligence
 
Big-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-KoenigBig-Data-Seminar-6-Aug-2014-Koenig
Big-Data-Seminar-6-Aug-2014-Koenig
 
Presentation on Big Data
Presentation on Big DataPresentation on Big Data
Presentation on Big Data
 
Levelling up your data infrastructure
Levelling up your data infrastructureLevelling up your data infrastructure
Levelling up your data infrastructure
 
Big data and beyond
Big data and beyondBig data and beyond
Big data and beyond
 
Big Data
Big DataBig Data
Big Data
 
Big Data Analytics.pdfbgfjgjgghfhhffhdfyf
Big Data Analytics.pdfbgfjgjgghfhhffhdfyfBig Data Analytics.pdfbgfjgjgghfhhffhdfyf
Big Data Analytics.pdfbgfjgjgghfhhffhdfyf
 
Big Data Landscape 2018
Big Data Landscape 2018Big Data Landscape 2018
Big Data Landscape 2018
 

Dernier

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Dernier (20)

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 

Getting Started with Big Data Analytics

  • 1. { The Single Step Beginning your big data journey
  • 3. Spil Games: A leader in online gaming • 180 million monthly and 12 million daily players • More than one billion gameplays monthly • >50 websites, local in 15 languages • Active in every country of the world (even Vatican City!) • Platform, Publisher, Developer
  • 4. What is big data?
  • 5. X Matters Define Metrics Define Requirements Develop Data Source Design Data Mart Design Report Sign Off Report Reporting Available Slow IT-Centric Inflexible
  • 6. Big Data BI: Agile approach, data first Capture Explore Define Apply + Track Open Adaptive Evolving Structure
  • 7. Do we need real time analytics? Traditional ETLReal Time • Once a day • Once a week • Delayed • Faster than human perception • <200 milliseconds “In Time” In Time: Information is available fast enough to influence decisions • Following a product release (hours) • While a customer is in the shop/on the site (minutes) • While the query runs (seconds) The Velocity Continuum In Time: Fast enough, Cheap enough, Easy enough
  • 8. Parts and needs of a big data stack Unstructured data intake Unstructured data storage Structured data storage Human interface layer Predictive analytics tools Select A,B,sum(C) From X Group by 1,2 • High Query Performance • Denormalized • Scalable; high concurrency • Cheap • Flexible Schema • Easy Management • Scalable • Schemaless or adaptive schema • Resilient • Highly Flexible • Simple to use • In-tool metadata • Not memory constrained • Flexible inputs/outputs • Easy iteration
  • 9. Spil: Harmony of open source/commercial Unstructured data intake Unstructured data storage Structured data storage Human interface layer Predictive analytics tools • >100x faster than based systems • Handles tables >10B rows easily • Excellent concurrency on load/query • Data marts not required • Cross-platform merging • Anyone can develop • Open source • Easy development • Integrates with rest of tools • Industry standard • Open source • Ecosystem • Existing infrastructure • Integration with production systems
  • 10. Demographic Prediction Analytical use cases Multivariate Testing/Site Optimization Explore, Learn, Predict, Measure
  • 11. Getting your big data off the ground Start Fresh Have a Problem Be Agile Pragmatism > Perfection Be Flexible Be Fast Make Mistakes Find Value A tool, not a goal
  • 12. Good Luck on your Journey! Rob Winters Director, Reporting/Analytics Spil Games www.robertdwinters.com