SlideShare une entreprise Scribd logo
1  sur  18
Télécharger pour lire hors ligne
Big Data
                 + Social
                 + Games
                 @Is Cool
                        16/03/2012
TITRE DOCUMENT
Who is IsCool Entertainment?


  Social game publisher based in    Agenda
  Paris, France                     • What do we do?
                                    Social Gaming
  #1 French publisher in terms of   •   What kind of (Big) Analytics we do?
  audience (450k Daily Active       Lots
  Users) & revenue                  •   How we do it ?
                                    Hadoop, Python, R, Tableau, Geph and stuff…
  2.8 Millions Fans
  80 employees
                                                          Florian Douetteau
  9.1 million € revenue in 2010                           CTO
  4 live applications on Facebook                         @fdouetteau
Is Cool Games

                              IsCool,   Absolute Solitaire,
                Delirious Collectible   The best solitaire game
                               Game     available online




             Temple Of Mahjong,         Belote Multijoueur,
           Collect, Play, Exchange      Play, Win, Meet
Games & Virtual Goods

                         Play the Game & Gain some
                         virtual goods
                         Play again & Gain more
                         Collaborate with other players
                         & Gain More
                         ….
                         Possibly buy
                         To grow quicker
                         To help others
Virtual Goods  Virtual Economy

  Virtual Goods Must not be too
  easy to get
  The game would not be fun !
  No monetization
  Virtual Goods must not be hard
  to get
  People would churn because of    Let’s Trade 1
                                    Watch against
    frustration !                    3 Hammers
  Virtual Goods can be usually
  traded between players
  Virtual and actual “Price” of a
  good
Why is this Big Data ?

                           Number of object transactions per day
                           NYSE              3,600,000,000
   18 Million users
   generated actions
   per day                 IsCool            2,150,000,000
   7 Billions per year.
                           Nasdaq            1,600,000,000

   9,8 TB Data to
                           Nikkey            1,500,000,000
   analyze
                           Footsie           860,000,000

                           CAC 40            142,500,000
The Real Big Data Challenge
  Collaborate for collective insights
                                                        Programmers’ Perspective :
 Game Designer Perspective :                            Log Files & Work ?
 Nice Charts ?
                                                 Realtime?

                  what
                 metrics?
                               data scientist?




BI Veteran:                                           Business Guy Perspective:
Schema Definition ?                                   Revenue Forecast ?
Specifics of Game Analytics

  Virtual Goods
  We are the Factory AND the
    Shop, and most of the products
    are free.
  Social Networks
  Network effects are key
  Games
  The product changes EVERY day !
  Sudden wage of unexpected
    players from Guatemala !
  People try to cheat !
Use Case 1 : Understanding Users

  1: Defining engagement

                                        Tenure length
                                        Visit frequency
                                        Virality
 Traffic               Key drivers???   Paying user conversion
                                        ARPPU
                                        Score
                                        Use of feature A,B,C…
Case Study 1 - Segment User Behaviours

  2: Describing engagement patterns: Running a segment analysis
Use Case 2 : Understanding Users as a whole


  10 Million Nodes
  Around 1 000 Billion
  Edges




                               How does the graph evolve in
                               time ?
                               What are the
                               communities?
Understanding Users as a Whole

Lots of small clusters ((mostly 2
players)
                                              Some mid size communities




                                    A very large community
Use Case 3 : Analyze Long Terms effect of a feature

                             A/B Tests
                             Some features can be A/B tested
                             …and some cannot !
                             How to measure the uplift ?
                             Are players using the new feature…
                             More engaged?
                             Generate more virality ?
                             etc….
                             Complexity
                             Multiple variable to observe
                               (other features, history )



                                                 TITRE DOCUMENT   16/03/2012
… How




over the last 3 years   Analyzing the Offer
• Tools changed         • Online Analytics Platform
• Scale changed         • Commercial / Open Source ETL
• Focus Changed         • Commercial BI Visualization Software
                        • Commercial / Open Source databases
                          (column stores)
                        •…
What we learned


         Diversity                  Relativity                    Superciality

• There's no Hadoop+R       • Windows / Linux ? Cloud     • Ability to display is more
  Magic (Expertise, Entry     or on-premise ?               important than the
  Costs, Maintenance)       • Do you have internal data     result.
• There’s no XYZ Magical      mining experts (yes/no) ?
  Product                   • Do you have internal
                              scalability
                              experts (yes/no) ?
                            • What is _real_ budget ?
                              0K ? 10K ? 100K ? 1000K
                              ?
Mixed Approach
  SaaS Analytics Platforms
  For common, business metrics (virality,
     traffic, engagement)
  Corporate Level Visibility
  Day-to-day
  Internal Datawarehousing
  Detailed Business Metrics
  Virtual Economy Modeling
  Long term behaviours
  Business Level Visibility
  Week-to-Week
  Datamining tools
  Ad-hoc analytics
  Graph Analytics
Datawarehouse for the Big Data era

   Hadoop/Hive (through Amazon’s                   Open Source ETL (PyBabe)
   Elastric Map Reduce)                            • Pure Python ETL
   • Used to reduce the amount of information :    • Good integration with AWS/ S3
     10 GB a day => 1GB a day                      • Easy to integrate in our development
   • High cost of development for "business"         environment
     related processing




   Columnar Database (Infinidb, Open               Dashboarding (Tableau Software)
   Source)                                         • +Direct connection to the database
   • Free (as beer)                                • +Excel fan biz guy can use it with no training !
   • Good performance for analytics tasks on a
     few hundreds million lines ( SELECT … GROUP
     BY … ORDER … )
   • Featured and limited performance compared
     to commercial Column Stores
Questions ?

Contenu connexe

Tendances

Daniel Rabinovich - Etsy - New York
Daniel Rabinovich - Etsy - New YorkDaniel Rabinovich - Etsy - New York
Daniel Rabinovich - Etsy - New York
Daniel Rabinovich
 
New 2001 Net Odyssey
New 2001 Net OdysseyNew 2001 Net Odyssey
New 2001 Net Odyssey
Anna Pollock
 

Tendances (20)

Deliver New Customer Experiences Through AI-enabled Chatbots
 Deliver New Customer Experiences Through AI-enabled Chatbots Deliver New Customer Experiences Through AI-enabled Chatbots
Deliver New Customer Experiences Through AI-enabled Chatbots
 
AWS Empowering Digital Marketing with the AWS Cloud
AWS Empowering Digital Marketing with the AWS Cloud AWS Empowering Digital Marketing with the AWS Cloud
AWS Empowering Digital Marketing with the AWS Cloud
 
Welcome easypurl presentation small medium biz
Welcome easypurl presentation small medium bizWelcome easypurl presentation small medium biz
Welcome easypurl presentation small medium biz
 
Integrator role whitespace
Integrator role whitespaceIntegrator role whitespace
Integrator role whitespace
 
IBM Marketing Cloud mobile solutions
IBM Marketing Cloud mobile solutionsIBM Marketing Cloud mobile solutions
IBM Marketing Cloud mobile solutions
 
AWS Sydney Summit 2019 Re:Cap
AWS Sydney Summit 2019 Re:CapAWS Sydney Summit 2019 Re:Cap
AWS Sydney Summit 2019 Re:Cap
 
Amazon Web Services SWOT & Competitor Analysis
Amazon Web Services SWOT & Competitor AnalysisAmazon Web Services SWOT & Competitor Analysis
Amazon Web Services SWOT & Competitor Analysis
 
All The Screens: Cross Platform Design Strategies
All The Screens: Cross Platform Design StrategiesAll The Screens: Cross Platform Design Strategies
All The Screens: Cross Platform Design Strategies
 
Demystifying the desktop - Matt Groves
Demystifying the desktop - Matt GrovesDemystifying the desktop - Matt Groves
Demystifying the desktop - Matt Groves
 
Introducing Amazon Connect-Keynote-Enterprise Connect 2017
Introducing Amazon Connect-Keynote-Enterprise Connect 2017Introducing Amazon Connect-Keynote-Enterprise Connect 2017
Introducing Amazon Connect-Keynote-Enterprise Connect 2017
 
Implementation of Amazon Connect, Powered By Accenture
Implementation of Amazon Connect, Powered By AccentureImplementation of Amazon Connect, Powered By Accenture
Implementation of Amazon Connect, Powered By Accenture
 
Overview of IBM Watson Services via Blue Mix
Overview of IBM Watson Services via Blue Mix Overview of IBM Watson Services via Blue Mix
Overview of IBM Watson Services via Blue Mix
 
AWS Summit Singapore 2019 | Driving Business Outcomes with Data Lake on AWS
AWS Summit Singapore 2019 | Driving Business Outcomes with Data Lake on AWSAWS Summit Singapore 2019 | Driving Business Outcomes with Data Lake on AWS
AWS Summit Singapore 2019 | Driving Business Outcomes with Data Lake on AWS
 
Maximising the Customer Experience with Amazon Connect and AI Services
Maximising the Customer Experience with Amazon Connect and AI ServicesMaximising the Customer Experience with Amazon Connect and AI Services
Maximising the Customer Experience with Amazon Connect and AI Services
 
SharePoint Saturday Warsaw - Conversational AI applications in Microsoft Teams
SharePoint Saturday Warsaw - Conversational AI applications in Microsoft TeamsSharePoint Saturday Warsaw - Conversational AI applications in Microsoft Teams
SharePoint Saturday Warsaw - Conversational AI applications in Microsoft Teams
 
Daniel Rabinovich - Etsy - New York
Daniel Rabinovich - Etsy - New YorkDaniel Rabinovich - Etsy - New York
Daniel Rabinovich - Etsy - New York
 
Henry Stewart DAM Webinar - The Future of DAM: Enterprise Power Meets Departm...
Henry Stewart DAM Webinar - The Future of DAM: Enterprise Power Meets Departm...Henry Stewart DAM Webinar - The Future of DAM: Enterprise Power Meets Departm...
Henry Stewart DAM Webinar - The Future of DAM: Enterprise Power Meets Departm...
 
New 2001 Net Odyssey
New 2001 Net OdysseyNew 2001 Net Odyssey
New 2001 Net Odyssey
 
The pitch deck we used to raise $200,000 for our Brazilian startup
The pitch deck we used to raise $200,000 for our Brazilian startupThe pitch deck we used to raise $200,000 for our Brazilian startup
The pitch deck we used to raise $200,000 for our Brazilian startup
 
The Connected Data Imperative: The Shifting Enterprise Data Story
The Connected Data Imperative: The Shifting Enterprise Data StoryThe Connected Data Imperative: The Shifting Enterprise Data Story
The Connected Data Imperative: The Shifting Enterprise Data Story
 

En vedette

Codemasters presentation
Codemasters presentationCodemasters presentation
Codemasters presentation
Ethan Falconer
 
About Teamtop Games, 8th August, v1.4
About Teamtop Games, 8th August, v1.4About Teamtop Games, 8th August, v1.4
About Teamtop Games, 8th August, v1.4
Frank Guo 郭智宇
 

En vedette (12)

Codemasters presentation
Codemasters presentationCodemasters presentation
Codemasters presentation
 
Andrey Zimenko, WG Labs
Andrey Zimenko, WG LabsAndrey Zimenko, WG Labs
Andrey Zimenko, WG Labs
 
What is a Good Game? (From the Perspective of a Game Publisher) | Christopher...
What is a Good Game? (From the Perspective of a Game Publisher) | Christopher...What is a Good Game? (From the Perspective of a Game Publisher) | Christopher...
What is a Good Game? (From the Perspective of a Game Publisher) | Christopher...
 
Leela Mobile Game Publisher
Leela Mobile Game PublisherLeela Mobile Game Publisher
Leela Mobile Game Publisher
 
FL Mobile 소개
FL Mobile 소개FL Mobile 소개
FL Mobile 소개
 
Atvi buy
Atvi buyAtvi buy
Atvi buy
 
About Teamtop Games, 8th August, v1.4
About Teamtop Games, 8th August, v1.4About Teamtop Games, 8th August, v1.4
About Teamtop Games, 8th August, v1.4
 
Palestra GamerTalks InfNet - 2015 - Mercado e Oportunidades para empresas de ...
Palestra GamerTalks InfNet - 2015 - Mercado e Oportunidades para empresas de ...Palestra GamerTalks InfNet - 2015 - Mercado e Oportunidades para empresas de ...
Palestra GamerTalks InfNet - 2015 - Mercado e Oportunidades para empresas de ...
 
Mobile Game Asia 2015 Ho Chi Minh City: The role of a game publisher in Vietnam
Mobile Game Asia 2015 Ho Chi Minh City: The role of a game publisher in VietnamMobile Game Asia 2015 Ho Chi Minh City: The role of a game publisher in Vietnam
Mobile Game Asia 2015 Ho Chi Minh City: The role of a game publisher in Vietnam
 
The Role Of A Game Publisher In Vietnam
The Role Of A Game Publisher In VietnamThe Role Of A Game Publisher In Vietnam
The Role Of A Game Publisher In Vietnam
 
Ubisoft
UbisoftUbisoft
Ubisoft
 
Business Plan
Business PlanBusiness Plan
Business Plan
 

Similaire à Big data paris 2011 is cool florian douetteau

01 9 15 paul frampton omma keynote-deck
01 9 15 paul frampton omma keynote-deck01 9 15 paul frampton omma keynote-deck
01 9 15 paul frampton omma keynote-deck
MediaPost
 
Business Model of Social Games (Detailed) - Gerald Tan (2012-Mar-01)
Business Model of Social Games (Detailed) - Gerald Tan (2012-Mar-01)Business Model of Social Games (Detailed) - Gerald Tan (2012-Mar-01)
Business Model of Social Games (Detailed) - Gerald Tan (2012-Mar-01)
Gerald Tan
 
OpTier McKinsey Big Data Overview
OpTier McKinsey Big Data OverviewOpTier McKinsey Big Data Overview
OpTier McKinsey Big Data Overview
nickychu
 
McKinsey Big Data Overview
McKinsey Big Data OverviewMcKinsey Big Data Overview
McKinsey Big Data Overview
optier
 
McKinsey Big Data Overview
McKinsey Big Data OverviewMcKinsey Big Data Overview
McKinsey Big Data Overview
optier
 
5. happy elements
5. happy elements5. happy elements
5. happy elements
Alice Kim
 
Io t idas_intro_ul20_nobkg
Io t idas_intro_ul20_nobkgIo t idas_intro_ul20_nobkg
Io t idas_intro_ul20_nobkg
FIWARE
 

Similaire à Big data paris 2011 is cool florian douetteau (20)

One Million Players Isn't Cool (at Festival of Games 2011)
One Million Players Isn't Cool (at Festival of Games 2011)One Million Players Isn't Cool (at Festival of Games 2011)
One Million Players Isn't Cool (at Festival of Games 2011)
 
danmcclary-pspresentation-katieboyle-171030115522.pdf
danmcclary-pspresentation-katieboyle-171030115522.pdfdanmcclary-pspresentation-katieboyle-171030115522.pdf
danmcclary-pspresentation-katieboyle-171030115522.pdf
 
Why Big and Small Data Is Important by Google's Product Manager
Why Big and Small Data Is Important by Google's Product ManagerWhy Big and Small Data Is Important by Google's Product Manager
Why Big and Small Data Is Important by Google's Product Manager
 
01 9 15 paul frampton omma keynote-deck
01 9 15 paul frampton omma keynote-deck01 9 15 paul frampton omma keynote-deck
01 9 15 paul frampton omma keynote-deck
 
Business Model of Social Games (Detailed) - Gerald Tan (2012-Mar-01)
Business Model of Social Games (Detailed) - Gerald Tan (2012-Mar-01)Business Model of Social Games (Detailed) - Gerald Tan (2012-Mar-01)
Business Model of Social Games (Detailed) - Gerald Tan (2012-Mar-01)
 
Wikibon Big Data Capital Markets Day 2014
Wikibon Big Data Capital Markets Day 2014Wikibon Big Data Capital Markets Day 2014
Wikibon Big Data Capital Markets Day 2014
 
Agile data science
Agile data scienceAgile data science
Agile data science
 
Microsoft BizSpark: Tips for Startups - midem 2012 presentation
Microsoft BizSpark: Tips for Startups - midem 2012 presentationMicrosoft BizSpark: Tips for Startups - midem 2012 presentation
Microsoft BizSpark: Tips for Startups - midem 2012 presentation
 
Continuum Analytics and Python
Continuum Analytics and PythonContinuum Analytics and Python
Continuum Analytics and Python
 
OpTier McKinsey Big Data Overview
OpTier McKinsey Big Data OverviewOpTier McKinsey Big Data Overview
OpTier McKinsey Big Data Overview
 
McKinsey Big Data Overview
McKinsey Big Data OverviewMcKinsey Big Data Overview
McKinsey Big Data Overview
 
McKinsey Big Data Overview
McKinsey Big Data OverviewMcKinsey Big Data Overview
McKinsey Big Data Overview
 
Tokens and Complex Systems
Tokens and Complex SystemsTokens and Complex Systems
Tokens and Complex Systems
 
Tokens, Complex Systems, and Nature
Tokens, Complex Systems, and NatureTokens, Complex Systems, and Nature
Tokens, Complex Systems, and Nature
 
5. happy elements
5. happy elements5. happy elements
5. happy elements
 
Massively multiplayer data challenges in mobile game analytics
Massively multiplayer data  challenges in mobile game analyticsMassively multiplayer data  challenges in mobile game analytics
Massively multiplayer data challenges in mobile game analytics
 
Massively multiplayer data challenges in mobile game analytics
Massively multiplayer data  challenges in mobile game analyticsMassively multiplayer data  challenges in mobile game analytics
Massively multiplayer data challenges in mobile game analytics
 
Fiware IoT_intro&scenarios
Fiware IoT_intro&scenariosFiware IoT_intro&scenarios
Fiware IoT_intro&scenarios
 
Io t idas_intro_ul20_nobkg
Io t idas_intro_ul20_nobkgIo t idas_intro_ul20_nobkg
Io t idas_intro_ul20_nobkg
 
Harnessing Big Data to Better Serve Your Audience - Core Audience / iCrossing
Harnessing Big Data to Better Serve Your Audience - Core Audience / iCrossingHarnessing Big Data to Better Serve Your Audience - Core Audience / iCrossing
Harnessing Big Data to Better Serve Your Audience - Core Audience / iCrossing
 

Dernier

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Dernier (20)

Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live StreamsTop 5 Benefits OF Using Muvi Live Paywall For Live Streams
Top 5 Benefits OF Using Muvi Live Paywall For Live Streams
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...Apidays New York 2024 - The value of a flexible API Management solution for O...
Apidays New York 2024 - The value of a flexible API Management solution for O...
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
Deploy with confidence: VMware Cloud Foundation 5.1 on next gen Dell PowerEdg...
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 

Big data paris 2011 is cool florian douetteau

  • 1. Big Data + Social + Games @Is Cool 16/03/2012 TITRE DOCUMENT
  • 2. Who is IsCool Entertainment? Social game publisher based in Agenda Paris, France • What do we do? Social Gaming #1 French publisher in terms of • What kind of (Big) Analytics we do? audience (450k Daily Active Lots Users) & revenue • How we do it ? Hadoop, Python, R, Tableau, Geph and stuff… 2.8 Millions Fans 80 employees Florian Douetteau 9.1 million € revenue in 2010 CTO 4 live applications on Facebook @fdouetteau
  • 3. Is Cool Games IsCool, Absolute Solitaire, Delirious Collectible The best solitaire game Game available online Temple Of Mahjong, Belote Multijoueur, Collect, Play, Exchange Play, Win, Meet
  • 4. Games & Virtual Goods Play the Game & Gain some virtual goods Play again & Gain more Collaborate with other players & Gain More …. Possibly buy  To grow quicker  To help others
  • 5. Virtual Goods  Virtual Economy Virtual Goods Must not be too easy to get  The game would not be fun !  No monetization Virtual Goods must not be hard to get  People would churn because of Let’s Trade 1 Watch against frustration ! 3 Hammers Virtual Goods can be usually traded between players Virtual and actual “Price” of a good
  • 6. Why is this Big Data ? Number of object transactions per day  NYSE 3,600,000,000 18 Million users generated actions per day  IsCool 2,150,000,000 7 Billions per year.  Nasdaq 1,600,000,000 9,8 TB Data to  Nikkey 1,500,000,000 analyze  Footsie 860,000,000  CAC 40 142,500,000
  • 7. The Real Big Data Challenge Collaborate for collective insights Programmers’ Perspective : Game Designer Perspective : Log Files & Work ? Nice Charts ? Realtime? what metrics? data scientist? BI Veteran: Business Guy Perspective: Schema Definition ? Revenue Forecast ?
  • 8. Specifics of Game Analytics Virtual Goods  We are the Factory AND the Shop, and most of the products are free. Social Networks  Network effects are key Games  The product changes EVERY day !  Sudden wage of unexpected players from Guatemala !  People try to cheat !
  • 9. Use Case 1 : Understanding Users 1: Defining engagement Tenure length Visit frequency Virality Traffic Key drivers??? Paying user conversion ARPPU Score Use of feature A,B,C…
  • 10. Case Study 1 - Segment User Behaviours 2: Describing engagement patterns: Running a segment analysis
  • 11. Use Case 2 : Understanding Users as a whole 10 Million Nodes Around 1 000 Billion Edges How does the graph evolve in time ? What are the communities?
  • 12. Understanding Users as a Whole Lots of small clusters ((mostly 2 players) Some mid size communities A very large community
  • 13. Use Case 3 : Analyze Long Terms effect of a feature A/B Tests  Some features can be A/B tested  …and some cannot !  How to measure the uplift ? Are players using the new feature…  More engaged?  Generate more virality ?  etc…. Complexity  Multiple variable to observe (other features, history ) TITRE DOCUMENT 16/03/2012
  • 14. … How over the last 3 years Analyzing the Offer • Tools changed • Online Analytics Platform • Scale changed • Commercial / Open Source ETL • Focus Changed • Commercial BI Visualization Software • Commercial / Open Source databases (column stores) •…
  • 15. What we learned Diversity Relativity Superciality • There's no Hadoop+R • Windows / Linux ? Cloud • Ability to display is more Magic (Expertise, Entry or on-premise ? important than the Costs, Maintenance) • Do you have internal data result. • There’s no XYZ Magical mining experts (yes/no) ? Product • Do you have internal scalability experts (yes/no) ? • What is _real_ budget ? 0K ? 10K ? 100K ? 1000K ?
  • 16. Mixed Approach SaaS Analytics Platforms  For common, business metrics (virality, traffic, engagement)  Corporate Level Visibility  Day-to-day Internal Datawarehousing  Detailed Business Metrics  Virtual Economy Modeling  Long term behaviours  Business Level Visibility  Week-to-Week Datamining tools  Ad-hoc analytics  Graph Analytics
  • 17. Datawarehouse for the Big Data era Hadoop/Hive (through Amazon’s Open Source ETL (PyBabe) Elastric Map Reduce) • Pure Python ETL • Used to reduce the amount of information : • Good integration with AWS/ S3 10 GB a day => 1GB a day • Easy to integrate in our development • High cost of development for "business" environment related processing Columnar Database (Infinidb, Open Dashboarding (Tableau Software) Source) • +Direct connection to the database • Free (as beer) • +Excel fan biz guy can use it with no training ! • Good performance for analytics tasks on a few hundreds million lines ( SELECT … GROUP BY … ORDER … ) • Featured and limited performance compared to commercial Column Stores