SlideShare une entreprise Scribd logo
1  sur  21
Télécharger pour lire hors ligne
Big Data
Turning your data problem into a competitive advantage


Barak Regev
Head of Cloud Platform - EMEA
20 min in 1 minute


                      Managing big data is hard



                         There is a better way



                     Put your data to work for you.
How we do it - Google Infrastructure


                        4 billion hours of video per month
                             425 million Gmail users
                            100,000,000 GB web Index
                            0.25 secs to search results
Defining Big Data
Practical problems & opportunities
“ How are hotel reservations for Spain from New
  York compared with this time last year? ”

“ Do we need to adjust our marketing campaign?
  Where? ”


 CenterParcs - European hospitality
“ Which users who signed up last quarter,
 have also advanced at least 3 levels, and
 purchased an item worth more than $5? ”


 Claritics - mobile & social user analytics
Business & IT trends driving Big Data

   Opportunities                            Challenges
   Data is a core business asset            Information is growing faster than ability to
                                            leverage it
   Increasingly data is out in the Cloud
   (e.g. social, CRM)                       Tough for Enterprise to capture all the
                                            data they generate
   New things are possible in the Cloud     Scaling traditional BI for Big Data can be
   (unique algorithms, scale)               hard

   Greatly increased speed of sharing and   Skills: requires IT, analytics, software
   iteration                                development
What does Big Data look like?

   Some common characteristics                      Diverse industries

   Structured, semi-structured, unstructured   Retail point of sales transactions

         Millions if not billions of rows      User activity logs (mobile & social)

  Too large to process on a single machine     Mobile telemetry & smart devices

    Too large to store on a single machine         Industrial & manufacturing

              High rate of growth                       Financial trading

                   More daily                  Medical research (e.g. genomics)

                                                 Movie rendering & production
Put the Data to work
Google cloud services for Big Data
Use the cloud


                Composable cloud services

                Focus on the solution rather than on the
                infrastructure

                Do new things that weren't possible before

                Pay for what you use.
BIG DATA LOG ANALYSIS


                                                                              Google
POS,                                                                          Spreadsheets
Clickstream
RFID
Customer Loyalty                                                                                       Other BI Tools
Add clickthroughs..
                                         BigQuery

                                                                                                                      Data sets for
                                                                                                                      further Analysis


                                                                                                                App Engine
                                         Scalable Storage                                                       App
                                              SQL              API
                                                                                                                Marketing
         Corporate data                                                                                         Merchandising
         3rd party data                                                                                         Local Stores
                                                                                                                Partners

                  Store all your data         Analyze interactively                             Securely Share/
                     in the cloud       Product Affinity, Market Basket etc                  distribute the results
Scaling large ads reporting
 Latency               Customer load test: On-prem MySQL vs BigQuery
(seconds)




                                                                            # days of data


            Business: ads authoring tools and reporting
            Data: ad serving logs for 500 websites, ~300M rows/day
            Problem solved: interactively finding new trends and patterns
A New Hadoop Terasort World Record
What did we learn?



                        Store data with reliability, redundancy and consistency

                        Go from Data to Meaning


                        At Scale


                        ...fast



                     Google white papers
                     Google File System (2003)
                     MapReduce: Simplified Data Processing on Large Clusters (2004)
                     BigTable: A Distributed Storage System for Structured Data (2006)
                     Dremel: Interactive Analysis of Web-Scale Datasets (2010)
                     Machine Translation (2004-2011)
The virtuous cycle of data




                                   Collect Data
                              (Cloud Storage, Datastore,
                                      Logstore)


          Build application                                Process Data
             (GAE / GCE)                                   (App Engine, GCE)




                 (improve)


                                  Analyze Data,
                                     (BigQuery)
Build
The Next Generation of Data-Centric Applications
BigQuery use cases in industry

Ad Spend Attribution           Mash up Adwords + Google Analytics data + customer reservations for high
(online travel reservations)   volume attribution analysis

Media consulting               Analyze 20GB/day of DoubleClick display ads performance metrics for F500
(global top-5 media agency)    clients

Ad authoring tools             Deliver x-platform performance analytics dashboards to 100s of ads authoring
(online ads authoring)         customers

Social gaming                  Cohort analysis on million+ gamers to monetize massive online social gaming
(data analytics vendor)


Revenue optimization           Measure x-media campaign effectiveness to maximize occupancy rates
(holiday/travel properties)


                                          Business Requirements
                                    A single place to capture growing data
                                     Combine data from different sources
                                 Ad hoc detection of patterns and correlations
                                      Easily share data insights with org
                                    Distribute data-based decision making
BIME + BigQuery

Interactively analyze 450M rows of sales data
Mobile & social gaming user analysis




                                       Notice trend change




                                       Slice user data, identify segments




                                       Compare segments vs general
                                       population
Revenue optimization - hospitality industry


                            New solution for real-time decision making
                            Saves more than $150,00 a year




                                                                     AppEngine




                                                                     BigQuery                Cloud Storage




    Regional Sales   Analysts       Execs
                                                           BI team
                                                                                                     Oracle DB
                                                                         Netezza appliance
Thank you




            cloud.google.com

Contenu connexe

Tendances

ORCHESTRA - Gouvernance des donnees et MDM - Data forum MICROPOLE 2016
ORCHESTRA -  Gouvernance des donnees et MDM -  Data forum MICROPOLE 2016 ORCHESTRA -  Gouvernance des donnees et MDM -  Data forum MICROPOLE 2016
ORCHESTRA - Gouvernance des donnees et MDM - Data forum MICROPOLE 2016 Micropole Group
 
Big data analytics use cases: all you need to know
Big data analytics use cases:  all you need to knowBig data analytics use cases:  all you need to know
Big data analytics use cases: all you need to knowJane Brewer
 
Analytics: The Real-world Use of Big Data
Analytics: The Real-world Use of Big DataAnalytics: The Real-world Use of Big Data
Analytics: The Real-world Use of Big DataDavid Pittman
 
Overview of analytics and big data in practice
Overview of analytics and big data in practiceOverview of analytics and big data in practice
Overview of analytics and big data in practiceVivek Murugesan
 
Integrating Hadoop Into the Enterprise
Integrating Hadoop Into the EnterpriseIntegrating Hadoop Into the Enterprise
Integrating Hadoop Into the EnterpriseDataWorks Summit
 
Big data analytics for telecom operators final use cases 0712-2014_prof_m erdas
Big data analytics for telecom operators final use cases 0712-2014_prof_m erdasBig data analytics for telecom operators final use cases 0712-2014_prof_m erdas
Big data analytics for telecom operators final use cases 0712-2014_prof_m erdasProf Dr Mehmed ERDAS
 
Big Data Expo 2015 - Pentaho The Future of Analytics
Big Data Expo 2015 - Pentaho The Future of AnalyticsBig Data Expo 2015 - Pentaho The Future of Analytics
Big Data Expo 2015 - Pentaho The Future of AnalyticsBigDataExpo
 
Bmc joe goldberg
Bmc joe goldbergBmc joe goldberg
Bmc joe goldbergBigDataExpo
 
How artificial intelligence (AI) can help maximize customer intelligence ROI
How artificial intelligence (AI) can help maximize customer intelligence ROIHow artificial intelligence (AI) can help maximize customer intelligence ROI
How artificial intelligence (AI) can help maximize customer intelligence ROIVincent de Stoecklin
 
Smarter Big Data Strategies
Smarter Big Data StrategiesSmarter Big Data Strategies
Smarter Big Data StrategiesInfosys
 
AI projects - Lifecyle & Best Practices
AI projects - Lifecyle & Best PracticesAI projects - Lifecyle & Best Practices
AI projects - Lifecyle & Best PracticesVincent de Stoecklin
 
Big Data LDN 2017: BI Converges with AI - GPUs for Fast Data
Big Data LDN 2017: BI Converges with AI - GPUs for Fast DataBig Data LDN 2017: BI Converges with AI - GPUs for Fast Data
Big Data LDN 2017: BI Converges with AI - GPUs for Fast DataMatt Stubbs
 
Business case for Big Data Analytics
Business case for Big Data AnalyticsBusiness case for Big Data Analytics
Business case for Big Data AnalyticsVijay Rao
 
Esri Location Analytics: Four Implementation Models
Esri Location Analytics: Four Implementation ModelsEsri Location Analytics: Four Implementation Models
Esri Location Analytics: Four Implementation ModelsEsri
 
Analytics and information management.
Analytics and information management.Analytics and information management.
Analytics and information management.Mindtree Ltd.
 
What's the Big Deal About Big Data?
What's the Big Deal About Big Data?What's the Big Deal About Big Data?
What's the Big Deal About Big Data?Logi Analytics
 
Big Data Impact on Purchasing and SCM - PASIA World Conference Discussion
Big Data Impact on Purchasing and SCM - PASIA World Conference DiscussionBig Data Impact on Purchasing and SCM - PASIA World Conference Discussion
Big Data Impact on Purchasing and SCM - PASIA World Conference DiscussionBill Kohnen
 
Enterprise Cloud Forum: Turning Big Data into Big Dollars
Enterprise Cloud Forum: Turning Big Data into Big DollarsEnterprise Cloud Forum: Turning Big Data into Big Dollars
Enterprise Cloud Forum: Turning Big Data into Big DollarsRackspace
 

Tendances (20)

ORCHESTRA - Gouvernance des donnees et MDM - Data forum MICROPOLE 2016
ORCHESTRA -  Gouvernance des donnees et MDM -  Data forum MICROPOLE 2016 ORCHESTRA -  Gouvernance des donnees et MDM -  Data forum MICROPOLE 2016
ORCHESTRA - Gouvernance des donnees et MDM - Data forum MICROPOLE 2016
 
Big data analytics use cases: all you need to know
Big data analytics use cases:  all you need to knowBig data analytics use cases:  all you need to know
Big data analytics use cases: all you need to know
 
Big Data and Cloud Analytics
Big Data and Cloud AnalyticsBig Data and Cloud Analytics
Big Data and Cloud Analytics
 
Analytics: The Real-world Use of Big Data
Analytics: The Real-world Use of Big DataAnalytics: The Real-world Use of Big Data
Analytics: The Real-world Use of Big Data
 
Overview of analytics and big data in practice
Overview of analytics and big data in practiceOverview of analytics and big data in practice
Overview of analytics and big data in practice
 
Spotfire
SpotfireSpotfire
Spotfire
 
Integrating Hadoop Into the Enterprise
Integrating Hadoop Into the EnterpriseIntegrating Hadoop Into the Enterprise
Integrating Hadoop Into the Enterprise
 
Big data analytics for telecom operators final use cases 0712-2014_prof_m erdas
Big data analytics for telecom operators final use cases 0712-2014_prof_m erdasBig data analytics for telecom operators final use cases 0712-2014_prof_m erdas
Big data analytics for telecom operators final use cases 0712-2014_prof_m erdas
 
Big Data Expo 2015 - Pentaho The Future of Analytics
Big Data Expo 2015 - Pentaho The Future of AnalyticsBig Data Expo 2015 - Pentaho The Future of Analytics
Big Data Expo 2015 - Pentaho The Future of Analytics
 
Bmc joe goldberg
Bmc joe goldbergBmc joe goldberg
Bmc joe goldberg
 
How artificial intelligence (AI) can help maximize customer intelligence ROI
How artificial intelligence (AI) can help maximize customer intelligence ROIHow artificial intelligence (AI) can help maximize customer intelligence ROI
How artificial intelligence (AI) can help maximize customer intelligence ROI
 
Smarter Big Data Strategies
Smarter Big Data StrategiesSmarter Big Data Strategies
Smarter Big Data Strategies
 
AI projects - Lifecyle & Best Practices
AI projects - Lifecyle & Best PracticesAI projects - Lifecyle & Best Practices
AI projects - Lifecyle & Best Practices
 
Big Data LDN 2017: BI Converges with AI - GPUs for Fast Data
Big Data LDN 2017: BI Converges with AI - GPUs for Fast DataBig Data LDN 2017: BI Converges with AI - GPUs for Fast Data
Big Data LDN 2017: BI Converges with AI - GPUs for Fast Data
 
Business case for Big Data Analytics
Business case for Big Data AnalyticsBusiness case for Big Data Analytics
Business case for Big Data Analytics
 
Esri Location Analytics: Four Implementation Models
Esri Location Analytics: Four Implementation ModelsEsri Location Analytics: Four Implementation Models
Esri Location Analytics: Four Implementation Models
 
Analytics and information management.
Analytics and information management.Analytics and information management.
Analytics and information management.
 
What's the Big Deal About Big Data?
What's the Big Deal About Big Data?What's the Big Deal About Big Data?
What's the Big Deal About Big Data?
 
Big Data Impact on Purchasing and SCM - PASIA World Conference Discussion
Big Data Impact on Purchasing and SCM - PASIA World Conference DiscussionBig Data Impact on Purchasing and SCM - PASIA World Conference Discussion
Big Data Impact on Purchasing and SCM - PASIA World Conference Discussion
 
Enterprise Cloud Forum: Turning Big Data into Big Dollars
Enterprise Cloud Forum: Turning Big Data into Big DollarsEnterprise Cloud Forum: Turning Big Data into Big Dollars
Enterprise Cloud Forum: Turning Big Data into Big Dollars
 

En vedette

Rik Van Bruggen - Getting beer into and out of neo4j
Rik Van Bruggen - Getting beer into and out of neo4jRik Van Bruggen - Getting beer into and out of neo4j
Rik Van Bruggen - Getting beer into and out of neo4jPatrickCrompton
 
eSynergy Dave Sayers - Applying DevOps principles in established corporate or...
eSynergy Dave Sayers - Applying DevOps principles in established corporate or...eSynergy Dave Sayers - Applying DevOps principles in established corporate or...
eSynergy Dave Sayers - Applying DevOps principles in established corporate or...PatrickCrompton
 
Microsoft Azure User Group
Microsoft Azure User GroupMicrosoft Azure User Group
Microsoft Azure User GroupPatrickCrompton
 
eSynergy Dave Sayers - Applying DevOps principles in established corporate or...
eSynergy Dave Sayers - Applying DevOps principles in established corporate or...eSynergy Dave Sayers - Applying DevOps principles in established corporate or...
eSynergy Dave Sayers - Applying DevOps principles in established corporate or...PatrickCrompton
 
eSynergy Keiran Sweet - Bringing order to chaos with puppet
eSynergy Keiran Sweet - Bringing order to chaos with puppeteSynergy Keiran Sweet - Bringing order to chaos with puppet
eSynergy Keiran Sweet - Bringing order to chaos with puppetPatrickCrompton
 
Decreto sobre renovación de transporte.
Decreto sobre renovación de transporte.Decreto sobre renovación de transporte.
Decreto sobre renovación de transporte.Luis Noguera
 
Mrinal devadas, Hortonworks Making Sense Of Big Data
Mrinal devadas, Hortonworks Making Sense Of Big DataMrinal devadas, Hortonworks Making Sense Of Big Data
Mrinal devadas, Hortonworks Making Sense Of Big DataPatrickCrompton
 

En vedette (7)

Rik Van Bruggen - Getting beer into and out of neo4j
Rik Van Bruggen - Getting beer into and out of neo4jRik Van Bruggen - Getting beer into and out of neo4j
Rik Van Bruggen - Getting beer into and out of neo4j
 
eSynergy Dave Sayers - Applying DevOps principles in established corporate or...
eSynergy Dave Sayers - Applying DevOps principles in established corporate or...eSynergy Dave Sayers - Applying DevOps principles in established corporate or...
eSynergy Dave Sayers - Applying DevOps principles in established corporate or...
 
Microsoft Azure User Group
Microsoft Azure User GroupMicrosoft Azure User Group
Microsoft Azure User Group
 
eSynergy Dave Sayers - Applying DevOps principles in established corporate or...
eSynergy Dave Sayers - Applying DevOps principles in established corporate or...eSynergy Dave Sayers - Applying DevOps principles in established corporate or...
eSynergy Dave Sayers - Applying DevOps principles in established corporate or...
 
eSynergy Keiran Sweet - Bringing order to chaos with puppet
eSynergy Keiran Sweet - Bringing order to chaos with puppeteSynergy Keiran Sweet - Bringing order to chaos with puppet
eSynergy Keiran Sweet - Bringing order to chaos with puppet
 
Decreto sobre renovación de transporte.
Decreto sobre renovación de transporte.Decreto sobre renovación de transporte.
Decreto sobre renovación de transporte.
 
Mrinal devadas, Hortonworks Making Sense Of Big Data
Mrinal devadas, Hortonworks Making Sense Of Big DataMrinal devadas, Hortonworks Making Sense Of Big Data
Mrinal devadas, Hortonworks Making Sense Of Big Data
 

Similaire à Barak regev

Big data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You WantBig data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You WantStuart Miniman
 
Big Data: Its Characteristics And Architecture Capabilities
Big Data: Its Characteristics And Architecture CapabilitiesBig Data: Its Characteristics And Architecture Capabilities
Big Data: Its Characteristics And Architecture CapabilitiesAshraf Uddin
 
DAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
DAMA Webinar: Turn Grand Designs into a Reality with Data VirtualizationDAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
DAMA Webinar: Turn Grand Designs into a Reality with Data VirtualizationDenodo
 
01 im overview high level
01 im overview high level01 im overview high level
01 im overview high levelJames Findlay
 
Intel Cloud Summit: Big Data
Intel Cloud Summit: Big DataIntel Cloud Summit: Big Data
Intel Cloud Summit: Big DataIntelAPAC
 
Getting more out of your big data
Getting more out of your big dataGetting more out of your big data
Getting more out of your big dataNathan Bijnens
 
Connecting Silos in Real Time with Data Virtualization
Connecting Silos in Real Time with Data VirtualizationConnecting Silos in Real Time with Data Virtualization
Connecting Silos in Real Time with Data VirtualizationDenodo
 
Ab cs of big data
Ab cs of big dataAb cs of big data
Ab cs of big dataDigimark
 
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATIONBig Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATIONMatt Stubbs
 
The Comprehensive Approach: A Unified Information Architecture
The Comprehensive Approach: A Unified Information ArchitectureThe Comprehensive Approach: A Unified Information Architecture
The Comprehensive Approach: A Unified Information ArchitectureInside Analysis
 
Mesa Big Data 2nd Screen Final
Mesa Big Data 2nd Screen FinalMesa Big Data 2nd Screen Final
Mesa Big Data 2nd Screen FinalTripp Payne
 
A technical Introduction to Big Data Analytics
A technical Introduction to Big Data AnalyticsA technical Introduction to Big Data Analytics
A technical Introduction to Big Data AnalyticsPethuru Raj PhD
 
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder AtwalDataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder AtwalHarvinder Atwal
 
Attributes of a Modern Data Warehouse - Gartner Catalyst
Attributes of a Modern Data Warehouse - Gartner CatalystAttributes of a Modern Data Warehouse - Gartner Catalyst
Attributes of a Modern Data Warehouse - Gartner CatalystJack Mardack
 

Similaire à Barak regev (20)

Globant and Big Data on AWS
Globant and Big Data on AWSGlobant and Big Data on AWS
Globant and Big Data on AWS
 
Big data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You WantBig data? No. Big Decisions are What You Want
Big data? No. Big Decisions are What You Want
 
Enterprise Services Solutions
Enterprise Services SolutionsEnterprise Services Solutions
Enterprise Services Solutions
 
Big Data: Its Characteristics And Architecture Capabilities
Big Data: Its Characteristics And Architecture CapabilitiesBig Data: Its Characteristics And Architecture Capabilities
Big Data: Its Characteristics And Architecture Capabilities
 
Google and big query
Google and big queryGoogle and big query
Google and big query
 
The New Enterprise Data Platform
The New Enterprise Data PlatformThe New Enterprise Data Platform
The New Enterprise Data Platform
 
DAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
DAMA Webinar: Turn Grand Designs into a Reality with Data VirtualizationDAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
DAMA Webinar: Turn Grand Designs into a Reality with Data Virtualization
 
01 im overview high level
01 im overview high level01 im overview high level
01 im overview high level
 
Big data Analytics
Big data Analytics Big data Analytics
Big data Analytics
 
Intel Cloud Summit: Big Data
Intel Cloud Summit: Big DataIntel Cloud Summit: Big Data
Intel Cloud Summit: Big Data
 
Business process based analytics
Business process based analyticsBusiness process based analytics
Business process based analytics
 
Getting more out of your big data
Getting more out of your big dataGetting more out of your big data
Getting more out of your big data
 
Connecting Silos in Real Time with Data Virtualization
Connecting Silos in Real Time with Data VirtualizationConnecting Silos in Real Time with Data Virtualization
Connecting Silos in Real Time with Data Virtualization
 
Ab cs of big data
Ab cs of big dataAb cs of big data
Ab cs of big data
 
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATIONBig Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
Big Data LDN 2018: CONNECTING SILOS IN REAL-TIME WITH DATA VIRTUALIZATION
 
The Comprehensive Approach: A Unified Information Architecture
The Comprehensive Approach: A Unified Information ArchitectureThe Comprehensive Approach: A Unified Information Architecture
The Comprehensive Approach: A Unified Information Architecture
 
Mesa Big Data 2nd Screen Final
Mesa Big Data 2nd Screen FinalMesa Big Data 2nd Screen Final
Mesa Big Data 2nd Screen Final
 
A technical Introduction to Big Data Analytics
A technical Introduction to Big Data AnalyticsA technical Introduction to Big Data Analytics
A technical Introduction to Big Data Analytics
 
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder AtwalDataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
DataOps - Big Data and AI World London - March 2020 - Harvinder Atwal
 
Attributes of a Modern Data Warehouse - Gartner Catalyst
Attributes of a Modern Data Warehouse - Gartner CatalystAttributes of a Modern Data Warehouse - Gartner Catalyst
Attributes of a Modern Data Warehouse - Gartner Catalyst
 

Plus de PatrickCrompton

eSynergy Andy Hawkins - Enabling DevOps through next generation configuration...
eSynergy Andy Hawkins - Enabling DevOps through next generation configuration...eSynergy Andy Hawkins - Enabling DevOps through next generation configuration...
eSynergy Andy Hawkins - Enabling DevOps through next generation configuration...PatrickCrompton
 
eSynergy Paul Swartout - DevOps - what is it and why is it valuable to business
eSynergy Paul Swartout - DevOps - what is it and why is it valuable to businesseSynergy Paul Swartout - DevOps - what is it and why is it valuable to business
eSynergy Paul Swartout - DevOps - what is it and why is it valuable to businessPatrickCrompton
 
Top 10 photos from Comic Relief 2013
Top 10 photos from Comic Relief 2013Top 10 photos from Comic Relief 2013
Top 10 photos from Comic Relief 2013PatrickCrompton
 
Cloud and Big Data Conference Images
Cloud and Big Data Conference ImagesCloud and Big Data Conference Images
Cloud and Big Data Conference ImagesPatrickCrompton
 

Plus de PatrickCrompton (11)

eSynergy Andy Hawkins - Enabling DevOps through next generation configuration...
eSynergy Andy Hawkins - Enabling DevOps through next generation configuration...eSynergy Andy Hawkins - Enabling DevOps through next generation configuration...
eSynergy Andy Hawkins - Enabling DevOps through next generation configuration...
 
eSynergy Paul Swartout - DevOps - what is it and why is it valuable to business
eSynergy Paul Swartout - DevOps - what is it and why is it valuable to businesseSynergy Paul Swartout - DevOps - what is it and why is it valuable to business
eSynergy Paul Swartout - DevOps - what is it and why is it valuable to business
 
APSCo Cup Winners 2013
APSCo Cup Winners 2013APSCo Cup Winners 2013
APSCo Cup Winners 2013
 
Happy Easter
Happy EasterHappy Easter
Happy Easter
 
Top 10 photos from Comic Relief 2013
Top 10 photos from Comic Relief 2013Top 10 photos from Comic Relief 2013
Top 10 photos from Comic Relief 2013
 
Team photo
Team photoTeam photo
Team photo
 
Cloud and Big Data Conference Images
Cloud and Big Data Conference ImagesCloud and Big Data Conference Images
Cloud and Big Data Conference Images
 
Tim Marston.
Tim Marston.Tim Marston.
Tim Marston.
 
Tim marston
Tim marstonTim marston
Tim marston
 
Michael newberry
Michael newberryMichael newberry
Michael newberry
 
Andy cross
Andy crossAndy cross
Andy cross
 

Barak regev

  • 1. Big Data Turning your data problem into a competitive advantage Barak Regev Head of Cloud Platform - EMEA
  • 2. 20 min in 1 minute Managing big data is hard There is a better way Put your data to work for you.
  • 3. How we do it - Google Infrastructure 4 billion hours of video per month 425 million Gmail users 100,000,000 GB web Index 0.25 secs to search results
  • 4. Defining Big Data Practical problems & opportunities
  • 5. “ How are hotel reservations for Spain from New York compared with this time last year? ” “ Do we need to adjust our marketing campaign? Where? ” CenterParcs - European hospitality
  • 6. “ Which users who signed up last quarter, have also advanced at least 3 levels, and purchased an item worth more than $5? ” Claritics - mobile & social user analytics
  • 7. Business & IT trends driving Big Data Opportunities Challenges Data is a core business asset Information is growing faster than ability to leverage it Increasingly data is out in the Cloud (e.g. social, CRM) Tough for Enterprise to capture all the data they generate New things are possible in the Cloud Scaling traditional BI for Big Data can be (unique algorithms, scale) hard Greatly increased speed of sharing and Skills: requires IT, analytics, software iteration development
  • 8. What does Big Data look like? Some common characteristics Diverse industries Structured, semi-structured, unstructured Retail point of sales transactions Millions if not billions of rows User activity logs (mobile & social) Too large to process on a single machine Mobile telemetry & smart devices Too large to store on a single machine Industrial & manufacturing High rate of growth Financial trading More daily Medical research (e.g. genomics) Movie rendering & production
  • 9. Put the Data to work Google cloud services for Big Data
  • 10. Use the cloud Composable cloud services Focus on the solution rather than on the infrastructure Do new things that weren't possible before Pay for what you use.
  • 11. BIG DATA LOG ANALYSIS Google POS, Spreadsheets Clickstream RFID Customer Loyalty Other BI Tools Add clickthroughs.. BigQuery Data sets for further Analysis App Engine Scalable Storage App SQL API Marketing Corporate data Merchandising 3rd party data Local Stores Partners Store all your data Analyze interactively Securely Share/ in the cloud Product Affinity, Market Basket etc distribute the results
  • 12. Scaling large ads reporting Latency Customer load test: On-prem MySQL vs BigQuery (seconds) # days of data Business: ads authoring tools and reporting Data: ad serving logs for 500 websites, ~300M rows/day Problem solved: interactively finding new trends and patterns
  • 13. A New Hadoop Terasort World Record
  • 14. What did we learn? Store data with reliability, redundancy and consistency Go from Data to Meaning At Scale ...fast Google white papers Google File System (2003) MapReduce: Simplified Data Processing on Large Clusters (2004) BigTable: A Distributed Storage System for Structured Data (2006) Dremel: Interactive Analysis of Web-Scale Datasets (2010) Machine Translation (2004-2011)
  • 15. The virtuous cycle of data Collect Data (Cloud Storage, Datastore, Logstore) Build application Process Data (GAE / GCE) (App Engine, GCE) (improve) Analyze Data, (BigQuery)
  • 16. Build The Next Generation of Data-Centric Applications
  • 17. BigQuery use cases in industry Ad Spend Attribution Mash up Adwords + Google Analytics data + customer reservations for high (online travel reservations) volume attribution analysis Media consulting Analyze 20GB/day of DoubleClick display ads performance metrics for F500 (global top-5 media agency) clients Ad authoring tools Deliver x-platform performance analytics dashboards to 100s of ads authoring (online ads authoring) customers Social gaming Cohort analysis on million+ gamers to monetize massive online social gaming (data analytics vendor) Revenue optimization Measure x-media campaign effectiveness to maximize occupancy rates (holiday/travel properties) Business Requirements A single place to capture growing data Combine data from different sources Ad hoc detection of patterns and correlations Easily share data insights with org Distribute data-based decision making
  • 18. BIME + BigQuery Interactively analyze 450M rows of sales data
  • 19. Mobile & social gaming user analysis Notice trend change Slice user data, identify segments Compare segments vs general population
  • 20. Revenue optimization - hospitality industry New solution for real-time decision making Saves more than $150,00 a year AppEngine BigQuery Cloud Storage Regional Sales Analysts Execs BI team Oracle DB Netezza appliance
  • 21. Thank you cloud.google.com