SlideShare une entreprise Scribd logo
1  sur  37
Big data, big deal?

         February 2013




         Matt Turck
       Twitter: @mattturck
    Blog: http://mattturck.com
Background: I prepared this slide deck for a couple of
“Big Data 101” guest lectures I did in February 2012 at
New York University’s Stern School of Business and at
The New School. They’re intended for a college
level, non technical audience, as a first exposure to Big
Data and related concepts. I have re-used a number of
stats, graphics, cartoons and other materials freely
available on the internet. Thanks to the authors of those
materials.
What does Target know about
     pregnant women?
Hype

    Data is…
   "the new gold”
   “the new black”
   “the new plastic”
   "the new oil”
   “the new frontier”
Isn’t it what computers have always
                done?
What’s different this time?

         Volume.
         Variety.
         Velocity.
Facebook warehouses 180 petabytes
          of data a year
Twitter manages 1.2 million deliveries
            per second
New sources of data
Twitter manages 1.2 million deliveries
            per second
Open Government Data
Big data is data that exceeds the
processing capacity of conventional
database systems. The data is too
big, moves too fast, or doesn’t fit the
strictures of your database
architectures. To gain value from this
data, you must choose an alternative
way to process it.

               Edd Dumbill, O’Reilly
A new breed of technologies
Big Data Landscape
                  Infrastructure                                         Analytics                                      Applications
   NoSQL Databases              Hadoop Related           Analytics Solutions     Data Visualization                   Ad Optimization




                                                                                                            Publisher            Marketing
   NewSQL Databases
                                                        Statistical Computing                                 Tools

                                                                                      Social Media


MPP Databases     Management /     Cluster Services
                                                                                                                    Industry Applications
                   Monitoring
                                                         Sentiment Analysis      Analytics Services

                                       Security
                                                                                                               Application Service Providers
                                                         Location / People /
                                                                                  Big Data Search
                                                               Events
                      Storage
                                                                                      IT Analytics                   Data Sources
Crowdsourcing
                                                                                                              Data               Data Sources
                                     Collection /           Real-      Crowdsourced SMB Analytics          Marketplaces
                                      Transport             Time         Analytics




                                  Cross Infrastructure / Analytics                                                      Personal Data


                                                            Open Source Projects
 Framework      Query / Data           Data Access                   Coordination /         Real -    Statistical     Machine        Cloud
                   Flow                                                Workflow             Time        Tools         Learning     Deployment


                                         Matt Turck (@mattturck) and Shivon Zilis (@shivonz)
A new breed of people:
    Data scientists
     engineering
                                math

                     nerds


           nerds               nerds



                     nerds
comp sci
                             hacking




                   awesome nerds
                                       Credit: Hilary Mason, Bitly
Sexy nerds?




          “Data Scientist:
The Sexiest Job of the 21st Century”
           October 2012
Nerd talent shortage
Terms worth remembering

Structured vs. unstructured data
            Hadoop
        Cloud computing
       Data visualization
       Machine learning
      Predictive analytics
So what do you do with all that
        technology?
Lending
Trading
Insurance
Agriculture
Healthcare
Energy
Music
Education
But what about small data?
Moneyball is (relatively) small data
Nate Silver is (relatively) small data
Most companies only have small data
It’s not about big data
for the sake of big data
Data-driven management



“In God we trust. Everyone else, bring data”
Data-driven culture
Easier than ever for any business to be
           truly data-driven
Thanks!



           Learn more:

  NYC Data Business Meetup

meetup.com/NYC-Data-Business-Meetup/

Contenu connexe

Tendances

Big Data Landscape 2018
Big Data Landscape 2018Big Data Landscape 2018
Big Data Landscape 2018Leanne Hwee
 
Forecast of Big Data Trends
Forecast of Big Data TrendsForecast of Big Data Trends
Forecast of Big Data TrendsIMC Institute
 
BIG DATA & DATA ANALYTICS
BIG  DATA & DATA  ANALYTICSBIG  DATA & DATA  ANALYTICS
BIG DATA & DATA ANALYTICSNAGARAJAGIDDE
 
Tools for Unstructured Data Analytics
Tools for Unstructured Data AnalyticsTools for Unstructured Data Analytics
Tools for Unstructured Data AnalyticsRavi Teja
 
Big data using Public Cloud
Big data using Public CloudBig data using Public Cloud
Big data using Public CloudIMC Institute
 
Big data analytics use cases: all you need to know
Big data analytics use cases:  all you need to knowBig data analytics use cases:  all you need to know
Big data analytics use cases: all you need to knowJane Brewer
 
Real time analytics of big data
Real time analytics of big dataReal time analytics of big data
Real time analytics of big dataDeependra Jyoti
 
Big data analytics
Big data analyticsBig data analytics
Big data analyticsRavi Teja
 
The Business of Big Data - IA Ventures
The Business of Big Data - IA VenturesThe Business of Big Data - IA Ventures
The Business of Big Data - IA VenturesBen Siscovick
 
Big data Presentation
Big data PresentationBig data Presentation
Big data PresentationAswadmehar
 
Big Data on Public Cloud
Big Data on Public CloudBig Data on Public Cloud
Big Data on Public CloudIMC Institute
 

Tendances (20)

Big Data analytics
Big Data analyticsBig Data analytics
Big Data analytics
 
Big Data Landscape 2018
Big Data Landscape 2018Big Data Landscape 2018
Big Data Landscape 2018
 
Big Data 101
Big Data 101Big Data 101
Big Data 101
 
Forecast of Big Data Trends
Forecast of Big Data TrendsForecast of Big Data Trends
Forecast of Big Data Trends
 
BIG DATA & DATA ANALYTICS
BIG  DATA & DATA  ANALYTICSBIG  DATA & DATA  ANALYTICS
BIG DATA & DATA ANALYTICS
 
Tools for Unstructured Data Analytics
Tools for Unstructured Data AnalyticsTools for Unstructured Data Analytics
Tools for Unstructured Data Analytics
 
Importance of Big Data Analytics
Importance of Big Data AnalyticsImportance of Big Data Analytics
Importance of Big Data Analytics
 
Big data using Public Cloud
Big data using Public CloudBig data using Public Cloud
Big data using Public Cloud
 
Big data analytics use cases: all you need to know
Big data analytics use cases:  all you need to knowBig data analytics use cases:  all you need to know
Big data analytics use cases: all you need to know
 
Real time analytics of big data
Real time analytics of big dataReal time analytics of big data
Real time analytics of big data
 
Cloudant
CloudantCloudant
Cloudant
 
Big data analytics
Big data analyticsBig data analytics
Big data analytics
 
Apouc 2014-business-analytics-and-big-data
Apouc 2014-business-analytics-and-big-dataApouc 2014-business-analytics-and-big-data
Apouc 2014-business-analytics-and-big-data
 
Jobs Complexity
Jobs ComplexityJobs Complexity
Jobs Complexity
 
Big data case study collection
Big data   case study collectionBig data   case study collection
Big data case study collection
 
The Business of Big Data - IA Ventures
The Business of Big Data - IA VenturesThe Business of Big Data - IA Ventures
The Business of Big Data - IA Ventures
 
Introduction to Big Data
Introduction to Big DataIntroduction to Big Data
Introduction to Big Data
 
Big data Presentation
Big data PresentationBig data Presentation
Big data Presentation
 
Big Data on Public Cloud
Big Data on Public CloudBig Data on Public Cloud
Big Data on Public Cloud
 
Bigdata
Bigdata Bigdata
Bigdata
 

En vedette

The Astonishing Resurrection of AI (A Primer on Artificial Intelligence)
The Astonishing Resurrection of AI (A Primer on Artificial Intelligence)The Astonishing Resurrection of AI (A Primer on Artificial Intelligence)
The Astonishing Resurrection of AI (A Primer on Artificial Intelligence)Matt Turck
 
Hardware Startups: The VC Perspective
Hardware Startups: The VC PerspectiveHardware Startups: The VC Perspective
Hardware Startups: The VC PerspectiveMatt Turck
 
Sensors, Wearables and the Internet of Things: A Revolution in the Making
Sensors, Wearables and the Internet of Things: A Revolution in the MakingSensors, Wearables and the Internet of Things: A Revolution in the Making
Sensors, Wearables and the Internet of Things: A Revolution in the MakingMatt Turck
 
Building an AI Startup: Realities & Tactics
Building an AI Startup: Realities & TacticsBuilding an AI Startup: Realities & Tactics
Building an AI Startup: Realities & TacticsMatt Turck
 
NYC: A Natural Home for European Entrepreneurs
NYC: A Natural Home for European EntrepreneursNYC: A Natural Home for European Entrepreneurs
NYC: A Natural Home for European EntrepreneursMatt Turck
 
Seq2 seq learning
Seq2 seq learningSeq2 seq learning
Seq2 seq learningVu Pham
 
How to Interview a Data Scientist
How to Interview a Data ScientistHow to Interview a Data Scientist
How to Interview a Data ScientistDaniel Tunkelang
 
Big Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should KnowBig Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should KnowBernard Marr
 
Bett 2014 Learn Live Session - Big Data: School perspectives on what, how and...
Bett 2014 Learn Live Session - Big Data: School perspectives on what, how and...Bett 2014 Learn Live Session - Big Data: School perspectives on what, how and...
Bett 2014 Learn Live Session - Big Data: School perspectives on what, how and...FrogEducation
 
DIY IoT Backend
DIY IoT BackendDIY IoT Backend
DIY IoT BackendDiUS
 
Virtualized Network Services-An Overview of Alepo NFV Solutions
Virtualized Network Services-An Overview of Alepo NFV SolutionsVirtualized Network Services-An Overview of Alepo NFV Solutions
Virtualized Network Services-An Overview of Alepo NFV SolutionsAlepo
 
Target Holding - Big Dikes and Big Data
Target Holding - Big Dikes and Big DataTarget Holding - Big Dikes and Big Data
Target Holding - Big Dikes and Big DataFrens Jan Rumph
 
Not Provided - Search Marketing Thursday 7 November 2013
Not Provided - Search Marketing Thursday 7 November 2013Not Provided - Search Marketing Thursday 7 November 2013
Not Provided - Search Marketing Thursday 7 November 2013Martijn Scheijbeler
 
2016.07.21. 최신 소프트웨어 기술에 대한 이해
2016.07.21. 최신 소프트웨어 기술에 대한 이해 2016.07.21. 최신 소프트웨어 기술에 대한 이해
2016.07.21. 최신 소프트웨어 기술에 대한 이해 Chanjin Park
 
101 Internet of Things
101 Internet of Things 101 Internet of Things
101 Internet of Things Redweb Ltd
 

En vedette (20)

The Astonishing Resurrection of AI (A Primer on Artificial Intelligence)
The Astonishing Resurrection of AI (A Primer on Artificial Intelligence)The Astonishing Resurrection of AI (A Primer on Artificial Intelligence)
The Astonishing Resurrection of AI (A Primer on Artificial Intelligence)
 
Hardware Startups: The VC Perspective
Hardware Startups: The VC PerspectiveHardware Startups: The VC Perspective
Hardware Startups: The VC Perspective
 
Big data 101
Big data 101Big data 101
Big data 101
 
Sensors, Wearables and the Internet of Things: A Revolution in the Making
Sensors, Wearables and the Internet of Things: A Revolution in the MakingSensors, Wearables and the Internet of Things: A Revolution in the Making
Sensors, Wearables and the Internet of Things: A Revolution in the Making
 
Building an AI Startup: Realities & Tactics
Building an AI Startup: Realities & TacticsBuilding an AI Startup: Realities & Tactics
Building an AI Startup: Realities & Tactics
 
NYC: A Natural Home for European Entrepreneurs
NYC: A Natural Home for European EntrepreneursNYC: A Natural Home for European Entrepreneurs
NYC: A Natural Home for European Entrepreneurs
 
What is Big Data?
What is Big Data?What is Big Data?
What is Big Data?
 
Big data ppt
Big  data pptBig  data ppt
Big data ppt
 
Seq2 seq learning
Seq2 seq learningSeq2 seq learning
Seq2 seq learning
 
How to Interview a Data Scientist
How to Interview a Data ScientistHow to Interview a Data Scientist
How to Interview a Data Scientist
 
Big Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should KnowBig Data - 25 Amazing Facts Everyone Should Know
Big Data - 25 Amazing Facts Everyone Should Know
 
Bett 2014 Learn Live Session - Big Data: School perspectives on what, how and...
Bett 2014 Learn Live Session - Big Data: School perspectives on what, how and...Bett 2014 Learn Live Session - Big Data: School perspectives on what, how and...
Bett 2014 Learn Live Session - Big Data: School perspectives on what, how and...
 
DIY IOT
DIY IOTDIY IOT
DIY IOT
 
DIY IoT Backend
DIY IoT BackendDIY IoT Backend
DIY IoT Backend
 
Virtualized Network Services-An Overview of Alepo NFV Solutions
Virtualized Network Services-An Overview of Alepo NFV SolutionsVirtualized Network Services-An Overview of Alepo NFV Solutions
Virtualized Network Services-An Overview of Alepo NFV Solutions
 
Target Holding - Big Dikes and Big Data
Target Holding - Big Dikes and Big DataTarget Holding - Big Dikes and Big Data
Target Holding - Big Dikes and Big Data
 
Not Provided - Search Marketing Thursday 7 November 2013
Not Provided - Search Marketing Thursday 7 November 2013Not Provided - Search Marketing Thursday 7 November 2013
Not Provided - Search Marketing Thursday 7 November 2013
 
2016.07.21. 최신 소프트웨어 기술에 대한 이해
2016.07.21. 최신 소프트웨어 기술에 대한 이해 2016.07.21. 최신 소프트웨어 기술에 대한 이해
2016.07.21. 최신 소프트웨어 기술에 대한 이해
 
101 Internet of Things
101 Internet of Things 101 Internet of Things
101 Internet of Things
 
IoT - Quick Look
IoT - Quick LookIoT - Quick Look
IoT - Quick Look
 

Similaire à Big Data, Big Deal? (A Big Data 101 presentation)

Introduction to Big Data An analogy between Sugar Cane & Big Data
Introduction to Big Data An analogy  between Sugar Cane & Big DataIntroduction to Big Data An analogy  between Sugar Cane & Big Data
Introduction to Big Data An analogy between Sugar Cane & Big DataJean-Marc Desvaux
 
Customer summit - big data (final)
Customer summit  - big data (final)Customer summit  - big data (final)
Customer summit - big data (final)Anand Deshpande
 
01 im overview high level
01 im overview high level01 im overview high level
01 im overview high levelJames Findlay
 
Data analysis trend 2015 2016 v071
Data analysis trend 2015 2016 v071Data analysis trend 2015 2016 v071
Data analysis trend 2015 2016 v071Chun Myung Kyu
 
Big Data and Implications on Platform Architecture
Big Data and Implications on Platform ArchitectureBig Data and Implications on Platform Architecture
Big Data and Implications on Platform ArchitectureOdinot Stanislas
 
Ibm big data hadoop summit 2012 james kobielus final 6-13-12(1)
Ibm big data    hadoop summit 2012 james kobielus final 6-13-12(1)Ibm big data    hadoop summit 2012 james kobielus final 6-13-12(1)
Ibm big data hadoop summit 2012 james kobielus final 6-13-12(1)Ajay Ohri
 
Big Data Beyond Hadoop*: Research Directions for the Future
Big Data Beyond Hadoop*: Research Directions for the FutureBig Data Beyond Hadoop*: Research Directions for the Future
Big Data Beyond Hadoop*: Research Directions for the FutureOdinot Stanislas
 
Big Data = Big Decisions
Big Data = Big DecisionsBig Data = Big Decisions
Big Data = Big DecisionsInnoTech
 
Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)
Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)
Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)Mark Heid
 
What is big data - Architectures and Practical Use Cases
What is big data - Architectures and Practical Use CasesWhat is big data - Architectures and Practical Use Cases
What is big data - Architectures and Practical Use CasesTony Pearson
 
Sample Paper.doc.doc
Sample Paper.doc.docSample Paper.doc.doc
Sample Paper.doc.docbutest
 
The Comprehensive Approach: A Unified Information Architecture
The Comprehensive Approach: A Unified Information ArchitectureThe Comprehensive Approach: A Unified Information Architecture
The Comprehensive Approach: A Unified Information ArchitectureInside Analysis
 
Big Data For Investment Research Management
Big Data For Investment Research ManagementBig Data For Investment Research Management
Big Data For Investment Research ManagementIDT Partners
 
An Encyclopedic Overview Of Big Data Analytics
An Encyclopedic Overview Of Big Data AnalyticsAn Encyclopedic Overview Of Big Data Analytics
An Encyclopedic Overview Of Big Data AnalyticsAudrey Britton
 
Social Business in a World of Abundant Real-time Data
Social Business in a World of Abundant Real-time DataSocial Business in a World of Abundant Real-time Data
Social Business in a World of Abundant Real-time DataLee Bryant
 

Similaire à Big Data, Big Deal? (A Big Data 101 presentation) (20)

Introduction to Big Data An analogy between Sugar Cane & Big Data
Introduction to Big Data An analogy  between Sugar Cane & Big DataIntroduction to Big Data An analogy  between Sugar Cane & Big Data
Introduction to Big Data An analogy between Sugar Cane & Big Data
 
Customer summit - big data (final)
Customer summit  - big data (final)Customer summit  - big data (final)
Customer summit - big data (final)
 
01 im overview high level
01 im overview high level01 im overview high level
01 im overview high level
 
Data analysis trend 2015 2016 v071
Data analysis trend 2015 2016 v071Data analysis trend 2015 2016 v071
Data analysis trend 2015 2016 v071
 
Big Data and Implications on Platform Architecture
Big Data and Implications on Platform ArchitectureBig Data and Implications on Platform Architecture
Big Data and Implications on Platform Architecture
 
Ibm big data hadoop summit 2012 james kobielus final 6-13-12(1)
Ibm big data    hadoop summit 2012 james kobielus final 6-13-12(1)Ibm big data    hadoop summit 2012 james kobielus final 6-13-12(1)
Ibm big data hadoop summit 2012 james kobielus final 6-13-12(1)
 
Big Data Beyond Hadoop*: Research Directions for the Future
Big Data Beyond Hadoop*: Research Directions for the FutureBig Data Beyond Hadoop*: Research Directions for the Future
Big Data Beyond Hadoop*: Research Directions for the Future
 
Big Data = Big Decisions
Big Data = Big DecisionsBig Data = Big Decisions
Big Data = Big Decisions
 
Barak regev
Barak regevBarak regev
Barak regev
 
Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)
Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)
Big Data Meets Social Analytics - IBM Connect 2012 (CN-CC13)
 
What is big data - Architectures and Practical Use Cases
What is big data - Architectures and Practical Use CasesWhat is big data - Architectures and Practical Use Cases
What is big data - Architectures and Practical Use Cases
 
Sample Paper.doc.doc
Sample Paper.doc.docSample Paper.doc.doc
Sample Paper.doc.doc
 
The New Enterprise Data Platform
The New Enterprise Data PlatformThe New Enterprise Data Platform
The New Enterprise Data Platform
 
Big data-ppt-
Big data-ppt-Big data-ppt-
Big data-ppt-
 
Data mining
Data miningData mining
Data mining
 
IBM Stream au Hadoop User Group
IBM Stream au Hadoop User GroupIBM Stream au Hadoop User Group
IBM Stream au Hadoop User Group
 
The Comprehensive Approach: A Unified Information Architecture
The Comprehensive Approach: A Unified Information ArchitectureThe Comprehensive Approach: A Unified Information Architecture
The Comprehensive Approach: A Unified Information Architecture
 
Big Data For Investment Research Management
Big Data For Investment Research ManagementBig Data For Investment Research Management
Big Data For Investment Research Management
 
An Encyclopedic Overview Of Big Data Analytics
An Encyclopedic Overview Of Big Data AnalyticsAn Encyclopedic Overview Of Big Data Analytics
An Encyclopedic Overview Of Big Data Analytics
 
Social Business in a World of Abundant Real-time Data
Social Business in a World of Abundant Real-time DataSocial Business in a World of Abundant Real-time Data
Social Business in a World of Abundant Real-time Data
 

Dernier

Digital Tools & AI in Career Development
Digital Tools & AI in Career DevelopmentDigital Tools & AI in Career Development
Digital Tools & AI in Career DevelopmentMahmoud Rabie
 
Infrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsInfrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsYoss Cohen
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...panagenda
 
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)Mark Simos
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
React JS; all concepts. Contains React Features, JSX, functional & Class comp...
React JS; all concepts. Contains React Features, JSX, functional & Class comp...React JS; all concepts. Contains React Features, JSX, functional & Class comp...
React JS; all concepts. Contains React Features, JSX, functional & Class comp...Karmanjay Verma
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...itnewsafrica
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes
 
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...Jeffrey Haguewood
 
All These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDFAll These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDFMichael Gough
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkPixlogix Infotech
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Strongerpanagenda
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfpanagenda
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesManik S Magar
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality AssuranceInflectra
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
Microservices, Docker deploy and Microservices source code in C#
Microservices, Docker deploy and Microservices source code in C#Microservices, Docker deploy and Microservices source code in C#
Microservices, Docker deploy and Microservices source code in C#Karmanjay Verma
 
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...BookNet Canada
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 

Dernier (20)

Digital Tools & AI in Career Development
Digital Tools & AI in Career DevelopmentDigital Tools & AI in Career Development
Digital Tools & AI in Career Development
 
Infrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platformsInfrared simulation and processing on Nvidia platforms
Infrared simulation and processing on Nvidia platforms
 
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
Why device, WIFI, and ISP insights are crucial to supporting remote Microsoft...
 
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
Tampa BSides - The No BS SOC (slides from April 6, 2024 talk)
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
React JS; all concepts. Contains React Features, JSX, functional & Class comp...
React JS; all concepts. Contains React Features, JSX, functional & Class comp...React JS; all concepts. Contains React Features, JSX, functional & Class comp...
React JS; all concepts. Contains React Features, JSX, functional & Class comp...
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...Abdul Kader Baba- Managing Cybersecurity Risks  and Compliance Requirements i...
Abdul Kader Baba- Managing Cybersecurity Risks and Compliance Requirements i...
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
 
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
Email Marketing Automation for Bonterra Impact Management (fka Social Solutio...
 
All These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDFAll These Sophisticated Attacks, Can We Really Detect Them - PDF
All These Sophisticated Attacks, Can We Really Detect Them - PDF
 
React Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App FrameworkReact Native vs Ionic - The Best Mobile App Framework
React Native vs Ionic - The Best Mobile App Framework
 
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better StrongerModern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
Modern Roaming for Notes and Nomad – Cheaper Faster Better Stronger
 
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdfSo einfach geht modernes Roaming fuer Notes und Nomad.pdf
So einfach geht modernes Roaming fuer Notes und Nomad.pdf
 
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotesMuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
MuleSoft Online Meetup Group - B2B Crash Course: Release SparkNotes
 
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance[Webinar] SpiraTest - Setting New Standards in Quality Assurance
[Webinar] SpiraTest - Setting New Standards in Quality Assurance
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
Microservices, Docker deploy and Microservices source code in C#
Microservices, Docker deploy and Microservices source code in C#Microservices, Docker deploy and Microservices source code in C#
Microservices, Docker deploy and Microservices source code in C#
 
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...
Transcript: New from BookNet Canada for 2024: BNC SalesData and LibraryData -...
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 

Big Data, Big Deal? (A Big Data 101 presentation)

  • 1. Big data, big deal? February 2013 Matt Turck Twitter: @mattturck Blog: http://mattturck.com
  • 2. Background: I prepared this slide deck for a couple of “Big Data 101” guest lectures I did in February 2012 at New York University’s Stern School of Business and at The New School. They’re intended for a college level, non technical audience, as a first exposure to Big Data and related concepts. I have re-used a number of stats, graphics, cartoons and other materials freely available on the internet. Thanks to the authors of those materials.
  • 3. What does Target know about pregnant women?
  • 4. Hype Data is… "the new gold” “the new black” “the new plastic” "the new oil” “the new frontier”
  • 5. Isn’t it what computers have always done?
  • 6. What’s different this time? Volume. Variety. Velocity.
  • 7.
  • 8. Facebook warehouses 180 petabytes of data a year
  • 9. Twitter manages 1.2 million deliveries per second
  • 11. Twitter manages 1.2 million deliveries per second
  • 13. Big data is data that exceeds the processing capacity of conventional database systems. The data is too big, moves too fast, or doesn’t fit the strictures of your database architectures. To gain value from this data, you must choose an alternative way to process it. Edd Dumbill, O’Reilly
  • 14. A new breed of technologies
  • 15. Big Data Landscape Infrastructure Analytics Applications NoSQL Databases Hadoop Related Analytics Solutions Data Visualization Ad Optimization Publisher Marketing NewSQL Databases Statistical Computing Tools Social Media MPP Databases Management / Cluster Services Industry Applications Monitoring Sentiment Analysis Analytics Services Security Application Service Providers Location / People / Big Data Search Events Storage IT Analytics Data Sources Crowdsourcing Data Data Sources Collection / Real- Crowdsourced SMB Analytics Marketplaces Transport Time Analytics Cross Infrastructure / Analytics Personal Data Open Source Projects Framework Query / Data Data Access Coordination / Real - Statistical Machine Cloud Flow Workflow Time Tools Learning Deployment Matt Turck (@mattturck) and Shivon Zilis (@shivonz)
  • 16. A new breed of people: Data scientists engineering math nerds nerds nerds nerds comp sci hacking awesome nerds Credit: Hilary Mason, Bitly
  • 17. Sexy nerds? “Data Scientist: The Sexiest Job of the 21st Century” October 2012
  • 19. Terms worth remembering Structured vs. unstructured data Hadoop Cloud computing Data visualization Machine learning Predictive analytics
  • 20. So what do you do with all that technology?
  • 27. Music
  • 29. But what about small data?
  • 31. Nate Silver is (relatively) small data
  • 32. Most companies only have small data
  • 33. It’s not about big data for the sake of big data
  • 34. Data-driven management “In God we trust. Everyone else, bring data”
  • 36. Easier than ever for any business to be truly data-driven
  • 37. Thanks! Learn more: NYC Data Business Meetup meetup.com/NYC-Data-Business-Meetup/

Notes de l'éditeur

  1. This is going to be a talk for people who love the internet.
  2. The true story of bitly, engineering, data science, loveHow to do data science at scaleBuilding teams and keeping people happyClever tricks
  3. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  4. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  5. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  6. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  7. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  8. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  9. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  10. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  11. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  12. Asking questions.
  13. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  14. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  15. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  16. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  17. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  18. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  19. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  20. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  21. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  22. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  23. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  24. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  25. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  26. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  27. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  28. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  29. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  30. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  31. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.
  32. Very different perspective, we have constrained resources, short time, and an expectation that what we do is relevant to the real world in some way.We build the system on this data, and then scale it for production use.