SlideShare a Scribd company logo
1 of 20
Hadoop&
The New Analytics
                 Casey Kiernan
Sr. Director / Data Architecture - Shopzilla.com
                November 1, 2011
Agenda
The New “Data”
The New Business Model
The New Analytic Scenarios
The New Analytic Architectures
The New Analytic Technologies
And, Yes… The New Data-Center




                                 2
SERVICES • SOA • JSON • AVRO • APPLICATIONS • HTML • JAVA • C# • THE CLOUD • HADOOP • OLAP




                                                                                                                               PYTHON • SITES • TENANT • ORG • SSN
FINANCE • ID • CUSTOMER • EXPERIAN




                                        SQL SERVER • ORACLE • UNIX • SUBVERSION • COMPLIANCE • SECURITY • SALESFORCE • MYSQL



                                           The World as I See it
My Mountain Bike as a Data Platform
                                 Data Collection
                                 Heart Rate


                                                                 Data Collection
                                                                 Altitude
      Data Collection                                            Temperature
      Speed / Trip Miles                                         Time                Guidance

                                                                                   Performance
                                                                                   Rate of Climb
                                                                                   Calories Burned
                                                                                   Miles Obtained
                                                                                   Total Climbed
                                                                                   Elapsed Time

                                                                                   Current,
                                                                                   Average,
                                                                                   Max Values

                                  Data Collection
                                  Cadence / RPM

                 Data Architecture - on a Local Wireless Network (ANT+ Protocol)
“Business” Analytics




    BUSINESS INTELLIGENCE.
    DATA WAREHOUSE/OLAP.
          OLTP DATA.

                       What are our most profitable Movie titles?
“Business” Analytics




         What did Happen?                 What will Happen?

         Operational Reporting           Tactical Analytics     Strategic

                   Months   WeeksWeeks       Months     Years




                                                                            6
“Personal” Analytics



              SELF-SERVICE.
               GUIDANCE.
              BEHAVIOURS.
                       What Movie should I watch tonight?
“Personal” Analytics


                       What is Happening NOW?



         What did Happen?                 What will Happen?

           Historical Behaviors          Tactical Analytics     Strategic

                   Months   WeeksWeeks       Months     Years




                                                                            8
Meaningful
GUIDANCE




                             Massive
           DATA COLLECTION


                                                    9
10
11
“Business” Analytics




   OLTP App

                       Staging
                                     Data                OLAP /    Business
   Orders App                      Warehouse                       Analyst
                                                         Reports
                                  OLTP to OLAP Mapping

    FIN App




                                 What are our most profitable Movie titles?

                                                                              12
“Personal” Analytics



                          End User
                          Application


                       Data      Analytics




                                What Movie should I watch tonight?

                                                               13
End-User Experience
    Browser, Tablet,          Self-Service Application
           Mobile,…
                           Personalization,        Personalized
                           Preferences, State      Recommendations

                        App Persistence            Analytics
Persistence/Analytics   “State” Persistence     “Read” Performance



                                        Big Data
                             Behaviors / “Write” Performance




                                     “Personal Analytics” Data Architecture

                                                                        14
RDBMS
Highly Structured Environment
Formalized intake process
ACID Transactional Semantics
Target Scenario – OLTP R/W
High Level Query Syntax - SQL


Hadoop
Late-Binding Structures
Non-Formal Intake (“Copy”)
Minimal Transaction Semantics
Target Scenario - Writers
Procedural Query Syntax - MapReduce


                                      15
The New Technology Stack
Specialization / Individual Scalability / Late-Binding - for each component

Technology              Data Warehousing            New Analytics
Analytics               OLAP                        OLAP + Open-Source
Data Movement           ETL Tool                    MapReduce
SQL                     RDBMS                       Hive
Schema Metadata         RDBMS                       JSON / AVRO
Indexing (Readers)      RDBMS                       HBase
RI                      RDBMS                       Application Logic
App Store (Objects)     RDBMS                       Key/Value - Cassandra,…
Schema / Columns        RDBMS                       Column Families / Dynamic
Logs (Writers)          RDBMS                       Scalable - Hadoop
Infrastructure          Data-Center                 Cloud




                                                                                16
End-User Experience
    Browser, Tablet,          Self-Service Application
           Mobile,…
                           Personalization,         Personalized
                           Preferences, State       Recommendations

                        App Persistence              Analytics
Persistence/Analytics    Cassandra (JSON)       Hbase (Column-Families)
 Data-Center or Cloud
                                                      MapReduce

                                        Big Data
                                     Hadoop (AVRO)                  SQL
                                                                    Hive




                                         Specialization of Data Technologies

                                                                           17
Personal Analytics + Business Intelligence




       App

                      Staging      Data                OLAP /    Business
     OLTP App                    Warehouse                       Analyst
                                                       Reports
                                OLTP to OLAP Mapping

     OLTP App




                                                                            18
Contact Information
If you have further questions or comments:



                        Casey Kiernan
                Sr. Director / Data Architecture
                         Shopzilla.com
                 casey.kiernan@hotmail.com
              BLOG: www.the-data-platform.com




                                                   19
A recent ride in AZ

More Related Content

What's hot

Use of EMR for Marketing Segmentation
Use of EMR for Marketing SegmentationUse of EMR for Marketing Segmentation
Use of EMR for Marketing SegmentationAmazon Web Services
 
Hortonworks Data Platform for Systems Integrators Webinar 9-5-2012.pptx
Hortonworks Data Platform for Systems Integrators Webinar 9-5-2012.pptxHortonworks Data Platform for Systems Integrators Webinar 9-5-2012.pptx
Hortonworks Data Platform for Systems Integrators Webinar 9-5-2012.pptxHortonworks
 
The Modern Data Architecture for Advanced Business Intelligence with Hortonwo...
The Modern Data Architecture for Advanced Business Intelligence with Hortonwo...The Modern Data Architecture for Advanced Business Intelligence with Hortonwo...
The Modern Data Architecture for Advanced Business Intelligence with Hortonwo...Hortonworks
 
Fusion - IBANK
Fusion - IBANKFusion - IBANK
Fusion - IBANKibankuk
 
SAP HANA - The Foundation of Real Time, Now on the AWS Cloud Computing Platform
SAP HANA - The Foundation of Real Time, Now on the AWS Cloud Computing PlatformSAP HANA - The Foundation of Real Time, Now on the AWS Cloud Computing Platform
SAP HANA - The Foundation of Real Time, Now on the AWS Cloud Computing PlatformAmazon Web Services
 
MDS ap_OEM Product Portfolio Intorduction to the DT & Analytics
MDS ap_OEM Product Portfolio Intorduction to the DT & AnalyticsMDS ap_OEM Product Portfolio Intorduction to the DT & Analytics
MDS ap_OEM Product Portfolio Intorduction to the DT & AnalyticsMDS ap
 
Modern Business Intelligence and Advanced Analytics
Modern Business Intelligence and Advanced AnalyticsModern Business Intelligence and Advanced Analytics
Modern Business Intelligence and Advanced AnalyticsCollective Intelligence Inc.
 

What's hot (9)

Adam azure presentation
Adam   azure presentationAdam   azure presentation
Adam azure presentation
 
Use of EMR for Marketing Segmentation
Use of EMR for Marketing SegmentationUse of EMR for Marketing Segmentation
Use of EMR for Marketing Segmentation
 
Hortonworks Data Platform for Systems Integrators Webinar 9-5-2012.pptx
Hortonworks Data Platform for Systems Integrators Webinar 9-5-2012.pptxHortonworks Data Platform for Systems Integrators Webinar 9-5-2012.pptx
Hortonworks Data Platform for Systems Integrators Webinar 9-5-2012.pptx
 
The Modern Data Architecture for Advanced Business Intelligence with Hortonwo...
The Modern Data Architecture for Advanced Business Intelligence with Hortonwo...The Modern Data Architecture for Advanced Business Intelligence with Hortonwo...
The Modern Data Architecture for Advanced Business Intelligence with Hortonwo...
 
JasperSoft and GlassFish
JasperSoft and GlassFishJasperSoft and GlassFish
JasperSoft and GlassFish
 
Fusion - IBANK
Fusion - IBANKFusion - IBANK
Fusion - IBANK
 
SAP HANA - The Foundation of Real Time, Now on the AWS Cloud Computing Platform
SAP HANA - The Foundation of Real Time, Now on the AWS Cloud Computing PlatformSAP HANA - The Foundation of Real Time, Now on the AWS Cloud Computing Platform
SAP HANA - The Foundation of Real Time, Now on the AWS Cloud Computing Platform
 
MDS ap_OEM Product Portfolio Intorduction to the DT & Analytics
MDS ap_OEM Product Portfolio Intorduction to the DT & AnalyticsMDS ap_OEM Product Portfolio Intorduction to the DT & Analytics
MDS ap_OEM Product Portfolio Intorduction to the DT & Analytics
 
Modern Business Intelligence and Advanced Analytics
Modern Business Intelligence and Advanced AnalyticsModern Business Intelligence and Advanced Analytics
Modern Business Intelligence and Advanced Analytics
 

Similar to 2011 - TDWI Big Data Forum - The New Analytics

New Analytical Architectures for Big Data
New Analytical Architectures for Big DataNew Analytical Architectures for Big Data
New Analytical Architectures for Big DataCasey Kiernan
 
Streaming Hadoop for Enterprise Adoption
Streaming Hadoop for Enterprise AdoptionStreaming Hadoop for Enterprise Adoption
Streaming Hadoop for Enterprise AdoptionDATAVERSITY
 
Apache hadoop bigdata-in-banking
Apache hadoop bigdata-in-bankingApache hadoop bigdata-in-banking
Apache hadoop bigdata-in-bankingm_hepburn
 
Farklı Ortamlarda Büyük Veri Kavramı -Big Data by Sybase
Farklı Ortamlarda Büyük Veri Kavramı -Big Data by Sybase Farklı Ortamlarda Büyük Veri Kavramı -Big Data by Sybase
Farklı Ortamlarda Büyük Veri Kavramı -Big Data by Sybase Sybase Türkiye
 
Big Data Analytics in a Heterogeneous World - Joydeep Das of Sybase
Big Data Analytics in a Heterogeneous World - Joydeep Das of SybaseBig Data Analytics in a Heterogeneous World - Joydeep Das of Sybase
Big Data Analytics in a Heterogeneous World - Joydeep Das of SybaseBigDataCloud
 
Hadoop for shanghai dev meetup
Hadoop for shanghai dev meetupHadoop for shanghai dev meetup
Hadoop for shanghai dev meetupRoby Chen
 
Enterprise linked data clouds
Enterprise linked data cloudsEnterprise linked data clouds
Enterprise linked data cloudsdamienjoyce
 
Big Data and Implications on Platform Architecture
Big Data and Implications on Platform ArchitectureBig Data and Implications on Platform Architecture
Big Data and Implications on Platform ArchitectureOdinot Stanislas
 
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016StampedeCon
 
Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...
Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...
Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...Impetus Technologies
 
Big Data launch keynote Singapore Patrick Buddenbaum
Big Data launch keynote Singapore Patrick BuddenbaumBig Data launch keynote Singapore Patrick Buddenbaum
Big Data launch keynote Singapore Patrick BuddenbaumIntelAPAC
 
Hadoop as data refinery
Hadoop as data refineryHadoop as data refinery
Hadoop as data refinerySteve Loughran
 
Hadoop as Data Refinery - Steve Loughran
Hadoop as Data Refinery - Steve LoughranHadoop as Data Refinery - Steve Loughran
Hadoop as Data Refinery - Steve LoughranJAX London
 
Hadoop World 2011: The Blind Men and the Elephant - Matthew Aslett - The 451 ...
Hadoop World 2011: The Blind Men and the Elephant - Matthew Aslett - The 451 ...Hadoop World 2011: The Blind Men and the Elephant - Matthew Aslett - The 451 ...
Hadoop World 2011: The Blind Men and the Elephant - Matthew Aslett - The 451 ...Cloudera, Inc.
 
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendIntroducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendCaserta
 
Crowd-Sourced Intelligence Built into Search over Hadoop
Crowd-Sourced Intelligence Built into Search over HadoopCrowd-Sourced Intelligence Built into Search over Hadoop
Crowd-Sourced Intelligence Built into Search over HadoopDataWorks Summit
 
SplunkLive: New Visibility=New Opportunity: How IT Can Drive Business Value
SplunkLive: New Visibility=New Opportunity: How IT Can Drive Business Value SplunkLive: New Visibility=New Opportunity: How IT Can Drive Business Value
SplunkLive: New Visibility=New Opportunity: How IT Can Drive Business Value Splunk
 

Similar to 2011 - TDWI Big Data Forum - The New Analytics (20)

New Analytical Architectures for Big Data
New Analytical Architectures for Big DataNew Analytical Architectures for Big Data
New Analytical Architectures for Big Data
 
Streaming Hadoop for Enterprise Adoption
Streaming Hadoop for Enterprise AdoptionStreaming Hadoop for Enterprise Adoption
Streaming Hadoop for Enterprise Adoption
 
Apache hadoop bigdata-in-banking
Apache hadoop bigdata-in-bankingApache hadoop bigdata-in-banking
Apache hadoop bigdata-in-banking
 
Farklı Ortamlarda Büyük Veri Kavramı -Big Data by Sybase
Farklı Ortamlarda Büyük Veri Kavramı -Big Data by Sybase Farklı Ortamlarda Büyük Veri Kavramı -Big Data by Sybase
Farklı Ortamlarda Büyük Veri Kavramı -Big Data by Sybase
 
Big Data Analytics in a Heterogeneous World - Joydeep Das of Sybase
Big Data Analytics in a Heterogeneous World - Joydeep Das of SybaseBig Data Analytics in a Heterogeneous World - Joydeep Das of Sybase
Big Data Analytics in a Heterogeneous World - Joydeep Das of Sybase
 
Introducing Splunk – The Big Data Engine
Introducing Splunk – The Big Data EngineIntroducing Splunk – The Big Data Engine
Introducing Splunk – The Big Data Engine
 
SAP HORTONWORKS
SAP HORTONWORKSSAP HORTONWORKS
SAP HORTONWORKS
 
Hadoop for shanghai dev meetup
Hadoop for shanghai dev meetupHadoop for shanghai dev meetup
Hadoop for shanghai dev meetup
 
Enterprise linked data clouds
Enterprise linked data cloudsEnterprise linked data clouds
Enterprise linked data clouds
 
Big Data and Implications on Platform Architecture
Big Data and Implications on Platform ArchitectureBig Data and Implications on Platform Architecture
Big Data and Implications on Platform Architecture
 
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
Best Practices For Building and Operating A Managed Data Lake - StampedeCon 2016
 
Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...
Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...
Apache Spark – The New Enterprise Backbone for ETL, Batch Processing and Real...
 
OOP 2014
OOP 2014OOP 2014
OOP 2014
 
Big Data launch keynote Singapore Patrick Buddenbaum
Big Data launch keynote Singapore Patrick BuddenbaumBig Data launch keynote Singapore Patrick Buddenbaum
Big Data launch keynote Singapore Patrick Buddenbaum
 
Hadoop as data refinery
Hadoop as data refineryHadoop as data refinery
Hadoop as data refinery
 
Hadoop as Data Refinery - Steve Loughran
Hadoop as Data Refinery - Steve LoughranHadoop as Data Refinery - Steve Loughran
Hadoop as Data Refinery - Steve Loughran
 
Hadoop World 2011: The Blind Men and the Elephant - Matthew Aslett - The 451 ...
Hadoop World 2011: The Blind Men and the Elephant - Matthew Aslett - The 451 ...Hadoop World 2011: The Blind Men and the Elephant - Matthew Aslett - The 451 ...
Hadoop World 2011: The Blind Men and the Elephant - Matthew Aslett - The 451 ...
 
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & TalendIntroducing the Big Data Ecosystem with Caserta Concepts & Talend
Introducing the Big Data Ecosystem with Caserta Concepts & Talend
 
Crowd-Sourced Intelligence Built into Search over Hadoop
Crowd-Sourced Intelligence Built into Search over HadoopCrowd-Sourced Intelligence Built into Search over Hadoop
Crowd-Sourced Intelligence Built into Search over Hadoop
 
SplunkLive: New Visibility=New Opportunity: How IT Can Drive Business Value
SplunkLive: New Visibility=New Opportunity: How IT Can Drive Business Value SplunkLive: New Visibility=New Opportunity: How IT Can Drive Business Value
SplunkLive: New Visibility=New Opportunity: How IT Can Drive Business Value
 

Recently uploaded

How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 

Recently uploaded (20)

How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
SAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptxSAP Build Work Zone - Overview L2-L3.pptx
SAP Build Work Zone - Overview L2-L3.pptx
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 

2011 - TDWI Big Data Forum - The New Analytics

  • 1. Hadoop& The New Analytics Casey Kiernan Sr. Director / Data Architecture - Shopzilla.com November 1, 2011
  • 2. Agenda The New “Data” The New Business Model The New Analytic Scenarios The New Analytic Architectures The New Analytic Technologies And, Yes… The New Data-Center 2
  • 3. SERVICES • SOA • JSON • AVRO • APPLICATIONS • HTML • JAVA • C# • THE CLOUD • HADOOP • OLAP PYTHON • SITES • TENANT • ORG • SSN FINANCE • ID • CUSTOMER • EXPERIAN SQL SERVER • ORACLE • UNIX • SUBVERSION • COMPLIANCE • SECURITY • SALESFORCE • MYSQL The World as I See it
  • 4. My Mountain Bike as a Data Platform Data Collection Heart Rate Data Collection Altitude Data Collection Temperature Speed / Trip Miles Time Guidance Performance Rate of Climb Calories Burned Miles Obtained Total Climbed Elapsed Time Current, Average, Max Values Data Collection Cadence / RPM Data Architecture - on a Local Wireless Network (ANT+ Protocol)
  • 5. “Business” Analytics BUSINESS INTELLIGENCE. DATA WAREHOUSE/OLAP. OLTP DATA. What are our most profitable Movie titles?
  • 6. “Business” Analytics What did Happen? What will Happen? Operational Reporting Tactical Analytics Strategic Months WeeksWeeks Months Years 6
  • 7. “Personal” Analytics SELF-SERVICE. GUIDANCE. BEHAVIOURS. What Movie should I watch tonight?
  • 8. “Personal” Analytics What is Happening NOW? What did Happen? What will Happen? Historical Behaviors Tactical Analytics Strategic Months WeeksWeeks Months Years 8
  • 9. Meaningful GUIDANCE Massive DATA COLLECTION 9
  • 10. 10
  • 11. 11
  • 12. “Business” Analytics OLTP App Staging Data OLAP / Business Orders App Warehouse Analyst Reports OLTP to OLAP Mapping FIN App What are our most profitable Movie titles? 12
  • 13. “Personal” Analytics End User Application Data Analytics What Movie should I watch tonight? 13
  • 14. End-User Experience Browser, Tablet, Self-Service Application Mobile,… Personalization, Personalized Preferences, State Recommendations App Persistence Analytics Persistence/Analytics “State” Persistence “Read” Performance Big Data Behaviors / “Write” Performance “Personal Analytics” Data Architecture 14
  • 15. RDBMS Highly Structured Environment Formalized intake process ACID Transactional Semantics Target Scenario – OLTP R/W High Level Query Syntax - SQL Hadoop Late-Binding Structures Non-Formal Intake (“Copy”) Minimal Transaction Semantics Target Scenario - Writers Procedural Query Syntax - MapReduce 15
  • 16. The New Technology Stack Specialization / Individual Scalability / Late-Binding - for each component Technology Data Warehousing New Analytics Analytics OLAP OLAP + Open-Source Data Movement ETL Tool MapReduce SQL RDBMS Hive Schema Metadata RDBMS JSON / AVRO Indexing (Readers) RDBMS HBase RI RDBMS Application Logic App Store (Objects) RDBMS Key/Value - Cassandra,… Schema / Columns RDBMS Column Families / Dynamic Logs (Writers) RDBMS Scalable - Hadoop Infrastructure Data-Center Cloud 16
  • 17. End-User Experience Browser, Tablet, Self-Service Application Mobile,… Personalization, Personalized Preferences, State Recommendations App Persistence Analytics Persistence/Analytics Cassandra (JSON) Hbase (Column-Families) Data-Center or Cloud MapReduce Big Data Hadoop (AVRO) SQL Hive Specialization of Data Technologies 17
  • 18. Personal Analytics + Business Intelligence App Staging Data OLAP / Business OLTP App Warehouse Analyst Reports OLTP to OLAP Mapping OLTP App 18
  • 19. Contact Information If you have further questions or comments: Casey Kiernan Sr. Director / Data Architecture Shopzilla.com casey.kiernan@hotmail.com BLOG: www.the-data-platform.com 19
  • 20. A recent ride in AZ