SlideShare une entreprise Scribd logo
1  sur  26
Télécharger pour lire hors ligne
Big Data Wonderland:
Two Views on the Big Data Revolution
Mark Madsen
Third Nature, Inc.
mark@thirdnature.net
@markmadsen
Marc Demarest
Noumenal, Inc.
marc@noumenal.com
Strata Santa Clara
February 2013
2 Third Nature, Inc. || Noumenal, Inc.
Preamble
Twenty Years On
• We came up together in this industry
in the early 1990s, as pointy-headed
advocates of star schema design,
trained by the deity himself, Ralph
Kimball
• Back then, it was a simpler
world...big iron, big DBMS, hand-
coded ETL, star schema, a
thousand rinky-dink query tools
• Mostly, conversation was
dominated by ETL and schema
design
• “There will never be a decisional
database larger than 10 GB...”
St. Ralph
Our Alma Mater
3 Third Nature, Inc. || Noumenal, Inc.
Preamble
Twenty Years On
• Twenty years on, we find ourselves
with opposing view on what is either
the biggest con, or the biggest sea-
change, in our data warehousing
odyssey
• Question: Is the big data revolution
big, or a revolution?
• Question: do we have to change? and
if so, how?
• Not a round table. A slugfest....
Demarest as
Shana Alexander?
Madsen as
Jack Kilpatrick?
4 Third Nature, Inc. || Noumenal, Inc.
Regular Programming Is Suspended
Demarest Madsen
5 Third Nature, Inc. || Noumenal, Inc.
Compromise
Demarest Madsen
You take the blue pill.
The story ends, you
wake up in your bed
and believe whatever
you want to believe.
You take the red pill,
you stay in Wonderland,
and I show you how
deep the rabbit hole
goes.
Remember, all I am
offering is the truth:
nothing more.
6 Third Nature, Inc. || Noumenal, Inc.
The Issues
1. Data As A Factor of Production
RED BLUE
Amen.
This change has been
in process for more
than a decade. Social
media leads the way,
but we’re all affected.
7 Third Nature, Inc. || Noumenal, Inc.
The Issues
1. Data As A Factor of Production
RED BLUE
Amen.
This change has been
in process for more
than a decade. Social
media leads the way,
but we’re all affected.
Hype.
For most companies,
data remains an
asset, but not a factor
in the production of its
products or services.
8 Third Nature, Inc. || Noumenal, Inc.
The Issues
2. The Reality of Big Data
RED BLUE
Few companies
transformed.
No quantification of
benefits, right now.
Leverage? Maybe.
9 Third Nature, Inc. || Noumenal, Inc.
The Issues
2. The Reality of Big Data
RED BLUE
No company escapes.
Text, social, sensors,
streaming -- the
instrumentation of the
real world transforms
company decision-
making processes.
Few companies
transformed.
No quantification of
benefits, right now.
Leverage? Maybe.
10 Third Nature, Inc. || Noumenal, Inc.
The Issues
3. Merchant DBMSs
RED BLUE
Increasingly irrelevant.
We’ve been over-
structured and under-
resourced for 20
years.
CSV is still the
international standard.
11 Third Nature, Inc. || Noumenal, Inc.
The Issues
3. Merchant DBMSs
RED BLUE
Increasingly irrelevant.
We’ve been over-
structured and under-
resourced for 20
years.
CSV is still the
international standard.
Will rise to the
challenge.
Any worthwhile
innovation will be
absorbed by the
merchant DBMS
players.
12 Third Nature, Inc. || Noumenal, Inc.
The Issues
4. Query, Reporting & Dashboarding Tools
RED BLUE
Will rise to the
challenge.
We have two
generations of
analysts trained to
feed using these tools.
13 Third Nature, Inc. || Noumenal, Inc.
The Issues
4. Query, Reporting & Dashboarding Tools
RED BLUE
Ineffective, now and in
the future.
Can’t do real-time,
can’t visualize large
data sets, can’t
support discovery and
exploration.
Will rise to the
challenge.
We have two
generations of
analysts trained to
feed using these tools.
14 Third Nature, Inc. || Noumenal, Inc.
The Issues
5. The Commodity Hardware Revolution & Radical Scale-Out
RED BLUE
The new topology.
Cheap compute,
unintelligent direct-
attach storage and
free comms make
large scale-out grids
the future.
15 Third Nature, Inc. || Noumenal, Inc.
The Issues
5. The Commodity Hardware Revolution & Radical Scale-Out
RED BLUE
The new topology.
Cheap compute,
unintelligent direct-
attach storage and
free comms make
large scale-out grids
the future.
The current topology
is alive and well.
These commodity
building blocks are,
after all, just SMP
platforms.
16 Third Nature, Inc. || Noumenal, Inc.
The Issues
6. Structured Query Language
RED BLUE
Tried-and-True.
Powerful, expressive
language for complex
analytical problems.
That’s why the noSQL
vendors reinvent it all
the time.
17 Third Nature, Inc. || Noumenal, Inc.
The Issues
6. Structured Query Language
RED BLUE
Toast.
Too complex, too hard
to code, too hard to
debug. A way of
ensuring dependency
on merchant DBMSs.
Tried-and-True.
Powerful, expressive
language for complex
analytical problems.
That’s why the noSQL
vendors reinvent it all
the time.
18 Third Nature, Inc. || Noumenal, Inc.
The Issues
7. New Programming Models
RED BLUE
Say hello to Pig.
New analytical
problems (decisioning,
discovery, exploration)
require new
languages, new tools
and new programming
models.
19 Third Nature, Inc. || Noumenal, Inc.
The Issues
7. New Programming Models
RED BLUE
Say hello to Pig.
New analytical
problems (decisioning,
discovery, exploration)
require new
languages, new tools
and new programming
models.
Say hello to Java.
Open source doesn’t
mean free. Or easy.
The skills gap here is
huge. And there are
few truly new
analytical problems.
20 Third Nature, Inc. || Noumenal, Inc.
The Issues
8. Conventional DW Architecture
RED BLUE
Perfectly viable.
No need to change
anything. Some new
technologies may play
roles in the existing
architecture, but we’re
good to go, generally.
21 Third Nature, Inc. || Noumenal, Inc.
The Issues
8. Conventional DW Architecture
RED BLUE
A relic.
Overly complex.
Difficult to implement.
Controlled by the
supply side of the
market, anyway.
Perfectly viable.
No need to change
anything. Some new
technologies may play
roles in the existing
architecture, but we’re
good to go, generally.
22 Third Nature, Inc. || Noumenal, Inc.
The Issues
9. The Cloud
RED BLUE
We all go there.
Most of the interesting
data is there; it’s more
effective to move our
data, and our
analyses, to where the
data is, already.
23 Third Nature, Inc. || Noumenal, Inc.
The Issues
9. The Cloud
RED BLUE
We all go there.
Most of the interesting
data is there; it’s more
effective to move our
data, and our
analyses, to where the
data is, already.
Don’t go there.
Public cloud security
is an oxymoron.
Your inside-the-firewall
apps remain the core
information asset.
24 Third Nature, Inc. || Noumenal, Inc.
The Issues
10. New Technologies
RED BLUE
Distract Us.
We’ve already seen
what best-of-breed
gives us: a circus.
25 Third Nature, Inc. || Noumenal, Inc.
The Issues
10. New Technologies
RED BLUE
Save Us.
Best of breed
integration led by in-
house designers ins
back, with a
vengeance.
Distract Us.
We’ve already seen
what best-of-breed
gives us: a circus.
26 Third Nature, Inc. || Noumenal, Inc.
What We Really Think
1. Data As A Factor of Production
2. The Reality of Big Data
3. Merchant DBMSs
4. Query, Reporting & Dashboarding Tools
5. The Commodity Hardware Revolution
6. Structured Query Language
7. New Programming Models
8. Conventional DW Architecture
9. The Cloud
10. New Technologies

Contenu connexe

Similaire à Big Data Wonderland: Two Views on the Big Data Revolution

The biggest data centre decision it decision makers will ever have to make by...
The biggest data centre decision it decision makers will ever have to make by...The biggest data centre decision it decision makers will ever have to make by...
The biggest data centre decision it decision makers will ever have to make by...Jonathan Blain
 
CSE Conference Keynote
CSE Conference KeynoteCSE Conference Keynote
CSE Conference KeynoteSrinath Perera
 
Everything Has Changed Except Us: Modernizing the Data Warehouse
Everything Has Changed Except Us: Modernizing the Data WarehouseEverything Has Changed Except Us: Modernizing the Data Warehouse
Everything Has Changed Except Us: Modernizing the Data Warehousemark madsen
 
DEEP Scott Killoh-3
DEEP Scott Killoh-3DEEP Scott Killoh-3
DEEP Scott Killoh-3jonobermeyer
 
Soderstrom
SoderstromSoderstrom
SoderstromNASAPMC
 
Mobile Monday (October 2014) - Riding Global Tech Trends
Mobile Monday (October 2014) - Riding Global Tech TrendsMobile Monday (October 2014) - Riding Global Tech Trends
Mobile Monday (October 2014) - Riding Global Tech TrendsMobile Monday Yangon
 
Big Data LDN 2017: Deep Learning Demystified
Big Data LDN 2017: Deep Learning DemystifiedBig Data LDN 2017: Deep Learning Demystified
Big Data LDN 2017: Deep Learning DemystifiedMatt Stubbs
 
Mind and the machine
Mind and the machineMind and the machine
Mind and the machineTNS
 
The changing nature of things: getting ready for a connected world.
The changing nature of things: getting ready for a connected world.The changing nature of things: getting ready for a connected world.
The changing nature of things: getting ready for a connected world.Alexandra Deschamps-Sonsino
 
Enabling the digital business
Enabling the digital businessEnabling the digital business
Enabling the digital businessDaisy Group
 
Dark Side of Cloud Adoption: People and Organizations Unable to Adapt and Imp...
Dark Side of Cloud Adoption: People and Organizations Unable to Adapt and Imp...Dark Side of Cloud Adoption: People and Organizations Unable to Adapt and Imp...
Dark Side of Cloud Adoption: People and Organizations Unable to Adapt and Imp...Dana Gardner
 
Big data and the data quality imperative
Big data and the data quality imperativeBig data and the data quality imperative
Big data and the data quality imperativeTrillium Software
 
Big Data By Vijay Bhaskar Semwal
Big Data By Vijay Bhaskar SemwalBig Data By Vijay Bhaskar Semwal
Big Data By Vijay Bhaskar SemwalIIIT Allahabad
 
Big Data and Bad Analogies
Big Data and Bad AnalogiesBig Data and Bad Analogies
Big Data and Bad Analogiesmark madsen
 
Using AI to Solve Data and IT Complexity -- And Better Enable AI
Using AI to Solve Data and IT Complexity -- And Better Enable AIUsing AI to Solve Data and IT Complexity -- And Better Enable AI
Using AI to Solve Data and IT Complexity -- And Better Enable AIDana Gardner
 

Similaire à Big Data Wonderland: Two Views on the Big Data Revolution (20)

Big data wonderland
Big data wonderlandBig data wonderland
Big data wonderland
 
Horse meat or beef? (3) D Murphy, National Grid, 21/3/13
Horse meat or beef? (3) D Murphy, National Grid, 21/3/13Horse meat or beef? (3) D Murphy, National Grid, 21/3/13
Horse meat or beef? (3) D Murphy, National Grid, 21/3/13
 
The biggest data centre decision it decision makers will ever have to make by...
The biggest data centre decision it decision makers will ever have to make by...The biggest data centre decision it decision makers will ever have to make by...
The biggest data centre decision it decision makers will ever have to make by...
 
CSE Conference Keynote
CSE Conference KeynoteCSE Conference Keynote
CSE Conference Keynote
 
Everything Has Changed Except Us: Modernizing the Data Warehouse
Everything Has Changed Except Us: Modernizing the Data WarehouseEverything Has Changed Except Us: Modernizing the Data Warehouse
Everything Has Changed Except Us: Modernizing the Data Warehouse
 
DEEP Scott Killoh-3
DEEP Scott Killoh-3DEEP Scott Killoh-3
DEEP Scott Killoh-3
 
Soderstrom
SoderstromSoderstrom
Soderstrom
 
Mobile Monday (October 2014) - Riding Global Tech Trends
Mobile Monday (October 2014) - Riding Global Tech TrendsMobile Monday (October 2014) - Riding Global Tech Trends
Mobile Monday (October 2014) - Riding Global Tech Trends
 
Big Data LDN 2017: Deep Learning Demystified
Big Data LDN 2017: Deep Learning DemystifiedBig Data LDN 2017: Deep Learning Demystified
Big Data LDN 2017: Deep Learning Demystified
 
Mind and the machine
Mind and the machineMind and the machine
Mind and the machine
 
The changing nature of things: getting ready for a connected world.
The changing nature of things: getting ready for a connected world.The changing nature of things: getting ready for a connected world.
The changing nature of things: getting ready for a connected world.
 
Fundamentals of Big Data
Fundamentals of Big DataFundamentals of Big Data
Fundamentals of Big Data
 
Enabling the digital business
Enabling the digital businessEnabling the digital business
Enabling the digital business
 
Tim willoughby
Tim willoughbyTim willoughby
Tim willoughby
 
Dark Side of Cloud Adoption: People and Organizations Unable to Adapt and Imp...
Dark Side of Cloud Adoption: People and Organizations Unable to Adapt and Imp...Dark Side of Cloud Adoption: People and Organizations Unable to Adapt and Imp...
Dark Side of Cloud Adoption: People and Organizations Unable to Adapt and Imp...
 
Big data and the data quality imperative
Big data and the data quality imperativeBig data and the data quality imperative
Big data and the data quality imperative
 
Big Data By Vijay Bhaskar Semwal
Big Data By Vijay Bhaskar SemwalBig Data By Vijay Bhaskar Semwal
Big Data By Vijay Bhaskar Semwal
 
Big Data and Bad Analogies
Big Data and Bad AnalogiesBig Data and Bad Analogies
Big Data and Bad Analogies
 
Using AI to Solve Data and IT Complexity -- And Better Enable AI
Using AI to Solve Data and IT Complexity -- And Better Enable AIUsing AI to Solve Data and IT Complexity -- And Better Enable AI
Using AI to Solve Data and IT Complexity -- And Better Enable AI
 
Big Data RF
Big Data RFBig Data RF
Big Data RF
 

Plus de mark madsen

Data Architecture: OMG It’s Made of People
Data Architecture: OMG It’s Made of PeopleData Architecture: OMG It’s Made of People
Data Architecture: OMG It’s Made of Peoplemark madsen
 
Solve User Problems: Data Architecture for Humans
Solve User Problems: Data Architecture for HumansSolve User Problems: Data Architecture for Humans
Solve User Problems: Data Architecture for Humansmark madsen
 
The Black Box: Interpretability, Reproducibility, and Data Management
The Black Box: Interpretability, Reproducibility, and Data ManagementThe Black Box: Interpretability, Reproducibility, and Data Management
The Black Box: Interpretability, Reproducibility, and Data Managementmark madsen
 
Operationalizing Machine Learning in the Enterprise
Operationalizing Machine Learning in the EnterpriseOperationalizing Machine Learning in the Enterprise
Operationalizing Machine Learning in the Enterprisemark madsen
 
Building a Data Platform Strata SF 2019
Building a Data Platform Strata SF 2019Building a Data Platform Strata SF 2019
Building a Data Platform Strata SF 2019mark madsen
 
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)mark madsen
 
Architecting a Platform for Enterprise Use - Strata London 2018
Architecting a Platform for Enterprise Use - Strata London 2018Architecting a Platform for Enterprise Use - Strata London 2018
Architecting a Platform for Enterprise Use - Strata London 2018mark madsen
 
A Brief Tour through the Geology & Endemic Botany of the Klamath-Siskiyou Range
A Brief Tour through the Geology & Endemic Botany of the Klamath-Siskiyou RangeA Brief Tour through the Geology & Endemic Botany of the Klamath-Siskiyou Range
A Brief Tour through the Geology & Endemic Botany of the Klamath-Siskiyou Rangemark madsen
 
How to understand trends in the data & software market
How to understand trends in the data & software marketHow to understand trends in the data & software market
How to understand trends in the data & software marketmark madsen
 
Pay no attention to the man behind the curtain - the unseen work behind data ...
Pay no attention to the man behind the curtain - the unseen work behind data ...Pay no attention to the man behind the curtain - the unseen work behind data ...
Pay no attention to the man behind the curtain - the unseen work behind data ...mark madsen
 
Assumptions about Data and Analysis: Briefing room webcast slides
Assumptions about Data and Analysis: Briefing room webcast slidesAssumptions about Data and Analysis: Briefing room webcast slides
Assumptions about Data and Analysis: Briefing room webcast slidesmark madsen
 
A Pragmatic Approach to Analyzing Customers
A Pragmatic Approach to Analyzing CustomersA Pragmatic Approach to Analyzing Customers
A Pragmatic Approach to Analyzing Customersmark madsen
 
Disruptive Innovation: how do you use these theories to manage your IT?
Disruptive Innovation: how do you use these theories to manage your IT?Disruptive Innovation: how do you use these theories to manage your IT?
Disruptive Innovation: how do you use these theories to manage your IT?mark madsen
 
Briefing room: An alternative for streaming data collection
Briefing room: An alternative for streaming data collectionBriefing room: An alternative for streaming data collection
Briefing room: An alternative for streaming data collectionmark madsen
 
Building the Enterprise Data Lake: A look at architecture
Building the Enterprise Data Lake: A look at architectureBuilding the Enterprise Data Lake: A look at architecture
Building the Enterprise Data Lake: A look at architecturemark madsen
 
Briefing Room analyst comments - streaming analytics
Briefing Room analyst comments - streaming analyticsBriefing Room analyst comments - streaming analytics
Briefing Room analyst comments - streaming analyticsmark madsen
 
Everything has changed except us
Everything has changed except usEverything has changed except us
Everything has changed except usmark madsen
 
Bi isn't big data and big data isn't BI (updated)
Bi isn't big data and big data isn't BI (updated)Bi isn't big data and big data isn't BI (updated)
Bi isn't big data and big data isn't BI (updated)mark madsen
 
On the edge: analytics for the modern enterprise (analyst comments)
On the edge: analytics for the modern enterprise (analyst comments)On the edge: analytics for the modern enterprise (analyst comments)
On the edge: analytics for the modern enterprise (analyst comments)mark madsen
 
Crossing the chasm with a high performance dynamically scalable open source p...
Crossing the chasm with a high performance dynamically scalable open source p...Crossing the chasm with a high performance dynamically scalable open source p...
Crossing the chasm with a high performance dynamically scalable open source p...mark madsen
 

Plus de mark madsen (20)

Data Architecture: OMG It’s Made of People
Data Architecture: OMG It’s Made of PeopleData Architecture: OMG It’s Made of People
Data Architecture: OMG It’s Made of People
 
Solve User Problems: Data Architecture for Humans
Solve User Problems: Data Architecture for HumansSolve User Problems: Data Architecture for Humans
Solve User Problems: Data Architecture for Humans
 
The Black Box: Interpretability, Reproducibility, and Data Management
The Black Box: Interpretability, Reproducibility, and Data ManagementThe Black Box: Interpretability, Reproducibility, and Data Management
The Black Box: Interpretability, Reproducibility, and Data Management
 
Operationalizing Machine Learning in the Enterprise
Operationalizing Machine Learning in the EnterpriseOperationalizing Machine Learning in the Enterprise
Operationalizing Machine Learning in the Enterprise
 
Building a Data Platform Strata SF 2019
Building a Data Platform Strata SF 2019Building a Data Platform Strata SF 2019
Building a Data Platform Strata SF 2019
 
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)Architecting a Data Platform For Enterprise Use (Strata NY 2018)
Architecting a Data Platform For Enterprise Use (Strata NY 2018)
 
Architecting a Platform for Enterprise Use - Strata London 2018
Architecting a Platform for Enterprise Use - Strata London 2018Architecting a Platform for Enterprise Use - Strata London 2018
Architecting a Platform for Enterprise Use - Strata London 2018
 
A Brief Tour through the Geology & Endemic Botany of the Klamath-Siskiyou Range
A Brief Tour through the Geology & Endemic Botany of the Klamath-Siskiyou RangeA Brief Tour through the Geology & Endemic Botany of the Klamath-Siskiyou Range
A Brief Tour through the Geology & Endemic Botany of the Klamath-Siskiyou Range
 
How to understand trends in the data & software market
How to understand trends in the data & software marketHow to understand trends in the data & software market
How to understand trends in the data & software market
 
Pay no attention to the man behind the curtain - the unseen work behind data ...
Pay no attention to the man behind the curtain - the unseen work behind data ...Pay no attention to the man behind the curtain - the unseen work behind data ...
Pay no attention to the man behind the curtain - the unseen work behind data ...
 
Assumptions about Data and Analysis: Briefing room webcast slides
Assumptions about Data and Analysis: Briefing room webcast slidesAssumptions about Data and Analysis: Briefing room webcast slides
Assumptions about Data and Analysis: Briefing room webcast slides
 
A Pragmatic Approach to Analyzing Customers
A Pragmatic Approach to Analyzing CustomersA Pragmatic Approach to Analyzing Customers
A Pragmatic Approach to Analyzing Customers
 
Disruptive Innovation: how do you use these theories to manage your IT?
Disruptive Innovation: how do you use these theories to manage your IT?Disruptive Innovation: how do you use these theories to manage your IT?
Disruptive Innovation: how do you use these theories to manage your IT?
 
Briefing room: An alternative for streaming data collection
Briefing room: An alternative for streaming data collectionBriefing room: An alternative for streaming data collection
Briefing room: An alternative for streaming data collection
 
Building the Enterprise Data Lake: A look at architecture
Building the Enterprise Data Lake: A look at architectureBuilding the Enterprise Data Lake: A look at architecture
Building the Enterprise Data Lake: A look at architecture
 
Briefing Room analyst comments - streaming analytics
Briefing Room analyst comments - streaming analyticsBriefing Room analyst comments - streaming analytics
Briefing Room analyst comments - streaming analytics
 
Everything has changed except us
Everything has changed except usEverything has changed except us
Everything has changed except us
 
Bi isn't big data and big data isn't BI (updated)
Bi isn't big data and big data isn't BI (updated)Bi isn't big data and big data isn't BI (updated)
Bi isn't big data and big data isn't BI (updated)
 
On the edge: analytics for the modern enterprise (analyst comments)
On the edge: analytics for the modern enterprise (analyst comments)On the edge: analytics for the modern enterprise (analyst comments)
On the edge: analytics for the modern enterprise (analyst comments)
 
Crossing the chasm with a high performance dynamically scalable open source p...
Crossing the chasm with a high performance dynamically scalable open source p...Crossing the chasm with a high performance dynamically scalable open source p...
Crossing the chasm with a high performance dynamically scalable open source p...
 

Dernier

DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningLars Bell
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionDilum Bandara
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxBkGupta21
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfMounikaPolabathina
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 3652toLead Limited
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024Lorenzo Miniero
 

Dernier (20)

DSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine TuningDSPy a system for AI to Write Prompts and Do Fine Tuning
DSPy a system for AI to Write Prompts and Do Fine Tuning
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!Nell’iperspazio con Rocket: il Framework Web di Rust!
Nell’iperspazio con Rocket: il Framework Web di Rust!
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Advanced Computer Architecture – An Introduction
Advanced Computer Architecture – An IntroductionAdvanced Computer Architecture – An Introduction
Advanced Computer Architecture – An Introduction
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
unit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptxunit 4 immunoblotting technique complete.pptx
unit 4 immunoblotting technique complete.pptx
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Unraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdfUnraveling Multimodality with Large Language Models.pdf
Unraveling Multimodality with Large Language Models.pdf
 
Dev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio WebDev Dives: Streamline document processing with UiPath Studio Web
Dev Dives: Streamline document processing with UiPath Studio Web
 
What is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdfWhat is DBT - The Ultimate Data Build Tool.pdf
What is DBT - The Ultimate Data Build Tool.pdf
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365Ensuring Technical Readiness For Copilot in Microsoft 365
Ensuring Technical Readiness For Copilot in Microsoft 365
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024SIP trunking in Janus @ Kamailio World 2024
SIP trunking in Janus @ Kamailio World 2024
 

Big Data Wonderland: Two Views on the Big Data Revolution

  • 1. Big Data Wonderland: Two Views on the Big Data Revolution Mark Madsen Third Nature, Inc. mark@thirdnature.net @markmadsen Marc Demarest Noumenal, Inc. marc@noumenal.com Strata Santa Clara February 2013
  • 2. 2 Third Nature, Inc. || Noumenal, Inc. Preamble Twenty Years On • We came up together in this industry in the early 1990s, as pointy-headed advocates of star schema design, trained by the deity himself, Ralph Kimball • Back then, it was a simpler world...big iron, big DBMS, hand- coded ETL, star schema, a thousand rinky-dink query tools • Mostly, conversation was dominated by ETL and schema design • “There will never be a decisional database larger than 10 GB...” St. Ralph Our Alma Mater
  • 3. 3 Third Nature, Inc. || Noumenal, Inc. Preamble Twenty Years On • Twenty years on, we find ourselves with opposing view on what is either the biggest con, or the biggest sea- change, in our data warehousing odyssey • Question: Is the big data revolution big, or a revolution? • Question: do we have to change? and if so, how? • Not a round table. A slugfest.... Demarest as Shana Alexander? Madsen as Jack Kilpatrick?
  • 4. 4 Third Nature, Inc. || Noumenal, Inc. Regular Programming Is Suspended Demarest Madsen
  • 5. 5 Third Nature, Inc. || Noumenal, Inc. Compromise Demarest Madsen You take the blue pill. The story ends, you wake up in your bed and believe whatever you want to believe. You take the red pill, you stay in Wonderland, and I show you how deep the rabbit hole goes. Remember, all I am offering is the truth: nothing more.
  • 6. 6 Third Nature, Inc. || Noumenal, Inc. The Issues 1. Data As A Factor of Production RED BLUE Amen. This change has been in process for more than a decade. Social media leads the way, but we’re all affected.
  • 7. 7 Third Nature, Inc. || Noumenal, Inc. The Issues 1. Data As A Factor of Production RED BLUE Amen. This change has been in process for more than a decade. Social media leads the way, but we’re all affected. Hype. For most companies, data remains an asset, but not a factor in the production of its products or services.
  • 8. 8 Third Nature, Inc. || Noumenal, Inc. The Issues 2. The Reality of Big Data RED BLUE Few companies transformed. No quantification of benefits, right now. Leverage? Maybe.
  • 9. 9 Third Nature, Inc. || Noumenal, Inc. The Issues 2. The Reality of Big Data RED BLUE No company escapes. Text, social, sensors, streaming -- the instrumentation of the real world transforms company decision- making processes. Few companies transformed. No quantification of benefits, right now. Leverage? Maybe.
  • 10. 10 Third Nature, Inc. || Noumenal, Inc. The Issues 3. Merchant DBMSs RED BLUE Increasingly irrelevant. We’ve been over- structured and under- resourced for 20 years. CSV is still the international standard.
  • 11. 11 Third Nature, Inc. || Noumenal, Inc. The Issues 3. Merchant DBMSs RED BLUE Increasingly irrelevant. We’ve been over- structured and under- resourced for 20 years. CSV is still the international standard. Will rise to the challenge. Any worthwhile innovation will be absorbed by the merchant DBMS players.
  • 12. 12 Third Nature, Inc. || Noumenal, Inc. The Issues 4. Query, Reporting & Dashboarding Tools RED BLUE Will rise to the challenge. We have two generations of analysts trained to feed using these tools.
  • 13. 13 Third Nature, Inc. || Noumenal, Inc. The Issues 4. Query, Reporting & Dashboarding Tools RED BLUE Ineffective, now and in the future. Can’t do real-time, can’t visualize large data sets, can’t support discovery and exploration. Will rise to the challenge. We have two generations of analysts trained to feed using these tools.
  • 14. 14 Third Nature, Inc. || Noumenal, Inc. The Issues 5. The Commodity Hardware Revolution & Radical Scale-Out RED BLUE The new topology. Cheap compute, unintelligent direct- attach storage and free comms make large scale-out grids the future.
  • 15. 15 Third Nature, Inc. || Noumenal, Inc. The Issues 5. The Commodity Hardware Revolution & Radical Scale-Out RED BLUE The new topology. Cheap compute, unintelligent direct- attach storage and free comms make large scale-out grids the future. The current topology is alive and well. These commodity building blocks are, after all, just SMP platforms.
  • 16. 16 Third Nature, Inc. || Noumenal, Inc. The Issues 6. Structured Query Language RED BLUE Tried-and-True. Powerful, expressive language for complex analytical problems. That’s why the noSQL vendors reinvent it all the time.
  • 17. 17 Third Nature, Inc. || Noumenal, Inc. The Issues 6. Structured Query Language RED BLUE Toast. Too complex, too hard to code, too hard to debug. A way of ensuring dependency on merchant DBMSs. Tried-and-True. Powerful, expressive language for complex analytical problems. That’s why the noSQL vendors reinvent it all the time.
  • 18. 18 Third Nature, Inc. || Noumenal, Inc. The Issues 7. New Programming Models RED BLUE Say hello to Pig. New analytical problems (decisioning, discovery, exploration) require new languages, new tools and new programming models.
  • 19. 19 Third Nature, Inc. || Noumenal, Inc. The Issues 7. New Programming Models RED BLUE Say hello to Pig. New analytical problems (decisioning, discovery, exploration) require new languages, new tools and new programming models. Say hello to Java. Open source doesn’t mean free. Or easy. The skills gap here is huge. And there are few truly new analytical problems.
  • 20. 20 Third Nature, Inc. || Noumenal, Inc. The Issues 8. Conventional DW Architecture RED BLUE Perfectly viable. No need to change anything. Some new technologies may play roles in the existing architecture, but we’re good to go, generally.
  • 21. 21 Third Nature, Inc. || Noumenal, Inc. The Issues 8. Conventional DW Architecture RED BLUE A relic. Overly complex. Difficult to implement. Controlled by the supply side of the market, anyway. Perfectly viable. No need to change anything. Some new technologies may play roles in the existing architecture, but we’re good to go, generally.
  • 22. 22 Third Nature, Inc. || Noumenal, Inc. The Issues 9. The Cloud RED BLUE We all go there. Most of the interesting data is there; it’s more effective to move our data, and our analyses, to where the data is, already.
  • 23. 23 Third Nature, Inc. || Noumenal, Inc. The Issues 9. The Cloud RED BLUE We all go there. Most of the interesting data is there; it’s more effective to move our data, and our analyses, to where the data is, already. Don’t go there. Public cloud security is an oxymoron. Your inside-the-firewall apps remain the core information asset.
  • 24. 24 Third Nature, Inc. || Noumenal, Inc. The Issues 10. New Technologies RED BLUE Distract Us. We’ve already seen what best-of-breed gives us: a circus.
  • 25. 25 Third Nature, Inc. || Noumenal, Inc. The Issues 10. New Technologies RED BLUE Save Us. Best of breed integration led by in- house designers ins back, with a vengeance. Distract Us. We’ve already seen what best-of-breed gives us: a circus.
  • 26. 26 Third Nature, Inc. || Noumenal, Inc. What We Really Think 1. Data As A Factor of Production 2. The Reality of Big Data 3. Merchant DBMSs 4. Query, Reporting & Dashboarding Tools 5. The Commodity Hardware Revolution 6. Structured Query Language 7. New Programming Models 8. Conventional DW Architecture 9. The Cloud 10. New Technologies