SlideShare une entreprise Scribd logo
1  sur  46
Télécharger pour lire hors ligne
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTING
How to Merge the Data Lake and the Data Warehouse
The Power of a Unified Analytics Warehouse
John Santaferraro
Research Director
EMA
Jeremiah Morrow
Senior Product Marketing Manager
Vertica
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTING
Watch the On-Demand Webinar
Slide 2
 How to Merge the Data Lake and the Data Warehouse: The
Power of a Unified Analytics Warehouse on-demand webinar:
https://info.enterprisemanagement.com/how-to-merge-the-data-lake-
and-the-data-warehouse-webinar-ws
 Check out upcoming webinars from EMA here:
https://www.enterprisemanagement.com/freeResearch
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTING
Featured Speakers
Slide 3 © 2020 Enterprise Management Associates, Inc.
John Santaferraro, Research Director, EMA
John is the research director for analytics, business intelligence, and data management
at EMA. He has 23 years of experience in data and analytics, from startups to executive
positions at Fortune 50 companies. His deep understanding of the industry comes from
years of leadership in implementation, product and marketing organizations, along with
multiple big data imagineering efforts for finance, communications, retail, manufacturing,
healthcare, events, oil and gas, and utilities.
Jeremiah Morrow, Senior Product Marketing Manager, Vertica
Jeremiah is responsible for marketing initiatives involving Vertica’s channel and
technology partners. He has worked in sales, marketing, and analyst relations for
technology companies ranging from infrastructure to software, and he is passionate
about building a strong and collaborative ecosystem to drive value for customers,
partners, and Vertica.
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTING
Sponsor
Trusted by AT&T, Cerner, Uber, The Trade Desk, and many other data-driven organizations, Vertica is the
unified analytics warehouse that solves three current market challenges that every organization faces.
Despite the disappointment in Hadoop, HDFS data lakes represent a very significant investment for many
companies, but the value is not equivalent to original expectations. Combined with the explosion of cloud
object storage, organizations struggle even more to unify their data. In addition, organizations favor a multi-
cloud or hybrid cloud and on-premises deployment strategy as they face the reality of cloud vendor lock-in,
costs, and migration challenges. Finally, machine learning is no longer a data science project, but must be
put into production to deliver the predictive analytics information in time to allow proactive actions.
Vertica is a Unified Analytics Warehouse that:
• Unifies HDFS data and Object Storage data lakes to capitalize on storage investments and maximize
business value.
• Unifies a company’s deployment options spanning multi-cloud and on-premises to embrace cloud
innovations, prevent lock-in, and meet regulatory and security requirements.
• Unifies the data science community and the business analyst and IT community, enabling each to
continue to use their preferred tools and languages while operationalizing machine learning at scale for
real-time predictive analytics.
Visit www.vertica.com for more information.
Slide 4 © 2020 Enterprise Management Associates, Inc.
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTING
Agenda
• The changing business and data landscape
• Inadequacies of the data lake and data warehouse
• The race for a unified analytics warehouse
• Requirements for a unified analytics warehouse
• Deploying a unified analytics warehouse
• Unified analytics warehouse for cloud and hybrid
• Vertica and the unified analytics warehouse
Slide 5 © 2020 Enterprise Management Associates, Inc.
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTINGSlide 6
The changing business and
data landscape
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTING
Digital, Mobile, IoT
Digital Mobile Internet of Things
Slide 7 © 2020 Enterprise Management Associates, Inc.
Evolution of Data Warehouse
Architecture
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTING
Business
Intelligence
CRM ERP
Billing
Application Data
Customer
Operational
Financial
ETL
Analytics Database
Transactional
Data
Message Queues
Files
Data Warehouse
Batch
Visualization
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTING
Data Lake
1
0
Contextual
Data
Files
Weather
Geo
Low
Latency
Batch
Transactional
Data
Application Data
OLTP/ODS
Batch ETL
or EL with T
done on
mass
storage
Data Prep /
Enrichment
Streaming
Data
Application data
Web clicks
Logs
Sensors
Operational metrics
User tracking
Geo-location
Visualization
Applications
Artificial Intelligence
Cloud
On-Premises
AND / OR
Distributed
Pub/Sub
Distributed
Prepped
Data
Object
Storage
Stream Processing
Data Lake
Mass Storage
Query Engine /
Machine
Learning
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTING
Cooperative Data Architecture
1
1
Contextual
Data
Files
Weather
Geo
Low
Latency
Batch
Transactional
Data
Application Data
OLTP/ODS
Batch ETL
or EL with T
done on
mass
storage
Data Lake
Mass Storage
Data Prep /
Enrichment
Streaming
Data
Application data
Web clicks
Logs
Sensors
Operational metrics
User tracking
Geo-location
Visualization
Applications
Artificial Intelligence
Import
Export
Query
Cloud
On-Premises
AND / OR
Distributed
Pub/Sub
Distributed
Columnar
Data
Object
Storage
Stream Processing
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTING
The Need for a Unified Analytics Warehouse
“Only structured and semi-structured data
combined can deliver on the promise of complete
business insights.”
Slide 12 © 2020 Enterprise Management Associates, Inc.
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTINGSlide 13
Inadequacies of the data lake
and data warehouse
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTING
The Data Swamp (Lake)
Data Lake
Shortcomings
• Built for semi-structured data
and requires structuring of data
on read to analyze
transactional data.
• Database component lacks
enterprise capabilities and
does not perform will
compared to analytic platforms.
Slide 14 © 2020 Enterprise Management Associates, Inc.
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTING
The Data Jailhouse (Warehouse)
Data Warehouse
Shortcomings
• Built for structured data and
requires structuring and
ingestion of semi-structured
data to analyze it.
• Too much effort required and
doesn’t support real-time ad-
hoc analysis of semi
structured data.
Slide 15 © 2020 Enterprise Management Associates, Inc.
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTINGSlide 16
The race for the unified
analytics warehouse
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTING
From the Data Lake Side
Build On
• Database Technology
• SQL Query Engines
Slide 17 © 2020 Enterprise Management Associates, Inc.
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTING
From the Data Warehouse Side
Drill Through
• Semi-Structured Data
• Tiered Storage
• Data Science Tools
Slide 18 © 2020 Enterprise Management Associates, Inc.
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTINGSlide 19
Requirements for a unified
analytics warehouse
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTING
What is a Unified Analytics Warehouse?
• Unified
• Adequately handles multi-structured
data in a single platform
• Analytics
• Data lake and the data warehouse
are primarily for analytics.
• Warehouse
• Stores multi-structured data in an
organized and accessible manner.
Slide 20 © 2020 Enterprise Management Associates, Inc.
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTING
Unified Analytics Warehouse
Slide 21 © 2020 Enterprise Management Associates, Inc.
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTING
Unified Analytics Warehouse
Slide 22 © 2020 Enterprise Management Associates, Inc.
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTING
Unified Analytics Warehouse
Slide 23 © 2020 Enterprise Management Associates, Inc.
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTING
What is a Unified Analytics Warehouse?
• Unified
• Adequately handles multi-structured
data in a single platform
• Analytics
• Data lake and the data warehouse
are primarily for analytics.
• Warehouse
• Stores multi-structured data in an
organized and accessible manner.
Slide 24 © 2020 Enterprise Management Associates, Inc.
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTING
Unified Analytics Warehouse
2
5
Contextual
Data
Files
Weather
Geo
Low
Latency
Batch
Transactional
Data
Application Data
OLTP/ODS
Batch ETL
or EL with T
done in
warehouse
Streaming
Data
Application data
Web clicks
Logs
Sensors
Operational metrics
User tracking
Geo-location
Stream Processing
Cloud
Visualization
Applications
Artificial Intelligence
Shared
Storage
On-Premises
HDFS HDFS
AND / OR Ingestion/ ELT/
Data Prep / Enrichment
Managed ML
Models
Reporting /
BI
Data Science /
ML
Departmental
Use
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTING
Data and the Unified Analytics Warehouse
• Structured Data
• Complex Data Types
• JSON
• XML
• Textual Data
• Streaming Data
Slide 26 © 2020 Enterprise Management Associates, Inc.
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTING
Enterprise and the Unified Analytics Warehouse
• Administration
• Management
• Orchestration
• Security
• Privacy
• Performance
• Scalability
• Agility
• Resiliency
• AI Enablement
• Recommendations
- Structure, Schema, Data
Relationships, Tuning
• Automation
- Metadata Generation,
Tuning, Maintenance,
Elasticity, Query Routing,
Data Tiering, Change
Management, Code
Generation
Slide 27 © 2020 Enterprise Management Associates, Inc.
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTING
Infrastructure and the Unified Analytics Warehouse
• Analytic Platform
• Distributed File System
• Object Storage
Slide 28 © 2020 Enterprise Management Associates, Inc.
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTING
Cloud and the Unified Analytics Warehouse
“53% of all data is
now in the cloud…”
• Single Pane of Glass
• Simple Migration
• Common Foundation
Slide 29 © 2020 Enterprise Management Associates, Inc.
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTING
Analytics and the Unified Analytics Warehouse
Embedded Analytics
• Prebuilt
• Common
• High-Performance
• Rapid Deployment
Analytical Performance
• Data Intensive
• Compute Intensive
• High Intensity
• High Concurrency
Slide 30 © 2020 Enterprise Management Associates, Inc.
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTING
Users and the Unified Analytics Warehouse
“Users also expect a platform that allows them
to put aside religious beliefs about how analytics
should be done.”
Slide 31 © 2020 Enterprise Management Associates, Inc.
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTINGSlide 32
Deploying a unified analytics
warehouse
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTING
Best Practices for the Unified Analytics Warehouse
 Choose a vendor with a vision for the UAW
 Choose a platform well into UAW transformation
 Choose a project with high value
 Choose a project with a mix of data and use cases
 Test use case capabilities in a hybrid environment
 Plan for end of life and continued migration
Slide 33 © 2020 Enterprise Management Associates, Inc.
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTINGSlide 34
Unified analytics warehouse
for cloud and hybrid environments
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTING
3 Necessities of for Cloud and Hybrid Success
• Common Management
• Common Infrastructure
• Interoperability
• Data interoperability
• Query interoperability
• Migration interoperability
Slide 35 © 2020 Enterprise Management Associates, Inc.
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTINGSlide 36
Vertica and the unified analytics
warehouse
The Unified Analytics Warehouse
What is Vertica?
3
8
SQL Database
Load and store data in a
data warehouse
designed for blazing
fast analytics
Query Engine
Ask complex analytical
questions and get fast
answers regardless of
where the data resides
Vertica is an advanced analytics platform built for the scale and complexity of today’s data-driven
world. It combines the power of a high-performance, MPP query engine with advanced analytics
and machine learning.
Analytics & ML
Create, train and deploy
advanced analytics and
machine learning models
at massive scale
Remove scale, performance and capacity constraints
3
9
Get data quickly enough to act upon it, explore your data interactively,
and enable everyone to make their own data-driven decisions
Fear of more uses or growing data volumes is a thing of the past
Scale Data Volumes Scale Users
SQL Database
+
Vertica Advanced Analytics Platform
+
Get data quickly enough to act upon it, explore your data interactively,
and enable everyone to make their own data-driven decisions
Analytics & ML Query Engine
2020 McKnight Consulting Group Benchmark
4
0
“At 60 concurrent users, Vertica in Eon
Mode was 1.8x less expensive than
Redshift. The unnamed data cloud
platform consistently had the highest price
performance.”
“Vertica in Eon Mode had 1.14x the QPH
of the next highest database (unnamed
data cloud platform) for the 60
concurrent user workload.”
“Vertica in Eon Mod consistently had the
shortest elapsed time for the longest
running [query] thread across the
concurrency profiles at 250 TB.”
Vertica in Eon Mode, Amazon Redshift, and an Unnamed Data Cloud Platform
Philips Healthcare –
Predictive Maintenance
Machine Learning can simplify business processes and
improve the customer experience
4
2
Predictive Maintenance
System
Problem
Customer
Call Dispatch
Onsite
Trouble-
shooting
Remote Monitoring Predicts
Potential failure Service Scheduled Problem Avoided
Parts
Delivery
Repair or
Replace
System
Functional
Reactive Maintenance
Predictive Maintenance
High Level Architecture
4
3
Data Lake (Raw Data Archive)
Parallel Extract Transform Load
Legacy
Database
s
CRM &
Business
Systems
Installed Base
Philips Remote Service Network
DSP
(HSI)
Remote
Monitoring Radar
4.0
Remote Service
RSW & CHA
Reliability
Dashboards
R&D Access
Outcomes
4
4
"Remote service provides us with
an engineer online all the time.
They tell us when we've got a fault
before we know we've got a fault.
And not only that, they can fix the
fault before we knew we had a
fault. And that's impressive."
Cobalt Imaging, Gloucester
"Now we will have more uptime
on the scanner and potentially be
able to see more patients…It's a
new level of service for us, with a
greater satisfaction." -
Radiographer, New Stobhill
Hospital in Glasgow
500 TB of data in more than 300 tables. 30 trillion data points
80 different data sources integrated
24/7
months from scratch to production8
live data feeds. Millions of logs per week.
…and marching towards 0 unplanned downtime
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTING
EMA Independent Analysis of Vertica and the UAW
45 © 2020 Enterprise Management Associates, Inc.
0
2
4
6
8
10
Data Requirements
Enterprise
Requirements
Infrastructure
Requirements
Hybrid Requirements
Analytical
Requirements
User Requirements
From EMA - The Race for a Unified Analytics Warehouse
IT & DATA MANAGEMENT RESEARCH,
INDUSTRY ANALYSIS & CONSULTING
Question and Answer: Log Questions in the Q&A panel
© 2020 Enterprise Management Associates, Inc.
Interested in Learning More?
LEARN ABOUT:
DOWNLOAD
EMA – The Race for a
Unified Analytics Warehouse -
https://www.vertica.com/resource/the-race-for-
a-unified-analytics-warehouse/
McKnight Consulting Group – Cloud Database
Performance Benchmark -
https://www.vertica.com/cloud-database-
benchmark-report-2/
Vertica Community Edition -
https://www.vertica.com/try/
Blog – https://www.vertica.com/blog
LinkedIn -
https://www.linkedin.com/showcase
/vertica-co/

Contenu connexe

Tendances

(BDT317) Building A Data Lake On AWS
(BDT317) Building A Data Lake On AWS(BDT317) Building A Data Lake On AWS
(BDT317) Building A Data Lake On AWS
Amazon Web Services
 

Tendances (20)

Data Lake Overview
Data Lake OverviewData Lake Overview
Data Lake Overview
 
Databricks Delta Lake and Its Benefits
Databricks Delta Lake and Its BenefitsDatabricks Delta Lake and Its Benefits
Databricks Delta Lake and Its Benefits
 
(BDT317) Building A Data Lake On AWS
(BDT317) Building A Data Lake On AWS(BDT317) Building A Data Lake On AWS
(BDT317) Building A Data Lake On AWS
 
Azure DataBricks for Data Engineering by Eugene Polonichko
Azure DataBricks for Data Engineering by Eugene PolonichkoAzure DataBricks for Data Engineering by Eugene Polonichko
Azure DataBricks for Data Engineering by Eugene Polonichko
 
KSnow: Getting started with Snowflake
KSnow: Getting started with SnowflakeKSnow: Getting started with Snowflake
KSnow: Getting started with Snowflake
 
Introducing Databricks Delta
Introducing Databricks DeltaIntroducing Databricks Delta
Introducing Databricks Delta
 
Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4Data Lakehouse Symposium | Day 4
Data Lakehouse Symposium | Day 4
 
Snowflake: Your Data. No Limits (Session sponsored by Snowflake) - AWS Summit...
Snowflake: Your Data. No Limits (Session sponsored by Snowflake) - AWS Summit...Snowflake: Your Data. No Limits (Session sponsored by Snowflake) - AWS Summit...
Snowflake: Your Data. No Limits (Session sponsored by Snowflake) - AWS Summit...
 
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data Architecture
 
Spark access control on Amazon EMR with AWS Lake Formation
Spark access control on Amazon EMR with AWS Lake FormationSpark access control on Amazon EMR with AWS Lake Formation
Spark access control on Amazon EMR with AWS Lake Formation
 
(ZDM) Zero Downtime DB Migration to Oracle Cloud
(ZDM) Zero Downtime DB Migration to Oracle Cloud(ZDM) Zero Downtime DB Migration to Oracle Cloud
(ZDM) Zero Downtime DB Migration to Oracle Cloud
 
The Marriage of the Data Lake and the Data Warehouse and Why You Need Both
The Marriage of the Data Lake and the Data Warehouse and Why You Need BothThe Marriage of the Data Lake and the Data Warehouse and Why You Need Both
The Marriage of the Data Lake and the Data Warehouse and Why You Need Both
 
[NEW LAUNCH!] Deep Dive on Amazon FSx for Windows File Server (STG322-R) - AW...
[NEW LAUNCH!] Deep Dive on Amazon FSx for Windows File Server (STG322-R) - AW...[NEW LAUNCH!] Deep Dive on Amazon FSx for Windows File Server (STG322-R) - AW...
[NEW LAUNCH!] Deep Dive on Amazon FSx for Windows File Server (STG322-R) - AW...
 
An overview of snowflake
An overview of snowflakeAn overview of snowflake
An overview of snowflake
 
Snowflake Architecture.pptx
Snowflake Architecture.pptxSnowflake Architecture.pptx
Snowflake Architecture.pptx
 
Serverlesss Big Data Analytics with Amazon Athena and Quicksight
Serverlesss Big Data Analytics with Amazon Athena and QuicksightServerlesss Big Data Analytics with Amazon Athena and Quicksight
Serverlesss Big Data Analytics with Amazon Athena and Quicksight
 
Snowflake + Power BI: Cloud Analytics for Everyone
Snowflake + Power BI: Cloud Analytics for EveryoneSnowflake + Power BI: Cloud Analytics for Everyone
Snowflake + Power BI: Cloud Analytics for Everyone
 
Migrating SAP Workloads to AWS: Stories and Tips - AWS Summit Sydney
Migrating SAP Workloads to AWS: Stories and Tips - AWS Summit SydneyMigrating SAP Workloads to AWS: Stories and Tips - AWS Summit Sydney
Migrating SAP Workloads to AWS: Stories and Tips - AWS Summit Sydney
 
Snowflake essentials
Snowflake essentialsSnowflake essentials
Snowflake essentials
 
Databricks Fundamentals
Databricks FundamentalsDatabricks Fundamentals
Databricks Fundamentals
 

Similaire à How to Merge the Data Lake and the Data Warehouse: The Power of a Unified Analytics Warehouse

Analytics in a Day Virtual Workshop
Analytics in a Day Virtual WorkshopAnalytics in a Day Virtual Workshop
Analytics in a Day Virtual Workshop
CCG
 

Similaire à How to Merge the Data Lake and the Data Warehouse: The Power of a Unified Analytics Warehouse (20)

Profiting from the Digital Shift: Time Series Databases as Value Creation Eng...
Profiting from the Digital Shift: Time Series Databases as Value Creation Eng...Profiting from the Digital Shift: Time Series Databases as Value Creation Eng...
Profiting from the Digital Shift: Time Series Databases as Value Creation Eng...
 
6 Principles of Modern Change Data Capture: How to Build Fast, Agile, Reliabl...
6 Principles of Modern Change Data Capture: How to Build Fast, Agile, Reliabl...6 Principles of Modern Change Data Capture: How to Build Fast, Agile, Reliabl...
6 Principles of Modern Change Data Capture: How to Build Fast, Agile, Reliabl...
 
Four Key Considerations for your Big Data Analytics Strategy
Four Key Considerations for your Big Data Analytics StrategyFour Key Considerations for your Big Data Analytics Strategy
Four Key Considerations for your Big Data Analytics Strategy
 
Data Virtualization: An Introduction
Data Virtualization: An IntroductionData Virtualization: An Introduction
Data Virtualization: An Introduction
 
Delivering Analytics at The Speed of Transactions with Data Fabric
Delivering Analytics at The Speed of Transactions with Data FabricDelivering Analytics at The Speed of Transactions with Data Fabric
Delivering Analytics at The Speed of Transactions with Data Fabric
 
Analytics in a Day Virtual Workshop
Analytics in a Day Virtual WorkshopAnalytics in a Day Virtual Workshop
Analytics in a Day Virtual Workshop
 
Big data an elephant business opportunities
Big data an elephant   business opportunitiesBig data an elephant   business opportunities
Big data an elephant business opportunities
 
Exploiting Data Lakes: Architecture, Capabilities & Future
Exploiting Data Lakes: Architecture, Capabilities & FutureExploiting Data Lakes: Architecture, Capabilities & Future
Exploiting Data Lakes: Architecture, Capabilities & Future
 
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and ClouderaIs your big data journey stalling? Take the Leap with Capgemini and Cloudera
Is your big data journey stalling? Take the Leap with Capgemini and Cloudera
 
Enabling 360-degree Business Insights with SAP Data
Enabling 360-degree Business Insights with SAP DataEnabling 360-degree Business Insights with SAP Data
Enabling 360-degree Business Insights with SAP Data
 
Accelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and VisualizationAccelerate Self-Service Analytics with Data Virtualization and Visualization
Accelerate Self-Service Analytics with Data Virtualization and Visualization
 
Where does Fast Data Strategy Fit within IT Projects
Where does Fast Data Strategy Fit within IT ProjectsWhere does Fast Data Strategy Fit within IT Projects
Where does Fast Data Strategy Fit within IT Projects
 
When SAP alone is not enough
When SAP alone is not enoughWhen SAP alone is not enough
When SAP alone is not enough
 
When and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data ArchitectureWhen and How Data Lakes Fit into a Modern Data Architecture
When and How Data Lakes Fit into a Modern Data Architecture
 
Analyst Keynote: Forrester: Data Fabric Strategy is Vital for Business Innova...
Analyst Keynote: Forrester: Data Fabric Strategy is Vital for Business Innova...Analyst Keynote: Forrester: Data Fabric Strategy is Vital for Business Innova...
Analyst Keynote: Forrester: Data Fabric Strategy is Vital for Business Innova...
 
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BIAugmentation, Collaboration, Governance: Defining the Future of Self-Service BI
Augmentation, Collaboration, Governance: Defining the Future of Self-Service BI
 
Visualisation and forecasting on IT capacity planning data
Visualisation and forecasting on IT capacity planning dataVisualisation and forecasting on IT capacity planning data
Visualisation and forecasting on IT capacity planning data
 
How Businesses use Big Data to Impact the Bottom Line
How Businesses use Big Data to Impact the Bottom LineHow Businesses use Big Data to Impact the Bottom Line
How Businesses use Big Data to Impact the Bottom Line
 
IDC Portugal | Como Libertar os Seus Dados com Virtualização de Dados
IDC Portugal | Como Libertar os Seus Dados com Virtualização de DadosIDC Portugal | Como Libertar os Seus Dados com Virtualização de Dados
IDC Portugal | Como Libertar os Seus Dados com Virtualização de Dados
 
Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)Future of Data Strategy (ASEAN)
Future of Data Strategy (ASEAN)
 

Plus de Enterprise Management Associates

Plus de Enterprise Management Associates (20)

Highlights from the EMA Radar™ Report for Workload Automation and Orchestrati...
Highlights from the EMA Radar™ Report for Workload Automation and Orchestrati...Highlights from the EMA Radar™ Report for Workload Automation and Orchestrati...
Highlights from the EMA Radar™ Report for Workload Automation and Orchestrati...
 
Real-world incident response, management, and prevention
Real-world incident response, management, and preventionReal-world incident response, management, and prevention
Real-world incident response, management, and prevention
 
Observability: Challenges, Priorities, Solutions, and the Role of OpenTelemetry
Observability: Challenges, Priorities, Solutions, and the Role of OpenTelemetryObservability: Challenges, Priorities, Solutions, and the Role of OpenTelemetry
Observability: Challenges, Priorities, Solutions, and the Role of OpenTelemetry
 
NetSecOps: Examining How Network and Security Teams Collaborate for a Better ...
NetSecOps: Examining How Network and Security Teams Collaborate for a Better ...NetSecOps: Examining How Network and Security Teams Collaborate for a Better ...
NetSecOps: Examining How Network and Security Teams Collaborate for a Better ...
 
Modern ITSM—the untapped game-changer for midsize organizations
Modern ITSM—the untapped game-changer for midsize organizationsModern ITSM—the untapped game-changer for midsize organizations
Modern ITSM—the untapped game-changer for midsize organizations
 
Unveiling Strategic Trends in Global Finance, Banking, and Insurance - IT Ex...
Unveiling Strategic Trends in Global Finance, Banking, and Insurance -  IT Ex...Unveiling Strategic Trends in Global Finance, Banking, and Insurance -  IT Ex...
Unveiling Strategic Trends in Global Finance, Banking, and Insurance - IT Ex...
 
Unlocking Master Data Management (MDM) Success: Real-World Insights and Strat...
Unlocking Master Data Management (MDM) Success: Real-World Insights and Strat...Unlocking Master Data Management (MDM) Success: Real-World Insights and Strat...
Unlocking Master Data Management (MDM) Success: Real-World Insights and Strat...
 
Transcending Passwords: Emerging Trends in Authentication
Transcending Passwords: Emerging Trends in AuthenticationTranscending Passwords: Emerging Trends in Authentication
Transcending Passwords: Emerging Trends in Authentication
 
Modernize NetOps with Business-Aware Network Monitoring
Modernize NetOps with Business-Aware Network MonitoringModernize NetOps with Business-Aware Network Monitoring
Modernize NetOps with Business-Aware Network Monitoring
 
Navigating the Complexity of Distributed Microservices across AWS, Azure, and...
Navigating the Complexity of Distributed Microservices across AWS, Azure, and...Navigating the Complexity of Distributed Microservices across AWS, Azure, and...
Navigating the Complexity of Distributed Microservices across AWS, Azure, and...
 
Navigating Today’s Threat Landscape: Discussing Hype vs. Reality
Navigating Today’s Threat Landscape: Discussing Hype vs. RealityNavigating Today’s Threat Landscape: Discussing Hype vs. Reality
Navigating Today’s Threat Landscape: Discussing Hype vs. Reality
 
Kubernetes Unveiled: Trends, Challenges, and Opportunities
Kubernetes Unveiled: Trends, Challenges, and OpportunitiesKubernetes Unveiled: Trends, Challenges, and Opportunities
Kubernetes Unveiled: Trends, Challenges, and Opportunities
 
DDI Directions: DNS, DHCP and IP Address Management Strategies for the Multi-...
DDI Directions: DNS, DHCP and IP Address Management Strategies for the Multi-...DDI Directions: DNS, DHCP and IP Address Management Strategies for the Multi-...
DDI Directions: DNS, DHCP and IP Address Management Strategies for the Multi-...
 
Challenges and Best Practices for Securing Modern Operational Technology Netw...
Challenges and Best Practices for Securing Modern Operational Technology Netw...Challenges and Best Practices for Securing Modern Operational Technology Netw...
Challenges and Best Practices for Securing Modern Operational Technology Netw...
 
CMDB in Cloud Times: Myths, Mistakes, and Mastery
CMDB in Cloud Times: Myths, Mistakes, and Mastery CMDB in Cloud Times: Myths, Mistakes, and Mastery
CMDB in Cloud Times: Myths, Mistakes, and Mastery
 
Modernizing Network Engineering and Operations in the Era of Hybrid and Remot...
Modernizing Network Engineering and Operations in the Era of Hybrid and Remot...Modernizing Network Engineering and Operations in the Era of Hybrid and Remot...
Modernizing Network Engineering and Operations in the Era of Hybrid and Remot...
 
Why Should Organizations Consider Extended Detection and Response (XDR)?
Why Should Organizations Consider Extended Detection and Response (XDR)?Why Should Organizations Consider Extended Detection and Response (XDR)?
Why Should Organizations Consider Extended Detection and Response (XDR)?
 
Five Managed SD-WAN Trends to Watch in 2023
Five Managed SD-WAN Trends to Watch in 2023Five Managed SD-WAN Trends to Watch in 2023
Five Managed SD-WAN Trends to Watch in 2023
 
Moving Beyond Remote Access: Discover the Power of Zero Trust Network Access
Moving Beyond Remote Access: Discover the Power of Zero Trust Network AccessMoving Beyond Remote Access: Discover the Power of Zero Trust Network Access
Moving Beyond Remote Access: Discover the Power of Zero Trust Network Access
 
[Analyst Research Slides] Build vs. Buy: Finding the Best Path to Network Aut...
[Analyst Research Slides] Build vs. Buy: Finding the Best Path to Network Aut...[Analyst Research Slides] Build vs. Buy: Finding the Best Path to Network Aut...
[Analyst Research Slides] Build vs. Buy: Finding the Best Path to Network Aut...
 

Dernier

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 

Dernier (20)

Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
AWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of TerraformAWS Community Day CPH - Three problems of Terraform
AWS Community Day CPH - Three problems of Terraform
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Tech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdfTech Trends Report 2024 Future Today Institute.pdf
Tech Trends Report 2024 Future Today Institute.pdf
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Developing An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of BrazilDeveloping An App To Navigate The Roads of Brazil
Developing An App To Navigate The Roads of Brazil
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...
 

How to Merge the Data Lake and the Data Warehouse: The Power of a Unified Analytics Warehouse

  • 1. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING How to Merge the Data Lake and the Data Warehouse The Power of a Unified Analytics Warehouse John Santaferraro Research Director EMA Jeremiah Morrow Senior Product Marketing Manager Vertica
  • 2. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING Watch the On-Demand Webinar Slide 2  How to Merge the Data Lake and the Data Warehouse: The Power of a Unified Analytics Warehouse on-demand webinar: https://info.enterprisemanagement.com/how-to-merge-the-data-lake- and-the-data-warehouse-webinar-ws  Check out upcoming webinars from EMA here: https://www.enterprisemanagement.com/freeResearch
  • 3. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING Featured Speakers Slide 3 © 2020 Enterprise Management Associates, Inc. John Santaferraro, Research Director, EMA John is the research director for analytics, business intelligence, and data management at EMA. He has 23 years of experience in data and analytics, from startups to executive positions at Fortune 50 companies. His deep understanding of the industry comes from years of leadership in implementation, product and marketing organizations, along with multiple big data imagineering efforts for finance, communications, retail, manufacturing, healthcare, events, oil and gas, and utilities. Jeremiah Morrow, Senior Product Marketing Manager, Vertica Jeremiah is responsible for marketing initiatives involving Vertica’s channel and technology partners. He has worked in sales, marketing, and analyst relations for technology companies ranging from infrastructure to software, and he is passionate about building a strong and collaborative ecosystem to drive value for customers, partners, and Vertica.
  • 4. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING Sponsor Trusted by AT&T, Cerner, Uber, The Trade Desk, and many other data-driven organizations, Vertica is the unified analytics warehouse that solves three current market challenges that every organization faces. Despite the disappointment in Hadoop, HDFS data lakes represent a very significant investment for many companies, but the value is not equivalent to original expectations. Combined with the explosion of cloud object storage, organizations struggle even more to unify their data. In addition, organizations favor a multi- cloud or hybrid cloud and on-premises deployment strategy as they face the reality of cloud vendor lock-in, costs, and migration challenges. Finally, machine learning is no longer a data science project, but must be put into production to deliver the predictive analytics information in time to allow proactive actions. Vertica is a Unified Analytics Warehouse that: • Unifies HDFS data and Object Storage data lakes to capitalize on storage investments and maximize business value. • Unifies a company’s deployment options spanning multi-cloud and on-premises to embrace cloud innovations, prevent lock-in, and meet regulatory and security requirements. • Unifies the data science community and the business analyst and IT community, enabling each to continue to use their preferred tools and languages while operationalizing machine learning at scale for real-time predictive analytics. Visit www.vertica.com for more information. Slide 4 © 2020 Enterprise Management Associates, Inc.
  • 5. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING Agenda • The changing business and data landscape • Inadequacies of the data lake and data warehouse • The race for a unified analytics warehouse • Requirements for a unified analytics warehouse • Deploying a unified analytics warehouse • Unified analytics warehouse for cloud and hybrid • Vertica and the unified analytics warehouse Slide 5 © 2020 Enterprise Management Associates, Inc.
  • 6. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTINGSlide 6 The changing business and data landscape
  • 7. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING Digital, Mobile, IoT Digital Mobile Internet of Things Slide 7 © 2020 Enterprise Management Associates, Inc.
  • 8. Evolution of Data Warehouse Architecture
  • 9. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING Business Intelligence CRM ERP Billing Application Data Customer Operational Financial ETL Analytics Database Transactional Data Message Queues Files Data Warehouse Batch Visualization
  • 10. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING Data Lake 1 0 Contextual Data Files Weather Geo Low Latency Batch Transactional Data Application Data OLTP/ODS Batch ETL or EL with T done on mass storage Data Prep / Enrichment Streaming Data Application data Web clicks Logs Sensors Operational metrics User tracking Geo-location Visualization Applications Artificial Intelligence Cloud On-Premises AND / OR Distributed Pub/Sub Distributed Prepped Data Object Storage Stream Processing Data Lake Mass Storage Query Engine / Machine Learning
  • 11. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING Cooperative Data Architecture 1 1 Contextual Data Files Weather Geo Low Latency Batch Transactional Data Application Data OLTP/ODS Batch ETL or EL with T done on mass storage Data Lake Mass Storage Data Prep / Enrichment Streaming Data Application data Web clicks Logs Sensors Operational metrics User tracking Geo-location Visualization Applications Artificial Intelligence Import Export Query Cloud On-Premises AND / OR Distributed Pub/Sub Distributed Columnar Data Object Storage Stream Processing
  • 12. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING The Need for a Unified Analytics Warehouse “Only structured and semi-structured data combined can deliver on the promise of complete business insights.” Slide 12 © 2020 Enterprise Management Associates, Inc.
  • 13. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTINGSlide 13 Inadequacies of the data lake and data warehouse
  • 14. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING The Data Swamp (Lake) Data Lake Shortcomings • Built for semi-structured data and requires structuring of data on read to analyze transactional data. • Database component lacks enterprise capabilities and does not perform will compared to analytic platforms. Slide 14 © 2020 Enterprise Management Associates, Inc.
  • 15. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING The Data Jailhouse (Warehouse) Data Warehouse Shortcomings • Built for structured data and requires structuring and ingestion of semi-structured data to analyze it. • Too much effort required and doesn’t support real-time ad- hoc analysis of semi structured data. Slide 15 © 2020 Enterprise Management Associates, Inc.
  • 16. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTINGSlide 16 The race for the unified analytics warehouse
  • 17. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING From the Data Lake Side Build On • Database Technology • SQL Query Engines Slide 17 © 2020 Enterprise Management Associates, Inc.
  • 18. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING From the Data Warehouse Side Drill Through • Semi-Structured Data • Tiered Storage • Data Science Tools Slide 18 © 2020 Enterprise Management Associates, Inc.
  • 19. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTINGSlide 19 Requirements for a unified analytics warehouse
  • 20. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING What is a Unified Analytics Warehouse? • Unified • Adequately handles multi-structured data in a single platform • Analytics • Data lake and the data warehouse are primarily for analytics. • Warehouse • Stores multi-structured data in an organized and accessible manner. Slide 20 © 2020 Enterprise Management Associates, Inc.
  • 21. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING Unified Analytics Warehouse Slide 21 © 2020 Enterprise Management Associates, Inc.
  • 22. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING Unified Analytics Warehouse Slide 22 © 2020 Enterprise Management Associates, Inc.
  • 23. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING Unified Analytics Warehouse Slide 23 © 2020 Enterprise Management Associates, Inc.
  • 24. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING What is a Unified Analytics Warehouse? • Unified • Adequately handles multi-structured data in a single platform • Analytics • Data lake and the data warehouse are primarily for analytics. • Warehouse • Stores multi-structured data in an organized and accessible manner. Slide 24 © 2020 Enterprise Management Associates, Inc.
  • 25. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING Unified Analytics Warehouse 2 5 Contextual Data Files Weather Geo Low Latency Batch Transactional Data Application Data OLTP/ODS Batch ETL or EL with T done in warehouse Streaming Data Application data Web clicks Logs Sensors Operational metrics User tracking Geo-location Stream Processing Cloud Visualization Applications Artificial Intelligence Shared Storage On-Premises HDFS HDFS AND / OR Ingestion/ ELT/ Data Prep / Enrichment Managed ML Models Reporting / BI Data Science / ML Departmental Use
  • 26. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING Data and the Unified Analytics Warehouse • Structured Data • Complex Data Types • JSON • XML • Textual Data • Streaming Data Slide 26 © 2020 Enterprise Management Associates, Inc.
  • 27. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING Enterprise and the Unified Analytics Warehouse • Administration • Management • Orchestration • Security • Privacy • Performance • Scalability • Agility • Resiliency • AI Enablement • Recommendations - Structure, Schema, Data Relationships, Tuning • Automation - Metadata Generation, Tuning, Maintenance, Elasticity, Query Routing, Data Tiering, Change Management, Code Generation Slide 27 © 2020 Enterprise Management Associates, Inc.
  • 28. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING Infrastructure and the Unified Analytics Warehouse • Analytic Platform • Distributed File System • Object Storage Slide 28 © 2020 Enterprise Management Associates, Inc.
  • 29. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING Cloud and the Unified Analytics Warehouse “53% of all data is now in the cloud…” • Single Pane of Glass • Simple Migration • Common Foundation Slide 29 © 2020 Enterprise Management Associates, Inc.
  • 30. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING Analytics and the Unified Analytics Warehouse Embedded Analytics • Prebuilt • Common • High-Performance • Rapid Deployment Analytical Performance • Data Intensive • Compute Intensive • High Intensity • High Concurrency Slide 30 © 2020 Enterprise Management Associates, Inc.
  • 31. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING Users and the Unified Analytics Warehouse “Users also expect a platform that allows them to put aside religious beliefs about how analytics should be done.” Slide 31 © 2020 Enterprise Management Associates, Inc.
  • 32. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTINGSlide 32 Deploying a unified analytics warehouse
  • 33. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING Best Practices for the Unified Analytics Warehouse  Choose a vendor with a vision for the UAW  Choose a platform well into UAW transformation  Choose a project with high value  Choose a project with a mix of data and use cases  Test use case capabilities in a hybrid environment  Plan for end of life and continued migration Slide 33 © 2020 Enterprise Management Associates, Inc.
  • 34. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTINGSlide 34 Unified analytics warehouse for cloud and hybrid environments
  • 35. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING 3 Necessities of for Cloud and Hybrid Success • Common Management • Common Infrastructure • Interoperability • Data interoperability • Query interoperability • Migration interoperability Slide 35 © 2020 Enterprise Management Associates, Inc.
  • 36. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTINGSlide 36 Vertica and the unified analytics warehouse
  • 38. What is Vertica? 3 8 SQL Database Load and store data in a data warehouse designed for blazing fast analytics Query Engine Ask complex analytical questions and get fast answers regardless of where the data resides Vertica is an advanced analytics platform built for the scale and complexity of today’s data-driven world. It combines the power of a high-performance, MPP query engine with advanced analytics and machine learning. Analytics & ML Create, train and deploy advanced analytics and machine learning models at massive scale
  • 39. Remove scale, performance and capacity constraints 3 9 Get data quickly enough to act upon it, explore your data interactively, and enable everyone to make their own data-driven decisions Fear of more uses or growing data volumes is a thing of the past Scale Data Volumes Scale Users SQL Database + Vertica Advanced Analytics Platform + Get data quickly enough to act upon it, explore your data interactively, and enable everyone to make their own data-driven decisions Analytics & ML Query Engine
  • 40. 2020 McKnight Consulting Group Benchmark 4 0 “At 60 concurrent users, Vertica in Eon Mode was 1.8x less expensive than Redshift. The unnamed data cloud platform consistently had the highest price performance.” “Vertica in Eon Mode had 1.14x the QPH of the next highest database (unnamed data cloud platform) for the 60 concurrent user workload.” “Vertica in Eon Mod consistently had the shortest elapsed time for the longest running [query] thread across the concurrency profiles at 250 TB.” Vertica in Eon Mode, Amazon Redshift, and an Unnamed Data Cloud Platform
  • 42. Machine Learning can simplify business processes and improve the customer experience 4 2 Predictive Maintenance System Problem Customer Call Dispatch Onsite Trouble- shooting Remote Monitoring Predicts Potential failure Service Scheduled Problem Avoided Parts Delivery Repair or Replace System Functional Reactive Maintenance Predictive Maintenance
  • 43. High Level Architecture 4 3 Data Lake (Raw Data Archive) Parallel Extract Transform Load Legacy Database s CRM & Business Systems Installed Base Philips Remote Service Network DSP (HSI) Remote Monitoring Radar 4.0 Remote Service RSW & CHA Reliability Dashboards R&D Access
  • 44. Outcomes 4 4 "Remote service provides us with an engineer online all the time. They tell us when we've got a fault before we know we've got a fault. And not only that, they can fix the fault before we knew we had a fault. And that's impressive." Cobalt Imaging, Gloucester "Now we will have more uptime on the scanner and potentially be able to see more patients…It's a new level of service for us, with a greater satisfaction." - Radiographer, New Stobhill Hospital in Glasgow 500 TB of data in more than 300 tables. 30 trillion data points 80 different data sources integrated 24/7 months from scratch to production8 live data feeds. Millions of logs per week. …and marching towards 0 unplanned downtime
  • 45. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING EMA Independent Analysis of Vertica and the UAW 45 © 2020 Enterprise Management Associates, Inc. 0 2 4 6 8 10 Data Requirements Enterprise Requirements Infrastructure Requirements Hybrid Requirements Analytical Requirements User Requirements From EMA - The Race for a Unified Analytics Warehouse
  • 46. IT & DATA MANAGEMENT RESEARCH, INDUSTRY ANALYSIS & CONSULTING Question and Answer: Log Questions in the Q&A panel © 2020 Enterprise Management Associates, Inc. Interested in Learning More? LEARN ABOUT: DOWNLOAD EMA – The Race for a Unified Analytics Warehouse - https://www.vertica.com/resource/the-race-for- a-unified-analytics-warehouse/ McKnight Consulting Group – Cloud Database Performance Benchmark - https://www.vertica.com/cloud-database- benchmark-report-2/ Vertica Community Edition - https://www.vertica.com/try/ Blog – https://www.vertica.com/blog LinkedIn - https://www.linkedin.com/showcase /vertica-co/