SlideShare a Scribd company logo
1 of 32
© 2017 MapR Technologies 1
+
Clarke Patterson, Head of Product Marketing at StreamSets
Ankur Desai, Sr. Product Marketing Manager at MapR
Data Warehouse Modernization:
Accelerating Time-to-Action
© 2017 MapR Technologies 2
Traditional Data Warehouse Architecture: Batch Processing
User
Interaction
Message
Bus
Operational
Database
ETL Analytics/Big
Data Cluster
Analytics
Batch analysis
(e.g., 24 hours)
Runs periodically
to minimize load
on DBMS
Data
collection
© 2017 MapR Technologies 3
More Complexity: Multiple Data Sources and Pipelines
User
Interaction
Message
Bus
Operational
Database ETL Analytics/Big
Data Cluster
Analytics
Batch analysis
(e.g., 24 hours)
© 2017 MapR Technologies 4
Challenges with Traditional Data Warehouse Based Architectures
Ever increasing cost
Expensive licensing models combined with the
massive volumes of data being created means the
cost of data warehousing is significantly rising.
Inability to scale-out
Data warehouses cannot scale-out linearly using
commodity hardware. Buying new expensive
hardware is straining IT budgets.
Unused data driving cost up
70% of data in DW is unused, i.e. never queried in
past 1 year.
Misuse of CPU capacity
Almost 60% of CPU capacity is used for ETL/ELT.
15% of CPU consumed by ETL to load unused data.
This affects performance of queries.
Inability to support non-relational data
Designed for relational data, data warehouses are
not suitable for unstructured data coming from
sensors, logs, devices, social media etc.
Inability to support modern analytics
DWs do not support modern analytics technologies
such as machine learning and stream processing.
>$10K
Cost/TB
70%
Data Unused
60%
CPU used for ETL
© 2017 MapR Technologies 5
Optimizing Your Data Warehouse
with MapR
© 2017 MapR Technologies 6
Select the ideal offload candidates
MapR experts will help you select the data and ETL workload ideal for offload. Keep the frequently
queried data in the data warehouse. Select unused data (often up to 70% of the data in the DW) for
offloading into the MapR Converged Data Platform,
Build the data pipeline
Data migration can be performed using batch methods using NFS or Sqoop or real-time methods using
tools such as Kafka Connect for MapR-ES. Many data warehouses provide connectors for Hadoop that help
to simplify the migration. Upon migration, the data can be stored in MapR-DB or Hive tables, or Parquet or
Avro files depending on requirements.
Deliver the data to the stakeholders
Utilize SQL engines such as Apache Drill, Hive, or Spark SQL to deliver data to traditional BI tools. You can
continue using your favorite BI tools such as Tableau, Qlik, Microstrategy etc. The existing BI teams can
continue querying the offloaded data using SQL. This solution ensures smooth and continuous operation for
BI teams.
Optimizing your Data Warehouse: 3 Step Solution from MapR
1
2
3
3
Steps
5
Weeks
Time-to-Value
40%
Offload Target
© 2017 MapR Technologies 7
Data Warehouse Optimization Reference Architecture
© 2017 MapR Technologies 8
MapR Converged Data Platform: Key Features
Interactive SQL analysis
Apache Drill on MapR platform allows you to use
ANSI SQL to query any data. BI teams can continue
using SQL and the same BI tools.
Multi-temperature storage
MapR provides multi-temperature storage. Store
your hot, warm, and cold data within MapR on
hardware of your choice, further optimizing for cost.
Streaming for real-time insights
MapR Streams allows you to bring data for analysis
as soon as the data is created. In contrast, legacy
DW solutions are batch oriented.
Multi-tenant big data platform
MapR is the only big data platform that provides
multi-tenancy on the data placement level, helping
you meet the regulatory requirements.
Enterprise grade data governance
The MapR platform provides enterprise grade
security, auditing, and lineage to meet your data
governance needs.
Converge SQL and Machine Learning
Single platform for storage, database, and
streaming. Your choice of compute engine on top
(Spark, Hadoop, SQL, Machine learning)
1st
Rank for Data
Warehousing among
Big Data Solutions*
1st
Rank for
Price/Performance**
1st
Rank for Streaming***
*2016 Gartner Critical Capabilities for DW
**TPCx-HS Performance Benchmarks
***DBTA Reader’s Choice Awards
© 2017 MapR Technologies 9
Data
Exploration
Using Drill
Event
Streaming with
MapR-ES
Transformations
Real-Time Analytics and Dashboards with Stream Processing
Operational
Database
Stream Processing
Operational
Database
Operational
Database
Change Data
Capture
Change Data
Capture
Change Data
Capture
Real-time
Business
Intelligence
Static Data
– Inserts
only
Frequently
updated
data
© 2017 MapR Technologies 10
StreamSets Solves the Data
Drift
Top challenges for the big data
warehouse
68%
60%
52%
47%
40%
32%
1%
0% 18% 35% 53% 70% 88%
Ensuring the quality of the data (accuracy,
completeness, consistency)
Complying with security and data privacy policies
Keeping data flow pipelines operating effectively
Building pipelines for getting data into the data
store
Upgrading big data infrastructure components
(Kafka, Hadoop, etc.).
Adapting pipelines to meet new requirements
We have no challenges
What challenges does your company face when
managing your big data flows?
What’s the impact?
Yes
87%
No
13%
Yes
74%
No
26%
Does ‘bad data’ occasionally
get into your data stores?
Do you believe there is any
‘bad data’ in your data
stores currently?
In response…
53% change data
flow pipelines at
least several
times a month
New standards for data warehousing
ETL ETL
Ingest Analyze
Past (ETL)
➢ Fixed schema ETL for Data
warehouses
➢ Source Data structured and
rigid transaction data
Data Sources Data Stores Data Consumers
Emerging (Ingest)
➢ Explosion of Data Stores –
fluid infrastructure
➢ Source Data predominantly
multi-structured interaction
data
➢ Data Drift: Structure,
Semantic, Infrastructure
Delayed and
False Insights
Solving Data Drift
Tools
Applications
Data Stores Data ConsumersData Sources
Poor Data Trust &
Quality
Data Drift
Custom code
Fixed-schema
Trusted & Timely
Insights
Data KPIs
(Trusted High
Quality Data)
Solving Data Drift
Tools
Applications
Data Stores Data ConsumersData Sources
Data Drift
Intent-Driven
Drift-Handling
Think of dataflows as cyclical
processes
Build
Development
processes are far more
complex and drawn out
than they need to be
Execute
The economics of data
have changed, giving
way to a choice of
executing and
deployment options
Operate
Architectures are
constantly changing
and have more
stringent SLA’s
Build
Not all
developers are
created equally
>_
Integrations are
abundant and
unnecessarily rigid
Build-to-deploy takes
far longer than
necessary
Execute
Multiple deployment
options exist yet
constraints limit making
use of them
Mixed workloads are the
norm, must handle both
batch and streaming
11001001001001101001
00101010010010010010
10100100100101010101
01001001001010100100
11010001110010100100
10010010100101110101
Scalability is a must, both
today and into the future
Operate
Increasingly, the business
expects SLA’s on the
quality and timeliness of
data
Architectures are
constantly evolving, with
new versions or new
projects regularly being
added
Data, and it’s structure,
will inevitably change,
causing wide spread
impact
StreamSets Data Operations
Platform
EFFICIENCY
Intent Driven Flows
Batch & Streaming Ingest
In-stream Sanitization
MASTER
Availability & Accuracy
Proactive Remediation
MEASURE
Any Path
Any Time
MAP
Dataflow Lineage
Live Data Architecture
CONTROL
Drift Handling
Stage & Flow Metrics
Lineage & Impact Analysis
AGILITY
Flexible deployment
Exception Handling
Seamless Evolution
EVOLVE (Proactive)
REMEDIATE (Reactive)
DEVELOP OPERATE
CloudClusterStandalone
StreamSets Data Collector Dataflow Performance Manager
Edge
StreamSets & MapR optimize the
EDW
StreamSets & MapR enable real-
time streams
Operational
Database
Operational
Database
Operational
Database
Change Data
Capture
Change Data
Capture
Change Data
Capture
Data
Exploration
Using Drill
Event Streaming
with MapR-ES
Transformations
Stream Processing
Real-time
Business
Intelligence
Static Data
– Inserts
only
Frequently
updated
data
© 2017 MapR Technologies 23
Business Benefits
© 2017 MapR Technologies 24
MapR Converged Data Platform: Key Business Benefits
$10M
In Savings per 1 PB
over 5 years
53%
Lower CapEx for the
combined DW +
MapR Solution
90%
Lower CapEX for
MapR vs Legacy DW
Reduce TCO of data analysis
Sharply reduce the cost of data management
and analytics. The cost saving can be utilized
towards revenue generating innovations.
Maximize value of current investment
Increase available “headroom” and avoid or
minimize new CapEx. Improve performance of
your existing data warehousing assets.
Answer new questions
Use analytics tools unavailable in legacy data
warehousing (Drill, Parquet, Spark, Machine
Learning, others).
Leverage existing skillsets
BI teams can continue using familiar tools such
as Tableau, Qlik, Microstrategy etc. on both
DW and the MapR data lake.
Leverage hybrid deployment model
MapR provides single global namespace to
help you create a homogeneous data fabric
across on-premises and cloud hosted data.
Get results fasters
The MapR Quick Start Solution for Data
Warehousing will help you get value for
optimization project within 5 weeks.
© 2017 MapR Technologies 25
Beyond Cost Reduction: IDC discovers 4X ROI for MapR Customers
4X
ROI
8.2
Months
Payback Time
39%
Higher Developer
Productivity
$19.44M
Avg. Business
Benefits
42%
Lower TCO vs. Other
Big Data Vendors
31%
Higher Data Scientist
Productivity
© 2017 MapR Technologies 26
Case Studies
© 2017 MapR Technologies 27
Cisco was able to analyze sales opportunities in 1/10 the time, at 1/10 the cost, and
generated $40 million in incremental service bookings in the first year.
Cisco uses integrated customer data to increase revenues
• Create shared view of customer & operations across 75,000 employees
• Increase revenue opportunities with sales partners
• Customer information was siloed in different divisions
• Customer interactions were inconsistent and not satisfying
• Missed opportunities for upselling/cross selling
• Use MapR to collect customer information across touch points
• Integrate billing, support, manufacturing, social media, websites, dial-in data
• Generate new sales leads internally and for partners
OBJECTIVES
CHALLENGES
SOLUTION
Architecture for
Sales Partner Opportunities
Business
Impact
© 2017 MapR Technologies 28
Zions Bank builds cost effective security analytics and fraud detection
on one platform
• Fraud Operations and Security Analytics team at Zions maintains data stores, builds
statistical models to detect fraud, and then uses these models to data mine and
evaluate suspicious activity
“We initially got into centralizing all of our data from an information security perspective. We
then saw that we could use this same environment to help with fraud detection”
Michael Fowkes - SVP Fraud Operations and Security Analytics
• Existing technology infrastructure could not scale
• Timeliness of reports degraded over the last several years
• Chose MapR and cut storage costs by 50%
• Querying time reduced from 24 hours to 30 min on 1.2 PB of data
• Leverage MapR scale for increased model accuracy and deeper insights
OBJECTIVES
CHALLENGES
SOLUTION
Business
Impact
Rich history of industry recognition
Cool Vendor in Data
Management, 2017
Best Open Source Tool,
2016
10 Coolest Big Data
Startups of 2016
© 2017 MapR Technologies 30
Thank You
© 2017 MapR Technologies 31
Q&A
ENGAGE WITH US
Contact us at:
855-NOW-MAPR
Or
maprisr@mapr.com
https://twitter.com/mapr
https://www.linkedin.com/company/mapr-
technologies
Follow us at:
© 2017 MapR Technologies 32
Additional Resources
• Learn more at our Solution Page at: https://mapr.com/dwo
• Try MapR at https://mapr.com/download/
• Blog: Best Practices on Migrating from a Data Warehouse to a Big Data Platform

More Related Content

What's hot

Introduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse ArchitectureIntroduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse ArchitectureDatabricks
 
Lessons Learned: Understanding Pipeline Pricing in Azure Data Factory and Azu...
Lessons Learned: Understanding Pipeline Pricing in Azure Data Factory and Azu...Lessons Learned: Understanding Pipeline Pricing in Azure Data Factory and Azu...
Lessons Learned: Understanding Pipeline Pricing in Azure Data Factory and Azu...Cathrine Wilhelmsen
 
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...DataScienceConferenc1
 
Intro to Data Vault 2.0 on Snowflake
Intro to Data Vault 2.0 on SnowflakeIntro to Data Vault 2.0 on Snowflake
Intro to Data Vault 2.0 on SnowflakeKent Graziano
 
Databricks Fundamentals
Databricks FundamentalsDatabricks Fundamentals
Databricks FundamentalsDalibor Wijas
 
Data Mesh for Dinner
Data Mesh for DinnerData Mesh for Dinner
Data Mesh for DinnerKent Graziano
 
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureDatabricks
 
Snowflake Architecture.pptx
Snowflake Architecture.pptxSnowflake Architecture.pptx
Snowflake Architecture.pptxchennakesava44
 
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021Tristan Baker
 
Big Data and Data Warehousing Together with Azure Synapse Analytics (SQLBits ...
Big Data and Data Warehousing Together with Azure Synapse Analytics (SQLBits ...Big Data and Data Warehousing Together with Azure Synapse Analytics (SQLBits ...
Big Data and Data Warehousing Together with Azure Synapse Analytics (SQLBits ...Michael Rys
 
Data Warehouse or Data Lake, Which Do I Choose?
Data Warehouse or Data Lake, Which Do I Choose?Data Warehouse or Data Lake, Which Do I Choose?
Data Warehouse or Data Lake, Which Do I Choose?DATAVERSITY
 
Data Lake Overview
Data Lake OverviewData Lake Overview
Data Lake OverviewJames Serra
 
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)Kent Graziano
 
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data PipelinesPutting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data PipelinesDATAVERSITY
 

What's hot (20)

Introduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse ArchitectureIntroduction SQL Analytics on Lakehouse Architecture
Introduction SQL Analytics on Lakehouse Architecture
 
Lessons Learned: Understanding Pipeline Pricing in Azure Data Factory and Azu...
Lessons Learned: Understanding Pipeline Pricing in Azure Data Factory and Azu...Lessons Learned: Understanding Pipeline Pricing in Azure Data Factory and Azu...
Lessons Learned: Understanding Pipeline Pricing in Azure Data Factory and Azu...
 
Modern Data Architecture
Modern Data ArchitectureModern Data Architecture
Modern Data Architecture
 
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
[DSC Europe 22] Lakehouse architecture with Delta Lake and Databricks - Draga...
 
Intro to Data Vault 2.0 on Snowflake
Intro to Data Vault 2.0 on SnowflakeIntro to Data Vault 2.0 on Snowflake
Intro to Data Vault 2.0 on Snowflake
 
Databricks Fundamentals
Databricks FundamentalsDatabricks Fundamentals
Databricks Fundamentals
 
Data Mesh for Dinner
Data Mesh for DinnerData Mesh for Dinner
Data Mesh for Dinner
 
Modernizing to a Cloud Data Architecture
Modernizing to a Cloud Data ArchitectureModernizing to a Cloud Data Architecture
Modernizing to a Cloud Data Architecture
 
Snowflake Architecture.pptx
Snowflake Architecture.pptxSnowflake Architecture.pptx
Snowflake Architecture.pptx
 
Lakehouse in Azure
Lakehouse in AzureLakehouse in Azure
Lakehouse in Azure
 
Data Mesh
Data MeshData Mesh
Data Mesh
 
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
 
Webinar Data Mesh - Part 3
Webinar Data Mesh - Part 3Webinar Data Mesh - Part 3
Webinar Data Mesh - Part 3
 
Big Data and Data Warehousing Together with Azure Synapse Analytics (SQLBits ...
Big Data and Data Warehousing Together with Azure Synapse Analytics (SQLBits ...Big Data and Data Warehousing Together with Azure Synapse Analytics (SQLBits ...
Big Data and Data Warehousing Together with Azure Synapse Analytics (SQLBits ...
 
Data Warehouse or Data Lake, Which Do I Choose?
Data Warehouse or Data Lake, Which Do I Choose?Data Warehouse or Data Lake, Which Do I Choose?
Data Warehouse or Data Lake, Which Do I Choose?
 
Architecting a datalake
Architecting a datalakeArchitecting a datalake
Architecting a datalake
 
Data Vault Overview
Data Vault OverviewData Vault Overview
Data Vault Overview
 
Data Lake Overview
Data Lake OverviewData Lake Overview
Data Lake Overview
 
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
Agile Data Engineering: Introduction to Data Vault 2.0 (2018)
 
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data PipelinesPutting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
Putting the Ops in DataOps: Orchestrate the Flow of Data Across Data Pipelines
 

Similar to Data Warehouse Modernization: Accelerating Time-To-Action

Enabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data CaptureEnabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data CaptureMapR Technologies
 
Accelerating Big Data Analytics
Accelerating Big Data AnalyticsAccelerating Big Data Analytics
Accelerating Big Data AnalyticsAttunity
 
Vikram Andem Big Data Strategy @ IATA Technology Roadmap
Vikram Andem Big Data Strategy @ IATA Technology Roadmap Vikram Andem Big Data Strategy @ IATA Technology Roadmap
Vikram Andem Big Data Strategy @ IATA Technology Roadmap IT Strategy Group
 
Transform your DBMS to drive engagement innovation with Big Data
Transform your DBMS to drive engagement innovation with Big DataTransform your DBMS to drive engagement innovation with Big Data
Transform your DBMS to drive engagement innovation with Big DataAshnikbiz
 
Big Data Practice_Planning_steps_RK
Big Data Practice_Planning_steps_RKBig Data Practice_Planning_steps_RK
Big Data Practice_Planning_steps_RKRajesh Jayarman
 
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...Precisely
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureDATAVERSITY
 
Seagate: Sensor Overload! Taming The Raging Manufacturing Big Data Torrent
Seagate: Sensor Overload! Taming The Raging Manufacturing Big Data TorrentSeagate: Sensor Overload! Taming The Raging Manufacturing Big Data Torrent
Seagate: Sensor Overload! Taming The Raging Manufacturing Big Data TorrentSeeling Cheung
 
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)Denodo
 
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRobertsWP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRobertsJane Roberts
 
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...DATAVERSITY
 
Skillwise Big Data part 2
Skillwise Big Data part 2Skillwise Big Data part 2
Skillwise Big Data part 2Skillwise Group
 
Traditional data word
Traditional data wordTraditional data word
Traditional data wordorcoxsm
 
Enabling Next Gen Analytics with Azure Data Lake and StreamSets
Enabling Next Gen Analytics with Azure Data Lake and StreamSetsEnabling Next Gen Analytics with Azure Data Lake and StreamSets
Enabling Next Gen Analytics with Azure Data Lake and StreamSetsStreamsets Inc.
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database RoundtableEric Kavanagh
 
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization Denodo
 
Which Change Data Capture Strategy is Right for You?
Which Change Data Capture Strategy is Right for You?Which Change Data Capture Strategy is Right for You?
Which Change Data Capture Strategy is Right for You?Precisely
 
Lecture4 big data technology foundations
Lecture4 big data technology foundationsLecture4 big data technology foundations
Lecture4 big data technology foundationshktripathy
 

Similar to Data Warehouse Modernization: Accelerating Time-To-Action (20)

Enabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data CaptureEnabling Real-Time Business with Change Data Capture
Enabling Real-Time Business with Change Data Capture
 
Accelerating Big Data Analytics
Accelerating Big Data AnalyticsAccelerating Big Data Analytics
Accelerating Big Data Analytics
 
Vikram Andem Big Data Strategy @ IATA Technology Roadmap
Vikram Andem Big Data Strategy @ IATA Technology Roadmap Vikram Andem Big Data Strategy @ IATA Technology Roadmap
Vikram Andem Big Data Strategy @ IATA Technology Roadmap
 
Transform your DBMS to drive engagement innovation with Big Data
Transform your DBMS to drive engagement innovation with Big DataTransform your DBMS to drive engagement innovation with Big Data
Transform your DBMS to drive engagement innovation with Big Data
 
Big Data Practice_Planning_steps_RK
Big Data Practice_Planning_steps_RKBig Data Practice_Planning_steps_RK
Big Data Practice_Planning_steps_RK
 
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
Engineering Machine Learning Data Pipelines Series: Streaming New Data as It ...
 
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data ArchitectureADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
ADV Slides: When and How Data Lakes Fit into a Modern Data Architecture
 
Seagate: Sensor Overload! Taming The Raging Manufacturing Big Data Torrent
Seagate: Sensor Overload! Taming The Raging Manufacturing Big Data TorrentSeagate: Sensor Overload! Taming The Raging Manufacturing Big Data Torrent
Seagate: Sensor Overload! Taming The Raging Manufacturing Big Data Torrent
 
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
Logical Data Lakes: From Single Purpose to Multipurpose Data Lakes (APAC)
 
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRobertsWP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts
WP_Impetus_2016_Guide_to_Modernize_Your_Enterprise_Data_Warehouse_JRoberts
 
Skilwise Big data
Skilwise Big dataSkilwise Big data
Skilwise Big data
 
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
ADV Slides: The Evolution of the Data Platform and What It Means to Enterpris...
 
Skillwise Big Data part 2
Skillwise Big Data part 2Skillwise Big Data part 2
Skillwise Big Data part 2
 
Traditional data word
Traditional data wordTraditional data word
Traditional data word
 
Hadoop and Your Enterprise Data Warehouse
Hadoop and Your Enterprise Data WarehouseHadoop and Your Enterprise Data Warehouse
Hadoop and Your Enterprise Data Warehouse
 
Enabling Next Gen Analytics with Azure Data Lake and StreamSets
Enabling Next Gen Analytics with Azure Data Lake and StreamSetsEnabling Next Gen Analytics with Azure Data Lake and StreamSets
Enabling Next Gen Analytics with Azure Data Lake and StreamSets
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database Roundtable
 
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
DAMA & Denodo Webinar: Modernizing Data Architecture Using Data Virtualization
 
Which Change Data Capture Strategy is Right for You?
Which Change Data Capture Strategy is Right for You?Which Change Data Capture Strategy is Right for You?
Which Change Data Capture Strategy is Right for You?
 
Lecture4 big data technology foundations
Lecture4 big data technology foundationsLecture4 big data technology foundations
Lecture4 big data technology foundations
 

More from MapR Technologies

Converging your data landscape
Converging your data landscapeConverging your data landscape
Converging your data landscapeMapR Technologies
 
ML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & EvaluationML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & EvaluationMapR Technologies
 
Self-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your DataSelf-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your DataMapR Technologies
 
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...MapR Technologies
 
ML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning LogisticsML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning LogisticsMapR Technologies
 
Machine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model ManagementMachine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model ManagementMapR Technologies
 
Live Tutorial – Streaming Real-Time Events Using Apache APIs
Live Tutorial – Streaming Real-Time Events Using Apache APIsLive Tutorial – Streaming Real-Time Events Using Apache APIs
Live Tutorial – Streaming Real-Time Events Using Apache APIsMapR Technologies
 
Bringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale StorageBringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale StorageMapR Technologies
 
Live Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn PredictionLive Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn PredictionMapR Technologies
 
An Introduction to the MapR Converged Data Platform
An Introduction to the MapR Converged Data PlatformAn Introduction to the MapR Converged Data Platform
An Introduction to the MapR Converged Data PlatformMapR Technologies
 
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...MapR Technologies
 
Best Practices for Data Convergence in Healthcare
Best Practices for Data Convergence in HealthcareBest Practices for Data Convergence in Healthcare
Best Practices for Data Convergence in HealthcareMapR Technologies
 
Geo-Distributed Big Data and Analytics
Geo-Distributed Big Data and AnalyticsGeo-Distributed Big Data and Analytics
Geo-Distributed Big Data and AnalyticsMapR Technologies
 
MapR Product Update - Spring 2017
MapR Product Update - Spring 2017MapR Product Update - Spring 2017
MapR Product Update - Spring 2017MapR Technologies
 
3 Benefits of Multi-Temperature Data Management for Data Analytics
3 Benefits of Multi-Temperature Data Management for Data Analytics3 Benefits of Multi-Temperature Data Management for Data Analytics
3 Benefits of Multi-Temperature Data Management for Data AnalyticsMapR Technologies
 
Cisco & MapR bring 3 Superpowers to SAP HANA Deployments
Cisco & MapR bring 3 Superpowers to SAP HANA DeploymentsCisco & MapR bring 3 Superpowers to SAP HANA Deployments
Cisco & MapR bring 3 Superpowers to SAP HANA DeploymentsMapR Technologies
 
MapR and Cisco Make IT Better
MapR and Cisco Make IT BetterMapR and Cisco Make IT Better
MapR and Cisco Make IT BetterMapR Technologies
 
Evolving from RDBMS to NoSQL + SQL
Evolving from RDBMS to NoSQL + SQLEvolving from RDBMS to NoSQL + SQL
Evolving from RDBMS to NoSQL + SQLMapR Technologies
 
Evolving Beyond the Data Lake: A Story of Wind and Rain
Evolving Beyond the Data Lake: A Story of Wind and RainEvolving Beyond the Data Lake: A Story of Wind and Rain
Evolving Beyond the Data Lake: A Story of Wind and RainMapR Technologies
 
Open Source Innovations in the MapR Ecosystem Pack 2.0
Open Source Innovations in the MapR Ecosystem Pack 2.0Open Source Innovations in the MapR Ecosystem Pack 2.0
Open Source Innovations in the MapR Ecosystem Pack 2.0MapR Technologies
 

More from MapR Technologies (20)

Converging your data landscape
Converging your data landscapeConverging your data landscape
Converging your data landscape
 
ML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & EvaluationML Workshop 2: Machine Learning Model Comparison & Evaluation
ML Workshop 2: Machine Learning Model Comparison & Evaluation
 
Self-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your DataSelf-Service Data Science for Leveraging ML & AI on All of Your Data
Self-Service Data Science for Leveraging ML & AI on All of Your Data
 
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
Machine Learning for Chickens, Autonomous Driving and a 3-year-old Who Won’t ...
 
ML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning LogisticsML Workshop 1: A New Architecture for Machine Learning Logistics
ML Workshop 1: A New Architecture for Machine Learning Logistics
 
Machine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model ManagementMachine Learning Success: The Key to Easier Model Management
Machine Learning Success: The Key to Easier Model Management
 
Live Tutorial – Streaming Real-Time Events Using Apache APIs
Live Tutorial – Streaming Real-Time Events Using Apache APIsLive Tutorial – Streaming Real-Time Events Using Apache APIs
Live Tutorial – Streaming Real-Time Events Using Apache APIs
 
Bringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale StorageBringing Structure, Scalability, and Services to Cloud-Scale Storage
Bringing Structure, Scalability, and Services to Cloud-Scale Storage
 
Live Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn PredictionLive Machine Learning Tutorial: Churn Prediction
Live Machine Learning Tutorial: Churn Prediction
 
An Introduction to the MapR Converged Data Platform
An Introduction to the MapR Converged Data PlatformAn Introduction to the MapR Converged Data Platform
An Introduction to the MapR Converged Data Platform
 
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
How to Leverage the Cloud for Business Solutions | Strata Data Conference Lon...
 
Best Practices for Data Convergence in Healthcare
Best Practices for Data Convergence in HealthcareBest Practices for Data Convergence in Healthcare
Best Practices for Data Convergence in Healthcare
 
Geo-Distributed Big Data and Analytics
Geo-Distributed Big Data and AnalyticsGeo-Distributed Big Data and Analytics
Geo-Distributed Big Data and Analytics
 
MapR Product Update - Spring 2017
MapR Product Update - Spring 2017MapR Product Update - Spring 2017
MapR Product Update - Spring 2017
 
3 Benefits of Multi-Temperature Data Management for Data Analytics
3 Benefits of Multi-Temperature Data Management for Data Analytics3 Benefits of Multi-Temperature Data Management for Data Analytics
3 Benefits of Multi-Temperature Data Management for Data Analytics
 
Cisco & MapR bring 3 Superpowers to SAP HANA Deployments
Cisco & MapR bring 3 Superpowers to SAP HANA DeploymentsCisco & MapR bring 3 Superpowers to SAP HANA Deployments
Cisco & MapR bring 3 Superpowers to SAP HANA Deployments
 
MapR and Cisco Make IT Better
MapR and Cisco Make IT BetterMapR and Cisco Make IT Better
MapR and Cisco Make IT Better
 
Evolving from RDBMS to NoSQL + SQL
Evolving from RDBMS to NoSQL + SQLEvolving from RDBMS to NoSQL + SQL
Evolving from RDBMS to NoSQL + SQL
 
Evolving Beyond the Data Lake: A Story of Wind and Rain
Evolving Beyond the Data Lake: A Story of Wind and RainEvolving Beyond the Data Lake: A Story of Wind and Rain
Evolving Beyond the Data Lake: A Story of Wind and Rain
 
Open Source Innovations in the MapR Ecosystem Pack 2.0
Open Source Innovations in the MapR Ecosystem Pack 2.0Open Source Innovations in the MapR Ecosystem Pack 2.0
Open Source Innovations in the MapR Ecosystem Pack 2.0
 

Recently uploaded

Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Researchmichael115558
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...amitlee9823
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxolyaivanovalion
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfMarinCaroMartnezBerg
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...amitlee9823
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Valters Lauzums
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Standamitlee9823
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...only4webmaster01
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteedamy56318795
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...amitlee9823
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...amitlee9823
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxolyaivanovalion
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceDelhi Call girls
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls
 

Recently uploaded (20)

Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
Chintamani Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore ...
 
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night StandCall Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Bellandur ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Anomaly detection and data imputation within time series
Anomaly detection and data imputation within time seriesAnomaly detection and data imputation within time series
Anomaly detection and data imputation within time series
 
BigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptxBigBuy dropshipping via API with DroFx.pptx
BigBuy dropshipping via API with DroFx.pptx
 
FESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdfFESE Capital Markets Fact Sheet 2024 Q1.pdf
FESE Capital Markets Fact Sheet 2024 Q1.pdf
 
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night StandCall Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Attibele ☎ 7737669865 🥵 Book Your One night Stand
 
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
Vip Mumbai Call Girls Thane West Call On 9920725232 With Body to body massage...
 
Generative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and MilvusGenerative AI on Enterprise Cloud with NiFi and Milvus
Generative AI on Enterprise Cloud with NiFi and Milvus
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night StandCall Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
Call Girls In Doddaballapur Road ☎ 7737669865 🥵 Book Your One night Stand
 
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
Call Girls Indiranagar Just Call 👗 9155563397 👗 Top Class Call Girl Service B...
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
(NEHA) Call Girls Katra Call Now 8617697112 Katra Escorts 24x7
 
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
Mg Road Call Girls Service: 🍓 7737669865 🍓 High Profile Model Escorts | Banga...
 
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
Call Girls Bommasandra Just Call 👗 7737669865 👗 Top Class Call Girl Service B...
 
Midocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFxMidocean dropshipping via API with DroFx
Midocean dropshipping via API with DroFx
 
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort ServiceBDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
BDSM⚡Call Girls in Mandawali Delhi >༒8448380779 Escort Service
 
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
 

Data Warehouse Modernization: Accelerating Time-To-Action

  • 1. © 2017 MapR Technologies 1 + Clarke Patterson, Head of Product Marketing at StreamSets Ankur Desai, Sr. Product Marketing Manager at MapR Data Warehouse Modernization: Accelerating Time-to-Action
  • 2. © 2017 MapR Technologies 2 Traditional Data Warehouse Architecture: Batch Processing User Interaction Message Bus Operational Database ETL Analytics/Big Data Cluster Analytics Batch analysis (e.g., 24 hours) Runs periodically to minimize load on DBMS Data collection
  • 3. © 2017 MapR Technologies 3 More Complexity: Multiple Data Sources and Pipelines User Interaction Message Bus Operational Database ETL Analytics/Big Data Cluster Analytics Batch analysis (e.g., 24 hours)
  • 4. © 2017 MapR Technologies 4 Challenges with Traditional Data Warehouse Based Architectures Ever increasing cost Expensive licensing models combined with the massive volumes of data being created means the cost of data warehousing is significantly rising. Inability to scale-out Data warehouses cannot scale-out linearly using commodity hardware. Buying new expensive hardware is straining IT budgets. Unused data driving cost up 70% of data in DW is unused, i.e. never queried in past 1 year. Misuse of CPU capacity Almost 60% of CPU capacity is used for ETL/ELT. 15% of CPU consumed by ETL to load unused data. This affects performance of queries. Inability to support non-relational data Designed for relational data, data warehouses are not suitable for unstructured data coming from sensors, logs, devices, social media etc. Inability to support modern analytics DWs do not support modern analytics technologies such as machine learning and stream processing. >$10K Cost/TB 70% Data Unused 60% CPU used for ETL
  • 5. © 2017 MapR Technologies 5 Optimizing Your Data Warehouse with MapR
  • 6. © 2017 MapR Technologies 6 Select the ideal offload candidates MapR experts will help you select the data and ETL workload ideal for offload. Keep the frequently queried data in the data warehouse. Select unused data (often up to 70% of the data in the DW) for offloading into the MapR Converged Data Platform, Build the data pipeline Data migration can be performed using batch methods using NFS or Sqoop or real-time methods using tools such as Kafka Connect for MapR-ES. Many data warehouses provide connectors for Hadoop that help to simplify the migration. Upon migration, the data can be stored in MapR-DB or Hive tables, or Parquet or Avro files depending on requirements. Deliver the data to the stakeholders Utilize SQL engines such as Apache Drill, Hive, or Spark SQL to deliver data to traditional BI tools. You can continue using your favorite BI tools such as Tableau, Qlik, Microstrategy etc. The existing BI teams can continue querying the offloaded data using SQL. This solution ensures smooth and continuous operation for BI teams. Optimizing your Data Warehouse: 3 Step Solution from MapR 1 2 3 3 Steps 5 Weeks Time-to-Value 40% Offload Target
  • 7. © 2017 MapR Technologies 7 Data Warehouse Optimization Reference Architecture
  • 8. © 2017 MapR Technologies 8 MapR Converged Data Platform: Key Features Interactive SQL analysis Apache Drill on MapR platform allows you to use ANSI SQL to query any data. BI teams can continue using SQL and the same BI tools. Multi-temperature storage MapR provides multi-temperature storage. Store your hot, warm, and cold data within MapR on hardware of your choice, further optimizing for cost. Streaming for real-time insights MapR Streams allows you to bring data for analysis as soon as the data is created. In contrast, legacy DW solutions are batch oriented. Multi-tenant big data platform MapR is the only big data platform that provides multi-tenancy on the data placement level, helping you meet the regulatory requirements. Enterprise grade data governance The MapR platform provides enterprise grade security, auditing, and lineage to meet your data governance needs. Converge SQL and Machine Learning Single platform for storage, database, and streaming. Your choice of compute engine on top (Spark, Hadoop, SQL, Machine learning) 1st Rank for Data Warehousing among Big Data Solutions* 1st Rank for Price/Performance** 1st Rank for Streaming*** *2016 Gartner Critical Capabilities for DW **TPCx-HS Performance Benchmarks ***DBTA Reader’s Choice Awards
  • 9. © 2017 MapR Technologies 9 Data Exploration Using Drill Event Streaming with MapR-ES Transformations Real-Time Analytics and Dashboards with Stream Processing Operational Database Stream Processing Operational Database Operational Database Change Data Capture Change Data Capture Change Data Capture Real-time Business Intelligence Static Data – Inserts only Frequently updated data
  • 10. © 2017 MapR Technologies 10 StreamSets Solves the Data Drift
  • 11. Top challenges for the big data warehouse 68% 60% 52% 47% 40% 32% 1% 0% 18% 35% 53% 70% 88% Ensuring the quality of the data (accuracy, completeness, consistency) Complying with security and data privacy policies Keeping data flow pipelines operating effectively Building pipelines for getting data into the data store Upgrading big data infrastructure components (Kafka, Hadoop, etc.). Adapting pipelines to meet new requirements We have no challenges What challenges does your company face when managing your big data flows?
  • 12. What’s the impact? Yes 87% No 13% Yes 74% No 26% Does ‘bad data’ occasionally get into your data stores? Do you believe there is any ‘bad data’ in your data stores currently? In response… 53% change data flow pipelines at least several times a month
  • 13. New standards for data warehousing ETL ETL Ingest Analyze Past (ETL) ➢ Fixed schema ETL for Data warehouses ➢ Source Data structured and rigid transaction data Data Sources Data Stores Data Consumers Emerging (Ingest) ➢ Explosion of Data Stores – fluid infrastructure ➢ Source Data predominantly multi-structured interaction data ➢ Data Drift: Structure, Semantic, Infrastructure
  • 14. Delayed and False Insights Solving Data Drift Tools Applications Data Stores Data ConsumersData Sources Poor Data Trust & Quality Data Drift Custom code Fixed-schema
  • 15. Trusted & Timely Insights Data KPIs (Trusted High Quality Data) Solving Data Drift Tools Applications Data Stores Data ConsumersData Sources Data Drift Intent-Driven Drift-Handling
  • 16. Think of dataflows as cyclical processes Build Development processes are far more complex and drawn out than they need to be Execute The economics of data have changed, giving way to a choice of executing and deployment options Operate Architectures are constantly changing and have more stringent SLA’s
  • 17. Build Not all developers are created equally >_ Integrations are abundant and unnecessarily rigid Build-to-deploy takes far longer than necessary
  • 18. Execute Multiple deployment options exist yet constraints limit making use of them Mixed workloads are the norm, must handle both batch and streaming 11001001001001101001 00101010010010010010 10100100100101010101 01001001001010100100 11010001110010100100 10010010100101110101 Scalability is a must, both today and into the future
  • 19. Operate Increasingly, the business expects SLA’s on the quality and timeliness of data Architectures are constantly evolving, with new versions or new projects regularly being added Data, and it’s structure, will inevitably change, causing wide spread impact
  • 20. StreamSets Data Operations Platform EFFICIENCY Intent Driven Flows Batch & Streaming Ingest In-stream Sanitization MASTER Availability & Accuracy Proactive Remediation MEASURE Any Path Any Time MAP Dataflow Lineage Live Data Architecture CONTROL Drift Handling Stage & Flow Metrics Lineage & Impact Analysis AGILITY Flexible deployment Exception Handling Seamless Evolution EVOLVE (Proactive) REMEDIATE (Reactive) DEVELOP OPERATE CloudClusterStandalone StreamSets Data Collector Dataflow Performance Manager Edge
  • 21. StreamSets & MapR optimize the EDW
  • 22. StreamSets & MapR enable real- time streams Operational Database Operational Database Operational Database Change Data Capture Change Data Capture Change Data Capture Data Exploration Using Drill Event Streaming with MapR-ES Transformations Stream Processing Real-time Business Intelligence Static Data – Inserts only Frequently updated data
  • 23. © 2017 MapR Technologies 23 Business Benefits
  • 24. © 2017 MapR Technologies 24 MapR Converged Data Platform: Key Business Benefits $10M In Savings per 1 PB over 5 years 53% Lower CapEx for the combined DW + MapR Solution 90% Lower CapEX for MapR vs Legacy DW Reduce TCO of data analysis Sharply reduce the cost of data management and analytics. The cost saving can be utilized towards revenue generating innovations. Maximize value of current investment Increase available “headroom” and avoid or minimize new CapEx. Improve performance of your existing data warehousing assets. Answer new questions Use analytics tools unavailable in legacy data warehousing (Drill, Parquet, Spark, Machine Learning, others). Leverage existing skillsets BI teams can continue using familiar tools such as Tableau, Qlik, Microstrategy etc. on both DW and the MapR data lake. Leverage hybrid deployment model MapR provides single global namespace to help you create a homogeneous data fabric across on-premises and cloud hosted data. Get results fasters The MapR Quick Start Solution for Data Warehousing will help you get value for optimization project within 5 weeks.
  • 25. © 2017 MapR Technologies 25 Beyond Cost Reduction: IDC discovers 4X ROI for MapR Customers 4X ROI 8.2 Months Payback Time 39% Higher Developer Productivity $19.44M Avg. Business Benefits 42% Lower TCO vs. Other Big Data Vendors 31% Higher Data Scientist Productivity
  • 26. © 2017 MapR Technologies 26 Case Studies
  • 27. © 2017 MapR Technologies 27 Cisco was able to analyze sales opportunities in 1/10 the time, at 1/10 the cost, and generated $40 million in incremental service bookings in the first year. Cisco uses integrated customer data to increase revenues • Create shared view of customer & operations across 75,000 employees • Increase revenue opportunities with sales partners • Customer information was siloed in different divisions • Customer interactions were inconsistent and not satisfying • Missed opportunities for upselling/cross selling • Use MapR to collect customer information across touch points • Integrate billing, support, manufacturing, social media, websites, dial-in data • Generate new sales leads internally and for partners OBJECTIVES CHALLENGES SOLUTION Architecture for Sales Partner Opportunities Business Impact
  • 28. © 2017 MapR Technologies 28 Zions Bank builds cost effective security analytics and fraud detection on one platform • Fraud Operations and Security Analytics team at Zions maintains data stores, builds statistical models to detect fraud, and then uses these models to data mine and evaluate suspicious activity “We initially got into centralizing all of our data from an information security perspective. We then saw that we could use this same environment to help with fraud detection” Michael Fowkes - SVP Fraud Operations and Security Analytics • Existing technology infrastructure could not scale • Timeliness of reports degraded over the last several years • Chose MapR and cut storage costs by 50% • Querying time reduced from 24 hours to 30 min on 1.2 PB of data • Leverage MapR scale for increased model accuracy and deeper insights OBJECTIVES CHALLENGES SOLUTION Business Impact
  • 29. Rich history of industry recognition Cool Vendor in Data Management, 2017 Best Open Source Tool, 2016 10 Coolest Big Data Startups of 2016
  • 30. © 2017 MapR Technologies 30 Thank You
  • 31. © 2017 MapR Technologies 31 Q&A ENGAGE WITH US Contact us at: 855-NOW-MAPR Or maprisr@mapr.com https://twitter.com/mapr https://www.linkedin.com/company/mapr- technologies Follow us at:
  • 32. © 2017 MapR Technologies 32 Additional Resources • Learn more at our Solution Page at: https://mapr.com/dwo • Try MapR at https://mapr.com/download/ • Blog: Best Practices on Migrating from a Data Warehouse to a Big Data Platform

Editor's Notes

  1. Now, let’s look at how these events are generally analyzed today. Many customers are using batch oriented analysis for many critical business decisions.
  2. History is repeating itself Past: 70% of data warehouse projects used to fail. Fixed-schema ETL technology came along and automated what was previously a manual and brittle task. Future: Explosion of Big Data apps, tools and techniques. Tied to specific data stores (fluid & multiplying). Inherent schema-centricity of legacy ETL tools prevent them from being used in the extracting and loading (now called ingest) of semi-structured data. Organizations have resorted to manually-coded data ingest pipelines. Manually-coded pipelines are unsustainable, but more importantly fail due to drift.
  3. You have been hearing about the business impact of big data applications for half a decade now. Commonality is in the source of data…while the previous decades of applications focused on transaction data, the emerging use case focuses on event and interaction data...these sources are not just databases and apps, but rather logs, devices, and device data. Big data sources (e.g. systems, sensors) suffer from data drift--the unending, unpredictable and unannounced mutation of data caused by the operations, maintenance and modernization of data sources Today data is delivered to data stores by writing low-level code to transport mechanisms such as Sqoop, Flume and Kafka.…create big problems in data stores Brittle: Data flows break frequently because low-level code can’t adapt when structure changes. Opaque: These problems manifest themselves as surprises because there is no visibility into the health of data flows or the data being delivered. Adhoc: Data integrity corrodes as the meaning of the data changes without detection and new or changed fields do not get properly processed. …with serious business implications Poor business decisions get made based on incomplete, inaccurate or late data Trust in the data is lost as these errors are discovered post hoc Productivity and agility is sacrificed as data engineers and scientists spend all of their time fixing pipelines, doing janitorial work and forensics.
  4. You have been hearing about the business impact of big data applications for half a decade now. Commonality is in the source of data…while the previous decades of applications focused on transaction data, the emerging use case focuses on event and interaction data...these sources are not just databases and apps, but rather logs, devices, and device data. Big data sources (e.g. systems, sensors) suffer from data drift--the unending, unpredictable and unannounced mutation of data caused by the operations, maintenance and modernization of data sources Today data is delivered to data stores by writing low-level code to transport mechanisms such as Sqoop, Flume and Kafka.…create big problems in data stores Brittle: Data flows break frequently because low-level code can’t adapt when structure changes. Opaque: These problems manifest themselves as surprises because there is no visibility into the health of data flows or the data being delivered. Adhoc: Data integrity corrodes as the meaning of the data changes without detection and new or changed fields do not get properly processed. …with serious business implications Poor business decisions get made based on incomplete, inaccurate or late data Trust in the data is lost as these errors are discovered post hoc Productivity and agility is sacrificed as data engineers and scientists spend all of their time fixing pipelines, doing janitorial work and forensics.
  5. Key point: With the right approach, ingest can happen far more effectively and efficiently that before Sub point 1: Not everyone is a developer On one hand we’re extremely lucky: we’re in a market where there’s seemingly an endless number of choices for solving our various data problems. The tricky part is many of them are rather technical in nature, requiring developing new skills or seeking out hard to find resources (ie.personnel) to make use of them. While many folks thrive on being a hard core developer, many others do not, a lot of times simply because it’s faster to use simplified tooling in order to complete a project faster. The point here is you should not be constrained from taking advantage of new technologies if you lack the skills, and your adoption doesn’t need to take as long as it is if you don’t want it to. Sub point 2: Integrations are abundant and unnecessarily rigid
  6. Capturing all existing data on customers into the data lake. ERP, SFDC, etc. Mapping out a picture of the customer to feed other use cases. This feeds “lead information” to SFDC for sales reps to understand where the opportunities are. Supply chain analytics is another use case with Cisco – they subscontract information out and use this to manage quality of the supply chain process. Fix issues early in the manufacturing process from sub-contractors and get in front of it for the customer. Have saved $Million in SCM efficiency.
  7. Global bank fraud costs $200B annually) Zions Bank Fights Fraud, Gains Insights and Cuts Data Storage Costs with MapR   The Business Zions Bank, based in Salt Lake City, Utah, is a subsidiary of Zions Bancorporation that operates more than 500 offices and 600 ATMs in 10 Western U.S. states. As a full-service bank, Zions offers commercial, installment and mortgage loans; trust services; foreign banking services; electronic and online banking services; automatic deposit and nationwide banking and transfer services; as well as checking and savings programs.   Challenge “Being a financial institution, we have a bull’s-eye painted on our backs,” says Michael Fowkes, Zions Bank SVP Fraud Operations and Security Analytics. “Crooks want to steal money, and banks are often a target, so fraud protection is critical to our business. If fraud gets out of control, it eats into our profitability.”   The Zions Bank Fraud Operations and Security Analytics team maintains data stores, builds statistical models to detect fraud, and then uses these models to data mine and evaluate suspicious activity.   Zions has been refining their solution over the past 8-9 years. Fowkes explains that about eight years ago they found that when they loaded in a lot of data, performance degraded significantly when they tried to do reporting.   “We always kept our eye out for new data stores. When it came time to refresh our data stores, we decided to go to Hadoop,” says Fowkes.   MapR Solution Zions Bank chose MapR for its security features, NFS mountable file system, high availability, ease of management and its superior performance capabilities, which allow for a more efficient use of hardware and a better ROI.   The bank relies on MapR for a critical part of their security architecture. MapR helps Zions predict phishing behavior and payments fraud in real time and minimize their impact. With MapR, Zions can run more detailed analytics and forensics.   Benefits The bank has seen multiple benefits from their MapR solution:   Cuts storage costs in half Zions is seeing significant benefits from a storage perspective. With their other data sources, they had to hold on to source data sets so they still have the original data. MapR eliminates the need to have multiple data sources.   “When we cut over to MapR, we cut our expenses in half from a data storage perspective,” says Fowkes. (Michael, do you need to get clearance on this quote?)   Cost effective to scale Since MapR scales linearly, capacity planning is much easier. “We know that growth won’t be incredibly expensive like with distributed database platforms which charge per terabyte of storage. This can get quite expensive,” says Fowkes. “The others cost a lot more to scale. MapR allows us to scale at a reasonable price.” <Michael, can you provide any specific metrics about the difference in cost to scale with the MapR solution? > Increases accuracy, speed and insights Fowkes explains that before, when you created a statistical model, you had to use sample data. “MapR allows you to wrangle large amounts of data,” he says. “You can use all of your data and create a more accurate model. This is also used in forensics so we have one place to research what happened.”   Two years of data add up to about 1.2 petabytes of data. Wrangling this amount of data used to be daunting. “In the past, it could take a full day. Now we can do a data query of two years of data in 30 minutes,” he says.   Multiple uses for data stores Centralizing data stores serves multiple uses—from data security to fraud detection to risk management to customer marketing. “We initially got into centralizing all of our data from an information security perspective. We then saw that we could use this same environment to help with fraud detection,” he says. “Now that we have this data we know we can do more with it. Right now we’re working on a business project on the marketing side, completely outside of fraud and info security. It’s the same data to look at on the business side for customer analytics,” he says. “And our risk group leverages data that’s used in the system too. Having a more granular view of data, you get additional insights.”   Summary MapR is enabling Zions Bank to improve its security infrastructure while reducing costs. They’ve been able to cut storage costs in half, scale their solution cost-effectively, make more efficient use of hardware, make statistical models more accurate, increase the performance and speed of high volume data queries, generate deeper insights and help them leverage their data stores across several aspects of the business.