SlideShare une entreprise Scribd logo
1  sur  41
Télécharger pour lire hors ligne
Big Data && Fast Data
Vitaliy Rudnytskiy, SAP
August 2014, Budapest
© 2013 SAP AG or an SAP affiliate company. All rights reserved. 2
Disclaimer
This presentation outlines our general product direction and should not be relied on in making a
purchase decision. This presentation is not subject to your license agreement or any other agreement
with SAP. SAP has no obligation to pursue any course of business outlined in this presentation or to
develop or release any functionality mentioned in this presentation. This presentation and SAP's
strategy and possible future developments are subject to change and may be changed by SAP at any
time for any reason without notice. This document is provided without a warranty of any kind, either
express or implied, including but not limited to, the implied warranties of merchantability, fitness for a
particular purpose, or non-infringement. SAP assumes no responsibility for errors or omissions in this
document, except if such damages were caused by SAP intentionally or grossly negligent.
© 2013 SAP AG or an SAP affiliate company. All rights reserved. 3
Let’s start with…
… me :) Vitaliy Rudnytskiy
@Sygyzmundovych
SAP’s Developer Center team
- developers.sap.com
- Data Management and Analytics
Live in Wrocław, Poland
What is …
Big Data?
4
What is Big Data?
5
3xV: Volume, Variety, Velocity
4xV: + Veracity
5xV: + Value
6xV: + Vitaliy 
„One Terabyte Club” sounds like terayears ago!
7
Big Data Definitions accordingly to CIOs
8
Instant Access to Data
9
SAP Road to In-Memory Technology
11
Source: “SAP HANA Essentials”, Jeffrey Word
Principle #1
Keep all required data (aka “hot data”) in computers’ main memory
Rarely accessed data (aka “cold data”) can be moved to cheaper storage
© 2013 SAP AG or an SAP affiliate company. All rights reserved. 13
Data Access Latencies and Bandwidth
Source: Intel
Principle #2
Compress data to minimize the footprint
© 2013 SAP AG or an SAP affiliate company. All rights reserved. 17
Columnar and Row Based Data Storage
© 2013 SAP AG or an SAP affiliate company. All rights reserved. 18
Data Compression in Column Store
1. Dictionary Compression
2. Compression of the Value ID
Sequence
• Prefix encoding
• Run Length encoding
• Cluster encoding
• Sparse encoding
• Indirect encoding
3. Further Compression
for Main Dictionary
Principle #3
Cache-sensitive data layout and cache-aware algorithms
© 2013 SAP AG or an SAP affiliate company. All rights reserved. 21
„RAM Locality is King”
Source: Microsoft Research: research.microsoft.com/en.../Flash_is_Good.ppt
© 2013 SAP AG or an SAP affiliate company. All rights reserved. 22
Bandwidth between CPU and Main Memory
(a) Intel Shared Front Side Bus (UMA Architecture)
(b) Intel Quick Path Interconnect (NUMA Architecture)
Source: “In-Memory Data Management: An Inflection Point for Enterprise Applications” by Hasso Plattner, Alexander Zeier
Principle #4
Software performance growth is in the parallel processing,
not in increasing clock speed
© 2013 SAP AG or an SAP affiliate company. All rights reserved. 25
Examples of Parallelization in a Column Store
Principle #5
Move data-intensive operations to the data layer
Integrate data processing engines (relational, text, graph, streaming…)
Include built-in libraries (geospecial, predictive, 3rd parties)
Have you seen…
1
Petabyte? 31
The SAP HANA One Petabyte Test
33
Source: http://www.saphana.com/community/blogs/blog/2012/11/12/the-sap-hana-one-petabyte-test
Guinness World Record: 12.1 PB DW
34
Source: http://www.guinnessworldrecords.com/world-records/5000/largest-data-warehouse
Here comes Hadoop
2.8 ZB in 2012
85% from New Data Types
15x Machine Data by 2020
40 ZB by 2020
New Sources (Sentiment,
Clickstream, Geo, Sensor)
Ref:Hortonworks
37
SAP HANA && Hadoop: Comparison
38Source: „How to Use Hadoop with Your SAP® Software Landscape”, http://www.saphana.com/docs/DOC-3777
Big Data && Fast Data
39
Query FederationTwo-Speed Analytics
Big Data is Strategic to SAP and Partners
40
http://www.cloudera.com/content/cloudera/en/
solutions/partner/SAP.html http://hortonworks.com/partner/sap/
© 2013 SAP AG or an SAP affiliate company. All rights reserved. 41
Planned Innovations Future DirectionToday
SAP HANA and Hadoop
Product road map overview – key themes and capabilities
This is the current state of planning and may be changed by SAP at any time.
Integrate:
 Usability
– Data Virtualization (Smart Data Access ) to Hive
via ODBC connectivity
– Richer SQL access from SAP HANA studio to
Hive Tables
 Co-processing
– Compute SQL operation between Hive and HANA
 Performance
– SDA optimizations: function completion
– Remote caching of HIVE jobs
 Enterprise readiness
– Hadoop Distributions Offering from our Partners
a. Co-innovation & Reselling with Intel and
Hortonworks
b. Open for other distro’s using HIVE odbc drivers
Optimize:
 Usability
– Execute custom map reduce methods from SAP
HANA and consume the results from HANA queries
via SQL User Defined Function (UDF).
– SDA support for Data Provisioning for the SAP HANA
Service/Adapter Framework
 Performance
– SDA optimization for connectivity, semi joins &
relocation when large amounts of data is queried
from SAP HANA and the number of Hadoop nodes
are increased
– Concurrent execution of HIVE jobs from SAP HANA
 Enterprise readiness
– * Support for HANA Authorization/ Authentication
and encryption capabilities with partner Hadoop
Disributions
– * Integrate and validate Cloudera CDH-5.1 and
Hortonworks HDP-2.0 Hadoop distributions with
latest SAP HANA platform
– Support Apache Spark SQL via partners distro
See Appendix for abbreviations
Synthesize:
 Usability
– Trigger and consume output of PIG jobs from HANA
– Support HBase as a store for HANA federated queries
 Co-processing
– Advanced SDA for Hadoop windowing, table
partitioned functions
– Hadoop as extended data store for HANA (Big Unified
Table)
– Tiered storage: hot/cold data controlled by HANA
 Performance
– Distributed in-memory caching of hot HDFS data
– Parallel connections from HANA to Hadoop for loading
and queries
– Extend HANA compute engine to Hadoop
 Enterprise readiness
– Integration of Partner Hadoop Distro features into SAP
HANA Cockpit
– Send Hadoop logs to SAP Solution Manager
– HANA snapshot to HDFS with restore
– Common authentication and authorization
Want to learn and code in…
SAP
HANA? 42
43
Free online education: open.sap.com
45
Community and developer access:
http://developers.sap.com/hana
• Free developer editions
hosted in the cloud
(AWS, Azure, CloudShare…)
• How-to guides
• Community support
• Blogs
© 2013 SAP AG or an SAP affiliate company. All rights reserved. 46
Examples of some cool ideas from the community
„HADOOP HDFS Explorer built with HANA XS and SAPUI5” by Aron MacDonald
http://scn.sap.com/community/developer-center/hana/blog/2014/07/03/hadoop-hdfs-explorer-built-with-hana-xs-and-sapui5
© 2013 SAP AG or an SAP affiliate company. All rights reserved. 47
Examples of some cool ideas from the community
„Experiences with SAP HANA Geo-Spatial Features” by Trinoy Hazarika
http://scn.sap.com/community/developer-center/hana/blog/2014/02/25/experiences-with-sap-hana-geo-spatial-features-part-1
© 2013 SAP AG or an SAP affiliate company. All rights reserved. 48
Examples of some cool ideas from the community
„Predicting My Next Twitter Follower with SAP HANA PAL” by Lucas Sparvieri
*PAL – Predictive Analysis Library
http://scn.sap.com/community/developer-center/hana/blog/2013/09/02/predicting-my-next-twitter-follower-with-sap-hana-pal
© 2013 SAP AG or an SAP affiliate company. All rights reserved. 49
Examples of some cool ideas from the community
„Detecting World Cup GOAL using Twitter and SAP HANA” by Stevanic Artana
http://scn.sap.com/community/developer-center/hana/blog/2014/07/03/goal-detection-using-twitter-and-sap-hana
© 2013 SAP AG or an SAP affiliate company. All rights reserved. 50
Examples of some cool ideas from the community
„A Simple Door Monitoring System with HANA XS and Raspberry Pi” by Ferry Gunawan
http://scn.sap.com/community/developer-center/hana/blog/2014/07/09/build-a-door-sensor-with-raspberry-pi-and-hana
1000+ Startups working with us
51More info and how to join: http://startups.saphana.com
Semantic Vision from Czech Rep.
52
More: http://www.saphana.com/community/learn/startups/news-views/blog/2013/10/04/beyond-the-big-hype-
using-semantic-search-to-predict-the-way-the-universe-will-behave
Startups in SAP Startup Forum Program
53
2/3rds in
these 4
categories
Area of Startup Focus % of Startups
Visualization / BI / Market Insight 25%
Social Media / Collaboration / Gaming 17%
Predictive Analytics / Complex Analytics 13%
Sensor Network Data / Internet of Things 10%
Geospatial / Geo-Location / 3D Analysis 8%
Primarily a Mobile Solution 7%
Big Data Infrastructure / Appliance + SAP HANA 6%
Enterprise Process Acceleration 6%
Extension of specific SAP Solutions and SAP Expertise 4%
Energy Management / Sustainability 3%
54
Community and developer access:
developers.sap.com
Contact information:
Vitaliy Rudnytskiy
vitaliy.rudnytskiy@sap.com
@Sygyzmundovych
© 2013 SAP AG or an SAP affiliate company. All rights reserved. 55
© 2013 SAP AG or an SAP affiliate company. All rights reserved.
No part of this publication may be reproduced or transmitted in any form or for any purpose without the express permission of SAP AG.
The information contained herein may be changed without prior notice.
Some software products marketed by SAP AG and its distributors contain proprietary software components of other software vendors.
National product specifications may vary.
These materials are provided by SAP AG and its affiliated companies ("SAP Group") for informational purposes only, without representation or warranty of any kind, and
SAP Group shall not be liable for errors or omissions with respect to the materials. The only warranties for SAP Group products and services are those that are set forth
in the express warranty statements accompanying such products and services, if any. Nothing herein should be construed as constituting an additional warranty.
SAP and other SAP products and services mentioned herein as well as their respective logos are trademarks or registered trademarks of SAP AG in Germany and
other countries.
Please see http://www.sap.com/corporate-en/legal/copyright/index.epx#trademark for additional trademark information and notices.

Contenu connexe

Tendances

Sap hana l1 -reinventing real-time businesses through innovation, value & si...
Sap hana l1  -reinventing real-time businesses through innovation, value & si...Sap hana l1  -reinventing real-time businesses through innovation, value & si...
Sap hana l1 -reinventing real-time businesses through innovation, value & si...
Daniel Lahl
 
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Pentaho
 

Tendances (20)

SAP Inside Track Belgium 2018 - SAP Leonardo Machine Learning Demystified
SAP Inside Track Belgium 2018 - SAP Leonardo Machine Learning DemystifiedSAP Inside Track Belgium 2018 - SAP Leonardo Machine Learning Demystified
SAP Inside Track Belgium 2018 - SAP Leonardo Machine Learning Demystified
 
SAP EIM Overview
SAP EIM OverviewSAP EIM Overview
SAP EIM Overview
 
SAP Predictive Analytics
SAP Predictive AnalyticsSAP Predictive Analytics
SAP Predictive Analytics
 
Business intelligence in the era of big data
Business intelligence in the era of big dataBusiness intelligence in the era of big data
Business intelligence in the era of big data
 
In-Memory Analytics - SAP Big Data - Analytics Tools Selection - SAP HANA & ...
In-Memory Analytics - SAP Big Data - Analytics Tools Selection  - SAP HANA & ...In-Memory Analytics - SAP Big Data - Analytics Tools Selection  - SAP HANA & ...
In-Memory Analytics - SAP Big Data - Analytics Tools Selection - SAP HANA & ...
 
SAP HANA in Healthcare: Real-Time Big Data Analysis
SAP HANA in Healthcare: Real-Time Big Data AnalysisSAP HANA in Healthcare: Real-Time Big Data Analysis
SAP HANA in Healthcare: Real-Time Big Data Analysis
 
How PepsiCo's Big Data Strategy is Disrupting CPG Retail Analytics
How PepsiCo's Big Data Strategy is Disrupting CPG Retail AnalyticsHow PepsiCo's Big Data Strategy is Disrupting CPG Retail Analytics
How PepsiCo's Big Data Strategy is Disrupting CPG Retail Analytics
 
SAP HANA Vora SITMTY 20160707
SAP HANA Vora SITMTY 20160707SAP HANA Vora SITMTY 20160707
SAP HANA Vora SITMTY 20160707
 
Enterprise data science - What it takes to build?
Enterprise data science - What it takes to build?Enterprise data science - What it takes to build?
Enterprise data science - What it takes to build?
 
Revolutionizing Executive Insight - The SAP Digital Boardroom
Revolutionizing Executive Insight - The SAP Digital BoardroomRevolutionizing Executive Insight - The SAP Digital Boardroom
Revolutionizing Executive Insight - The SAP Digital Boardroom
 
SAP HANA Interactive Use Case Map
SAP HANA Interactive Use Case MapSAP HANA Interactive Use Case Map
SAP HANA Interactive Use Case Map
 
Sap hana l1 -reinventing real-time businesses through innovation, value & si...
Sap hana l1  -reinventing real-time businesses through innovation, value & si...Sap hana l1  -reinventing real-time businesses through innovation, value & si...
Sap hana l1 -reinventing real-time businesses through innovation, value & si...
 
Hadoop, Spark and Big Data Summit presentation with SAP HANA Vora and a path ...
Hadoop, Spark and Big Data Summit presentation with SAP HANA Vora and a path ...Hadoop, Spark and Big Data Summit presentation with SAP HANA Vora and a path ...
Hadoop, Spark and Big Data Summit presentation with SAP HANA Vora and a path ...
 
SAP Data Services
SAP Data ServicesSAP Data Services
SAP Data Services
 
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
Big Data Integration Webinar: Reducing Implementation Efforts of Hadoop, NoSQ...
 
Accelerate Your Move to an Intelligent Enterprise with SAP Cloud Platform and...
Accelerate Your Move to an Intelligent Enterprise with SAP Cloud Platform and...Accelerate Your Move to an Intelligent Enterprise with SAP Cloud Platform and...
Accelerate Your Move to an Intelligent Enterprise with SAP Cloud Platform and...
 
Predictive Analytics with SAP HANA
Predictive Analytics with SAP HANA Predictive Analytics with SAP HANA
Predictive Analytics with SAP HANA
 
#askSAP Analytics Innovations Community Call: Delivering Big Data Inisghts wi...
#askSAP Analytics Innovations Community Call: Delivering Big Data Inisghts wi...#askSAP Analytics Innovations Community Call: Delivering Big Data Inisghts wi...
#askSAP Analytics Innovations Community Call: Delivering Big Data Inisghts wi...
 
20100430 introduction to business objects data services
20100430 introduction to business objects data services20100430 introduction to business objects data services
20100430 introduction to business objects data services
 
SAP Helps Reduce Silos Between Business and Spatial Data
SAP Helps Reduce Silos Between Business and Spatial DataSAP Helps Reduce Silos Between Business and Spatial Data
SAP Helps Reduce Silos Between Business and Spatial Data
 

Similaire à SAP HANA - Big Data and Fast Data

How is sap data services unique for sap hana integration
How is sap data services unique for sap hana integrationHow is sap data services unique for sap hana integration
How is sap data services unique for sap hana integration
Flavio Alejandro Corradini
 

Similaire à SAP HANA - Big Data and Fast Data (20)

SUSE Technical Webinar: Build HANA Apps in the Framework of the SAP and SUSE ...
SUSE Technical Webinar: Build HANA Apps in the Framework of the SAP and SUSE ...SUSE Technical Webinar: Build HANA Apps in the Framework of the SAP and SUSE ...
SUSE Technical Webinar: Build HANA Apps in the Framework of the SAP and SUSE ...
 
Analytics Products L2 public 2020-23 Black.pptx
Analytics Products L2 public 2020-23 Black.pptxAnalytics Products L2 public 2020-23 Black.pptx
Analytics Products L2 public 2020-23 Black.pptx
 
SAP HANA Cloud Platform Community BOF @ Devoxx 2013
SAP HANA Cloud Platform Community BOF @ Devoxx 2013SAP HANA Cloud Platform Community BOF @ Devoxx 2013
SAP HANA Cloud Platform Community BOF @ Devoxx 2013
 
SAP HANA Cloud – Virtual Bootcamp: How to use the HANA Persistence Se…
SAP HANA Cloud – Virtual Bootcamp: How to use the HANA Persistence Se…SAP HANA Cloud – Virtual Bootcamp: How to use the HANA Persistence Se…
SAP HANA Cloud – Virtual Bootcamp: How to use the HANA Persistence Se…
 
SAP HANA SQL Data Warehousing (Sefan Linders)
SAP HANA SQL Data Warehousing (Sefan Linders)SAP HANA SQL Data Warehousing (Sefan Linders)
SAP HANA SQL Data Warehousing (Sefan Linders)
 
Sap bw4 hana
Sap bw4 hanaSap bw4 hana
Sap bw4 hana
 
Custom Development - SAP HANA
Custom Development - SAP HANACustom Development - SAP HANA
Custom Development - SAP HANA
 
SAP TechEd 2013: CD105: Extending SuccessFactors EmployeeCentral with apps on...
SAP TechEd 2013: CD105: Extending SuccessFactors EmployeeCentral with apps on...SAP TechEd 2013: CD105: Extending SuccessFactors EmployeeCentral with apps on...
SAP TechEd 2013: CD105: Extending SuccessFactors EmployeeCentral with apps on...
 
SAP HANA Cloud Platform: The void between your Datacenter and the Cloud
SAP HANA Cloud Platform: The void between your Datacenter and the CloudSAP HANA Cloud Platform: The void between your Datacenter and the Cloud
SAP HANA Cloud Platform: The void between your Datacenter and the Cloud
 
SAP HANA Cloud: From Your Datacenter to the Cloud and Back
SAP HANA Cloud: From Your Datacenter to the Cloud and Back  SAP HANA Cloud: From Your Datacenter to the Cloud and Back
SAP HANA Cloud: From Your Datacenter to the Cloud and Back
 
How is sap data services unique for sap hana integration
How is sap data services unique for sap hana integrationHow is sap data services unique for sap hana integration
How is sap data services unique for sap hana integration
 
Webinar SAP BusinessObjects Cloud (English)
Webinar SAP BusinessObjects Cloud (English)Webinar SAP BusinessObjects Cloud (English)
Webinar SAP BusinessObjects Cloud (English)
 
SAP Data Hub e SUSE Container as a Service Platform
SAP Data Hub e SUSE Container as a Service PlatformSAP Data Hub e SUSE Container as a Service Platform
SAP Data Hub e SUSE Container as a Service Platform
 
Developing and Deploying Applications on the SAP HANA Platform
Developing and Deploying Applications on the SAP HANA PlatformDeveloping and Deploying Applications on the SAP HANA Platform
Developing and Deploying Applications on the SAP HANA Platform
 
Data Warehouse Cloud - Das Ende von SAP BW?
Data Warehouse Cloud - Das Ende von SAP BW?Data Warehouse Cloud - Das Ende von SAP BW?
Data Warehouse Cloud - Das Ende von SAP BW?
 
SAP HANA and SAP Vora
SAP HANA and SAP VoraSAP HANA and SAP Vora
SAP HANA and SAP Vora
 
SAP HANA Data Center Intelligence Overview
SAP HANA Data Center Intelligence OverviewSAP HANA Data Center Intelligence Overview
SAP HANA Data Center Intelligence Overview
 
The Power of Collective Insight with SAP BI
The Power of Collective Insight with SAP BIThe Power of Collective Insight with SAP BI
The Power of Collective Insight with SAP BI
 
SAP HANA SPS10- Multitenant Database Containers
SAP HANA SPS10- Multitenant Database ContainersSAP HANA SPS10- Multitenant Database Containers
SAP HANA SPS10- Multitenant Database Containers
 
In-Memory Database Platform for Big Data
In-Memory Database Platform for Big DataIn-Memory Database Platform for Big Data
In-Memory Database Platform for Big Data
 

Plus de Vitaliy Rudnytskiy

Plus de Vitaliy Rudnytskiy (20)

SIT Wrocław 2019 - Intro
SIT Wrocław 2019 - IntroSIT Wrocław 2019 - Intro
SIT Wrocław 2019 - Intro
 
Wroclaw SAP Meetup 2019/02
Wroclaw SAP Meetup 2019/02Wroclaw SAP Meetup 2019/02
Wroclaw SAP Meetup 2019/02
 
Wrocław SAP Meetup - 2018/02
Wrocław SAP Meetup - 2018/02Wrocław SAP Meetup - 2018/02
Wrocław SAP Meetup - 2018/02
 
Gentle Introduction into Geospatial (using SQL in SAP HANA)
Gentle Introduction into Geospatial (using SQL in SAP HANA)Gentle Introduction into Geospatial (using SQL in SAP HANA)
Gentle Introduction into Geospatial (using SQL in SAP HANA)
 
IoT at Scale
IoT at ScaleIoT at Scale
IoT at Scale
 
Welcome to SAP Community of Developers!
Welcome to SAP Community of Developers!Welcome to SAP Community of Developers!
Welcome to SAP Community of Developers!
 
Wroclaw SAP Meetup 2017/10
Wroclaw SAP Meetup 2017/10Wroclaw SAP Meetup 2017/10
Wroclaw SAP Meetup 2017/10
 
SAP Vora CodeJam
SAP Vora CodeJamSAP Vora CodeJam
SAP Vora CodeJam
 
Mobile of People and Internet of Things: State of the Union
Mobile of People and Internet of Things: State of the UnionMobile of People and Internet of Things: State of the Union
Mobile of People and Internet of Things: State of the Union
 
Wroclaw SAP Meetup - 2017/01
Wroclaw SAP Meetup - 2017/01Wroclaw SAP Meetup - 2017/01
Wroclaw SAP Meetup - 2017/01
 
Wroclaw SAP Meetup - 2016/10
Wroclaw SAP Meetup - 2016/10Wroclaw SAP Meetup - 2016/10
Wroclaw SAP Meetup - 2016/10
 
Quantify your drive: IoT on a personal scale with SAP technologies
Quantify your drive: IoT on a personal scale with SAP technologiesQuantify your drive: IoT on a personal scale with SAP technologies
Quantify your drive: IoT on a personal scale with SAP technologies
 
Overview of SAP HANA Cloud Platform
Overview of SAP HANA Cloud PlatformOverview of SAP HANA Cloud Platform
Overview of SAP HANA Cloud Platform
 
OpenUI5
OpenUI5OpenUI5
OpenUI5
 
Welcome to SAP Community of Developers!
Welcome to SAP Community of Developers!Welcome to SAP Community of Developers!
Welcome to SAP Community of Developers!
 
SAP Developer Center - March 2016 update
SAP Developer Center - March 2016 updateSAP Developer Center - March 2016 update
SAP Developer Center - March 2016 update
 
SAP Tech Innovation for Business - 2014.05
SAP Tech Innovation for Business - 2014.05SAP Tech Innovation for Business - 2014.05
SAP Tech Innovation for Business - 2014.05
 
SAP CodeJam Mobile - Poland 2013
SAP CodeJam Mobile - Poland 2013SAP CodeJam Mobile - Poland 2013
SAP CodeJam Mobile - Poland 2013
 
SAP Store (in Polish / po polsku)
SAP Store (in Polish / po polsku)SAP Store (in Polish / po polsku)
SAP Store (in Polish / po polsku)
 
SAP Runs SAP Mobile
SAP Runs SAP MobileSAP Runs SAP Mobile
SAP Runs SAP Mobile
 

Dernier

Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
vu2urc
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
Enterprise Knowledge
 

Dernier (20)

presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
Raspberry Pi 5: Challenges and Solutions in Bringing up an OpenGL/Vulkan Driv...
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
GenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdfGenAI Risks & Security Meetup 01052024.pdf
GenAI Risks & Security Meetup 01052024.pdf
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 

SAP HANA - Big Data and Fast Data

  • 1. Big Data && Fast Data Vitaliy Rudnytskiy, SAP August 2014, Budapest
  • 2. © 2013 SAP AG or an SAP affiliate company. All rights reserved. 2 Disclaimer This presentation outlines our general product direction and should not be relied on in making a purchase decision. This presentation is not subject to your license agreement or any other agreement with SAP. SAP has no obligation to pursue any course of business outlined in this presentation or to develop or release any functionality mentioned in this presentation. This presentation and SAP's strategy and possible future developments are subject to change and may be changed by SAP at any time for any reason without notice. This document is provided without a warranty of any kind, either express or implied, including but not limited to, the implied warranties of merchantability, fitness for a particular purpose, or non-infringement. SAP assumes no responsibility for errors or omissions in this document, except if such damages were caused by SAP intentionally or grossly negligent.
  • 3. © 2013 SAP AG or an SAP affiliate company. All rights reserved. 3 Let’s start with… … me :) Vitaliy Rudnytskiy @Sygyzmundovych SAP’s Developer Center team - developers.sap.com - Data Management and Analytics Live in Wrocław, Poland
  • 4. What is … Big Data? 4
  • 5. What is Big Data? 5 3xV: Volume, Variety, Velocity 4xV: + Veracity 5xV: + Value 6xV: + Vitaliy 
  • 6. „One Terabyte Club” sounds like terayears ago! 7
  • 7. Big Data Definitions accordingly to CIOs 8
  • 9. SAP Road to In-Memory Technology 11 Source: “SAP HANA Essentials”, Jeffrey Word
  • 10. Principle #1 Keep all required data (aka “hot data”) in computers’ main memory Rarely accessed data (aka “cold data”) can be moved to cheaper storage
  • 11. © 2013 SAP AG or an SAP affiliate company. All rights reserved. 13 Data Access Latencies and Bandwidth Source: Intel
  • 12. Principle #2 Compress data to minimize the footprint
  • 13. © 2013 SAP AG or an SAP affiliate company. All rights reserved. 17 Columnar and Row Based Data Storage
  • 14. © 2013 SAP AG or an SAP affiliate company. All rights reserved. 18 Data Compression in Column Store 1. Dictionary Compression 2. Compression of the Value ID Sequence • Prefix encoding • Run Length encoding • Cluster encoding • Sparse encoding • Indirect encoding 3. Further Compression for Main Dictionary
  • 15. Principle #3 Cache-sensitive data layout and cache-aware algorithms
  • 16. © 2013 SAP AG or an SAP affiliate company. All rights reserved. 21 „RAM Locality is King” Source: Microsoft Research: research.microsoft.com/en.../Flash_is_Good.ppt
  • 17. © 2013 SAP AG or an SAP affiliate company. All rights reserved. 22 Bandwidth between CPU and Main Memory (a) Intel Shared Front Side Bus (UMA Architecture) (b) Intel Quick Path Interconnect (NUMA Architecture) Source: “In-Memory Data Management: An Inflection Point for Enterprise Applications” by Hasso Plattner, Alexander Zeier
  • 18. Principle #4 Software performance growth is in the parallel processing, not in increasing clock speed
  • 19. © 2013 SAP AG or an SAP affiliate company. All rights reserved. 25 Examples of Parallelization in a Column Store
  • 20. Principle #5 Move data-intensive operations to the data layer Integrate data processing engines (relational, text, graph, streaming…) Include built-in libraries (geospecial, predictive, 3rd parties)
  • 22. The SAP HANA One Petabyte Test 33 Source: http://www.saphana.com/community/blogs/blog/2012/11/12/the-sap-hana-one-petabyte-test
  • 23. Guinness World Record: 12.1 PB DW 34 Source: http://www.guinnessworldrecords.com/world-records/5000/largest-data-warehouse
  • 24. Here comes Hadoop 2.8 ZB in 2012 85% from New Data Types 15x Machine Data by 2020 40 ZB by 2020 New Sources (Sentiment, Clickstream, Geo, Sensor) Ref:Hortonworks 37
  • 25. SAP HANA && Hadoop: Comparison 38Source: „How to Use Hadoop with Your SAP® Software Landscape”, http://www.saphana.com/docs/DOC-3777
  • 26. Big Data && Fast Data 39 Query FederationTwo-Speed Analytics
  • 27. Big Data is Strategic to SAP and Partners 40 http://www.cloudera.com/content/cloudera/en/ solutions/partner/SAP.html http://hortonworks.com/partner/sap/
  • 28. © 2013 SAP AG or an SAP affiliate company. All rights reserved. 41 Planned Innovations Future DirectionToday SAP HANA and Hadoop Product road map overview – key themes and capabilities This is the current state of planning and may be changed by SAP at any time. Integrate:  Usability – Data Virtualization (Smart Data Access ) to Hive via ODBC connectivity – Richer SQL access from SAP HANA studio to Hive Tables  Co-processing – Compute SQL operation between Hive and HANA  Performance – SDA optimizations: function completion – Remote caching of HIVE jobs  Enterprise readiness – Hadoop Distributions Offering from our Partners a. Co-innovation & Reselling with Intel and Hortonworks b. Open for other distro’s using HIVE odbc drivers Optimize:  Usability – Execute custom map reduce methods from SAP HANA and consume the results from HANA queries via SQL User Defined Function (UDF). – SDA support for Data Provisioning for the SAP HANA Service/Adapter Framework  Performance – SDA optimization for connectivity, semi joins & relocation when large amounts of data is queried from SAP HANA and the number of Hadoop nodes are increased – Concurrent execution of HIVE jobs from SAP HANA  Enterprise readiness – * Support for HANA Authorization/ Authentication and encryption capabilities with partner Hadoop Disributions – * Integrate and validate Cloudera CDH-5.1 and Hortonworks HDP-2.0 Hadoop distributions with latest SAP HANA platform – Support Apache Spark SQL via partners distro See Appendix for abbreviations Synthesize:  Usability – Trigger and consume output of PIG jobs from HANA – Support HBase as a store for HANA federated queries  Co-processing – Advanced SDA for Hadoop windowing, table partitioned functions – Hadoop as extended data store for HANA (Big Unified Table) – Tiered storage: hot/cold data controlled by HANA  Performance – Distributed in-memory caching of hot HDFS data – Parallel connections from HANA to Hadoop for loading and queries – Extend HANA compute engine to Hadoop  Enterprise readiness – Integration of Partner Hadoop Distro features into SAP HANA Cockpit – Send Hadoop logs to SAP Solution Manager – HANA snapshot to HDFS with restore – Common authentication and authorization
  • 29. Want to learn and code in… SAP HANA? 42
  • 31. 45 Community and developer access: http://developers.sap.com/hana • Free developer editions hosted in the cloud (AWS, Azure, CloudShare…) • How-to guides • Community support • Blogs
  • 32. © 2013 SAP AG or an SAP affiliate company. All rights reserved. 46 Examples of some cool ideas from the community „HADOOP HDFS Explorer built with HANA XS and SAPUI5” by Aron MacDonald http://scn.sap.com/community/developer-center/hana/blog/2014/07/03/hadoop-hdfs-explorer-built-with-hana-xs-and-sapui5
  • 33. © 2013 SAP AG or an SAP affiliate company. All rights reserved. 47 Examples of some cool ideas from the community „Experiences with SAP HANA Geo-Spatial Features” by Trinoy Hazarika http://scn.sap.com/community/developer-center/hana/blog/2014/02/25/experiences-with-sap-hana-geo-spatial-features-part-1
  • 34. © 2013 SAP AG or an SAP affiliate company. All rights reserved. 48 Examples of some cool ideas from the community „Predicting My Next Twitter Follower with SAP HANA PAL” by Lucas Sparvieri *PAL – Predictive Analysis Library http://scn.sap.com/community/developer-center/hana/blog/2013/09/02/predicting-my-next-twitter-follower-with-sap-hana-pal
  • 35. © 2013 SAP AG or an SAP affiliate company. All rights reserved. 49 Examples of some cool ideas from the community „Detecting World Cup GOAL using Twitter and SAP HANA” by Stevanic Artana http://scn.sap.com/community/developer-center/hana/blog/2014/07/03/goal-detection-using-twitter-and-sap-hana
  • 36. © 2013 SAP AG or an SAP affiliate company. All rights reserved. 50 Examples of some cool ideas from the community „A Simple Door Monitoring System with HANA XS and Raspberry Pi” by Ferry Gunawan http://scn.sap.com/community/developer-center/hana/blog/2014/07/09/build-a-door-sensor-with-raspberry-pi-and-hana
  • 37. 1000+ Startups working with us 51More info and how to join: http://startups.saphana.com
  • 38. Semantic Vision from Czech Rep. 52 More: http://www.saphana.com/community/learn/startups/news-views/blog/2013/10/04/beyond-the-big-hype- using-semantic-search-to-predict-the-way-the-universe-will-behave
  • 39. Startups in SAP Startup Forum Program 53 2/3rds in these 4 categories Area of Startup Focus % of Startups Visualization / BI / Market Insight 25% Social Media / Collaboration / Gaming 17% Predictive Analytics / Complex Analytics 13% Sensor Network Data / Internet of Things 10% Geospatial / Geo-Location / 3D Analysis 8% Primarily a Mobile Solution 7% Big Data Infrastructure / Appliance + SAP HANA 6% Enterprise Process Acceleration 6% Extension of specific SAP Solutions and SAP Expertise 4% Energy Management / Sustainability 3%
  • 40. 54 Community and developer access: developers.sap.com Contact information: Vitaliy Rudnytskiy vitaliy.rudnytskiy@sap.com @Sygyzmundovych
  • 41. © 2013 SAP AG or an SAP affiliate company. All rights reserved. 55 © 2013 SAP AG or an SAP affiliate company. All rights reserved. No part of this publication may be reproduced or transmitted in any form or for any purpose without the express permission of SAP AG. The information contained herein may be changed without prior notice. Some software products marketed by SAP AG and its distributors contain proprietary software components of other software vendors. National product specifications may vary. These materials are provided by SAP AG and its affiliated companies ("SAP Group") for informational purposes only, without representation or warranty of any kind, and SAP Group shall not be liable for errors or omissions with respect to the materials. The only warranties for SAP Group products and services are those that are set forth in the express warranty statements accompanying such products and services, if any. Nothing herein should be construed as constituting an additional warranty. SAP and other SAP products and services mentioned herein as well as their respective logos are trademarks or registered trademarks of SAP AG in Germany and other countries. Please see http://www.sap.com/corporate-en/legal/copyright/index.epx#trademark for additional trademark information and notices.