SlideShare une entreprise Scribd logo
1  sur  30
Télécharger pour lire hors ligne
HANA & Hadoop for
Big Data Management
Will Gardella, Senior Director
SAP Applied Research - Big Data Program
william.gardella@sap.com
Safe Harbor Statement
The information in this presentation is confidential and proprietary to SAP and may not be disclosed without
the permission of SAP. This presentation is not subject to your license agreement or any other service or
subscription agreement with SAP. SAP has no obligation to pursue any course of business outlined in this
document or any related presentation, or to develop or release any functionality mentioned therein. This
document, or any related presentation and SAP's strategy and possible future developments, products and or
platforms directions and functionality are all subject to change and may be changed by SAP at any time for
any reason without notice. The information on this document is not a commitment, promise or legal obligation
to deliver any material, code or functionality. This document is provided without a warranty of any kind, either
express or implied, including but not limited to, the implied warranties of merchantability, fitness for a
particular purpose, or non-infringement. This document is for informational purposes and may not be
incorporated into a contract. SAP assumes no responsibility for errors or omissions in this document, except
if such damages were caused by SAP intentionally or grossly negligent.

All forward-looking statements are subject to various risks and uncertainties that could cause actual results
to differ materially from expectations. Readers are cautioned not to place undue reliance on these forward-
looking statements, which speak only as of their dates, and they should not be relied upon in making
purchasing decisions.




 © 2012 SAP AG. All rights reserved.                                                                               2
SAP REAL-TIME DATA PLATFORM

A GAME-CHANGER
                                            SAP real time data platform
                                                      Open APIs and Protocols


                                                       Federated Access




                                                                                                Common Landscape
                Common Design &
                                  Transactional    In-Memory          Analytics




                                                                                                   Environment
                  Environment
                                                                                  Mobile Data
                   Modelling
                                      Data            Data           EDW Data
                                                                                  Management
                                  Management      Management        Management


                                       Information Management & Real-Time Data Movement
Traditional data management approaches are changing

                      1980s / 1990s   Today

                                                      100101
                                                      011010
                                                      100101




                                       ?

© 2012 SAP AG. All rights reserved.                            4
What is Big Data?
The 3 + 1 V’s




                                      Data Volume in Whole World
Volume
                                                                                                    Structured Data
Explosion in the                                                                                                       Location-
                                                                                          Automobiles
amount of data                                                                                                        based Data

                                                                                                                                 Machine Data

Variety
Multiple data formats;
non-structured data boom
                                                                                   Mobile




                                                                                Click Stream                7.9
                                                                                                               !
                                                                                                        Zettabytes
                                                                                                                                        IMHO, it’s great!


                                                                                                                                          Text Data



Velocity
Fast collection, processing                                                                                                             Point of Sale
                                                                                           Social
and consumption                                                                           Network                            Customer
                                                                                                                               Data
                                                                                                        RFID
Value                                                                  1.8
                                                                   Zettabytes
                                                                                                               Smart Meter


Keep everything, not only
high value data
                                                                   2011                                    2015                         Future
© 2012 SAP AG. All rights reserved.                                                                                                                         5
HANA for Big Data


Key Characteristics: in-memory, row & column, real time



How’s HANA for Big Data?
Volume: Billions of records
Variety: Text processing & search
Velocity: Real time
Value: High value data




© 2012 SAP AG. All rights reserved.                       6
Data management today: systems optimize for speed or capacity

                                      Most enterprise systems
                                      today not too large for
                                      real time
 Speed




                                                                Near 0 marginal
                                                                storage cost




                                                                 Size
© 2012 SAP AG. All rights reserved.                                               7
New storage and processing techniques required


                                                                           Real-time queries
                                                                           High value data
                                                                           Targeted data read
                                                               In-memory

                                                    Columnar
      Row                             Distributed
                                                                           Batch queries
                                                                           Flexible data sets
                                                                           All data read




© 2012 SAP AG. All rights reserved.                                                             8
Building an IT landscape for Big Data


                            Business Intelligence         Insight Discovery             Real-time Operations
Present
                                          BI Tools        Analytic Tools, Custom Data              Business
                                                             Analysis Applications         Applications & Processes

Process
                                      Information Views           Data Mining /
                                                                Predictive Analysis


   Store                              EDW / Data Marts       Analytic Data Warehouse
                                                                                                  Real-time
                                                                                                  Database



                                      ETL, Data Quality           Text Analysis               Real-time Loading
  Ingest

© 2012 SAP AG. All rights reserved.                                                                                   9
Hadoop
What is Apache Hadoop?


Apache Hadoop is open source software that enables reliable, scalable, distributed computing on
clusters of inexpensive servers

Reliable
 Software is fault tolerant, it expects and handles hardware and software failures

Scalable
 Designed for massive scale of processors, memory, and local attached storage

Distributed
 Handles replication. Offers massively parallel programming model, MapReduce

Hadoop framework handles: partitioning, scheduling, dispatch, execution, communication, failure
handling, monitoring, reporting and more


© 2012 SAP AG. All rights reserved.                                                               11
The Apache Hadoop technology family
logical view*



 Non-Relational DB                                                                            Scripting                                 Machine Learning
 Fine-grained data handling


                    Hive                                    HBase                                              Pig                                Mahout
    “Data warehouse” that provides SQL        Column oriented, schema-less, distributed            Platform for manipulating and          Machine learning libraries for
  interface. Data structure is projected ad       database modeled after Google’s                    analyzing large data sets.           recommendations, clustering,
   hoc onto unstructured underlying data        BigTable. Random realtime read/write               Scripting language for analysts          classification and itemsets



 MapReduce                                                                            Hadoop Common
  Parallel programming
  Large block data handling
                                                                        HDFS                                         MapReduce
   (e.g. 64MB)                                              Distributes & replicates data across             Distributes & monitors tasks, restarts
                                                                         machines                                         failed work


 * For simplicity, mappings to servers is omitted



© 2012 SAP AG. All rights reserved.                                                                                                                                        12
What does Hadoop bring to the table?


Cost efficient data storage and processing for large volumes of structured, semi-structured, and
unstructured data such as web logs, machine data, text data, call data records (CDRs), audio, video data

Batch Processing
Where fast response times are less critical than reliability and scalability

Complex Information Processing
Enable heavily recursive algorithms, machine learning, & queries that cannot be easily expressed in SQL

Low Value Data Archive
Data stays available, though access is slower

Post-hoc Analysis
Mine raw data that is either schema-less or where schema changes over time


© 2012 SAP AG. All rights reserved.                                                                       13
Example: Retail Point of Sales Demo Scenario



                                                   http://youtu.be/HmmPje38e1k
                     SAP BusinessObjects
                          Explorer




                                                    18 Million Visitor
                                                    Session Records

                                                                         9 TB Web Logs
                         1.1 Billion POS Records

                                HANA                                        Hadoop


© 2012 SAP AG. All rights reserved.                                                      14
Apache Hadoop bottom line


Strengths                             Weaknesses
+ Huge data volumes                   - Not efficient at small scale
+ Unstructured data                   - Real time is best case challenging, typically not possible
+ Reliable                            - Requires skilled engineering, operation and analyst resources
+ Scalable                            - Hiring qualified talent
+ Lowest cost                         - Less mature than SQL
+ Open source                         - Governance
+ No hardware lock in                 - Lack of user role support in access model
+ Batch processing




© 2012 SAP AG. All rights reserved.                                                                     15
Hadoop & Enterprise Information Management


                            Business Intelligence             Insight Discovery             Real-time Operations
Present
                                          BI Tools            Analytic Tools, Custom Data              Business
                                                                 Analysis Applications         Applications & Processes

Process
                                      Information Views               Data Mining /
                                                                    Predictive Analysis


   Store                              EDW / Data Marts           Analytic Data Warehouse
                                                                                                      Real-time
                                                          A                                           Database



                                      ETL, Data Quality               Text Analysis               Real-time Loading
  Ingest

© 2012 SAP AG. All rights reserved.                                                                                       16
IT administrator: Extract, transform, and load data quickly


                                                                           Metadata




                                                                                           Modeler
                                             Repository
                                                                Server
                                  Open Hub
                                                                            Data Load
                                                                                        Database Engine
                BW                                          Designer and
                                                            Management
                                                            Console


                                                   SAP Data Integrator                   SAP HANA or
                                                                                        SAP Sybase IQ


            Any Source


© 2012 SAP AG. All rights reserved.                                                                       17
Loading data from Hadoop into your database

1. Based on target, SAP Data Services
   translates queries into:
      o Hive Query Language (HQL)  Hive          SAP Data Services

      o Pig script  HDFS                                  Job                                      Process     Database
                                                         Process                           5         Data        Loader    6
2. Hive/Pig converts queries to
   Map/Reduce jobs
                                            1       HQL                 Pig                           HDFS       ODBC/
3. Result data files are generated on the         Generator        Generator               4       FileReader    JDBC
                                                                                                                 driver
   HDFS system                                                                 Text data
                                                                              processing

4. SAP Data Services use multiple
   threads to process data from Hive/Pig                                        M/R
                                            2                          M/R
                                                     Hive
5. Optional transforms: Data quality                                                                   Result
   operations                                                                      HDFS                 set
                                                Join tables, order /
                                                 filter data, apply
                                                                                               3
                                                      functions
6. Load results into database

© 2012 SAP AG. All rights reserved.                                                                                            18
SAP Data Services: Simple GUI build and run ETL process




                                      Push down to Hadoop
                                        through HiveSQL
                                          or Pig Scripts
                                                            Bulk load
                                                            into EDW


© 2012 SAP AG. All rights reserved.                                     19
Processing text to extract relevant data from Hadoop

1 Use SAP Data Services to extract:
    Core entities (who, what, when, where, etc.)

       Domains (voice of customer, public sector, enterprise, etc)
       Sentiment analysis (strong positive, weak positive,
        neutral, weak negative, strong negative)


2 Perform transformations
      Map text into pre-defined structures
      Cleanse, match, de-duplicate data


3 Load results quickly into EDW
      Map text to structure

© 2012 SAP AG. All rights reserved.                                   20
Hadoop Analytics


                            Business Intelligence B Insight Discovery                   Real-time Operations
Present
                                          BI Tools        Analytic Tools, Custom Data              Business
                                                             Analysis Applications         Applications & Processes

Process
                                      Information Views           Data Mining /
                                                                Predictive Analysis


   Store                              EDW / Data Marts       Analytic Data Warehouse
                                                                                                  Real-time
                                                                                                  Database



                                      ETL, Data Quality           Text Analysis               Real-time Loading
  Ingest

© 2012 SAP AG. All rights reserved.                                                                                   21
Business Analyst: Viewing data in Hadoop using GUI tools




                         Automatically generates HiveQL statements that are executed on a Hadoop cluster
© 2012 SAP AG. All rights reserved.                                                                        22
SAP BusinessObjects BI: Hadoop for Business Analysts


                                              Common user experience for all front-end tools

       Empower all analysts,
       enable all workflows
                                      Web Intelligence   Crystal Reports         Dashboards           Explorer

                                             Best access method for each specific data source
      High performance,
      feature rich, secure                       Universe Access                      Direct Access


                                                                   All data sources
       Extract, define, &
       manipulate metadata
                                        HADOOP      SAP BW    Sybase       SAP HANA      3rd party   Files    Web
                                                             databases                  databases            Services
                                         HIVE


© 2012 SAP AG. All rights reserved.                                                                                     23
Simple Tools for the BI administrator to define data access


      Build a Data Foundation against a Hive schema
      ■ Draw joins between Hive tables, aliases, derived tables,
        Hive views and Hive partitioned tables




© 2012 SAP AG. All rights reserved.                                24
Data Scientist: Flexibility is of the essence

                                        Chooses the variables that offer the most promise
                                                                                    100101
                                                                                    011010
                                                                                    100101



                                      Chooses the best tool based on data mining technique




                                      Chooses best analysis engine based on algorithm & data




© 2012 SAP AG. All rights reserved.                                                            25
Data Scientist: Open integration is key




                                         M/R
                                      algorithms



In-database                                        In-database
  Analytics                                          Analytics




© 2012 SAP AG. All rights reserved.                          26
Building an IT landscape for Big Data

                                      Business Intelligence      Insight Discovery          Real-time Operations

   Engage
                                              BI Tools        Analytic Tools, Custom Data            Business
                                                                 Analysis Applications       Applications & Processes

   Process
                                          Information Views           Data Mining /
                                                                    Predictive Analysis

     Store
                                                                                                    Real-time
                                          EDW / Data Marts       Analytic Data Warehouse            Database


     Ingest
                                          ETL, Data Quality           Text Analysis             Real-time Loading




© 2012 SAP AG. All rights reserved.                                                                                     27
Questions?
william.gardella@sap.com
© 2012 SAP AG. All rights reserved.

No part of this publication may be reproduced or transmitted in any form or for any purpose without the express   Google App Engine, Google Apps, Google Checkout, Google Data API, Google Maps, Google Mobile Ads,
permission of SAP AG. The information contained herein may be changed without prior notice.                       Google Mobile Updater, Google Mobile, Google Store, Google Sync, Google Updater, Google Voice,
                                                                                                                  Google Mail, Gmail, YouTube, Dalvik and Android are trademarks or registered trademarks of Google Inc.
Some software products marketed by SAP AG and its distributors contain proprietary software components of
other software vendors.                                                                                           INTERMEC is a registered trademark of Intermec Technologies Corporation.
Microsoft, Windows, Excel, Outlook, PowerPoint, Silverlight, and Visual Studio are registered trademarks of       Wi-Fi is a registered trademark of Wi-Fi Alliance.
Microsoft Corporation.
                                                                                                                  Bluetooth is a registered trademark of Bluetooth SIG Inc.
IBM, DB2, DB2 Universal Database, System i, System i5, System p, System p5, System x, System z, System
                                                                                                                  Motorola is a registered trademark of Motorola Trademark Holdings LLC.
z10, z10, z/VM, z/OS, OS/390, zEnterprise, PowerVM, Power Architecture, Power Systems, POWER7,
POWER6+, POWER6, POWER, PowerHA, pureScale, PowerPC, BladeCenter, System Storage, Storwize,                       Computop is a registered trademark of Computop Wirtschaftsinformatik GmbH.
XIV, GPFS, HACMP, RETAIN, DB2 Connect, RACF, Redbooks, OS/2, AIX, Intelligent Miner, WebSphere,
Tivoli, Informix, and Smarter Planet are trademarks or registered trademarks of IBM Corporation.                  SAP, R/3, SAP NetWeaver, Duet, PartnerEdge, ByDesign, SAP BusinessObjects Explorer, StreamWork,
                                                                                                                  SAP HANA, and other SAP products and services mentioned herein as well as their respective logos are
Linux is the registered trademark of Linus Torvalds in the United States and other countries.                     trademarks or registered trademarks of SAP AG in Germany and other countries.
Adobe, the Adobe logo, Acrobat, PostScript, and Reader are trademarks or registered trademarks of Adobe           Business Objects and the Business Objects logo, BusinessObjects, Crystal Reports, Crystal Decisions, Web
Systems Incorporated in the United States and other countries.                                                    Intelligence, Xcelsius, and other Business Objects products and services mentioned herein as well as their
                                                                                                                  respective logos are trademarks or registered trademarks of Business Objects Software Ltd. Business Objects
Oracle and Java are registered trademarks of Oracle and its affiliates.
                                                                                                                  is an SAP company.
UNIX, X/Open, OSF/1, and Motif are registered trademarks of the Open Group.
                                                                                                                  Sybase and Adaptive Server, iAnywhere, Sybase 365, SQL Anywhere, and other Sybase products and services
Citrix, ICA, Program Neighborhood, MetaFrame, WinFrame, VideoFrame, and MultiWin are trademarks or                mentioned herein as well as their respective logos are trademarks or registered trademarks of Sybase Inc.
registered trademarks of Citrix Systems Inc.                                                                      Sybase is an SAP company.
HTML, XML, XHTML, and W3C are trademarks or registered trademarks of W3C®, World Wide Web                         Crossgate, m@gic EDDY, B2B 360°, and B2B 360° Services are registered trademarks of Crossgate AG
Consortium, Massachusetts Institute of Technology.                                                                in Germany and other countries. Crossgate is an SAP company.
Apple, App Store, iBooks, iPad, iPhone, iPhoto, iPod, iTunes, Multi-Touch, Objective-C, Retina, Safari, Siri,     All other product and service names mentioned are the trademarks of their respective companies. Data
and Xcode are trademarks or registered trademarks of Apple Inc.                                                   contained in this document serves informational purposes only. National product specifications may vary.
IOS is a registered trademark of Cisco Systems Inc.                                                               The information in this document is proprietary to SAP. No part of this document may be reproduced, copied,
                                                                                                                  or transmitted in any form or for any purpose without the express prior written permission of SAP AG.
RIM, BlackBerry, BBM, BlackBerry Curve, BlackBerry Bold, BlackBerry Pearl, BlackBerry Torch, BlackBerry
Storm, BlackBerry Storm2, BlackBerry PlayBook, and BlackBerry App World are trademarks or registered
trademarks of Research in Motion Limited.




 © 2012 SAP AG. All rights reserved.                                                                                                                                                                                         29
© 2012 SAP AG. Alle Rechte vorbehalten.

Weitergabe und Vervielfältigung dieser Publikation oder von Teilen daraus sind, zu welchem Zweck und in        Google App Engine, Google Apps, Google Checkout, Google Data API, Google Maps, Google Mobile Ads,
welcher Form auch immer, ohne die ausdrückliche schriftliche Genehmigung durch SAP AG nicht gestattet.         Google Mobile Updater, Google Mobile, Google Store, Google Sync, Google Updater, Google Voice,
In dieser Publikation enthaltene Informationen können ohne vorherige Ankündigung geändert werden.              Google Mail, Gmail, YouTube, Dalvik und Android sind Marken oder eingetragene Marken von Google Inc.
Die von SAP AG oder deren Vertriebsfirmen angebotenen Softwareprodukte können Softwarekomponenten              INTERMEC ist eine eingetragene Marke der Intermec Technologies Corporation.
auch anderer Softwarehersteller enthalten.
                                                                                                               Wi-Fi ist eine eingetragene Marke der Wi-Fi Alliance.
Microsoft, Windows, Excel, Outlook, und PowerPoint sind eingetragene Marken der Microsoft Corporation.
                                                                                                               Bluetooth ist eine eingetragene Marke von Bluetooth SIG Inc.
IBM, DB2, DB2 Universal Database, System i, System i5, System p, System p5, System x, System z, System
                                                                                                               Motorola ist eine eingetragene Marke von Motorola Trademark Holdings, LLC.
z10, z10, z/VM, z/OS, OS/390, zEnterprise, PowerVM, Power Architecture, Power Systems, POWER7,
POWER6+, POWER6, POWER, PowerHA, pureScale, PowerPC, BladeCenter, System Storage, Storwize,                    Computop ist eine eingetragene Marke der Computop Wirtschaftsinformatik GmbH.
XIV, GPFS, HACMP, RETAIN, DB2 Connect, RACF, Redbooks, OS/2, AIX, Intelligent Miner, WebSphere,
Tivoli, Informix und Smarter Planet sind Marken oder eingetragene Marken der IBM Corporation.                  SAP, R/3, SAP NetWeaver, Duet, PartnerEdge, ByDesign, SAP BusinessObjects Explorer, StreamWork,
                                                                                                               SAP HANA und weitere im Text erwähnte SAP-Produkte und Dienstleistungen sowie die entsprechenden
Linux ist eine eingetragene Marke von Linus Torvalds in den USA und anderen Ländern.                           Logos sind Marken oder eingetragene Marken der SAP AG in Deutschland und anderen Ländern.
Adobe, das Adobe-Logo, Acrobat, PostScript und Reader sind Marken oder eingetragene Marken von                 Business Objects und das Business-Objects-Logo, BusinessObjects, Crystal Reports, Crystal Decisions,
Adobe Systems Incorporated in den USA und/oder anderen Ländern.                                                Web Intelligence, Xcelsius und andere im Text erwähnte Business-Objects-Produkte und Dienstleistungen
                                                                                                               sowie die entsprechenden Logos sind Marken oder eingetragene Marken der Business Objects Software Ltd.
Oracle und Java sind eingetragene Marken von Oracle und/oder ihrer Tochtergesellschaften.
                                                                                                               Business Objects ist ein Unternehmen der SAP AG.
UNIX, X/Open, OSF/1 und Motif sind eingetragene Marken der Open Group.
                                                                                                               Sybase und Adaptive Server, iAnywhere, Sybase 365, SQL Anywhere und weitere im Text erwähnte Sybase-
Citrix, ICA, Program Neighborhood, MetaFrame, WinFrame, VideoFrame und MultiWin sind Marken oder               Produkte und -Dienstleistungen sowie die entsprechenden Logos sind Marken oder eingetragene Marken der
eingetragene Marken von Citrix Systems, Inc.                                                                   Sybase Inc. Sybase ist ein Unternehmen der SAP AG.
HTML, XML, XHTML und W3C sind Marken oder eingetragene Marken des W3C®, World Wide Web                         Crossgate, m@gic EDDY, B2B 360°, B2B 360° Services sind eingetragene Marken der Crossgate AG in
Consortium, Massachusetts Institute of Technology.                                                             Deutschland und anderen Ländern. Crossgate ist ein Unternehmen der SAP AG.
Apple, App Store, iBooks, iPad, iPhone, iPhoto, iPod, iTunes, Multi-Touch, Objective-C, Retina, Safari, Siri   Alle anderen Namen von Produkten und Dienstleistungen sind Marken der jeweiligen Firmen. Die Angaben im
und Xcode sind Marken oder eingetragene Marken der Apple Inc.                                                  Text sind unverbindlich und dienen lediglich zu Informationszwecken. Produkte können länderspezifische
                                                                                                               Unterschiede aufweisen.
IOS ist eine eingetragene Marke von Cisco Systems Inc.
                                                                                                               Die in dieser Publikation enthaltene Information ist Eigentum der SAP. Weitergabe und Vervielfältigung dieser
RIM, BlackBerry, BBM, BlackBerry Curve, BlackBerry Bold, BlackBerry Pearl, BlackBerry Torch, BlackBerry
                                                                                                               Publikation oder von Teilen daraus sind, zu welchem Zweck und in welcher Form auch immer, nur mit
Storm, BlackBerry Storm2, BlackBerry PlayBook und BlackBerry App World sind Marken oder eingetragene
                                                                                                               ausdrücklicher schriftlicher Genehmigung durch SAP AG gestattet.
Marken von Research in Motion Limited.




 © 2012 SAP AG. All rights reserved.                                                                                                                                                                                      30

Contenu connexe

Tendances

Integration of SAP HANA with Hadoop
Integration of SAP HANA with HadoopIntegration of SAP HANA with Hadoop
Integration of SAP HANA with HadoopRamkumar Rajendran
 
Building Information Platform - Integration of Hadoop with SAP HANA and HANA ...
Building Information Platform - Integration of Hadoop with SAP HANA and HANA ...Building Information Platform - Integration of Hadoop with SAP HANA and HANA ...
Building Information Platform - Integration of Hadoop with SAP HANA and HANA ...DataWorks Summit/Hadoop Summit
 
Finance month closing with HANA
Finance month closing with HANAFinance month closing with HANA
Finance month closing with HANADouglas Bernardini
 
Hadoop, Spark and Big Data Summit presentation with SAP HANA Vora and a path ...
Hadoop, Spark and Big Data Summit presentation with SAP HANA Vora and a path ...Hadoop, Spark and Big Data Summit presentation with SAP HANA Vora and a path ...
Hadoop, Spark and Big Data Summit presentation with SAP HANA Vora and a path ...Ocean9, Inc.
 
SAP HANA for Line of Business Sales
SAP HANA for Line of Business SalesSAP HANA for Line of Business Sales
SAP HANA for Line of Business SalesSAP Technology
 
In-Memory Database Platform for Big Data
In-Memory Database Platform for Big DataIn-Memory Database Platform for Big Data
In-Memory Database Platform for Big DataSAP Technology
 
SAP Lambda Architecture Point of View
SAP Lambda Architecture Point of ViewSAP Lambda Architecture Point of View
SAP Lambda Architecture Point of ViewSnehanshu Shah
 
SAP HANA - Big Data and Fast Data
SAP HANA - Big Data and Fast DataSAP HANA - Big Data and Fast Data
SAP HANA - Big Data and Fast DataVitaliy Rudnytskiy
 
SAP Helps Reduce Silos Between Business and Spatial Data
SAP Helps Reduce Silos Between Business and Spatial DataSAP Helps Reduce Silos Between Business and Spatial Data
SAP Helps Reduce Silos Between Business and Spatial DataSAP Technology
 
How can Hadoop & SAP be integrated
How can Hadoop & SAP be integratedHow can Hadoop & SAP be integrated
How can Hadoop & SAP be integratedDouglas Bernardini
 
What's Planned for SAP HANA SPS10
What's Planned for SAP HANA SPS10What's Planned for SAP HANA SPS10
What's Planned for SAP HANA SPS10SAP Technology
 
Flexpod with SAP HANA and SAP Applications
Flexpod with SAP HANA and SAP ApplicationsFlexpod with SAP HANA and SAP Applications
Flexpod with SAP HANA and SAP ApplicationsLishantian
 
SAP HANA Interactive Use Case Map
SAP HANA Interactive Use Case MapSAP HANA Interactive Use Case Map
SAP HANA Interactive Use Case MapSAP Technology
 
How is sap data services unique for sap hana integration
How is sap data services unique for sap hana integrationHow is sap data services unique for sap hana integration
How is sap data services unique for sap hana integrationFlavio Alejandro Corradini
 
Enterprise Information Management
Enterprise Information ManagementEnterprise Information Management
Enterprise Information ManagementSAP Technology
 
SAP HANA SPS09 - HANA IM Services
SAP HANA SPS09 - HANA IM ServicesSAP HANA SPS09 - HANA IM Services
SAP HANA SPS09 - HANA IM ServicesSAP Technology
 

Tendances (20)

Integration of SAP HANA with Hadoop
Integration of SAP HANA with HadoopIntegration of SAP HANA with Hadoop
Integration of SAP HANA with Hadoop
 
Building Information Platform - Integration of Hadoop with SAP HANA and HANA ...
Building Information Platform - Integration of Hadoop with SAP HANA and HANA ...Building Information Platform - Integration of Hadoop with SAP HANA and HANA ...
Building Information Platform - Integration of Hadoop with SAP HANA and HANA ...
 
Finance month closing with HANA
Finance month closing with HANAFinance month closing with HANA
Finance month closing with HANA
 
Hadoop, Spark and Big Data Summit presentation with SAP HANA Vora and a path ...
Hadoop, Spark and Big Data Summit presentation with SAP HANA Vora and a path ...Hadoop, Spark and Big Data Summit presentation with SAP HANA Vora and a path ...
Hadoop, Spark and Big Data Summit presentation with SAP HANA Vora and a path ...
 
SAP HANA for Line of Business Sales
SAP HANA for Line of Business SalesSAP HANA for Line of Business Sales
SAP HANA for Line of Business Sales
 
In-Memory Database Platform for Big Data
In-Memory Database Platform for Big DataIn-Memory Database Platform for Big Data
In-Memory Database Platform for Big Data
 
SAP Lambda Architecture Point of View
SAP Lambda Architecture Point of ViewSAP Lambda Architecture Point of View
SAP Lambda Architecture Point of View
 
SAP HANA - Big Data and Fast Data
SAP HANA - Big Data and Fast DataSAP HANA - Big Data and Fast Data
SAP HANA - Big Data and Fast Data
 
SAP Helps Reduce Silos Between Business and Spatial Data
SAP Helps Reduce Silos Between Business and Spatial DataSAP Helps Reduce Silos Between Business and Spatial Data
SAP Helps Reduce Silos Between Business and Spatial Data
 
How can Hadoop & SAP be integrated
How can Hadoop & SAP be integratedHow can Hadoop & SAP be integrated
How can Hadoop & SAP be integrated
 
SAP HANA
SAP HANASAP HANA
SAP HANA
 
What's Planned for SAP HANA SPS10
What's Planned for SAP HANA SPS10What's Planned for SAP HANA SPS10
What's Planned for SAP HANA SPS10
 
Flexpod with SAP HANA and SAP Applications
Flexpod with SAP HANA and SAP ApplicationsFlexpod with SAP HANA and SAP Applications
Flexpod with SAP HANA and SAP Applications
 
SAP HANA One
SAP HANA OneSAP HANA One
SAP HANA One
 
SAP HANA Timeline
SAP HANA TimelineSAP HANA Timeline
SAP HANA Timeline
 
SAP HANA Interactive Use Case Map
SAP HANA Interactive Use Case MapSAP HANA Interactive Use Case Map
SAP HANA Interactive Use Case Map
 
SAP HORTONWORKS
SAP HORTONWORKSSAP HORTONWORKS
SAP HORTONWORKS
 
How is sap data services unique for sap hana integration
How is sap data services unique for sap hana integrationHow is sap data services unique for sap hana integration
How is sap data services unique for sap hana integration
 
Enterprise Information Management
Enterprise Information ManagementEnterprise Information Management
Enterprise Information Management
 
SAP HANA SPS09 - HANA IM Services
SAP HANA SPS09 - HANA IM ServicesSAP HANA SPS09 - HANA IM Services
SAP HANA SPS09 - HANA IM Services
 

En vedette

B2B Target Marketing Agency in korea
B2B Target Marketing Agency in koreaB2B Target Marketing Agency in korea
B2B Target Marketing Agency in koreaArunJin
 
5004 implementing aggregate_awareness_in_sap_business_objects
5004 implementing aggregate_awareness_in_sap_business_objects5004 implementing aggregate_awareness_in_sap_business_objects
5004 implementing aggregate_awareness_in_sap_business_objectsYogeeswar Reddy
 
Accelerating the Journey to Your Cloud
Accelerating the Journey to Your CloudAccelerating the Journey to Your Cloud
Accelerating the Journey to Your CloudArraya Solutions
 
Hybrid Cloud A Journey to the Cloud by Peter Hellemans
Hybrid Cloud A Journey to the Cloud by Peter HellemansHybrid Cloud A Journey to the Cloud by Peter Hellemans
Hybrid Cloud A Journey to the Cloud by Peter HellemansNRB
 
Security and Data Governance using Apache Ranger and Apache Atlas
Security and Data Governance using Apache Ranger and Apache AtlasSecurity and Data Governance using Apache Ranger and Apache Atlas
Security and Data Governance using Apache Ranger and Apache AtlasDataWorks Summit/Hadoop Summit
 
Data Science with Apache Spark - Crash Course - HS16SJ
Data Science with Apache Spark - Crash Course - HS16SJData Science with Apache Spark - Crash Course - HS16SJ
Data Science with Apache Spark - Crash Course - HS16SJDataWorks Summit/Hadoop Summit
 
Spark For Faster Batch Processing
Spark For Faster Batch ProcessingSpark For Faster Batch Processing
Spark For Faster Batch ProcessingEdureka!
 
Big Data: Architecture and Performance Considerations in Logical Data Lakes
Big Data: Architecture and Performance Considerations in Logical Data LakesBig Data: Architecture and Performance Considerations in Logical Data Lakes
Big Data: Architecture and Performance Considerations in Logical Data LakesDenodo
 
Spark Summit Europe 2016 Keynote - Databricks CEO
Spark Summit Europe 2016 Keynote  - Databricks CEO Spark Summit Europe 2016 Keynote  - Databricks CEO
Spark Summit Europe 2016 Keynote - Databricks CEO Databricks
 
Big Data Taiwan 2014 Track2-1: SAP 善用足跡,預測未來 - 全方位的行銷視野
Big Data Taiwan 2014 Track2-1: SAP 善用足跡,預測未來 - 全方位的行銷視野Big Data Taiwan 2014 Track2-1: SAP 善用足跡,預測未來 - 全方位的行銷視野
Big Data Taiwan 2014 Track2-1: SAP 善用足跡,預測未來 - 全方位的行銷視野Etu Solution
 
Cloudera 助力台灣大數據產業的發展
Cloudera 助力台灣大數據產業的發展Cloudera 助力台灣大數據產業的發展
Cloudera 助力台灣大數據產業的發展Etu Solution
 
Journey to the Cloud with Red Hat
Journey to the Cloud with Red HatJourney to the Cloud with Red Hat
Journey to the Cloud with Red HatKen Thompson
 
Apache Spark and Online Analytics
Apache Spark and Online Analytics Apache Spark and Online Analytics
Apache Spark and Online Analytics Databricks
 
Spark Summit EU 2016: The Next AMPLab: Real-time Intelligent Secure Execution
Spark Summit EU 2016: The Next AMPLab:  Real-time Intelligent Secure ExecutionSpark Summit EU 2016: The Next AMPLab:  Real-time Intelligent Secure Execution
Spark Summit EU 2016: The Next AMPLab: Real-time Intelligent Secure ExecutionDatabricks
 
Hortonworks Technical Workshop: What's New in HDP 2.3
Hortonworks Technical Workshop: What's New in HDP 2.3Hortonworks Technical Workshop: What's New in HDP 2.3
Hortonworks Technical Workshop: What's New in HDP 2.3Hortonworks
 

En vedette (16)

Job experience in De Nora
Job experience in De NoraJob experience in De Nora
Job experience in De Nora
 
B2B Target Marketing Agency in korea
B2B Target Marketing Agency in koreaB2B Target Marketing Agency in korea
B2B Target Marketing Agency in korea
 
5004 implementing aggregate_awareness_in_sap_business_objects
5004 implementing aggregate_awareness_in_sap_business_objects5004 implementing aggregate_awareness_in_sap_business_objects
5004 implementing aggregate_awareness_in_sap_business_objects
 
Accelerating the Journey to Your Cloud
Accelerating the Journey to Your CloudAccelerating the Journey to Your Cloud
Accelerating the Journey to Your Cloud
 
Hybrid Cloud A Journey to the Cloud by Peter Hellemans
Hybrid Cloud A Journey to the Cloud by Peter HellemansHybrid Cloud A Journey to the Cloud by Peter Hellemans
Hybrid Cloud A Journey to the Cloud by Peter Hellemans
 
Security and Data Governance using Apache Ranger and Apache Atlas
Security and Data Governance using Apache Ranger and Apache AtlasSecurity and Data Governance using Apache Ranger and Apache Atlas
Security and Data Governance using Apache Ranger and Apache Atlas
 
Data Science with Apache Spark - Crash Course - HS16SJ
Data Science with Apache Spark - Crash Course - HS16SJData Science with Apache Spark - Crash Course - HS16SJ
Data Science with Apache Spark - Crash Course - HS16SJ
 
Spark For Faster Batch Processing
Spark For Faster Batch ProcessingSpark For Faster Batch Processing
Spark For Faster Batch Processing
 
Big Data: Architecture and Performance Considerations in Logical Data Lakes
Big Data: Architecture and Performance Considerations in Logical Data LakesBig Data: Architecture and Performance Considerations in Logical Data Lakes
Big Data: Architecture and Performance Considerations in Logical Data Lakes
 
Spark Summit Europe 2016 Keynote - Databricks CEO
Spark Summit Europe 2016 Keynote  - Databricks CEO Spark Summit Europe 2016 Keynote  - Databricks CEO
Spark Summit Europe 2016 Keynote - Databricks CEO
 
Big Data Taiwan 2014 Track2-1: SAP 善用足跡,預測未來 - 全方位的行銷視野
Big Data Taiwan 2014 Track2-1: SAP 善用足跡,預測未來 - 全方位的行銷視野Big Data Taiwan 2014 Track2-1: SAP 善用足跡,預測未來 - 全方位的行銷視野
Big Data Taiwan 2014 Track2-1: SAP 善用足跡,預測未來 - 全方位的行銷視野
 
Cloudera 助力台灣大數據產業的發展
Cloudera 助力台灣大數據產業的發展Cloudera 助力台灣大數據產業的發展
Cloudera 助力台灣大數據產業的發展
 
Journey to the Cloud with Red Hat
Journey to the Cloud with Red HatJourney to the Cloud with Red Hat
Journey to the Cloud with Red Hat
 
Apache Spark and Online Analytics
Apache Spark and Online Analytics Apache Spark and Online Analytics
Apache Spark and Online Analytics
 
Spark Summit EU 2016: The Next AMPLab: Real-time Intelligent Secure Execution
Spark Summit EU 2016: The Next AMPLab:  Real-time Intelligent Secure ExecutionSpark Summit EU 2016: The Next AMPLab:  Real-time Intelligent Secure Execution
Spark Summit EU 2016: The Next AMPLab: Real-time Intelligent Secure Execution
 
Hortonworks Technical Workshop: What's New in HDP 2.3
Hortonworks Technical Workshop: What's New in HDP 2.3Hortonworks Technical Workshop: What's New in HDP 2.3
Hortonworks Technical Workshop: What's New in HDP 2.3
 

Similaire à SAP HANA and Apache Hadoop for Big Data Management (SF Scalable Systems Meetup)

Martin Wildberger Presentation
Martin Wildberger PresentationMartin Wildberger Presentation
Martin Wildberger PresentationMauricio Godoy
 
Ensuring Mobile BI Success
Ensuring Mobile BI SuccessEnsuring Mobile BI Success
Ensuring Mobile BI SuccessBirst
 
Hadoop's Opportunity to Power Next-Generation Architectures
Hadoop's Opportunity to Power Next-Generation ArchitecturesHadoop's Opportunity to Power Next-Generation Architectures
Hadoop's Opportunity to Power Next-Generation ArchitecturesDataWorks Summit
 
Hortonworks roadshow
Hortonworks roadshowHortonworks roadshow
Hortonworks roadshowAccenture
 
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data European Data Forum
 
Big Data and Implications on Platform Architecture
Big Data and Implications on Platform ArchitectureBig Data and Implications on Platform Architecture
Big Data and Implications on Platform ArchitectureOdinot Stanislas
 
Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831Cana Ko
 
Tackling big data with hadoop and open source integration
Tackling big data with hadoop and open source integrationTackling big data with hadoop and open source integration
Tackling big data with hadoop and open source integrationDataWorks Summit
 
Unified big data architecture
Unified big data architectureUnified big data architecture
Unified big data architectureDataWorks Summit
 
SAP HANA for Line of Business Finance
SAP HANA for Line of Business FinanceSAP HANA for Line of Business Finance
SAP HANA for Line of Business FinanceSAP Technology
 
The Next Generation of Big Data Analytics
The Next Generation of Big Data AnalyticsThe Next Generation of Big Data Analytics
The Next Generation of Big Data AnalyticsHortonworks
 
Big Data, Hadoop, Hortonworks and Microsoft HDInsight
Big Data, Hadoop, Hortonworks and Microsoft HDInsightBig Data, Hadoop, Hortonworks and Microsoft HDInsight
Big Data, Hadoop, Hortonworks and Microsoft HDInsightHortonworks
 
Asug SAP HANA Presentation - Perceptive Technologies SAP
Asug SAP HANA Presentation - Perceptive Technologies SAPAsug SAP HANA Presentation - Perceptive Technologies SAP
Asug SAP HANA Presentation - Perceptive Technologies SAPBrendan Kane
 
Big Data: A Big Trap for Product Development
Big Data: A Big Trap for Product DevelopmentBig Data: A Big Trap for Product Development
Big Data: A Big Trap for Product DevelopmentStrategy 2 Market, Inc,
 
The Comprehensive Approach: A Unified Information Architecture
The Comprehensive Approach: A Unified Information ArchitectureThe Comprehensive Approach: A Unified Information Architecture
The Comprehensive Approach: A Unified Information ArchitectureInside Analysis
 
Magic quadrant for data warehouse database management systems
Magic quadrant for data warehouse database management systems Magic quadrant for data warehouse database management systems
Magic quadrant for data warehouse database management systems divjeev
 
Introduction to Hortonworks Data Platform for Windows
Introduction to Hortonworks Data Platform for WindowsIntroduction to Hortonworks Data Platform for Windows
Introduction to Hortonworks Data Platform for WindowsHortonworks
 

Similaire à SAP HANA and Apache Hadoop for Big Data Management (SF Scalable Systems Meetup) (20)

SAP EIM
SAP EIM SAP EIM
SAP EIM
 
Martin Wildberger Presentation
Martin Wildberger PresentationMartin Wildberger Presentation
Martin Wildberger Presentation
 
Ensuring Mobile BI Success
Ensuring Mobile BI SuccessEnsuring Mobile BI Success
Ensuring Mobile BI Success
 
Hadoop's Opportunity to Power Next-Generation Architectures
Hadoop's Opportunity to Power Next-Generation ArchitecturesHadoop's Opportunity to Power Next-Generation Architectures
Hadoop's Opportunity to Power Next-Generation Architectures
 
Hortonworks roadshow
Hortonworks roadshowHortonworks roadshow
Hortonworks roadshow
 
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data
EDF2013: Selected Talk: Bryan Drexler: The 80/20 Rule and Big Data
 
Big Data and Implications on Platform Architecture
Big Data and Implications on Platform ArchitectureBig Data and Implications on Platform Architecture
Big Data and Implications on Platform Architecture
 
Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831Talk IT_ Oracle_김태완_110831
Talk IT_ Oracle_김태완_110831
 
Tackling big data with hadoop and open source integration
Tackling big data with hadoop and open source integrationTackling big data with hadoop and open source integration
Tackling big data with hadoop and open source integration
 
Unified big data architecture
Unified big data architectureUnified big data architecture
Unified big data architecture
 
SAP HANA for Line of Business Finance
SAP HANA for Line of Business FinanceSAP HANA for Line of Business Finance
SAP HANA for Line of Business Finance
 
The Next Generation of Big Data Analytics
The Next Generation of Big Data AnalyticsThe Next Generation of Big Data Analytics
The Next Generation of Big Data Analytics
 
Big Data, Hadoop, Hortonworks and Microsoft HDInsight
Big Data, Hadoop, Hortonworks and Microsoft HDInsightBig Data, Hadoop, Hortonworks and Microsoft HDInsight
Big Data, Hadoop, Hortonworks and Microsoft HDInsight
 
Asug SAP HANA Presentation - Perceptive Technologies SAP
Asug SAP HANA Presentation - Perceptive Technologies SAPAsug SAP HANA Presentation - Perceptive Technologies SAP
Asug SAP HANA Presentation - Perceptive Technologies SAP
 
2012 06 hortonworks paris hug
2012 06 hortonworks paris hug2012 06 hortonworks paris hug
2012 06 hortonworks paris hug
 
Big Data: A Big Trap for Product Development
Big Data: A Big Trap for Product DevelopmentBig Data: A Big Trap for Product Development
Big Data: A Big Trap for Product Development
 
The Comprehensive Approach: A Unified Information Architecture
The Comprehensive Approach: A Unified Information ArchitectureThe Comprehensive Approach: A Unified Information Architecture
The Comprehensive Approach: A Unified Information Architecture
 
Magic quadrant for data warehouse database management systems
Magic quadrant for data warehouse database management systems Magic quadrant for data warehouse database management systems
Magic quadrant for data warehouse database management systems
 
vBACD July 2012 - Apache Hadoop, Now and Beyond
vBACD July 2012 - Apache Hadoop, Now and BeyondvBACD July 2012 - Apache Hadoop, Now and Beyond
vBACD July 2012 - Apache Hadoop, Now and Beyond
 
Introduction to Hortonworks Data Platform for Windows
Introduction to Hortonworks Data Platform for WindowsIntroduction to Hortonworks Data Platform for Windows
Introduction to Hortonworks Data Platform for Windows
 

Dernier

Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessPixlogix Infotech
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slidevu2urc
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxMalak Abu Hammad
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024Rafal Los
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking MenDelhi Call girls
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationRadu Cotescu
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)Gabriella Davis
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slidespraypatel2
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsEnterprise Knowledge
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUK Journal
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Enterprise Knowledge
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking MenDelhi Call girls
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdfhans926745
 

Dernier (20)

Advantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your BusinessAdvantages of Hiring UIUX Design Service Providers for Your Business
Advantages of Hiring UIUX Design Service Providers for Your Business
 
Histor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slideHistor y of HAM Radio presentation slide
Histor y of HAM Radio presentation slide
 
The Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptxThe Codex of Business Writing Software for Real-World Solutions 2.pptx
The Codex of Business Writing Software for Real-World Solutions 2.pptx
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024Tata AIG General Insurance Company - Insurer Innovation Award 2024
Tata AIG General Insurance Company - Insurer Innovation Award 2024
 
Slack Application Development 101 Slides
Slack Application Development 101 SlidesSlack Application Development 101 Slides
Slack Application Development 101 Slides
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
IAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI SolutionsIAC 2024 - IA Fast Track to Search Focused AI Solutions
IAC 2024 - IA Fast Track to Search Focused AI Solutions
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 

SAP HANA and Apache Hadoop for Big Data Management (SF Scalable Systems Meetup)

  • 1. HANA & Hadoop for Big Data Management Will Gardella, Senior Director SAP Applied Research - Big Data Program william.gardella@sap.com
  • 2. Safe Harbor Statement The information in this presentation is confidential and proprietary to SAP and may not be disclosed without the permission of SAP. This presentation is not subject to your license agreement or any other service or subscription agreement with SAP. SAP has no obligation to pursue any course of business outlined in this document or any related presentation, or to develop or release any functionality mentioned therein. This document, or any related presentation and SAP's strategy and possible future developments, products and or platforms directions and functionality are all subject to change and may be changed by SAP at any time for any reason without notice. The information on this document is not a commitment, promise or legal obligation to deliver any material, code or functionality. This document is provided without a warranty of any kind, either express or implied, including but not limited to, the implied warranties of merchantability, fitness for a particular purpose, or non-infringement. This document is for informational purposes and may not be incorporated into a contract. SAP assumes no responsibility for errors or omissions in this document, except if such damages were caused by SAP intentionally or grossly negligent. All forward-looking statements are subject to various risks and uncertainties that could cause actual results to differ materially from expectations. Readers are cautioned not to place undue reliance on these forward- looking statements, which speak only as of their dates, and they should not be relied upon in making purchasing decisions. © 2012 SAP AG. All rights reserved. 2
  • 3. SAP REAL-TIME DATA PLATFORM A GAME-CHANGER SAP real time data platform Open APIs and Protocols Federated Access Common Landscape Common Design & Transactional In-Memory Analytics Environment Environment Mobile Data Modelling Data Data EDW Data Management Management Management Management Information Management & Real-Time Data Movement
  • 4. Traditional data management approaches are changing 1980s / 1990s Today 100101 011010 100101 ? © 2012 SAP AG. All rights reserved. 4
  • 5. What is Big Data? The 3 + 1 V’s Data Volume in Whole World Volume Structured Data Explosion in the Location- Automobiles amount of data based Data Machine Data Variety Multiple data formats; non-structured data boom Mobile Click Stream 7.9 ! Zettabytes IMHO, it’s great! Text Data Velocity Fast collection, processing Point of Sale Social and consumption Network Customer Data RFID Value 1.8 Zettabytes Smart Meter Keep everything, not only high value data 2011 2015 Future © 2012 SAP AG. All rights reserved. 5
  • 6. HANA for Big Data Key Characteristics: in-memory, row & column, real time How’s HANA for Big Data? Volume: Billions of records Variety: Text processing & search Velocity: Real time Value: High value data © 2012 SAP AG. All rights reserved. 6
  • 7. Data management today: systems optimize for speed or capacity Most enterprise systems today not too large for real time Speed Near 0 marginal storage cost Size © 2012 SAP AG. All rights reserved. 7
  • 8. New storage and processing techniques required Real-time queries High value data Targeted data read In-memory Columnar Row Distributed Batch queries Flexible data sets All data read © 2012 SAP AG. All rights reserved. 8
  • 9. Building an IT landscape for Big Data Business Intelligence Insight Discovery Real-time Operations Present BI Tools Analytic Tools, Custom Data Business Analysis Applications Applications & Processes Process Information Views Data Mining / Predictive Analysis Store EDW / Data Marts Analytic Data Warehouse Real-time Database ETL, Data Quality Text Analysis Real-time Loading Ingest © 2012 SAP AG. All rights reserved. 9
  • 11. What is Apache Hadoop? Apache Hadoop is open source software that enables reliable, scalable, distributed computing on clusters of inexpensive servers Reliable  Software is fault tolerant, it expects and handles hardware and software failures Scalable  Designed for massive scale of processors, memory, and local attached storage Distributed  Handles replication. Offers massively parallel programming model, MapReduce Hadoop framework handles: partitioning, scheduling, dispatch, execution, communication, failure handling, monitoring, reporting and more © 2012 SAP AG. All rights reserved. 11
  • 12. The Apache Hadoop technology family logical view* Non-Relational DB Scripting Machine Learning Fine-grained data handling Hive HBase Pig Mahout “Data warehouse” that provides SQL Column oriented, schema-less, distributed Platform for manipulating and Machine learning libraries for interface. Data structure is projected ad database modeled after Google’s analyzing large data sets. recommendations, clustering, hoc onto unstructured underlying data BigTable. Random realtime read/write Scripting language for analysts classification and itemsets MapReduce Hadoop Common  Parallel programming  Large block data handling HDFS MapReduce (e.g. 64MB) Distributes & replicates data across Distributes & monitors tasks, restarts machines failed work * For simplicity, mappings to servers is omitted © 2012 SAP AG. All rights reserved. 12
  • 13. What does Hadoop bring to the table? Cost efficient data storage and processing for large volumes of structured, semi-structured, and unstructured data such as web logs, machine data, text data, call data records (CDRs), audio, video data Batch Processing Where fast response times are less critical than reliability and scalability Complex Information Processing Enable heavily recursive algorithms, machine learning, & queries that cannot be easily expressed in SQL Low Value Data Archive Data stays available, though access is slower Post-hoc Analysis Mine raw data that is either schema-less or where schema changes over time © 2012 SAP AG. All rights reserved. 13
  • 14. Example: Retail Point of Sales Demo Scenario http://youtu.be/HmmPje38e1k SAP BusinessObjects Explorer 18 Million Visitor Session Records 9 TB Web Logs 1.1 Billion POS Records HANA Hadoop © 2012 SAP AG. All rights reserved. 14
  • 15. Apache Hadoop bottom line Strengths Weaknesses + Huge data volumes - Not efficient at small scale + Unstructured data - Real time is best case challenging, typically not possible + Reliable - Requires skilled engineering, operation and analyst resources + Scalable - Hiring qualified talent + Lowest cost - Less mature than SQL + Open source - Governance + No hardware lock in - Lack of user role support in access model + Batch processing © 2012 SAP AG. All rights reserved. 15
  • 16. Hadoop & Enterprise Information Management Business Intelligence Insight Discovery Real-time Operations Present BI Tools Analytic Tools, Custom Data Business Analysis Applications Applications & Processes Process Information Views Data Mining / Predictive Analysis Store EDW / Data Marts Analytic Data Warehouse Real-time A Database ETL, Data Quality Text Analysis Real-time Loading Ingest © 2012 SAP AG. All rights reserved. 16
  • 17. IT administrator: Extract, transform, and load data quickly Metadata Modeler Repository Server Open Hub Data Load Database Engine BW Designer and Management Console SAP Data Integrator SAP HANA or SAP Sybase IQ Any Source © 2012 SAP AG. All rights reserved. 17
  • 18. Loading data from Hadoop into your database 1. Based on target, SAP Data Services translates queries into: o Hive Query Language (HQL)  Hive SAP Data Services o Pig script  HDFS Job Process Database Process 5 Data Loader 6 2. Hive/Pig converts queries to Map/Reduce jobs 1 HQL Pig HDFS ODBC/ 3. Result data files are generated on the Generator Generator 4 FileReader JDBC driver HDFS system Text data processing 4. SAP Data Services use multiple threads to process data from Hive/Pig M/R 2 M/R Hive 5. Optional transforms: Data quality Result operations HDFS set Join tables, order / filter data, apply 3 functions 6. Load results into database © 2012 SAP AG. All rights reserved. 18
  • 19. SAP Data Services: Simple GUI build and run ETL process Push down to Hadoop through HiveSQL or Pig Scripts Bulk load into EDW © 2012 SAP AG. All rights reserved. 19
  • 20. Processing text to extract relevant data from Hadoop 1 Use SAP Data Services to extract:  Core entities (who, what, when, where, etc.)  Domains (voice of customer, public sector, enterprise, etc)  Sentiment analysis (strong positive, weak positive, neutral, weak negative, strong negative) 2 Perform transformations  Map text into pre-defined structures  Cleanse, match, de-duplicate data 3 Load results quickly into EDW  Map text to structure © 2012 SAP AG. All rights reserved. 20
  • 21. Hadoop Analytics Business Intelligence B Insight Discovery Real-time Operations Present BI Tools Analytic Tools, Custom Data Business Analysis Applications Applications & Processes Process Information Views Data Mining / Predictive Analysis Store EDW / Data Marts Analytic Data Warehouse Real-time Database ETL, Data Quality Text Analysis Real-time Loading Ingest © 2012 SAP AG. All rights reserved. 21
  • 22. Business Analyst: Viewing data in Hadoop using GUI tools Automatically generates HiveQL statements that are executed on a Hadoop cluster © 2012 SAP AG. All rights reserved. 22
  • 23. SAP BusinessObjects BI: Hadoop for Business Analysts Common user experience for all front-end tools Empower all analysts, enable all workflows Web Intelligence Crystal Reports Dashboards Explorer Best access method for each specific data source High performance, feature rich, secure Universe Access Direct Access All data sources Extract, define, & manipulate metadata HADOOP SAP BW Sybase SAP HANA 3rd party Files Web databases databases Services HIVE © 2012 SAP AG. All rights reserved. 23
  • 24. Simple Tools for the BI administrator to define data access Build a Data Foundation against a Hive schema ■ Draw joins between Hive tables, aliases, derived tables, Hive views and Hive partitioned tables © 2012 SAP AG. All rights reserved. 24
  • 25. Data Scientist: Flexibility is of the essence Chooses the variables that offer the most promise 100101 011010 100101 Chooses the best tool based on data mining technique Chooses best analysis engine based on algorithm & data © 2012 SAP AG. All rights reserved. 25
  • 26. Data Scientist: Open integration is key M/R algorithms In-database In-database Analytics Analytics © 2012 SAP AG. All rights reserved. 26
  • 27. Building an IT landscape for Big Data Business Intelligence Insight Discovery Real-time Operations Engage BI Tools Analytic Tools, Custom Data Business Analysis Applications Applications & Processes Process Information Views Data Mining / Predictive Analysis Store Real-time EDW / Data Marts Analytic Data Warehouse Database Ingest ETL, Data Quality Text Analysis Real-time Loading © 2012 SAP AG. All rights reserved. 27
  • 29. © 2012 SAP AG. All rights reserved. No part of this publication may be reproduced or transmitted in any form or for any purpose without the express Google App Engine, Google Apps, Google Checkout, Google Data API, Google Maps, Google Mobile Ads, permission of SAP AG. The information contained herein may be changed without prior notice. Google Mobile Updater, Google Mobile, Google Store, Google Sync, Google Updater, Google Voice, Google Mail, Gmail, YouTube, Dalvik and Android are trademarks or registered trademarks of Google Inc. Some software products marketed by SAP AG and its distributors contain proprietary software components of other software vendors. INTERMEC is a registered trademark of Intermec Technologies Corporation. Microsoft, Windows, Excel, Outlook, PowerPoint, Silverlight, and Visual Studio are registered trademarks of Wi-Fi is a registered trademark of Wi-Fi Alliance. Microsoft Corporation. Bluetooth is a registered trademark of Bluetooth SIG Inc. IBM, DB2, DB2 Universal Database, System i, System i5, System p, System p5, System x, System z, System Motorola is a registered trademark of Motorola Trademark Holdings LLC. z10, z10, z/VM, z/OS, OS/390, zEnterprise, PowerVM, Power Architecture, Power Systems, POWER7, POWER6+, POWER6, POWER, PowerHA, pureScale, PowerPC, BladeCenter, System Storage, Storwize, Computop is a registered trademark of Computop Wirtschaftsinformatik GmbH. XIV, GPFS, HACMP, RETAIN, DB2 Connect, RACF, Redbooks, OS/2, AIX, Intelligent Miner, WebSphere, Tivoli, Informix, and Smarter Planet are trademarks or registered trademarks of IBM Corporation. SAP, R/3, SAP NetWeaver, Duet, PartnerEdge, ByDesign, SAP BusinessObjects Explorer, StreamWork, SAP HANA, and other SAP products and services mentioned herein as well as their respective logos are Linux is the registered trademark of Linus Torvalds in the United States and other countries. trademarks or registered trademarks of SAP AG in Germany and other countries. Adobe, the Adobe logo, Acrobat, PostScript, and Reader are trademarks or registered trademarks of Adobe Business Objects and the Business Objects logo, BusinessObjects, Crystal Reports, Crystal Decisions, Web Systems Incorporated in the United States and other countries. Intelligence, Xcelsius, and other Business Objects products and services mentioned herein as well as their respective logos are trademarks or registered trademarks of Business Objects Software Ltd. Business Objects Oracle and Java are registered trademarks of Oracle and its affiliates. is an SAP company. UNIX, X/Open, OSF/1, and Motif are registered trademarks of the Open Group. Sybase and Adaptive Server, iAnywhere, Sybase 365, SQL Anywhere, and other Sybase products and services Citrix, ICA, Program Neighborhood, MetaFrame, WinFrame, VideoFrame, and MultiWin are trademarks or mentioned herein as well as their respective logos are trademarks or registered trademarks of Sybase Inc. registered trademarks of Citrix Systems Inc. Sybase is an SAP company. HTML, XML, XHTML, and W3C are trademarks or registered trademarks of W3C®, World Wide Web Crossgate, m@gic EDDY, B2B 360°, and B2B 360° Services are registered trademarks of Crossgate AG Consortium, Massachusetts Institute of Technology. in Germany and other countries. Crossgate is an SAP company. Apple, App Store, iBooks, iPad, iPhone, iPhoto, iPod, iTunes, Multi-Touch, Objective-C, Retina, Safari, Siri, All other product and service names mentioned are the trademarks of their respective companies. Data and Xcode are trademarks or registered trademarks of Apple Inc. contained in this document serves informational purposes only. National product specifications may vary. IOS is a registered trademark of Cisco Systems Inc. The information in this document is proprietary to SAP. No part of this document may be reproduced, copied, or transmitted in any form or for any purpose without the express prior written permission of SAP AG. RIM, BlackBerry, BBM, BlackBerry Curve, BlackBerry Bold, BlackBerry Pearl, BlackBerry Torch, BlackBerry Storm, BlackBerry Storm2, BlackBerry PlayBook, and BlackBerry App World are trademarks or registered trademarks of Research in Motion Limited. © 2012 SAP AG. All rights reserved. 29
  • 30. © 2012 SAP AG. Alle Rechte vorbehalten. Weitergabe und Vervielfältigung dieser Publikation oder von Teilen daraus sind, zu welchem Zweck und in Google App Engine, Google Apps, Google Checkout, Google Data API, Google Maps, Google Mobile Ads, welcher Form auch immer, ohne die ausdrückliche schriftliche Genehmigung durch SAP AG nicht gestattet. Google Mobile Updater, Google Mobile, Google Store, Google Sync, Google Updater, Google Voice, In dieser Publikation enthaltene Informationen können ohne vorherige Ankündigung geändert werden. Google Mail, Gmail, YouTube, Dalvik und Android sind Marken oder eingetragene Marken von Google Inc. Die von SAP AG oder deren Vertriebsfirmen angebotenen Softwareprodukte können Softwarekomponenten INTERMEC ist eine eingetragene Marke der Intermec Technologies Corporation. auch anderer Softwarehersteller enthalten. Wi-Fi ist eine eingetragene Marke der Wi-Fi Alliance. Microsoft, Windows, Excel, Outlook, und PowerPoint sind eingetragene Marken der Microsoft Corporation. Bluetooth ist eine eingetragene Marke von Bluetooth SIG Inc. IBM, DB2, DB2 Universal Database, System i, System i5, System p, System p5, System x, System z, System Motorola ist eine eingetragene Marke von Motorola Trademark Holdings, LLC. z10, z10, z/VM, z/OS, OS/390, zEnterprise, PowerVM, Power Architecture, Power Systems, POWER7, POWER6+, POWER6, POWER, PowerHA, pureScale, PowerPC, BladeCenter, System Storage, Storwize, Computop ist eine eingetragene Marke der Computop Wirtschaftsinformatik GmbH. XIV, GPFS, HACMP, RETAIN, DB2 Connect, RACF, Redbooks, OS/2, AIX, Intelligent Miner, WebSphere, Tivoli, Informix und Smarter Planet sind Marken oder eingetragene Marken der IBM Corporation. SAP, R/3, SAP NetWeaver, Duet, PartnerEdge, ByDesign, SAP BusinessObjects Explorer, StreamWork, SAP HANA und weitere im Text erwähnte SAP-Produkte und Dienstleistungen sowie die entsprechenden Linux ist eine eingetragene Marke von Linus Torvalds in den USA und anderen Ländern. Logos sind Marken oder eingetragene Marken der SAP AG in Deutschland und anderen Ländern. Adobe, das Adobe-Logo, Acrobat, PostScript und Reader sind Marken oder eingetragene Marken von Business Objects und das Business-Objects-Logo, BusinessObjects, Crystal Reports, Crystal Decisions, Adobe Systems Incorporated in den USA und/oder anderen Ländern. Web Intelligence, Xcelsius und andere im Text erwähnte Business-Objects-Produkte und Dienstleistungen sowie die entsprechenden Logos sind Marken oder eingetragene Marken der Business Objects Software Ltd. Oracle und Java sind eingetragene Marken von Oracle und/oder ihrer Tochtergesellschaften. Business Objects ist ein Unternehmen der SAP AG. UNIX, X/Open, OSF/1 und Motif sind eingetragene Marken der Open Group. Sybase und Adaptive Server, iAnywhere, Sybase 365, SQL Anywhere und weitere im Text erwähnte Sybase- Citrix, ICA, Program Neighborhood, MetaFrame, WinFrame, VideoFrame und MultiWin sind Marken oder Produkte und -Dienstleistungen sowie die entsprechenden Logos sind Marken oder eingetragene Marken der eingetragene Marken von Citrix Systems, Inc. Sybase Inc. Sybase ist ein Unternehmen der SAP AG. HTML, XML, XHTML und W3C sind Marken oder eingetragene Marken des W3C®, World Wide Web Crossgate, m@gic EDDY, B2B 360°, B2B 360° Services sind eingetragene Marken der Crossgate AG in Consortium, Massachusetts Institute of Technology. Deutschland und anderen Ländern. Crossgate ist ein Unternehmen der SAP AG. Apple, App Store, iBooks, iPad, iPhone, iPhoto, iPod, iTunes, Multi-Touch, Objective-C, Retina, Safari, Siri Alle anderen Namen von Produkten und Dienstleistungen sind Marken der jeweiligen Firmen. Die Angaben im und Xcode sind Marken oder eingetragene Marken der Apple Inc. Text sind unverbindlich und dienen lediglich zu Informationszwecken. Produkte können länderspezifische Unterschiede aufweisen. IOS ist eine eingetragene Marke von Cisco Systems Inc. Die in dieser Publikation enthaltene Information ist Eigentum der SAP. Weitergabe und Vervielfältigung dieser RIM, BlackBerry, BBM, BlackBerry Curve, BlackBerry Bold, BlackBerry Pearl, BlackBerry Torch, BlackBerry Publikation oder von Teilen daraus sind, zu welchem Zweck und in welcher Form auch immer, nur mit Storm, BlackBerry Storm2, BlackBerry PlayBook und BlackBerry App World sind Marken oder eingetragene ausdrücklicher schriftlicher Genehmigung durch SAP AG gestattet. Marken von Research in Motion Limited. © 2012 SAP AG. All rights reserved. 30