SlideShare a Scribd company logo
1 of 31
Download to read offline
Greenplum Database
                                            Overview


                                               Michael Crutcher
                                               Greenplum Product Management




© Copyright 2012 EMC Corporation. All rights reserved.                        1
© Copyright 2012 EMC Corporation. All rights reserved.   2
© Copyright 2012 EMC Corporation. All rights reserved.   3
© Copyright 2012 EMC Corporation. All rights reserved.   4
Greenplum Unified Analytic Platform




© Copyright 2012 EMC Corporation. All rights reserved.   5
GREENPLUM DATABASE


                                                         Industry Leading Database with
                                                         Massively Parallel Performance
                                                         To Empower your Analytics




© Copyright 2012 EMC Corporation. All rights reserved.                                    6
GREENPLUM DATABASE

Extreme Performance for Analytics
                                                Optimized for BI and analytics
                                                         – Deep integration with statistical packages
                                                         – High performance parallel implementations
                                               • Simple and automatic
                                                         – Just load and query like any database
                                                         – Tables are automatically distributed
                                                           across nodes
                                               • Extremely scalable
                                                         – MPP shared-nothing architecture
                                                         – All nodes can scan and process in parallel
                                                         – Linear scalability by adding nodes




© Copyright 2012 EMC Corporation. All rights reserved.                                                  7
GREENPLUM DATABASE

Performance Through Parallelism

              Master
              Servers                                    ...   ...
           Query planning &
               dispatch


            Network
          Interconnect


             Segment
             Servers                 ...                             ...
           Query processing
            & data storage




              External
              Sources
                Loading,
            streaming, etc.




© Copyright 2012 EMC Corporation. All rights reserved.                     8
GREENPLUM DATABASE

Greenplum Delivers Choice & Flexibility

                           Greenplum Data                Greenplum
                           Computing Appliance           Software Solutions
                           Choose Greenplum              Greenplum
                           Database and/or                 Database, Hadoop,
                           Hadoop modules in               & Chorus on your
                           ¼ rack increments               x86 hardware
                           Scale up by adding            Flexibility for any
                           your choice of                  workload or
                           additional modules              environment
                           Minimal time to value         Perpetual or
                                                           subscription licenses




© Copyright 2012 EMC Corporation. All rights reserved.                             9
Core Functionality
                                       GREENPLUM DATABASE




© Copyright 2012 EMC Corporation. All rights reserved.      10
GREENPLUM DATABASE

Component Overview
                                           CLIENT ACCESS                          3rd PARTY TOOLS                          ADMIN TOOLS
       CLIENT ACCESS                     ODBC, JDBC, OLEDB,                        BI Tools, ETL Tools               Greenplum Command Center
          & TOOLS                           MapReduce, etc.                          Data Mining, etc                Greenplum Package Manager




                                      LOADING & EXT. ACCESS                   STORAGE & DATA ACCESS                     LANGUAGE SUPPORT
                                        Petabyte-Scale Loading                 Hybrid Storage & Execution                 Comprehensive SQL
                                                                               (Row- & Column-Oriented)
                                         Trickle Micro-Batching                                                            Native MapReduce
          PRODUCT                        Anywhere Data Access
                                                                               In-Database Compression
                                                                                                                       SQL 2003 OLAP Extensions
          FEATURES                                                               Multi-Level Partitioning
                                                                                                                        Programmable Analytics
                                                                              Indexes – Btree, Bitmap, etc.
                                                                                                                         Analytics Extensions
                                                                                 External Table Support               (GeoSpatial, PR/R, PL/Java,
                                                                                                                         PL/Python, PL/Perl)



      GREENPLUM                        Multi-Level Fault Tolerance
   DATABASE ADAPTIVE                   (RAID, Mirroring, DR with                Online System Expansion                 Workload Management
       SERVICES                           Data Domain Boost)



                                                         Shared-Nothing MPP                                   Parallel Dataflow Engine
         CORE MPP
                                                    Parallel Query Optimizer                                gNet™ Software Interconnect
       ARCHITECTURE
                                                  Polymorphic Data Storage™                         Scatter/Gather Streaming™ Data Loading




© Copyright 2012 EMC Corporation. All rights reserved.                                                                                              11
GREENPLUM DATABASE

Most Powerful Data Loading Capabilities
                                                          SINGLE RACK COMPARISON
 Industry leading performance
  at 10+TB per-hour per-rack
 Scatter-Gather Streaming™
  provides true linear scaling
 Support for both large-batch and
  continuous real-time loading
  strategies                                             Greenplum    Oracle   Netezza   Teradata
                                                                     Exadata
 Enable complex data
  transformations ―in-flight‖                            Greenplum load rates scale linearly with
                                                           the number of racks, others do not.
 Transparent interfaces to loading                        For example, two racks = >20TB/H

  via support files, application, and
  services


© Copyright 2012 EMC Corporation. All rights reserved.                                              12
GREENPLUM DATABASE

Polymorphic Table StorageTM
                                                         TABLE ‗CUSTOMER‘
               Mar             Apr           May          Jun   Jul   Aug    Sept     Oct      Nov
               ‗11             ‗11           ‗11          ‗11   ‗11   ‗11     ‗11     ‗11      ‗11




                               Column-oriented for COLD DATA                Row-oriented for HOT DATA

   • Storage types can be mixed within a table or database
            – Four table types: heap, row-oriented AO, column-oriented AO,
              external
   • Rich compression functionality, definable column by column
            – Block compression: Gzip (levels 1-9), QuickLZ
            – Stream compression: RLE (levels 1-4)
   • Flexible indexing, partitioning, and more


© Copyright 2012 EMC Corporation. All rights reserved.                                                  13
GREENPLUM DATABASE

gNet Software Interconnect
 A supercomputing-based ―soft-switch‖
  responsible for
        – Efficiently pumping streams of data between motion
          nodes during query-plan execution
        – Delivers messages, moves data, collects results, and
          coordinates work among the segments in the system

                   gNet Software
                    Interconnect




© Copyright 2012 EMC Corporation. All rights reserved.           14
GREENPLUM DATABASE

Parallel Query Optimizer
                                                                         PHYSICAL EXECUTION PLAN
  Cost-based optimization                                                FROM SQL OR MAPREDUCE

   looks for the most                                                                Gather Motion

   efficient plan
                                                                                      4:1(Slice 3)


                                                                                          Sort
  Physical plan contains
   scans, joins, sorts,
                                                                                     HashAggregate



   aggregations, etc.                                                                   HashJoin



  Global planning avoids                                     Redistribute Motion
                                                                 4:4(Slice 1)                                    Hash

   sub-optimal ‘SQL
                                                                   HashJoin                                  HashJoin
   pushing’ to segments
                                                         Seq Scan on
  Directly inserts ‘motion’
                                                                                                   Seq Scan on
                                                           lineitem           Hash                                         Hash
                                                                                                    customer


   nodes for inter-segment                                                Seq Scan on
                                                                             orders
                                                                                                                   Broadcast Motion
                                                                                                                     4:4(Slice 2)

   communication                                                                                                        Seq Scan on
                                                                                                                          motion




© Copyright 2012 EMC Corporation. All rights reserved.                                                                                15
Analytics Overview
                                       GREENPLUM DATABASE




© Copyright 2012 EMC Corporation. All rights reserved.      16
GREENPLUM DATABASE

Analytical Capabilities Overview
Data Access & Query Layer                           ODBC            JDBC




                                                         SQL

       Stored                   SQL 2003                                   In-Database
                                                               MapReduce
     Procedures                   OLAP                                       Analytics
                                                                                         GREENPLUM
                                                                                            HD
                                        Polymorphic Storage



                                 GREENPLUM DATABASE

                                                           Greenplum gNet




© Copyright 2012 EMC Corporation. All rights reserved.                                               17
GREENPLUM DATABASE

In-Database Analytics: Categories
Data Access & Query Layer                           ODBC        JDBC




                                                                SQL

  In-Database Analytics

        Embedded
                                                           SAS Scoring
                                                           Accelerator
          Partner                       GPDB                                                User-Written
                                                                              Open Source
                                        Embedded                                            Analytical
                                                                              Extensions
                                        Analytics          SAS/HPA                          Algorithms
     Open-Source                                           High Performance
                                                           Analytics
      User-written


                                                    GREENPLUM DATABASE



© Copyright 2012 EMC Corporation. All rights reserved.                                                     18
GREENPLUM DATABASE

Analytics Highlight: MADlib
 Scalable in-database
  analytics
 Data-parallel
        –    Mathematical Algorithms
        –    Statistical Algorithms
        –    Machine learning Algorithms
        –    Supports structured and
             unstructured data.

 Open-source software
        – Source Accessibility
        – Converge business,
          academic, and open-source
          communities




© Copyright 2012 EMC Corporation. All rights reserved.   19
Manageability, Extensions
                                       GREENPLUM DATABASE




© Copyright 2012 EMC Corporation. All rights reserved.      20
GREENPLUM DATABASE

Easy Manageability for Big Data
 Single console for both Database and Hadoop
 Administration
        – Start, Stop Database
        – Recover, Rebalance Segments
 Interactive view of System Metrics
        – Real-time
        – Historic (Configurable by time period)
 In-depth view for System Health
        – Hardware health
        – Software (Database, Hadoop)
 Query Monitoring
        – Search, Prioritize, Cancel Queries
        – View Query‘s Execution Plan
 Workload Management
        – Configure Resource Queues
        – Prioritize Users




© Copyright 2012 EMC Corporation. All rights reserved.   21
GREENPLUM DATABASE

Easy Extension Installation
Greenplum Package Manager
                                                         Greenplum supports easy deployment
                                                         of numerous extensions like Madlib,
                                                         PL/Perl, PL/Java, PostGIS, etc.



                  Master
                  Servers




                Segment                                                         ...
                Servers          ...




© Copyright 2012 EMC Corporation. All rights reserved.                                         22
GREENPLUM DATABASE

High Performance gNet for Hadoop
Parallel Query Access
                                                                    Connect any data set in Hadoop to
                                                                     GP DB‘s SQL Engine
                                                                    Process Hadoop data in place
                                                                    Parallelize import/export data
                                                                     from/to Hadoop thanks to GP DB‘s
                                                                     market leading data sharing
                                                                     performance

                           gNet for Hadoop                          Supported formats:
                                                                      – Text (compressed and
                                                                        uncompressed)
                                                                      – binary
                                                          User-
                        Text            Binary
                                                         Defined
                                                                      – proprietary/user-defined

                                                                    GP HD 1.x, GP MR 1.x, CDH3u2



© Copyright 2012 EMC Corporation. All rights reserved.                                                   23
High Availability,
                                  Back up, Support
                                        GREENPLUM DATABASE




© Copyright 2012 EMC Corporation. All rights reserved.       24
GREENPLUM DATABASE

High Availability
 GPDB cluster
        – 2 Master servers
        – Multiple Segment servers
 Segment servers support
  multiple database
  instances
        – Primary instances that
          actively process queries
        – Standby mirror instances
 Block level mirroring
        – Low resource
                                                            Set of Active
          consumption                                    Segment Instances
        – Differential resynch
          capable for fast recovery



© Copyright 2012 EMC Corporation. All rights reserved.                       25
GREENPLUM DATABASE

Backup/Restore with EMC Data Domain
                                                          Integration options
                                                            – NFS: Data Domain device mounted
                                Full
                             Appliance
                                                              as NFS storage
                                 +
                            Data Domain
                                                            – DD Boost: Native, client-side
                                                              deduplication. Supported in GPDB
                                                              4.2 and higher

                            Boost or NFS
                                                          Drastic reduction in backup storage
                                                           requirement
                             2 X 10GBit IP
                                                          Backup all segment servers in
                                                           parallel directly to Data Domain
                                                          Data Domain Integrates seamlessly
                                                           into standard Greenplum full
                                                           backup data export and data
                                                           restore procedures




© Copyright 2012 EMC Corporation. All rights reserved.                                           26
GREENPLUM DATABASE

Backup/Restore with EMC Data Domain
Backup and restore between remote and primary sites

         Greenplum DCA                                                        Greenplum DCA


                                   Data Domain                        Data Domain

                                                            LAN/WAN



                                                         Data Domain
                                                          Replication


 Ideal for configurations with RPO and RTO requirements that can be specified in hours
 Supports:
    – Collection Replication for DD Boost backup
    – Directory-level replication for NFS backup
    – Encryption over the WAN




© Copyright 2012 EMC Corporation. All rights reserved.                                        27
GREENPLUM DATABASE

Customer Support Services
                                                     • Remote Technical Support
                                                         –   24x7 technical support and remote troubleshooting
                                                         –   Customer-managed case severity level
                                                         –   Four-hour response objective
                                                     • Onsite Support (DCA Only)
                                                         –   Installation of replacement parts
                                                         –   Replacement parts shipped for next business day arrival
                                                         –   GP SW upgrade included
                                                     • Proactive Service
                                                         –   Secure remote monitoring for hardware (DCA)
                                                         –   Notification of engineering technical advisories
                                                         –   Built-in tools maximize stability and performance
                                                     • Secure Self-Help
                                                         –   24x7 access to eService support tools including
                                                             knowledgebase, forums, and appropriately licensed
                                                             software updates




© Copyright 2012 EMC Corporation. All rights reserved.                                                             28
GREENPLUM DATABASE

Other Relevant Greenplum Sessions
Session                                                  Presenter         Times
Unified Analytics Platform Introduction                  Brian Wilson      Tues 10:00-11:00   Thurs 1:00-2:00
Greenplum Hadoop Overview                                Susheel Kaushik   Mon 10:00-11:00    Wed 4:15-5:15
Greenplum DCA Overview                                   Hanxi Chen        Mon 4:00-5:00      Thurs 10:00-11:00
Greenplum Analytics Workbench                            Apurva Desai      Wed 8:30-9:30      Thurs 10:00-11:00
Analytics on Hadoop                                      Don Miner         Tues 11:30-12:30   Thurs 8:30-9:30
Big Data Driven Businesses in Action:                    Mike Maxey        Wed 4:15-5:15      Thurs 11:30-12:30
Creating Real Business Value Using
Greenplum UAP (Panel w/4 Customers)
Analytics for Business Value: Collaboration              Josh Klahr        Mon 10:00-11:00    Wed 2:45-3:45
Disruptive Data Science — How Data                       Annika Jimenez    Tues 4:15-5:15     Thurs 11:30-12:30
Science and Big Data are Transforming                    David Dietrich
Business, IT and People




© Copyright 2012 EMC Corporation. All rights reserved.                                                            29
Thank You




© Copyright 2012 EMC Corporation. All rights reserved.        30
Greenplum Database Overview

More Related Content

What's hot

Best practices for MySQL High Availability Tutorial
Best practices for MySQL High Availability TutorialBest practices for MySQL High Availability Tutorial
Best practices for MySQL High Availability TutorialColin Charles
 
Wars of MySQL Cluster ( InnoDB Cluster VS Galera )
Wars of MySQL Cluster ( InnoDB Cluster VS Galera ) Wars of MySQL Cluster ( InnoDB Cluster VS Galera )
Wars of MySQL Cluster ( InnoDB Cluster VS Galera ) Mydbops
 
MySQL_MariaDB-성능개선-202201.pptx
MySQL_MariaDB-성능개선-202201.pptxMySQL_MariaDB-성능개선-202201.pptx
MySQL_MariaDB-성능개선-202201.pptxNeoClova
 
Migration to Oracle Multitenant
Migration to Oracle MultitenantMigration to Oracle Multitenant
Migration to Oracle MultitenantJitendra Singh
 
What to Expect From Oracle database 19c
What to Expect From Oracle database 19cWhat to Expect From Oracle database 19c
What to Expect From Oracle database 19cMaria Colgan
 
What's new in Oracle 19c & 18c Recovery Manager (RMAN)
What's new in Oracle 19c & 18c Recovery Manager (RMAN)What's new in Oracle 19c & 18c Recovery Manager (RMAN)
What's new in Oracle 19c & 18c Recovery Manager (RMAN)Satishbabu Gunukula
 
MySQL Operator for Kubernetes
MySQL Operator for KubernetesMySQL Operator for Kubernetes
MySQL Operator for KubernetesKenny Gryp
 
MySQL Group Replication
MySQL Group ReplicationMySQL Group Replication
MySQL Group ReplicationUlf Wendel
 
Percona Live 2022 - MySQL Shell for Visual Studio Code
Percona Live 2022 - MySQL Shell for Visual Studio CodePercona Live 2022 - MySQL Shell for Visual Studio Code
Percona Live 2022 - MySQL Shell for Visual Studio CodeFrederic Descamps
 
Why oracle data guard new features in oracle 18c, 19c
Why oracle data guard new features in oracle 18c, 19cWhy oracle data guard new features in oracle 18c, 19c
Why oracle data guard new features in oracle 18c, 19cSatishbabu Gunukula
 
Redo log improvements MYSQL 8.0
Redo log improvements MYSQL 8.0Redo log improvements MYSQL 8.0
Redo log improvements MYSQL 8.0Mydbops
 
Oracle RAC 19c and Later - Best Practices #OOWLON
Oracle RAC 19c and Later - Best Practices #OOWLONOracle RAC 19c and Later - Best Practices #OOWLON
Oracle RAC 19c and Later - Best Practices #OOWLONMarkus Michalewicz
 
Hadoop & Greenplum: Why Do Such a Thing?
Hadoop & Greenplum: Why Do Such a Thing?Hadoop & Greenplum: Why Do Such a Thing?
Hadoop & Greenplum: Why Do Such a Thing?Ed Kohlwey
 
Solving PostgreSQL wicked problems
Solving PostgreSQL wicked problemsSolving PostgreSQL wicked problems
Solving PostgreSQL wicked problemsAlexander Korotkov
 
MySQL Connectors 8.0.19 & DNS SRV
MySQL Connectors 8.0.19 & DNS SRVMySQL Connectors 8.0.19 & DNS SRV
MySQL Connectors 8.0.19 & DNS SRVKenny Gryp
 
DataOpsbarcelona 2019: Deep dive into MySQL Group Replication... the magic e...
DataOpsbarcelona 2019:  Deep dive into MySQL Group Replication... the magic e...DataOpsbarcelona 2019:  Deep dive into MySQL Group Replication... the magic e...
DataOpsbarcelona 2019: Deep dive into MySQL Group Replication... the magic e...Frederic Descamps
 

What's hot (20)

Best practices for MySQL High Availability Tutorial
Best practices for MySQL High Availability TutorialBest practices for MySQL High Availability Tutorial
Best practices for MySQL High Availability Tutorial
 
Wars of MySQL Cluster ( InnoDB Cluster VS Galera )
Wars of MySQL Cluster ( InnoDB Cluster VS Galera ) Wars of MySQL Cluster ( InnoDB Cluster VS Galera )
Wars of MySQL Cluster ( InnoDB Cluster VS Galera )
 
MyRocks Deep Dive
MyRocks Deep DiveMyRocks Deep Dive
MyRocks Deep Dive
 
MySQL_MariaDB-성능개선-202201.pptx
MySQL_MariaDB-성능개선-202201.pptxMySQL_MariaDB-성능개선-202201.pptx
MySQL_MariaDB-성능개선-202201.pptx
 
Migration to Oracle Multitenant
Migration to Oracle MultitenantMigration to Oracle Multitenant
Migration to Oracle Multitenant
 
ZFS appliance
ZFS applianceZFS appliance
ZFS appliance
 
What to Expect From Oracle database 19c
What to Expect From Oracle database 19cWhat to Expect From Oracle database 19c
What to Expect From Oracle database 19c
 
Data Guard Architecture & Setup
Data Guard Architecture & SetupData Guard Architecture & Setup
Data Guard Architecture & Setup
 
What's new in Oracle 19c & 18c Recovery Manager (RMAN)
What's new in Oracle 19c & 18c Recovery Manager (RMAN)What's new in Oracle 19c & 18c Recovery Manager (RMAN)
What's new in Oracle 19c & 18c Recovery Manager (RMAN)
 
MySQL Operator for Kubernetes
MySQL Operator for KubernetesMySQL Operator for Kubernetes
MySQL Operator for Kubernetes
 
MySQL Group Replication
MySQL Group ReplicationMySQL Group Replication
MySQL Group Replication
 
Percona Live 2022 - MySQL Shell for Visual Studio Code
Percona Live 2022 - MySQL Shell for Visual Studio CodePercona Live 2022 - MySQL Shell for Visual Studio Code
Percona Live 2022 - MySQL Shell for Visual Studio Code
 
Why oracle data guard new features in oracle 18c, 19c
Why oracle data guard new features in oracle 18c, 19cWhy oracle data guard new features in oracle 18c, 19c
Why oracle data guard new features in oracle 18c, 19c
 
Redo log improvements MYSQL 8.0
Redo log improvements MYSQL 8.0Redo log improvements MYSQL 8.0
Redo log improvements MYSQL 8.0
 
Oracle RAC 19c and Later - Best Practices #OOWLON
Oracle RAC 19c and Later - Best Practices #OOWLONOracle RAC 19c and Later - Best Practices #OOWLON
Oracle RAC 19c and Later - Best Practices #OOWLON
 
Hadoop & Greenplum: Why Do Such a Thing?
Hadoop & Greenplum: Why Do Such a Thing?Hadoop & Greenplum: Why Do Such a Thing?
Hadoop & Greenplum: Why Do Such a Thing?
 
Solving PostgreSQL wicked problems
Solving PostgreSQL wicked problemsSolving PostgreSQL wicked problems
Solving PostgreSQL wicked problems
 
LSM Trees
LSM TreesLSM Trees
LSM Trees
 
MySQL Connectors 8.0.19 & DNS SRV
MySQL Connectors 8.0.19 & DNS SRVMySQL Connectors 8.0.19 & DNS SRV
MySQL Connectors 8.0.19 & DNS SRV
 
DataOpsbarcelona 2019: Deep dive into MySQL Group Replication... the magic e...
DataOpsbarcelona 2019:  Deep dive into MySQL Group Replication... the magic e...DataOpsbarcelona 2019:  Deep dive into MySQL Group Replication... the magic e...
DataOpsbarcelona 2019: Deep dive into MySQL Group Replication... the magic e...
 

Similar to Greenplum Database Overview

EMC Greenplum Database version 4.2
EMC Greenplum Database version 4.2 EMC Greenplum Database version 4.2
EMC Greenplum Database version 4.2 EMC
 
Greenplum Analytics Workbench - What Can a Private Hadoop Cloud Do For You?
Greenplum Analytics Workbench - What Can a Private Hadoop Cloud Do For You?  Greenplum Analytics Workbench - What Can a Private Hadoop Cloud Do For You?
Greenplum Analytics Workbench - What Can a Private Hadoop Cloud Do For You? EMC
 
Accel Partners New Data Workshop 7-14-10
Accel Partners New Data Workshop 7-14-10Accel Partners New Data Workshop 7-14-10
Accel Partners New Data Workshop 7-14-10keirdo1
 
Green Plum IIIT- Allahabad
Green Plum IIIT- Allahabad Green Plum IIIT- Allahabad
Green Plum IIIT- Allahabad IIIT ALLAHABAD
 
Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...
Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...
Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...Cloudera, Inc.
 
Big Data and HPC
Big Data and HPCBig Data and HPC
Big Data and HPCNetApp
 
Ugif 04 2011 france ug04042011-jroy_part1
Ugif 04 2011   france ug04042011-jroy_part1Ugif 04 2011   france ug04042011-jroy_part1
Ugif 04 2011 france ug04042011-jroy_part1UGIF
 
The IBM Netezza Data Warehouse Appliance
The IBM Netezza Data Warehouse ApplianceThe IBM Netezza Data Warehouse Appliance
The IBM Netezza Data Warehouse ApplianceIBM Sverige
 
Netezza vs teradata
Netezza vs teradataNetezza vs teradata
Netezza vs teradataAsis Mohanty
 
Ugif 12 2011-informix iwa
Ugif 12 2011-informix iwaUgif 12 2011-informix iwa
Ugif 12 2011-informix iwaUGIF
 
Tools for developing and monitoring SQL in DB2 for z/OS
Tools for developing and monitoring SQL in DB2 for z/OSTools for developing and monitoring SQL in DB2 for z/OS
Tools for developing and monitoring SQL in DB2 for z/OSSurekha Parekh
 
Business Intelligence and Data Analytics Revolutionized with Apache Hadoop
Business Intelligence and Data Analytics Revolutionized with Apache HadoopBusiness Intelligence and Data Analytics Revolutionized with Apache Hadoop
Business Intelligence and Data Analytics Revolutionized with Apache HadoopCloudera, Inc.
 
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...Amr Awadallah
 
Netapp Evento Virtual Business Breakfast 20110616
Netapp Evento  Virtual  Business  Breakfast 20110616Netapp Evento  Virtual  Business  Breakfast 20110616
Netapp Evento Virtual Business Breakfast 20110616Bruno Banha
 
Netezza Deep Dives
Netezza Deep DivesNetezza Deep Dives
Netezza Deep DivesRush Shah
 
Oracle Systems _ David Baker _ Best Practices for Simplifying Implementation ...
Oracle Systems _ David Baker _ Best Practices for Simplifying Implementation ...Oracle Systems _ David Baker _ Best Practices for Simplifying Implementation ...
Oracle Systems _ David Baker _ Best Practices for Simplifying Implementation ...InSync2011
 
Integrating hadoop - Big Data TechCon 2013
Integrating hadoop - Big Data TechCon 2013Integrating hadoop - Big Data TechCon 2013
Integrating hadoop - Big Data TechCon 2013Jonathan Seidman
 

Similar to Greenplum Database Overview (20)

Greenplum feature
Greenplum featureGreenplum feature
Greenplum feature
 
EMC Greenplum Database version 4.2
EMC Greenplum Database version 4.2 EMC Greenplum Database version 4.2
EMC Greenplum Database version 4.2
 
Greenplum Analytics Workbench - What Can a Private Hadoop Cloud Do For You?
Greenplum Analytics Workbench - What Can a Private Hadoop Cloud Do For You?  Greenplum Analytics Workbench - What Can a Private Hadoop Cloud Do For You?
Greenplum Analytics Workbench - What Can a Private Hadoop Cloud Do For You?
 
Accel Partners New Data Workshop 7-14-10
Accel Partners New Data Workshop 7-14-10Accel Partners New Data Workshop 7-14-10
Accel Partners New Data Workshop 7-14-10
 
Green Plum IIIT- Allahabad
Green Plum IIIT- Allahabad Green Plum IIIT- Allahabad
Green Plum IIIT- Allahabad
 
Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...
Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...
Hadoop World 2011: How Hadoop Revolutionized Business Intelligence and Advanc...
 
Big Data and HPC
Big Data and HPCBig Data and HPC
Big Data and HPC
 
Ugif 04 2011 france ug04042011-jroy_part1
Ugif 04 2011   france ug04042011-jroy_part1Ugif 04 2011   france ug04042011-jroy_part1
Ugif 04 2011 france ug04042011-jroy_part1
 
The IBM Netezza Data Warehouse Appliance
The IBM Netezza Data Warehouse ApplianceThe IBM Netezza Data Warehouse Appliance
The IBM Netezza Data Warehouse Appliance
 
Netezza vs teradata
Netezza vs teradataNetezza vs teradata
Netezza vs teradata
 
Ugif 12 2011-informix iwa
Ugif 12 2011-informix iwaUgif 12 2011-informix iwa
Ugif 12 2011-informix iwa
 
Tools for developing and monitoring SQL in DB2 for z/OS
Tools for developing and monitoring SQL in DB2 for z/OSTools for developing and monitoring SQL in DB2 for z/OS
Tools for developing and monitoring SQL in DB2 for z/OS
 
Greenplum Architecture
Greenplum ArchitectureGreenplum Architecture
Greenplum Architecture
 
Business Intelligence and Data Analytics Revolutionized with Apache Hadoop
Business Intelligence and Data Analytics Revolutionized with Apache HadoopBusiness Intelligence and Data Analytics Revolutionized with Apache Hadoop
Business Intelligence and Data Analytics Revolutionized with Apache Hadoop
 
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
How Apache Hadoop is Revolutionizing Business Intelligence and Data Analytics...
 
Netapp Evento Virtual Business Breakfast 20110616
Netapp Evento  Virtual  Business  Breakfast 20110616Netapp Evento  Virtual  Business  Breakfast 20110616
Netapp Evento Virtual Business Breakfast 20110616
 
Netezza Deep Dives
Netezza Deep DivesNetezza Deep Dives
Netezza Deep Dives
 
Oracle Systems _ David Baker _ Best Practices for Simplifying Implementation ...
Oracle Systems _ David Baker _ Best Practices for Simplifying Implementation ...Oracle Systems _ David Baker _ Best Practices for Simplifying Implementation ...
Oracle Systems _ David Baker _ Best Practices for Simplifying Implementation ...
 
Integrating hadoop - Big Data TechCon 2013
Integrating hadoop - Big Data TechCon 2013Integrating hadoop - Big Data TechCon 2013
Integrating hadoop - Big Data TechCon 2013
 
EMC Unified Analytics Platform. Gintaras Pelenis
EMC Unified Analytics Platform. Gintaras PelenisEMC Unified Analytics Platform. Gintaras Pelenis
EMC Unified Analytics Platform. Gintaras Pelenis
 

More from EMC

INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
INDUSTRY-LEADING  TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUDINDUSTRY-LEADING  TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUDEMC
 
Cloud Foundry Summit Berlin Keynote
Cloud Foundry Summit Berlin Keynote Cloud Foundry Summit Berlin Keynote
Cloud Foundry Summit Berlin Keynote EMC
 
EMC GLOBAL DATA PROTECTION INDEX
EMC GLOBAL DATA PROTECTION INDEX EMC GLOBAL DATA PROTECTION INDEX
EMC GLOBAL DATA PROTECTION INDEX EMC
 
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIOTransforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIOEMC
 
Citrix ready-webinar-xtremio
Citrix ready-webinar-xtremioCitrix ready-webinar-xtremio
Citrix ready-webinar-xtremioEMC
 
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES EMC
 
EMC with Mirantis Openstack
EMC with Mirantis OpenstackEMC with Mirantis Openstack
EMC with Mirantis OpenstackEMC
 
Modern infrastructure for business data lake
Modern infrastructure for business data lakeModern infrastructure for business data lake
Modern infrastructure for business data lakeEMC
 
Force Cyber Criminals to Shop Elsewhere
Force Cyber Criminals to Shop ElsewhereForce Cyber Criminals to Shop Elsewhere
Force Cyber Criminals to Shop ElsewhereEMC
 
Pivotal : Moments in Container History
Pivotal : Moments in Container History Pivotal : Moments in Container History
Pivotal : Moments in Container History EMC
 
Data Lake Protection - A Technical Review
Data Lake Protection - A Technical ReviewData Lake Protection - A Technical Review
Data Lake Protection - A Technical ReviewEMC
 
Mobile E-commerce: Friend or Foe
Mobile E-commerce: Friend or FoeMobile E-commerce: Friend or Foe
Mobile E-commerce: Friend or FoeEMC
 
Virtualization Myths Infographic
Virtualization Myths Infographic Virtualization Myths Infographic
Virtualization Myths Infographic EMC
 
Intelligence-Driven GRC for Security
Intelligence-Driven GRC for SecurityIntelligence-Driven GRC for Security
Intelligence-Driven GRC for SecurityEMC
 
The Trust Paradox: Access Management and Trust in an Insecure Age
The Trust Paradox: Access Management and Trust in an Insecure AgeThe Trust Paradox: Access Management and Trust in an Insecure Age
The Trust Paradox: Access Management and Trust in an Insecure AgeEMC
 
EMC Technology Day - SRM University 2015
EMC Technology Day - SRM University 2015EMC Technology Day - SRM University 2015
EMC Technology Day - SRM University 2015EMC
 
EMC Academic Summit 2015
EMC Academic Summit 2015EMC Academic Summit 2015
EMC Academic Summit 2015EMC
 
Data Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education ServicesData Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education ServicesEMC
 
Using EMC Symmetrix Storage in VMware vSphere Environments
Using EMC Symmetrix Storage in VMware vSphere EnvironmentsUsing EMC Symmetrix Storage in VMware vSphere Environments
Using EMC Symmetrix Storage in VMware vSphere EnvironmentsEMC
 
Using EMC VNX storage with VMware vSphereTechBook
Using EMC VNX storage with VMware vSphereTechBookUsing EMC VNX storage with VMware vSphereTechBook
Using EMC VNX storage with VMware vSphereTechBookEMC
 

More from EMC (20)

INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
INDUSTRY-LEADING  TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUDINDUSTRY-LEADING  TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
INDUSTRY-LEADING TECHNOLOGY FOR LONG TERM RETENTION OF BACKUPS IN THE CLOUD
 
Cloud Foundry Summit Berlin Keynote
Cloud Foundry Summit Berlin Keynote Cloud Foundry Summit Berlin Keynote
Cloud Foundry Summit Berlin Keynote
 
EMC GLOBAL DATA PROTECTION INDEX
EMC GLOBAL DATA PROTECTION INDEX EMC GLOBAL DATA PROTECTION INDEX
EMC GLOBAL DATA PROTECTION INDEX
 
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIOTransforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
Transforming Desktop Virtualization with Citrix XenDesktop and EMC XtremIO
 
Citrix ready-webinar-xtremio
Citrix ready-webinar-xtremioCitrix ready-webinar-xtremio
Citrix ready-webinar-xtremio
 
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
EMC FORUM RESEARCH GLOBAL RESULTS - 10,451 RESPONSES ACROSS 33 COUNTRIES
 
EMC with Mirantis Openstack
EMC with Mirantis OpenstackEMC with Mirantis Openstack
EMC with Mirantis Openstack
 
Modern infrastructure for business data lake
Modern infrastructure for business data lakeModern infrastructure for business data lake
Modern infrastructure for business data lake
 
Force Cyber Criminals to Shop Elsewhere
Force Cyber Criminals to Shop ElsewhereForce Cyber Criminals to Shop Elsewhere
Force Cyber Criminals to Shop Elsewhere
 
Pivotal : Moments in Container History
Pivotal : Moments in Container History Pivotal : Moments in Container History
Pivotal : Moments in Container History
 
Data Lake Protection - A Technical Review
Data Lake Protection - A Technical ReviewData Lake Protection - A Technical Review
Data Lake Protection - A Technical Review
 
Mobile E-commerce: Friend or Foe
Mobile E-commerce: Friend or FoeMobile E-commerce: Friend or Foe
Mobile E-commerce: Friend or Foe
 
Virtualization Myths Infographic
Virtualization Myths Infographic Virtualization Myths Infographic
Virtualization Myths Infographic
 
Intelligence-Driven GRC for Security
Intelligence-Driven GRC for SecurityIntelligence-Driven GRC for Security
Intelligence-Driven GRC for Security
 
The Trust Paradox: Access Management and Trust in an Insecure Age
The Trust Paradox: Access Management and Trust in an Insecure AgeThe Trust Paradox: Access Management and Trust in an Insecure Age
The Trust Paradox: Access Management and Trust in an Insecure Age
 
EMC Technology Day - SRM University 2015
EMC Technology Day - SRM University 2015EMC Technology Day - SRM University 2015
EMC Technology Day - SRM University 2015
 
EMC Academic Summit 2015
EMC Academic Summit 2015EMC Academic Summit 2015
EMC Academic Summit 2015
 
Data Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education ServicesData Science and Big Data Analytics Book from EMC Education Services
Data Science and Big Data Analytics Book from EMC Education Services
 
Using EMC Symmetrix Storage in VMware vSphere Environments
Using EMC Symmetrix Storage in VMware vSphere EnvironmentsUsing EMC Symmetrix Storage in VMware vSphere Environments
Using EMC Symmetrix Storage in VMware vSphere Environments
 
Using EMC VNX storage with VMware vSphereTechBook
Using EMC VNX storage with VMware vSphereTechBookUsing EMC VNX storage with VMware vSphereTechBook
Using EMC VNX storage with VMware vSphereTechBook
 

Recently uploaded

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...apidays
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfOrbitshub
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FMESafe Software
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024The Digital Insurer
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...apidays
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusZilliz
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native ApplicationsWSO2
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MIND CTI
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKJago de Vreede
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdfSandro Moreira
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProduct Anonymous
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024The Digital Insurer
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...Zilliz
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 

Recently uploaded (20)

Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024Axa Assurance Maroc - Insurer Innovation Award 2024
Axa Assurance Maroc - Insurer Innovation Award 2024
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
Apidays New York 2024 - APIs in 2030: The Risk of Technological Sleepwalk by ...
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Architecting Cloud Native Applications
Architecting Cloud Native ApplicationsArchitecting Cloud Native Applications
Architecting Cloud Native Applications
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf[BuildWithAI] Introduction to Gemini.pdf
[BuildWithAI] Introduction to Gemini.pdf
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 

Greenplum Database Overview

  • 1. Greenplum Database Overview Michael Crutcher Greenplum Product Management © Copyright 2012 EMC Corporation. All rights reserved. 1
  • 2. © Copyright 2012 EMC Corporation. All rights reserved. 2
  • 3. © Copyright 2012 EMC Corporation. All rights reserved. 3
  • 4. © Copyright 2012 EMC Corporation. All rights reserved. 4
  • 5. Greenplum Unified Analytic Platform © Copyright 2012 EMC Corporation. All rights reserved. 5
  • 6. GREENPLUM DATABASE Industry Leading Database with Massively Parallel Performance To Empower your Analytics © Copyright 2012 EMC Corporation. All rights reserved. 6
  • 7. GREENPLUM DATABASE Extreme Performance for Analytics  Optimized for BI and analytics – Deep integration with statistical packages – High performance parallel implementations • Simple and automatic – Just load and query like any database – Tables are automatically distributed across nodes • Extremely scalable – MPP shared-nothing architecture – All nodes can scan and process in parallel – Linear scalability by adding nodes © Copyright 2012 EMC Corporation. All rights reserved. 7
  • 8. GREENPLUM DATABASE Performance Through Parallelism Master Servers ... ... Query planning & dispatch Network Interconnect Segment Servers ... ... Query processing & data storage External Sources Loading, streaming, etc. © Copyright 2012 EMC Corporation. All rights reserved. 8
  • 9. GREENPLUM DATABASE Greenplum Delivers Choice & Flexibility Greenplum Data Greenplum Computing Appliance Software Solutions Choose Greenplum  Greenplum Database and/or Database, Hadoop, Hadoop modules in & Chorus on your ¼ rack increments x86 hardware Scale up by adding  Flexibility for any your choice of workload or additional modules environment Minimal time to value  Perpetual or subscription licenses © Copyright 2012 EMC Corporation. All rights reserved. 9
  • 10. Core Functionality GREENPLUM DATABASE © Copyright 2012 EMC Corporation. All rights reserved. 10
  • 11. GREENPLUM DATABASE Component Overview CLIENT ACCESS 3rd PARTY TOOLS ADMIN TOOLS CLIENT ACCESS ODBC, JDBC, OLEDB, BI Tools, ETL Tools Greenplum Command Center & TOOLS MapReduce, etc. Data Mining, etc Greenplum Package Manager LOADING & EXT. ACCESS STORAGE & DATA ACCESS LANGUAGE SUPPORT Petabyte-Scale Loading Hybrid Storage & Execution Comprehensive SQL (Row- & Column-Oriented) Trickle Micro-Batching Native MapReduce PRODUCT Anywhere Data Access In-Database Compression SQL 2003 OLAP Extensions FEATURES Multi-Level Partitioning Programmable Analytics Indexes – Btree, Bitmap, etc. Analytics Extensions External Table Support (GeoSpatial, PR/R, PL/Java, PL/Python, PL/Perl) GREENPLUM Multi-Level Fault Tolerance DATABASE ADAPTIVE (RAID, Mirroring, DR with Online System Expansion Workload Management SERVICES Data Domain Boost) Shared-Nothing MPP Parallel Dataflow Engine CORE MPP Parallel Query Optimizer gNet™ Software Interconnect ARCHITECTURE Polymorphic Data Storage™ Scatter/Gather Streaming™ Data Loading © Copyright 2012 EMC Corporation. All rights reserved. 11
  • 12. GREENPLUM DATABASE Most Powerful Data Loading Capabilities SINGLE RACK COMPARISON  Industry leading performance at 10+TB per-hour per-rack  Scatter-Gather Streaming™ provides true linear scaling  Support for both large-batch and continuous real-time loading strategies Greenplum Oracle Netezza Teradata Exadata  Enable complex data transformations ―in-flight‖ Greenplum load rates scale linearly with the number of racks, others do not.  Transparent interfaces to loading For example, two racks = >20TB/H via support files, application, and services © Copyright 2012 EMC Corporation. All rights reserved. 12
  • 13. GREENPLUM DATABASE Polymorphic Table StorageTM TABLE ‗CUSTOMER‘ Mar Apr May Jun Jul Aug Sept Oct Nov ‗11 ‗11 ‗11 ‗11 ‗11 ‗11 ‗11 ‗11 ‗11 Column-oriented for COLD DATA Row-oriented for HOT DATA • Storage types can be mixed within a table or database – Four table types: heap, row-oriented AO, column-oriented AO, external • Rich compression functionality, definable column by column – Block compression: Gzip (levels 1-9), QuickLZ – Stream compression: RLE (levels 1-4) • Flexible indexing, partitioning, and more © Copyright 2012 EMC Corporation. All rights reserved. 13
  • 14. GREENPLUM DATABASE gNet Software Interconnect  A supercomputing-based ―soft-switch‖ responsible for – Efficiently pumping streams of data between motion nodes during query-plan execution – Delivers messages, moves data, collects results, and coordinates work among the segments in the system gNet Software Interconnect © Copyright 2012 EMC Corporation. All rights reserved. 14
  • 15. GREENPLUM DATABASE Parallel Query Optimizer PHYSICAL EXECUTION PLAN  Cost-based optimization FROM SQL OR MAPREDUCE looks for the most Gather Motion efficient plan 4:1(Slice 3) Sort  Physical plan contains scans, joins, sorts, HashAggregate aggregations, etc. HashJoin  Global planning avoids Redistribute Motion 4:4(Slice 1) Hash sub-optimal ‘SQL HashJoin HashJoin pushing’ to segments Seq Scan on  Directly inserts ‘motion’ Seq Scan on lineitem Hash Hash customer nodes for inter-segment Seq Scan on orders Broadcast Motion 4:4(Slice 2) communication Seq Scan on motion © Copyright 2012 EMC Corporation. All rights reserved. 15
  • 16. Analytics Overview GREENPLUM DATABASE © Copyright 2012 EMC Corporation. All rights reserved. 16
  • 17. GREENPLUM DATABASE Analytical Capabilities Overview Data Access & Query Layer ODBC JDBC SQL Stored SQL 2003 In-Database MapReduce Procedures OLAP Analytics GREENPLUM HD Polymorphic Storage GREENPLUM DATABASE Greenplum gNet © Copyright 2012 EMC Corporation. All rights reserved. 17
  • 18. GREENPLUM DATABASE In-Database Analytics: Categories Data Access & Query Layer ODBC JDBC SQL In-Database Analytics Embedded SAS Scoring Accelerator Partner GPDB User-Written Open Source Embedded Analytical Extensions Analytics SAS/HPA Algorithms Open-Source High Performance Analytics User-written GREENPLUM DATABASE © Copyright 2012 EMC Corporation. All rights reserved. 18
  • 19. GREENPLUM DATABASE Analytics Highlight: MADlib  Scalable in-database analytics  Data-parallel – Mathematical Algorithms – Statistical Algorithms – Machine learning Algorithms – Supports structured and unstructured data.  Open-source software – Source Accessibility – Converge business, academic, and open-source communities © Copyright 2012 EMC Corporation. All rights reserved. 19
  • 20. Manageability, Extensions GREENPLUM DATABASE © Copyright 2012 EMC Corporation. All rights reserved. 20
  • 21. GREENPLUM DATABASE Easy Manageability for Big Data  Single console for both Database and Hadoop  Administration – Start, Stop Database – Recover, Rebalance Segments  Interactive view of System Metrics – Real-time – Historic (Configurable by time period)  In-depth view for System Health – Hardware health – Software (Database, Hadoop)  Query Monitoring – Search, Prioritize, Cancel Queries – View Query‘s Execution Plan  Workload Management – Configure Resource Queues – Prioritize Users © Copyright 2012 EMC Corporation. All rights reserved. 21
  • 22. GREENPLUM DATABASE Easy Extension Installation Greenplum Package Manager Greenplum supports easy deployment of numerous extensions like Madlib, PL/Perl, PL/Java, PostGIS, etc. Master Servers Segment ... Servers ... © Copyright 2012 EMC Corporation. All rights reserved. 22
  • 23. GREENPLUM DATABASE High Performance gNet for Hadoop Parallel Query Access  Connect any data set in Hadoop to GP DB‘s SQL Engine  Process Hadoop data in place  Parallelize import/export data from/to Hadoop thanks to GP DB‘s market leading data sharing performance gNet for Hadoop  Supported formats: – Text (compressed and uncompressed) – binary User- Text Binary Defined – proprietary/user-defined  GP HD 1.x, GP MR 1.x, CDH3u2 © Copyright 2012 EMC Corporation. All rights reserved. 23
  • 24. High Availability, Back up, Support GREENPLUM DATABASE © Copyright 2012 EMC Corporation. All rights reserved. 24
  • 25. GREENPLUM DATABASE High Availability  GPDB cluster – 2 Master servers – Multiple Segment servers  Segment servers support multiple database instances – Primary instances that actively process queries – Standby mirror instances  Block level mirroring – Low resource Set of Active consumption Segment Instances – Differential resynch capable for fast recovery © Copyright 2012 EMC Corporation. All rights reserved. 25
  • 26. GREENPLUM DATABASE Backup/Restore with EMC Data Domain  Integration options – NFS: Data Domain device mounted Full Appliance as NFS storage + Data Domain – DD Boost: Native, client-side deduplication. Supported in GPDB 4.2 and higher Boost or NFS  Drastic reduction in backup storage requirement 2 X 10GBit IP  Backup all segment servers in parallel directly to Data Domain  Data Domain Integrates seamlessly into standard Greenplum full backup data export and data restore procedures © Copyright 2012 EMC Corporation. All rights reserved. 26
  • 27. GREENPLUM DATABASE Backup/Restore with EMC Data Domain Backup and restore between remote and primary sites Greenplum DCA Greenplum DCA Data Domain Data Domain LAN/WAN Data Domain Replication  Ideal for configurations with RPO and RTO requirements that can be specified in hours  Supports: – Collection Replication for DD Boost backup – Directory-level replication for NFS backup – Encryption over the WAN © Copyright 2012 EMC Corporation. All rights reserved. 27
  • 28. GREENPLUM DATABASE Customer Support Services • Remote Technical Support – 24x7 technical support and remote troubleshooting – Customer-managed case severity level – Four-hour response objective • Onsite Support (DCA Only) – Installation of replacement parts – Replacement parts shipped for next business day arrival – GP SW upgrade included • Proactive Service – Secure remote monitoring for hardware (DCA) – Notification of engineering technical advisories – Built-in tools maximize stability and performance • Secure Self-Help – 24x7 access to eService support tools including knowledgebase, forums, and appropriately licensed software updates © Copyright 2012 EMC Corporation. All rights reserved. 28
  • 29. GREENPLUM DATABASE Other Relevant Greenplum Sessions Session Presenter Times Unified Analytics Platform Introduction Brian Wilson Tues 10:00-11:00 Thurs 1:00-2:00 Greenplum Hadoop Overview Susheel Kaushik Mon 10:00-11:00 Wed 4:15-5:15 Greenplum DCA Overview Hanxi Chen Mon 4:00-5:00 Thurs 10:00-11:00 Greenplum Analytics Workbench Apurva Desai Wed 8:30-9:30 Thurs 10:00-11:00 Analytics on Hadoop Don Miner Tues 11:30-12:30 Thurs 8:30-9:30 Big Data Driven Businesses in Action: Mike Maxey Wed 4:15-5:15 Thurs 11:30-12:30 Creating Real Business Value Using Greenplum UAP (Panel w/4 Customers) Analytics for Business Value: Collaboration Josh Klahr Mon 10:00-11:00 Wed 2:45-3:45 Disruptive Data Science — How Data Annika Jimenez Tues 4:15-5:15 Thurs 11:30-12:30 Science and Big Data are Transforming David Dietrich Business, IT and People © Copyright 2012 EMC Corporation. All rights reserved. 29
  • 30. Thank You © Copyright 2012 EMC Corporation. All rights reserved. 30