SlideShare une entreprise Scribd logo
1  sur  18
About Presenter




0                     Karan Gulati (SSAS Maestro)
SQL - Parallel Data Warehouse (PDW)




    Let’s figure out……….




1                          Karan Gulati (SSAS Maestro)
What are we covering


    •   World of Appliance
    •   Introducing SQL Parallel Data Warehouse (PDW)
    •   Different Kinds of Nodes in PDW
    •   Hub and Spoke Architecture




2                                   Karan Gulati (SSAS Maestro)
What’s an Appliance?


    A  re we talking about a refrigerator or an oven?




3                                   Karan Gulati (SSAS Maestro)
Appliance World…….

    Appliance is nothing but preconfigured machine which is dedicated for specific
    use in contrast to general use.

    In Computer world - An appliance comes with hardware, with pre-installed OS,
    and Software, keeping all best practices or guideline in mind while building an
    Appliance.

    What this means to users?
    Just plug and play…... and ready to use just like a refrigerator or an oven.




4                                      Karan Gulati (SSAS Maestro)
Have you heard about SQL PDW



    Microsoft SQL Server Parallel Data Warehouse (SQL Server PDW) is:

        •   Massively Parallel Processing Appliance (MPP)
        •   Simple to deploy
        •   Pre-built Appliance with software, hardware and networking components
        •   Highly scalable data storage, and high-speed data transfer
        •   One answer to largest data warehouse workloads




5                                   Karan Gulati (SSAS Maestro)
Symmetric Multi Processing

    First, lets understand Symmetric multi processing(SMP)

    In SMP each CPU core can work with any section of memory or disk, and all
    memory and all disk available to each core.

    Problem starts when too many CPUs making requests same time for data on the
    system bus which creates a traffic jam and that results in queue consequently
    slowness and limited amount of processing can take place on SMP creates
    limitation as the usage grows System Bus.




6                                    Karan Gulati (SSAS Maestro)
Solution to SMP Problem lies in MPP

    Massively Parallel Processing Architecture refers to the use of a large number of
    separate computes to perform a set of a job.

    In simple words MPP is:
    Multiple boxes with their own CPUs, Memory and other resources to perform
    given task; this way we are using the power of all machines / nodes in one go.




7                                     Karan Gulati (SSAS Maestro)
SQL PDW: Flow of Query Execution




                          Control node break                                  When the compute
                            the Query into                   DMS or Data       nodes are finished,
                            multiple parallel              Movement Service      control nodes
     Query hits control     operations and                  coordinates any      handles post-
          node            distribute them out                needed data       processing and re-
                          to compute nodes                 movement among     integration of result
                           where the actual                     nodes           sets for delivery
                              data resides                                      back to the users




8                                      Karan Gulati (SSAS Maestro)
SQL PDW: Nodes and Services


       Control Node


       Compute Node


       Administrative Service Nodes


       Data Movement Services

9                         Karan Gulati (SSAS Maestro)
Control Node
     An Control node that is the central point of control for processing queries on the
     SQL Server PDW appliance. The Control node receives the user query, creates a
     distributed query plan, communicates relevant plan operations and data to
     Compute nodes, receives Compute node results, performs any necessary
     aggregation of results, and then returns the query results to the user.




10                                     Karan Gulati (SSAS Maestro)
Compute Node
     An Compute node that is the basic unit of scalability and storage. Each Compute
     node in the SQL Server PDW appliance uses its own user-data and computing
     resources to perform a portion of each parallel query.




11                                    Karan Gulati (SSAS Maestro)
Administrative Service Nodes
     •   Landing Zone node: An appliance node that provides temporary storage and
         processing for loading data onto the appliance.
     •   Management node: An appliance node that performs multiple functions
         related to managing the hardware and software in the appliance. This node is
         the hub for software deployment and servicing, authentication within the
         appliance (not login authentication), and monitoring system health and
         performance
     •   Backup Node: The Backup Node provides high-speed integrated backup at
         the database level. This is tied to the organization’s overall backup strategy
         and systems.




12                                     Karan Gulati (SSAS Maestro)
Data Movement Services




                      • When a query is submitted to a control node, it is the
                        PDW Engine that determines what the query plan will
                        be on each individual compute node, then submits the
                        query to all the compute nodes through the DMS


      DMS             • Further DMS coordinates any needed data movement
                        among nodes taking place between and handles any
                        functions that needed to be resolved centrally

                      • In simple words DMS is the brain that ties all the
                        nodes together




13                            Karan Gulati (SSAS Maestro)
Hub and Spoke Architecture

     Data warehousing architecture with a central hub data warehouse that provides a
     flexible and high speed ability to move or copy EDW data to spokes.

     A spoke is typically a data mart in an optimized physical storage for a particular
     user group or organization.

      A data mart is usually a much smaller subset of the data in the EDW and specific
     to the reporting and analytic needs of a specific user community.




14                                     Karan Gulati (SSAS Maestro)
SQL PDW – Act as Hub
     Using a true hub-and-spoke architecture, all enterprise data can be
     maintained on a SQL Server 2008 R2 Parallel Data Warehouse hub while
     departments or business units keep their existing data marts to suit their
     needs. High-speed data transfer relieves traditional barriers to hub and
     spoke. Power users can even deploy a dedicated MPP appliance as a
     spoke so they can autonomously manage resources, while IT can enforce
     enterprise standards across all data.




15                                 Karan Gulati (SSAS Maestro)
Recommended Reading
       SQL Server 2008 R2 Parallel Data Warehouse
       ITIC: Comparison of Oracle Database Appliance to Microsoft SQL Server
       Implementing a SQL Server PDW Using the Kimball Approach
       Implementing Data Warehouse 2.0 by Immon




16                                  Karan Gulati (SSAS Maestro)
Thanks

       Contact Speaker -

                 http://karanspeaks.com

                 http://blogs.msdn.com/karang

                 https://twitter.com/karangspeaks

               http://in.linkedin.com/in/karanspeaks




17                          Karan Gulati (SSAS Maestro)

Contenu connexe

Tendances

Tendances (20)

Extreme SSAS - Part I
Extreme SSAS  - Part IExtreme SSAS  - Part I
Extreme SSAS - Part I
 
Migrating on premises workload to azure sql database
Migrating on premises workload to azure sql databaseMigrating on premises workload to azure sql database
Migrating on premises workload to azure sql database
 
Microsoft cloud big data strategy
Microsoft cloud big data strategyMicrosoft cloud big data strategy
Microsoft cloud big data strategy
 
Sql server 2016 new features
Sql server 2016 new featuresSql server 2016 new features
Sql server 2016 new features
 
Designing a modern data warehouse in azure
Designing a modern data warehouse in azure   Designing a modern data warehouse in azure
Designing a modern data warehouse in azure
 
Introduction to Cortana Analytics
Introduction to Cortana AnalyticsIntroduction to Cortana Analytics
Introduction to Cortana Analytics
 
Data warehouse con azure synapse analytics
Data warehouse con azure synapse analyticsData warehouse con azure synapse analytics
Data warehouse con azure synapse analytics
 
Azure SQL Database Managed Instance
Azure SQL Database Managed InstanceAzure SQL Database Managed Instance
Azure SQL Database Managed Instance
 
Azure Analysis Services (Azure Bootcamp 2018)
Azure Analysis Services (Azure Bootcamp 2018)Azure Analysis Services (Azure Bootcamp 2018)
Azure Analysis Services (Azure Bootcamp 2018)
 
SQL to Azure Migrations
SQL to Azure MigrationsSQL to Azure Migrations
SQL to Azure Migrations
 
Delta Lake with Azure Databricks
Delta Lake with Azure DatabricksDelta Lake with Azure Databricks
Delta Lake with Azure Databricks
 
SQL Server 2016 new features
SQL Server 2016 new featuresSQL Server 2016 new features
SQL Server 2016 new features
 
Synapse for mere mortals
Synapse for mere mortalsSynapse for mere mortals
Synapse for mere mortals
 
Introduction to Azure Data Lake
Introduction to Azure Data LakeIntroduction to Azure Data Lake
Introduction to Azure Data Lake
 
Data visualization with sql analytics
Data visualization with sql analyticsData visualization with sql analytics
Data visualization with sql analytics
 
Azure Lowlands: An intro to Azure Data Lake
Azure Lowlands: An intro to Azure Data LakeAzure Lowlands: An intro to Azure Data Lake
Azure Lowlands: An intro to Azure Data Lake
 
SQL Server 2019 Big Data Cluster
SQL Server 2019 Big Data ClusterSQL Server 2019 Big Data Cluster
SQL Server 2019 Big Data Cluster
 
Azure SQL DB Managed Instances Built to easily modernize application data layer
Azure SQL DB Managed Instances Built to easily modernize application data layerAzure SQL DB Managed Instances Built to easily modernize application data layer
Azure SQL DB Managed Instances Built to easily modernize application data layer
 
Introducing Azure SQL Database
Introducing Azure SQL DatabaseIntroducing Azure SQL Database
Introducing Azure SQL Database
 
Microsoft SQL Server 2016 - Everything Built In
Microsoft SQL Server 2016 - Everything Built InMicrosoft SQL Server 2016 - Everything Built In
Microsoft SQL Server 2016 - Everything Built In
 

En vedette

Hortonworks roadshow
Hortonworks roadshowHortonworks roadshow
Hortonworks roadshow
Accenture
 
Bi303 data warehousing with fast track and pdw - Assaf Fraenkel
Bi303 data warehousing with fast track and pdw - Assaf FraenkelBi303 data warehousing with fast track and pdw - Assaf Fraenkel
Bi303 data warehousing with fast track and pdw - Assaf Fraenkel
sqlserver.co.il
 
Sql server 2012_parallel_data_warehouse_breakthrough_platform_white_paper
Sql server 2012_parallel_data_warehouse_breakthrough_platform_white_paperSql server 2012_parallel_data_warehouse_breakthrough_platform_white_paper
Sql server 2012_parallel_data_warehouse_breakthrough_platform_white_paper
Wendy Frodyma
 
Data Provisioning & Optimization
Data Provisioning & OptimizationData Provisioning & Optimization
Data Provisioning & Optimization
Ambareesh Kulkarni
 
V4 qlik view-datastorage
V4 qlik view-datastorageV4 qlik view-datastorage
V4 qlik view-datastorage
naresh akki
 
KPerry - 20463 Implementing a Data Warehouse with Microsoft® SQL Server (2)
KPerry - 20463 Implementing a Data Warehouse with Microsoft® SQL Server (2)KPerry - 20463 Implementing a Data Warehouse with Microsoft® SQL Server (2)
KPerry - 20463 Implementing a Data Warehouse with Microsoft® SQL Server (2)
Kwame M. Perry
 
Different ways to load data in qlikview
Different ways to load data in qlikviewDifferent ways to load data in qlikview
Different ways to load data in qlikview
Swamy Danthuri
 
IOD 2009 ECM Specific Breakouts
IOD 2009 ECM Specific BreakoutsIOD 2009 ECM Specific Breakouts
IOD 2009 ECM Specific Breakouts
Ranjun Chauhan
 

En vedette (20)

Accelerating Big Data Analytics
Accelerating Big Data AnalyticsAccelerating Big Data Analytics
Accelerating Big Data Analytics
 
SSAS Reference Architecture
SSAS Reference ArchitectureSSAS Reference Architecture
SSAS Reference Architecture
 
Hortonworks roadshow
Hortonworks roadshowHortonworks roadshow
Hortonworks roadshow
 
Bi303 data warehousing with fast track and pdw - Assaf Fraenkel
Bi303 data warehousing with fast track and pdw - Assaf FraenkelBi303 data warehousing with fast track and pdw - Assaf Fraenkel
Bi303 data warehousing with fast track and pdw - Assaf Fraenkel
 
Sql server 2012_parallel_data_warehouse_breakthrough_platform_white_paper
Sql server 2012_parallel_data_warehouse_breakthrough_platform_white_paperSql server 2012_parallel_data_warehouse_breakthrough_platform_white_paper
Sql server 2012_parallel_data_warehouse_breakthrough_platform_white_paper
 
Attunity Solutions for Teradata
Attunity Solutions for TeradataAttunity Solutions for Teradata
Attunity Solutions for Teradata
 
A19 amis
A19 amisA19 amis
A19 amis
 
Attunity Hortonworks Webinar- Sept 22, 2016
Attunity Hortonworks Webinar- Sept 22, 2016Attunity Hortonworks Webinar- Sept 22, 2016
Attunity Hortonworks Webinar- Sept 22, 2016
 
How to Operationalise Real-Time Hadoop in the Cloud
How to Operationalise Real-Time Hadoop in the CloudHow to Operationalise Real-Time Hadoop in the Cloud
How to Operationalise Real-Time Hadoop in the Cloud
 
Data Provisioning & Optimization
Data Provisioning & OptimizationData Provisioning & Optimization
Data Provisioning & Optimization
 
V4 qlik view-datastorage
V4 qlik view-datastorageV4 qlik view-datastorage
V4 qlik view-datastorage
 
Llorance New Horizons 20768 Developing SQL Data Models
Llorance New Horizons 20768 Developing SQL Data ModelsLlorance New Horizons 20768 Developing SQL Data Models
Llorance New Horizons 20768 Developing SQL Data Models
 
TIQ Solutions - QlikView Data Integration in a Java World
TIQ Solutions - QlikView Data Integration in a Java WorldTIQ Solutions - QlikView Data Integration in a Java World
TIQ Solutions - QlikView Data Integration in a Java World
 
KPerry - 20463 Implementing a Data Warehouse with Microsoft® SQL Server (2)
KPerry - 20463 Implementing a Data Warehouse with Microsoft® SQL Server (2)KPerry - 20463 Implementing a Data Warehouse with Microsoft® SQL Server (2)
KPerry - 20463 Implementing a Data Warehouse with Microsoft® SQL Server (2)
 
Different ways to load data in qlikview
Different ways to load data in qlikviewDifferent ways to load data in qlikview
Different ways to load data in qlikview
 
Microsoft SSAS: Should I Use Tabular or Multidimensional?
Microsoft SSAS: Should I Use Tabular or Multidimensional?Microsoft SSAS: Should I Use Tabular or Multidimensional?
Microsoft SSAS: Should I Use Tabular or Multidimensional?
 
Versa Shore Microsoft APS PDW webinar
Versa Shore Microsoft APS PDW webinarVersa Shore Microsoft APS PDW webinar
Versa Shore Microsoft APS PDW webinar
 
PDW value proposition
PDW value propositionPDW value proposition
PDW value proposition
 
Resume abhishek deloitte_informatica
Resume abhishek deloitte_informaticaResume abhishek deloitte_informatica
Resume abhishek deloitte_informatica
 
IOD 2009 ECM Specific Breakouts
IOD 2009 ECM Specific BreakoutsIOD 2009 ECM Specific Breakouts
IOD 2009 ECM Specific Breakouts
 

Similaire à SQL - Parallel Data Warehouse (PDW)

Big_SQL_3.0_Whitepaper
Big_SQL_3.0_WhitepaperBig_SQL_3.0_Whitepaper
Big_SQL_3.0_Whitepaper
Scott Gray
 
Financial, Retail And Shopping Domains
Financial, Retail And Shopping DomainsFinancial, Retail And Shopping Domains
Financial, Retail And Shopping Domains
Sonia Sanchez
 
Sybase IQ ile Muhteşem Performans
Sybase IQ ile Muhteşem PerformansSybase IQ ile Muhteşem Performans
Sybase IQ ile Muhteşem Performans
Sybase Türkiye
 

Similaire à SQL - Parallel Data Warehouse (PDW) (20)

Exploring Microsoft Azure Infrastructures
Exploring Microsoft Azure InfrastructuresExploring Microsoft Azure Infrastructures
Exploring Microsoft Azure Infrastructures
 
Whats New Sql Server 2008 R2 Cw
Whats New Sql Server 2008 R2 CwWhats New Sql Server 2008 R2 Cw
Whats New Sql Server 2008 R2 Cw
 
Whats New Sql Server 2008 R2
Whats New Sql Server 2008 R2Whats New Sql Server 2008 R2
Whats New Sql Server 2008 R2
 
Azure Days 2019: Grösser und Komplexer ist nicht immer besser (Meinrad Weiss)
Azure Days 2019: Grösser und Komplexer ist nicht immer besser (Meinrad Weiss)Azure Days 2019: Grösser und Komplexer ist nicht immer besser (Meinrad Weiss)
Azure Days 2019: Grösser und Komplexer ist nicht immer besser (Meinrad Weiss)
 
DataCluster
DataClusterDataCluster
DataCluster
 
Fast Analytics
Fast Analytics Fast Analytics
Fast Analytics
 
Introducing Azure SQL Data Warehouse
Introducing Azure SQL Data WarehouseIntroducing Azure SQL Data Warehouse
Introducing Azure SQL Data Warehouse
 
SDN Multi-Controller Domain.pptx
SDN Multi-Controller Domain.pptxSDN Multi-Controller Domain.pptx
SDN Multi-Controller Domain.pptx
 
Big_SQL_3.0_Whitepaper
Big_SQL_3.0_WhitepaperBig_SQL_3.0_Whitepaper
Big_SQL_3.0_Whitepaper
 
Sql Sever Presentation.pptx
Sql Sever Presentation.pptxSql Sever Presentation.pptx
Sql Sever Presentation.pptx
 
Rise of NewSQL
Rise of NewSQLRise of NewSQL
Rise of NewSQL
 
IBM Pure Data System for Analytics (Netezza)
IBM Pure Data System for Analytics (Netezza)IBM Pure Data System for Analytics (Netezza)
IBM Pure Data System for Analytics (Netezza)
 
AZURE Data Related Services
AZURE Data Related ServicesAZURE Data Related Services
AZURE Data Related Services
 
Tales From The Front: An Architecture For Multi-Data Center Scalable Applicat...
Tales From The Front: An Architecture For Multi-Data Center Scalable Applicat...Tales From The Front: An Architecture For Multi-Data Center Scalable Applicat...
Tales From The Front: An Architecture For Multi-Data Center Scalable Applicat...
 
Artur Borycki - Beyond Lambda - how to get from logical to physical - code.ta...
Artur Borycki - Beyond Lambda - how to get from logical to physical - code.ta...Artur Borycki - Beyond Lambda - how to get from logical to physical - code.ta...
Artur Borycki - Beyond Lambda - how to get from logical to physical - code.ta...
 
No sql
No sqlNo sql
No sql
 
Financial, Retail And Shopping Domains
Financial, Retail And Shopping DomainsFinancial, Retail And Shopping Domains
Financial, Retail And Shopping Domains
 
Azure Data Factory ETL Patterns in the Cloud
Azure Data Factory ETL Patterns in the CloudAzure Data Factory ETL Patterns in the Cloud
Azure Data Factory ETL Patterns in the Cloud
 
Session 1.docx
Session 1.docxSession 1.docx
Session 1.docx
 
Sybase IQ ile Muhteşem Performans
Sybase IQ ile Muhteşem PerformansSybase IQ ile Muhteşem Performans
Sybase IQ ile Muhteşem Performans
 

Dernier

Dernier (20)

Handwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed textsHandwritten Text Recognition for manuscripts and early printed texts
Handwritten Text Recognition for manuscripts and early printed texts
 
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Evaluating the top large language models.pdf
Evaluating the top large language models.pdfEvaluating the top large language models.pdf
Evaluating the top large language models.pdf
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdfUnderstanding Discord NSFW Servers A Guide for Responsible Users.pdf
Understanding Discord NSFW Servers A Guide for Responsible Users.pdf
 
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
08448380779 Call Girls In Diplomatic Enclave Women Seeking Men
 
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time AutomationFrom Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
 
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men08448380779 Call Girls In Greater Kailash - I Women Seeking Men
08448380779 Call Girls In Greater Kailash - I Women Seeking Men
 
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
04-2024-HHUG-Sales-and-Marketing-Alignment.pptx
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
Scaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organizationScaling API-first – The story of a global engineering organization
Scaling API-first – The story of a global engineering organization
 
08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men08448380779 Call Girls In Civil Lines Women Seeking Men
08448380779 Call Girls In Civil Lines Women Seeking Men
 
What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?What Are The Drone Anti-jamming Systems Technology?
What Are The Drone Anti-jamming Systems Technology?
 

SQL - Parallel Data Warehouse (PDW)

  • 1. About Presenter 0 Karan Gulati (SSAS Maestro)
  • 2. SQL - Parallel Data Warehouse (PDW) Let’s figure out………. 1 Karan Gulati (SSAS Maestro)
  • 3. What are we covering • World of Appliance • Introducing SQL Parallel Data Warehouse (PDW) • Different Kinds of Nodes in PDW • Hub and Spoke Architecture 2 Karan Gulati (SSAS Maestro)
  • 4. What’s an Appliance? A re we talking about a refrigerator or an oven? 3 Karan Gulati (SSAS Maestro)
  • 5. Appliance World……. Appliance is nothing but preconfigured machine which is dedicated for specific use in contrast to general use. In Computer world - An appliance comes with hardware, with pre-installed OS, and Software, keeping all best practices or guideline in mind while building an Appliance. What this means to users? Just plug and play…... and ready to use just like a refrigerator or an oven. 4 Karan Gulati (SSAS Maestro)
  • 6. Have you heard about SQL PDW Microsoft SQL Server Parallel Data Warehouse (SQL Server PDW) is: • Massively Parallel Processing Appliance (MPP) • Simple to deploy • Pre-built Appliance with software, hardware and networking components • Highly scalable data storage, and high-speed data transfer • One answer to largest data warehouse workloads 5 Karan Gulati (SSAS Maestro)
  • 7. Symmetric Multi Processing First, lets understand Symmetric multi processing(SMP) In SMP each CPU core can work with any section of memory or disk, and all memory and all disk available to each core. Problem starts when too many CPUs making requests same time for data on the system bus which creates a traffic jam and that results in queue consequently slowness and limited amount of processing can take place on SMP creates limitation as the usage grows System Bus. 6 Karan Gulati (SSAS Maestro)
  • 8. Solution to SMP Problem lies in MPP Massively Parallel Processing Architecture refers to the use of a large number of separate computes to perform a set of a job. In simple words MPP is: Multiple boxes with their own CPUs, Memory and other resources to perform given task; this way we are using the power of all machines / nodes in one go. 7 Karan Gulati (SSAS Maestro)
  • 9. SQL PDW: Flow of Query Execution Control node break When the compute the Query into DMS or Data nodes are finished, multiple parallel Movement Service control nodes Query hits control operations and coordinates any handles post- node distribute them out needed data processing and re- to compute nodes movement among integration of result where the actual nodes sets for delivery data resides back to the users 8 Karan Gulati (SSAS Maestro)
  • 10. SQL PDW: Nodes and Services Control Node Compute Node Administrative Service Nodes Data Movement Services 9 Karan Gulati (SSAS Maestro)
  • 11. Control Node An Control node that is the central point of control for processing queries on the SQL Server PDW appliance. The Control node receives the user query, creates a distributed query plan, communicates relevant plan operations and data to Compute nodes, receives Compute node results, performs any necessary aggregation of results, and then returns the query results to the user. 10 Karan Gulati (SSAS Maestro)
  • 12. Compute Node An Compute node that is the basic unit of scalability and storage. Each Compute node in the SQL Server PDW appliance uses its own user-data and computing resources to perform a portion of each parallel query. 11 Karan Gulati (SSAS Maestro)
  • 13. Administrative Service Nodes • Landing Zone node: An appliance node that provides temporary storage and processing for loading data onto the appliance. • Management node: An appliance node that performs multiple functions related to managing the hardware and software in the appliance. This node is the hub for software deployment and servicing, authentication within the appliance (not login authentication), and monitoring system health and performance • Backup Node: The Backup Node provides high-speed integrated backup at the database level. This is tied to the organization’s overall backup strategy and systems. 12 Karan Gulati (SSAS Maestro)
  • 14. Data Movement Services • When a query is submitted to a control node, it is the PDW Engine that determines what the query plan will be on each individual compute node, then submits the query to all the compute nodes through the DMS DMS • Further DMS coordinates any needed data movement among nodes taking place between and handles any functions that needed to be resolved centrally • In simple words DMS is the brain that ties all the nodes together 13 Karan Gulati (SSAS Maestro)
  • 15. Hub and Spoke Architecture Data warehousing architecture with a central hub data warehouse that provides a flexible and high speed ability to move or copy EDW data to spokes. A spoke is typically a data mart in an optimized physical storage for a particular user group or organization. A data mart is usually a much smaller subset of the data in the EDW and specific to the reporting and analytic needs of a specific user community. 14 Karan Gulati (SSAS Maestro)
  • 16. SQL PDW – Act as Hub Using a true hub-and-spoke architecture, all enterprise data can be maintained on a SQL Server 2008 R2 Parallel Data Warehouse hub while departments or business units keep their existing data marts to suit their needs. High-speed data transfer relieves traditional barriers to hub and spoke. Power users can even deploy a dedicated MPP appliance as a spoke so they can autonomously manage resources, while IT can enforce enterprise standards across all data. 15 Karan Gulati (SSAS Maestro)
  • 17. Recommended Reading SQL Server 2008 R2 Parallel Data Warehouse ITIC: Comparison of Oracle Database Appliance to Microsoft SQL Server Implementing a SQL Server PDW Using the Kimball Approach Implementing Data Warehouse 2.0 by Immon 16 Karan Gulati (SSAS Maestro)
  • 18. Thanks Contact Speaker - http://karanspeaks.com http://blogs.msdn.com/karang https://twitter.com/karangspeaks http://in.linkedin.com/in/karanspeaks 17 Karan Gulati (SSAS Maestro)