SlideShare a Scribd company logo
1 of 43
Download to read offline
© 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved.
Timothy DiLauro, AWS Solutions Architect
Julien Lépine, AWS Solutions Architect
October 2015
CMP306
On-Demand Windows HPC on AWS
Windows Clusters for Dynamic Needs
What to Expect from the Session
HPC on AWS
AWS Architecture for Windows HPC
AWS Architecture for HPC
Best Practices for Windows HPC
Demonstration
HPC on AWS
Low cost with flexible pricing Efficient clusters
Unlimited infrastructure
Faster time to results
Concurrent Clusters on-demand
Increased collaboration
Why AWS for HPC?
Popular HPC workloads on AWS
Genome
processing
Modeling and
Simulation
Government and
Educational Research
Monte Carlo
Simulations
Transcoding and
Encoding
Computational
Chemistry
Benefits of Agility
Elastic Cloud-Based Resources
Actual demand
Resources scaled to demand
Waste Customer
Dissatisfaction
Actual Demand
Predicted Demand
Rigid On-Premises Resources
Cost Benefits of HPC in the Cloud
Pay As You Go Model
Use only what you need
Multiple pricing models
On-Premises
Capital Expense Model
High upfront capital cost
High cost of ongoing support
AWS Journey for HPC Customer
Dev, Test, Eval True Production Mission Critical All-in
Build new production apps
Migrate production apps
Build mission-critical apps
Migrate mission-critical apps
Development and test
Eval and training
Corporate standard
“Cloud First”
AWS Architecture for HPC
On-Demand HPC on AWS
With AWS, deploy multiple clusters
running at the same time and match the
architectures to the jobs
AWS Architecture for HPC
Amazon
Virtual Private
Cloud
Amazon
Simple Storage
Service
Amazon
Elastic Block
Store
Amazon
Elastic Compute
Cloud
Amazon
CloudWatch
AWS
CloudFormation
Auto Scaling
2006 2007 2008 2009 2010 2011 2012-2013 2014
m1.small
m1.xlarge
m1.large
m1.small
m2.2xlarge
m2.4xlarge
c1.medium
c1.xlarge
m1.xlarge
m1.large
m1.small
cc2.8xlarge
cc1.4xlarge
cg1.4xlarge
t1.micro
m2.xlarge
m2.2xlarge
m2.4xlarge
c1.medium
c1.xlarge
m1.xlarge
m1.large
m1.small
cr1.8xlarge
hs1.8xlarge
m3.xlarge
m3.2xlarge
hi1.4xlarge
m1.medium
cc2.8xlarge
cg1.4xlarge
t1.micro
m2.xlarge
m2.2xlarge
m2.4xlarge
c1.medium
c1.xlarge
m1.xlarge
m1.large
m1.small
cc1.4xlarge
cg1.4xlarge
t1.micro
m2.xlarge
m2.2xlarge
m2.4xlarge
c1.medium
c1.xlarge
m1.xlarge
m1.large
m1.small
t2.micro
t2.small
t2.medium
t1.micro
hs1.8xlarge
m3.xlarge
m3.2xlarge
hi1.4xlarge
m1.medium
cc2.8xlarge
cr1.8xlarge
cg1.4xlarge
m2.xlarge
m2.2xlarge
m2.4xlarge
c1.medium
c1.xlarge
m1.xlarge
m1.large
m1.small
c1.medium
c1.xlarge
m1.xlarge
m1.large
m1.small
new
existing
Amazon Elastic Compute Cloud
g2.2xlarge
hs1.xlarge
hs1.2xlarge
hs1.4xlarge
c3.large
c3.xlarge
c3.2xlarge
c3.4xlarge
c3.8xlarge
m3.medium
m3.large
i2.large
i2.xlarge
i2.4xlarge
i2.8xlarge
r3.large
r3.xlarge
r3.2xlarge
r3.4xlarge
r3.8xlarge
Continuing to enable customer choice and right sizing of clusters
m4.large
m4.xlarge
m4.2xlarge
d2.xlarge
d2.2xlarge
d2.4xlarge
d2.8xlarge
t2.micro
t2.small
t2.medium
t2.large
t1.micro
hs1.8xlarge
m3.xlarge
m3.2xlarge
hi1.4xlarge
m1.medium
cc2.8xlarge
cr1.8xlarge
cg1.4xlarge
m2.xlarge
m2.2xlarge
m2.4xlarge
c1.medium
c1.xlarge
m1.xlarge
m1.large
m1.small
m4.4xlarge
m4.10xlarge
c4.xlarge
c4.2xlarge
c4.4xlarge
c4.8xlarge
g2.8xlarge
g2.2xlarge
hs1.xlarge
hs1.2xlarge
hs1.4xlarge
c3.large
c3.xlarge
c3.2xlarge
c3.4xlarge
c3.8xlarge
m3.medium
m3.large
i2.large
i2.xlarge
i2.4xlarge
i2.8xlarge
r3.large
r3.xlarge
r3.2xlarge
r3.4xlarge
r3.8xlarge
2015
Auto Scaling and Amazon CloudWatch
Match demands of cluster queue with appropriate compute needs
CloudWatch
Auto Scaling group
Windows HPC Job Manager
Amazon Elastic Block Store
• Designed for five nines of availability
• Attaches to Amazon EC2 within the same Availability Zone
• Point-in-time snapshots to Amazon S3
• Checkbox enabled encryption
Magnetic
General Purpose
(SSD)
Provisioned IOPS
(SSD)
Volume types
When performance
matters, use SSD-
backed volumes!
Network attached persistent block storage volumes for Amazon EC2
Amazon EBS
• Default 30 GB volume
• Gets initial I/O credit of 5.4M
• Burst for up to 30 mins @ 3000 IOPS
• Accumulate 90 I/O credits/second
Windows Boot Volume
Decrease launch time of instances by leveraging General Purpose SSD
Amazon Simple Storage Service
Store input and result datasets for dynamic and transitive Windows HPC clusters
Redundancy
Durability: designed for 99.999999999%
Availability: designed for 99.9%
Capacity
Consumption-based storage model
Virtually unlimited capacity
Security
Encryption in Transit: HTTPS/TLS
Encryption at Rest: SSE, SSE-C, SSE-KMS
Ease of use
Storage Classes: Standard, RRS, Glacier
Lifecycle Policies: archive, expiration
Amazon S3
Copy data to Amazon S3 and enable SSE
Write-S3Object –BucketName mybucket -Folder .Scripts -KeyPrefix SampleScripts -ServerSideEncryption
Copy data from Amazon S3 to a local folder
Read-S3Object –BucketName mybucket -KeyPrefix SampleScripts –Folder .
• Bucket: mybucket
• Keyname Space: SampleScripts
• Local Folder: .Scripts
Migrate data to AWS and Windows HPC clusters with AWS Tools for PowerShell
AWS CloudFormation
• Create templates to describe the AWS resources used to run your
application
• Provision identical copies of a stack
• Templates can be stored in a source control system
• Track all changes made to your infrastructure stack
• Modify and update resources in a controlled and predictable way
• Just choose what resources and configurations you need
• Customize your template via parameters
Consistently and easily deploy Windows HPC clusters based on workflow needs
Templated resource provisioning
Infrastructure as code
Declarative and flexible
AWS Architecture for HPC
• Users directory
• Bastion host
• Head node
• Compute nodes
Core Infrastructure Cluster Infrastructure
Amazon VPC
Users
Bastion
Core
Head
Compute
Compute
Compute
Compute
Compute
Compute
Compute
Compute
Cluster
AWS Architecture for HPC
Hybrid or “burst” All-in AWS
Choose the right deployment architecture for the use case
Core infrastructure:
Users directory
Bastion host
On-premises
AWS
AWS Directory Service
Amazon EC2
Cluster infrastructure:
Head node
Compute node
Storage
AWS
AWS
On-premises/AWS
Amazon EC2
Amazon EC2
Amazon S3
User workstations On-premises Amazon WorkSpaces
AWS Architecture for HPC
“Burst” to virtually unlimited compute capacity in AWS
Amazon VPC
Users
Bastion
Core
Head
Compute
Compute
Compute
Compute
Compute
Compute
Compute
Compute
ClusterWorkstations
Head
HPC
Users
CoreCluster
On-Premise
HPC
HPC HPC
AWS Architecture for HPC
Deploy users, infrastructure, and cluster all in AWS
Amazon VPC
Core
Head
Compute
Compute
Compute
Compute
Compute
Compute
Compute
Compute
ClusterWorkstations
Users
Bastion
AWS Architecture for Windows HPC
Windows Server on AWS
Easy Licensing
OS $/Hr
BYOL
Optimized AWS
Software for
Windows
EC2Config, drivers
Experience
October 2008
Every use case
Every industry
OS Choice
2003R2
2008, 2008R2
2012, 2012R2
Microsoft Portfolio
SQL Server
SharePoint
Exchange, Lync
Customize Systems
50+ EC2 instances
32, 64 bits
CPU, GPU
AWS Architecture for Windows HPC
Networking best practices for Windows HPC clusters
• Network Design- Leverage both public and private subnets, manage sizing
• Availability – Use multi-AZ design
• Access Control – use VPC endpoint and NAT for external accesses
Availability Zone A
Availability Zone B
Private Subnet
10.0.10.0/24
Public Subnet
10.0.0.0/24
Core
Private Subnet 2
10.0.11.0/24
VPC
Endpoint
NAT
Public Subnet
10.0.1.0/24
NAT
AWS Architecture for Windows HPC
• Domain Controller – Highly available extension of your existing environment
• Remote Desktop Gateway - Increase security posture
Core infrastructure best practices for Windows HPC clusters
Availability Zone A
Availability Zone B
Private Subnet
10.0.10.0/24
Public Subnet
10.0.0.0/24
DC
Core
Private Subnet 2
10.0.11.0/24
DC
RDGW
Public Subnet
10.0.1.0/24
AWS Architecture for Windows HPC
• Head Node – Size independent of Compute Node, General Purpose family
• Compute Nodes – use Auto Scaling groups and cluster instances
• S3 Bucket – Persistent, secure, available storage of cluster input and results
Cluster infrastructure best practices for Windows HPC clusters
Availability Zone B
Availability Zone A
Private Subnet
10.0.10.0/24
Public Subnet
10.0.0.0/24
Core
Private Subnet 2
10.0.11.0/24
Head
Compute
Compute
Compute
Compute
Compute
Compute
Compute
Compute
Cluster
Public Subnet
10.0.1.0/24
S3
Bucket
VPC
Endpoint
AWS Architecture for Windows HPC
All at once, complete Windows HPC infrastructure on AWS
Availability Zone B
Availability Zone A
Private Subnet
10.0.10.0/24
Public Subnet
10.0.0.0/24
DC
S3
Bucket
Core
Private Subnet 2
10.0.11.0/24
DC
Head
Compute
Compute
Compute
Compute
Compute
Compute
Compute
Compute
Cluster
VPC
Endpoint
RDGW
NAT
Public Subnet
10.0.1.0/24
NAT
AWS Architecture for Windows HPC
Launch multiple clusters right-sized to complete work in amount of time specified
Private Subnet
10.0.10.0/24
Public Subnet
10.0.0.0/24
DC
Core
Private Subnet 2
10.0.11.0/24
DC
Head
Compute
Compute
Compute
Compute
Compute
Compute
Compute
Compute
Cluster
Head
Compute
Compute
Compute
Compute
Compute
Compute
Compute
Compute
Head
Compute
Compute
Compute
Compute
Compute
Compute
Compute Compute Compute Compute
RDGW
NAT
Public Subnet
10.0.1.0/24
NAT
Availability Zone A
Availability Zone B
S3
Bucket
VPC
Endpoint
Best Practices for Windows HPC
Secure Windows HPC Workloads on AWS
AWS Resource Access: Enable access to AWS resource through
policies in IAM roles
Encryption at Rest: Enable encryption on EBS volumes and specify
server side encryption for objects in Amazon S3
Create private access to input and output results stored in Amazon S3
via VPC endpoints
Ensure auditability of AWS account by enabling AWS CloudTrail
Leverage native AWS security features to enhance the
security posture of Windows HPC
Optimized network for Windows HPC
Enhanced Networking: SR-IOV feature provides higher PPS
performance, lower latencies, and very low network jitter
Placement Groups: All instances get low latency, full bisection,
10Gbps bandwidth between instances
EBS Optimization: Get up to 4000Mbps additional dedicated
throughput dedicated to your storage needs
AWS PV Drivers / Intel Drivers: Make sure you stay current with
the latest
Get the most of AWS networking for your HPC workloads
Optimized processing with Windows HPC
Hyper-threading: Most current generation AWS instances provide
hyper-threading, keep it or deactivate it based on your needs
Turbo Boost: Latest generation of instances leave you control C-
state and P-state registers for your processors
The right instance: Choose your constraints (price, CPU, GPU,
RAM, network) and get the instance type that fits your use case
The right storage: Choose the amount and support of instance
storage or Amazon EBS storage required, and leverage storage
services such as Amazon S3
Get the most of your instances for your HPC workloads
Automated Windows HPC computing
Windows PowerShell®: You can get all the installation and
configuration of the instances done automatically
AWS Tools for Windows PowerShell: Your cluster can become
aware of the infrastructure it is running on
Auto Scaling: Automate provisioning and scaling of your cluster to
have your workloads finished when you need them
AWS CloudFormation: Deploy your clusters in a few clicks, create
test clusters in minutes
Get your cluster as code, running in minutes from scratch
Demonstration
Windows HPC AWS CloudFormation Template
Enable automated deployments of clusters with pre-built template
Amazon VPC
DC
RDGW
Core
Head
Compute
Compute
Compute
Compute
Compute
Compute
Compute
Compute
Cluster
AWS CloudFormation Templates: Prerequisites
Things to do before starting the template
Select your region and base image
• VPC + Subnet: Just input selected CIDR
• Instance Types: for all instances
• (Optional) Placement Group: Create a VPC placement group
Prepare installation media then snapshot
• Download Microsoft HPC Pack and unzip to HPCPack2012R2-Full
• Extract SQL Server installation to SQLInstall
• Download Intel SR-IOV drivers and extract to PROWinx64
• Download latest AWS PV drivers and extract to AWSPVDriverSetup
Select installation configuration:
• Define domain configuration and credentials
AWS CloudFormation Template: Core
Building the core Windows infrastructure
Base Network
• VPC + Public Subnet: Select your CIDR
• DHCP Option Set: Configured to use DC
• Security Groups: For bastion and cluster
Core Infrastructure:
• Domain Controller in new forest
• Remote Desktop Bastion Host (outside of domain)
• Domain User “Join Computer to Domain” privileges
AWS CloudFormation Template: Cluster
Building the Microsoft HPC cluster on AWS
Head-Node
• Multi-role: database, HPC Head node, Share
• Monitored: Amazon CloudWatch Custom metrics
Compute Nodes:
• Automated: Automatic configuration to join the cluster
• Scalable: Auto Scaling group resizing the cluster based on load
• Up-to-date: auto upgrade of AWS and Intel Drivers
Windows HPC AWS CloudFormation Template
In < 30 minutes, your cluster will be ready to accept jobs.
Getting Started Collateral
QwikLAB: Launching Microsoft HPC Pack on AWS:
https://www.qwiklab.com/focuses/preview/1604?search=19103
Reference CloudFormation Template:
https://github.com/awslabs/aws-cfn-windows-hpc--template
Remember to complete
your evaluations!
Thank you!

More Related Content

Viewers also liked

AWS APAC Webinar Week - Understanding AWS Storage Options
AWS APAC Webinar Week - Understanding AWS Storage OptionsAWS APAC Webinar Week - Understanding AWS Storage Options
AWS APAC Webinar Week - Understanding AWS Storage OptionsAmazon Web Services
 
AWS March 2016 Webinar Series Getting Started with Serverless Architectures
AWS March 2016 Webinar Series   Getting Started with Serverless ArchitecturesAWS March 2016 Webinar Series   Getting Started with Serverless Architectures
AWS March 2016 Webinar Series Getting Started with Serverless ArchitecturesAmazon Web Services
 
AWS Mobile Services & SDK Introduction & Demo
AWS Mobile Services & SDK Introduction & DemoAWS Mobile Services & SDK Introduction & Demo
AWS Mobile Services & SDK Introduction & DemoAmazon Web Services
 
The Pace of Innovation - Pop-up Loft Tel Aviv
The Pace of Innovation - Pop-up Loft Tel AvivThe Pace of Innovation - Pop-up Loft Tel Aviv
The Pace of Innovation - Pop-up Loft Tel AvivAmazon Web Services
 
Compute Without Servers – Building Applications with AWS Lambda - Technical 301
Compute Without Servers – Building Applications with AWS Lambda - Technical 301Compute Without Servers – Building Applications with AWS Lambda - Technical 301
Compute Without Servers – Building Applications with AWS Lambda - Technical 301Amazon Web Services
 
(NET307) Pinterest: The road from EC2-Classic To EC2-VPC
(NET307) Pinterest: The road from EC2-Classic To EC2-VPC(NET307) Pinterest: The road from EC2-Classic To EC2-VPC
(NET307) Pinterest: The road from EC2-Classic To EC2-VPCAmazon Web Services
 
(DEV204) Building High-Performance Native Cloud Apps In C++
(DEV204) Building High-Performance Native Cloud Apps In C++(DEV204) Building High-Performance Native Cloud Apps In C++
(DEV204) Building High-Performance Native Cloud Apps In C++Amazon Web Services
 
Workshop: AWS Lamda Signal Corps vs Zombies
Workshop: AWS Lamda Signal Corps vs ZombiesWorkshop: AWS Lamda Signal Corps vs Zombies
Workshop: AWS Lamda Signal Corps vs ZombiesAmazon Web Services
 
Security Day IAM Recommended Practices
Security Day IAM Recommended PracticesSecurity Day IAM Recommended Practices
Security Day IAM Recommended PracticesAmazon Web Services
 
Ansible on aws - Pop-up Loft Tel Aviv
Ansible on aws - Pop-up Loft Tel AvivAnsible on aws - Pop-up Loft Tel Aviv
Ansible on aws - Pop-up Loft Tel AvivAmazon Web Services
 
Grid Computing for Financial Services
Grid Computing for Financial ServicesGrid Computing for Financial Services
Grid Computing for Financial ServicesAmazon Web Services
 
Getting Started with Amazon Enterprise Applications
Getting Started with Amazon Enterprise ApplicationsGetting Started with Amazon Enterprise Applications
Getting Started with Amazon Enterprise ApplicationsAmazon Web Services
 
AWS July Webinar Series: Amazon redshift migration and load data 20150722
AWS July Webinar Series: Amazon redshift migration and load data 20150722AWS July Webinar Series: Amazon redshift migration and load data 20150722
AWS July Webinar Series: Amazon redshift migration and load data 20150722Amazon Web Services
 
HPC Cluster Computing from 64 to 156,000 Cores 
HPC Cluster Computing from 64 to 156,000 Cores HPC Cluster Computing from 64 to 156,000 Cores 
HPC Cluster Computing from 64 to 156,000 Cores inside-BigData.com
 
Unlocking Open Data in the Cloud
Unlocking Open Data in the CloudUnlocking Open Data in the Cloud
Unlocking Open Data in the CloudAmazon Web Services
 

Viewers also liked (18)

Agile BI - Pop-up Loft Tel Aviv
Agile BI - Pop-up Loft Tel AvivAgile BI - Pop-up Loft Tel Aviv
Agile BI - Pop-up Loft Tel Aviv
 
AWS APAC Webinar Week - Understanding AWS Storage Options
AWS APAC Webinar Week - Understanding AWS Storage OptionsAWS APAC Webinar Week - Understanding AWS Storage Options
AWS APAC Webinar Week - Understanding AWS Storage Options
 
AWS March 2016 Webinar Series Getting Started with Serverless Architectures
AWS March 2016 Webinar Series   Getting Started with Serverless ArchitecturesAWS March 2016 Webinar Series   Getting Started with Serverless Architectures
AWS March 2016 Webinar Series Getting Started with Serverless Architectures
 
AWS Mobile Services & SDK Introduction & Demo
AWS Mobile Services & SDK Introduction & DemoAWS Mobile Services & SDK Introduction & Demo
AWS Mobile Services & SDK Introduction & Demo
 
Deep Dive: Hybrid Architectures
Deep Dive: Hybrid ArchitecturesDeep Dive: Hybrid Architectures
Deep Dive: Hybrid Architectures
 
The Pace of Innovation - Pop-up Loft Tel Aviv
The Pace of Innovation - Pop-up Loft Tel AvivThe Pace of Innovation - Pop-up Loft Tel Aviv
The Pace of Innovation - Pop-up Loft Tel Aviv
 
Compute Without Servers – Building Applications with AWS Lambda - Technical 301
Compute Without Servers – Building Applications with AWS Lambda - Technical 301Compute Without Servers – Building Applications with AWS Lambda - Technical 301
Compute Without Servers – Building Applications with AWS Lambda - Technical 301
 
(NET307) Pinterest: The road from EC2-Classic To EC2-VPC
(NET307) Pinterest: The road from EC2-Classic To EC2-VPC(NET307) Pinterest: The road from EC2-Classic To EC2-VPC
(NET307) Pinterest: The road from EC2-Classic To EC2-VPC
 
(DEV204) Building High-Performance Native Cloud Apps In C++
(DEV204) Building High-Performance Native Cloud Apps In C++(DEV204) Building High-Performance Native Cloud Apps In C++
(DEV204) Building High-Performance Native Cloud Apps In C++
 
Workshop: AWS Lamda Signal Corps vs Zombies
Workshop: AWS Lamda Signal Corps vs ZombiesWorkshop: AWS Lamda Signal Corps vs Zombies
Workshop: AWS Lamda Signal Corps vs Zombies
 
Security Day IAM Recommended Practices
Security Day IAM Recommended PracticesSecurity Day IAM Recommended Practices
Security Day IAM Recommended Practices
 
Ansible on aws - Pop-up Loft Tel Aviv
Ansible on aws - Pop-up Loft Tel AvivAnsible on aws - Pop-up Loft Tel Aviv
Ansible on aws - Pop-up Loft Tel Aviv
 
My First Big Data Application
My First Big Data ApplicationMy First Big Data Application
My First Big Data Application
 
Grid Computing for Financial Services
Grid Computing for Financial ServicesGrid Computing for Financial Services
Grid Computing for Financial Services
 
Getting Started with Amazon Enterprise Applications
Getting Started with Amazon Enterprise ApplicationsGetting Started with Amazon Enterprise Applications
Getting Started with Amazon Enterprise Applications
 
AWS July Webinar Series: Amazon redshift migration and load data 20150722
AWS July Webinar Series: Amazon redshift migration and load data 20150722AWS July Webinar Series: Amazon redshift migration and load data 20150722
AWS July Webinar Series: Amazon redshift migration and load data 20150722
 
HPC Cluster Computing from 64 to 156,000 Cores 
HPC Cluster Computing from 64 to 156,000 Cores HPC Cluster Computing from 64 to 156,000 Cores 
HPC Cluster Computing from 64 to 156,000 Cores 
 
Unlocking Open Data in the Cloud
Unlocking Open Data in the CloudUnlocking Open Data in the Cloud
Unlocking Open Data in the Cloud
 

More from Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Recently uploaded

From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfIngrid Airi González
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Scott Andery
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 

Recently uploaded (20)

From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Generative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdfGenerative Artificial Intelligence: How generative AI works.pdf
Generative Artificial Intelligence: How generative AI works.pdf
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
Enhancing User Experience - Exploring the Latest Features of Tallyman Axis Lo...
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 

(CMP306) Dynamic, On-Demand Windows HPC Clusters On AWS

  • 1. © 2015, Amazon Web Services, Inc. or its Affiliates. All rights reserved. Timothy DiLauro, AWS Solutions Architect Julien Lépine, AWS Solutions Architect October 2015 CMP306 On-Demand Windows HPC on AWS Windows Clusters for Dynamic Needs
  • 2. What to Expect from the Session HPC on AWS AWS Architecture for Windows HPC AWS Architecture for HPC Best Practices for Windows HPC Demonstration
  • 4. Low cost with flexible pricing Efficient clusters Unlimited infrastructure Faster time to results Concurrent Clusters on-demand Increased collaboration Why AWS for HPC?
  • 5. Popular HPC workloads on AWS Genome processing Modeling and Simulation Government and Educational Research Monte Carlo Simulations Transcoding and Encoding Computational Chemistry
  • 6. Benefits of Agility Elastic Cloud-Based Resources Actual demand Resources scaled to demand Waste Customer Dissatisfaction Actual Demand Predicted Demand Rigid On-Premises Resources
  • 7. Cost Benefits of HPC in the Cloud Pay As You Go Model Use only what you need Multiple pricing models On-Premises Capital Expense Model High upfront capital cost High cost of ongoing support
  • 8. AWS Journey for HPC Customer Dev, Test, Eval True Production Mission Critical All-in Build new production apps Migrate production apps Build mission-critical apps Migrate mission-critical apps Development and test Eval and training Corporate standard “Cloud First”
  • 10. On-Demand HPC on AWS With AWS, deploy multiple clusters running at the same time and match the architectures to the jobs
  • 11. AWS Architecture for HPC Amazon Virtual Private Cloud Amazon Simple Storage Service Amazon Elastic Block Store Amazon Elastic Compute Cloud Amazon CloudWatch AWS CloudFormation Auto Scaling
  • 12. 2006 2007 2008 2009 2010 2011 2012-2013 2014 m1.small m1.xlarge m1.large m1.small m2.2xlarge m2.4xlarge c1.medium c1.xlarge m1.xlarge m1.large m1.small cc2.8xlarge cc1.4xlarge cg1.4xlarge t1.micro m2.xlarge m2.2xlarge m2.4xlarge c1.medium c1.xlarge m1.xlarge m1.large m1.small cr1.8xlarge hs1.8xlarge m3.xlarge m3.2xlarge hi1.4xlarge m1.medium cc2.8xlarge cg1.4xlarge t1.micro m2.xlarge m2.2xlarge m2.4xlarge c1.medium c1.xlarge m1.xlarge m1.large m1.small cc1.4xlarge cg1.4xlarge t1.micro m2.xlarge m2.2xlarge m2.4xlarge c1.medium c1.xlarge m1.xlarge m1.large m1.small t2.micro t2.small t2.medium t1.micro hs1.8xlarge m3.xlarge m3.2xlarge hi1.4xlarge m1.medium cc2.8xlarge cr1.8xlarge cg1.4xlarge m2.xlarge m2.2xlarge m2.4xlarge c1.medium c1.xlarge m1.xlarge m1.large m1.small c1.medium c1.xlarge m1.xlarge m1.large m1.small new existing Amazon Elastic Compute Cloud g2.2xlarge hs1.xlarge hs1.2xlarge hs1.4xlarge c3.large c3.xlarge c3.2xlarge c3.4xlarge c3.8xlarge m3.medium m3.large i2.large i2.xlarge i2.4xlarge i2.8xlarge r3.large r3.xlarge r3.2xlarge r3.4xlarge r3.8xlarge Continuing to enable customer choice and right sizing of clusters m4.large m4.xlarge m4.2xlarge d2.xlarge d2.2xlarge d2.4xlarge d2.8xlarge t2.micro t2.small t2.medium t2.large t1.micro hs1.8xlarge m3.xlarge m3.2xlarge hi1.4xlarge m1.medium cc2.8xlarge cr1.8xlarge cg1.4xlarge m2.xlarge m2.2xlarge m2.4xlarge c1.medium c1.xlarge m1.xlarge m1.large m1.small m4.4xlarge m4.10xlarge c4.xlarge c4.2xlarge c4.4xlarge c4.8xlarge g2.8xlarge g2.2xlarge hs1.xlarge hs1.2xlarge hs1.4xlarge c3.large c3.xlarge c3.2xlarge c3.4xlarge c3.8xlarge m3.medium m3.large i2.large i2.xlarge i2.4xlarge i2.8xlarge r3.large r3.xlarge r3.2xlarge r3.4xlarge r3.8xlarge 2015
  • 13. Auto Scaling and Amazon CloudWatch Match demands of cluster queue with appropriate compute needs CloudWatch Auto Scaling group Windows HPC Job Manager
  • 14. Amazon Elastic Block Store • Designed for five nines of availability • Attaches to Amazon EC2 within the same Availability Zone • Point-in-time snapshots to Amazon S3 • Checkbox enabled encryption Magnetic General Purpose (SSD) Provisioned IOPS (SSD) Volume types When performance matters, use SSD- backed volumes! Network attached persistent block storage volumes for Amazon EC2
  • 15. Amazon EBS • Default 30 GB volume • Gets initial I/O credit of 5.4M • Burst for up to 30 mins @ 3000 IOPS • Accumulate 90 I/O credits/second Windows Boot Volume Decrease launch time of instances by leveraging General Purpose SSD
  • 16. Amazon Simple Storage Service Store input and result datasets for dynamic and transitive Windows HPC clusters Redundancy Durability: designed for 99.999999999% Availability: designed for 99.9% Capacity Consumption-based storage model Virtually unlimited capacity Security Encryption in Transit: HTTPS/TLS Encryption at Rest: SSE, SSE-C, SSE-KMS Ease of use Storage Classes: Standard, RRS, Glacier Lifecycle Policies: archive, expiration
  • 17. Amazon S3 Copy data to Amazon S3 and enable SSE Write-S3Object –BucketName mybucket -Folder .Scripts -KeyPrefix SampleScripts -ServerSideEncryption Copy data from Amazon S3 to a local folder Read-S3Object –BucketName mybucket -KeyPrefix SampleScripts –Folder . • Bucket: mybucket • Keyname Space: SampleScripts • Local Folder: .Scripts Migrate data to AWS and Windows HPC clusters with AWS Tools for PowerShell
  • 18. AWS CloudFormation • Create templates to describe the AWS resources used to run your application • Provision identical copies of a stack • Templates can be stored in a source control system • Track all changes made to your infrastructure stack • Modify and update resources in a controlled and predictable way • Just choose what resources and configurations you need • Customize your template via parameters Consistently and easily deploy Windows HPC clusters based on workflow needs Templated resource provisioning Infrastructure as code Declarative and flexible
  • 19. AWS Architecture for HPC • Users directory • Bastion host • Head node • Compute nodes Core Infrastructure Cluster Infrastructure Amazon VPC Users Bastion Core Head Compute Compute Compute Compute Compute Compute Compute Compute Cluster
  • 20. AWS Architecture for HPC Hybrid or “burst” All-in AWS Choose the right deployment architecture for the use case Core infrastructure: Users directory Bastion host On-premises AWS AWS Directory Service Amazon EC2 Cluster infrastructure: Head node Compute node Storage AWS AWS On-premises/AWS Amazon EC2 Amazon EC2 Amazon S3 User workstations On-premises Amazon WorkSpaces
  • 21. AWS Architecture for HPC “Burst” to virtually unlimited compute capacity in AWS Amazon VPC Users Bastion Core Head Compute Compute Compute Compute Compute Compute Compute Compute ClusterWorkstations Head HPC Users CoreCluster On-Premise HPC HPC HPC
  • 22. AWS Architecture for HPC Deploy users, infrastructure, and cluster all in AWS Amazon VPC Core Head Compute Compute Compute Compute Compute Compute Compute Compute ClusterWorkstations Users Bastion
  • 23. AWS Architecture for Windows HPC
  • 24. Windows Server on AWS Easy Licensing OS $/Hr BYOL Optimized AWS Software for Windows EC2Config, drivers Experience October 2008 Every use case Every industry OS Choice 2003R2 2008, 2008R2 2012, 2012R2 Microsoft Portfolio SQL Server SharePoint Exchange, Lync Customize Systems 50+ EC2 instances 32, 64 bits CPU, GPU
  • 25. AWS Architecture for Windows HPC Networking best practices for Windows HPC clusters • Network Design- Leverage both public and private subnets, manage sizing • Availability – Use multi-AZ design • Access Control – use VPC endpoint and NAT for external accesses Availability Zone A Availability Zone B Private Subnet 10.0.10.0/24 Public Subnet 10.0.0.0/24 Core Private Subnet 2 10.0.11.0/24 VPC Endpoint NAT Public Subnet 10.0.1.0/24 NAT
  • 26. AWS Architecture for Windows HPC • Domain Controller – Highly available extension of your existing environment • Remote Desktop Gateway - Increase security posture Core infrastructure best practices for Windows HPC clusters Availability Zone A Availability Zone B Private Subnet 10.0.10.0/24 Public Subnet 10.0.0.0/24 DC Core Private Subnet 2 10.0.11.0/24 DC RDGW Public Subnet 10.0.1.0/24
  • 27. AWS Architecture for Windows HPC • Head Node – Size independent of Compute Node, General Purpose family • Compute Nodes – use Auto Scaling groups and cluster instances • S3 Bucket – Persistent, secure, available storage of cluster input and results Cluster infrastructure best practices for Windows HPC clusters Availability Zone B Availability Zone A Private Subnet 10.0.10.0/24 Public Subnet 10.0.0.0/24 Core Private Subnet 2 10.0.11.0/24 Head Compute Compute Compute Compute Compute Compute Compute Compute Cluster Public Subnet 10.0.1.0/24 S3 Bucket VPC Endpoint
  • 28. AWS Architecture for Windows HPC All at once, complete Windows HPC infrastructure on AWS Availability Zone B Availability Zone A Private Subnet 10.0.10.0/24 Public Subnet 10.0.0.0/24 DC S3 Bucket Core Private Subnet 2 10.0.11.0/24 DC Head Compute Compute Compute Compute Compute Compute Compute Compute Cluster VPC Endpoint RDGW NAT Public Subnet 10.0.1.0/24 NAT
  • 29. AWS Architecture for Windows HPC Launch multiple clusters right-sized to complete work in amount of time specified Private Subnet 10.0.10.0/24 Public Subnet 10.0.0.0/24 DC Core Private Subnet 2 10.0.11.0/24 DC Head Compute Compute Compute Compute Compute Compute Compute Compute Cluster Head Compute Compute Compute Compute Compute Compute Compute Compute Head Compute Compute Compute Compute Compute Compute Compute Compute Compute Compute RDGW NAT Public Subnet 10.0.1.0/24 NAT Availability Zone A Availability Zone B S3 Bucket VPC Endpoint
  • 30. Best Practices for Windows HPC
  • 31. Secure Windows HPC Workloads on AWS AWS Resource Access: Enable access to AWS resource through policies in IAM roles Encryption at Rest: Enable encryption on EBS volumes and specify server side encryption for objects in Amazon S3 Create private access to input and output results stored in Amazon S3 via VPC endpoints Ensure auditability of AWS account by enabling AWS CloudTrail Leverage native AWS security features to enhance the security posture of Windows HPC
  • 32. Optimized network for Windows HPC Enhanced Networking: SR-IOV feature provides higher PPS performance, lower latencies, and very low network jitter Placement Groups: All instances get low latency, full bisection, 10Gbps bandwidth between instances EBS Optimization: Get up to 4000Mbps additional dedicated throughput dedicated to your storage needs AWS PV Drivers / Intel Drivers: Make sure you stay current with the latest Get the most of AWS networking for your HPC workloads
  • 33. Optimized processing with Windows HPC Hyper-threading: Most current generation AWS instances provide hyper-threading, keep it or deactivate it based on your needs Turbo Boost: Latest generation of instances leave you control C- state and P-state registers for your processors The right instance: Choose your constraints (price, CPU, GPU, RAM, network) and get the instance type that fits your use case The right storage: Choose the amount and support of instance storage or Amazon EBS storage required, and leverage storage services such as Amazon S3 Get the most of your instances for your HPC workloads
  • 34. Automated Windows HPC computing Windows PowerShell®: You can get all the installation and configuration of the instances done automatically AWS Tools for Windows PowerShell: Your cluster can become aware of the infrastructure it is running on Auto Scaling: Automate provisioning and scaling of your cluster to have your workloads finished when you need them AWS CloudFormation: Deploy your clusters in a few clicks, create test clusters in minutes Get your cluster as code, running in minutes from scratch
  • 36. Windows HPC AWS CloudFormation Template Enable automated deployments of clusters with pre-built template Amazon VPC DC RDGW Core Head Compute Compute Compute Compute Compute Compute Compute Compute Cluster
  • 37. AWS CloudFormation Templates: Prerequisites Things to do before starting the template Select your region and base image • VPC + Subnet: Just input selected CIDR • Instance Types: for all instances • (Optional) Placement Group: Create a VPC placement group Prepare installation media then snapshot • Download Microsoft HPC Pack and unzip to HPCPack2012R2-Full • Extract SQL Server installation to SQLInstall • Download Intel SR-IOV drivers and extract to PROWinx64 • Download latest AWS PV drivers and extract to AWSPVDriverSetup Select installation configuration: • Define domain configuration and credentials
  • 38. AWS CloudFormation Template: Core Building the core Windows infrastructure Base Network • VPC + Public Subnet: Select your CIDR • DHCP Option Set: Configured to use DC • Security Groups: For bastion and cluster Core Infrastructure: • Domain Controller in new forest • Remote Desktop Bastion Host (outside of domain) • Domain User “Join Computer to Domain” privileges
  • 39. AWS CloudFormation Template: Cluster Building the Microsoft HPC cluster on AWS Head-Node • Multi-role: database, HPC Head node, Share • Monitored: Amazon CloudWatch Custom metrics Compute Nodes: • Automated: Automatic configuration to join the cluster • Scalable: Auto Scaling group resizing the cluster based on load • Up-to-date: auto upgrade of AWS and Intel Drivers
  • 40. Windows HPC AWS CloudFormation Template In < 30 minutes, your cluster will be ready to accept jobs.
  • 41. Getting Started Collateral QwikLAB: Launching Microsoft HPC Pack on AWS: https://www.qwiklab.com/focuses/preview/1604?search=19103 Reference CloudFormation Template: https://github.com/awslabs/aws-cfn-windows-hpc--template