SlideShare une entreprise Scribd logo
1  sur  32
Télécharger pour lire hors ligne
How to get the most out of your AWS Redshift
Investment, while keeping cost down
WEBINAR SERIES : AWS OPTIMIZATION
Agilisium Innovation Labs
• Tens of thousands of customers and growing
• 3x faster than other CDWs
• 200+ new features in last 18 months
AWS Redshift : A Shift towards the Future
• Keep up with the rapid pace of innovation
• Lack of time to experiment
• Extend knowledge on best practices
But what seems to be the challenge ?
All above have impeded organizations ability to extract maximum value from
their existing Redshift investments
In the next 30-35 mins…
• Key design/architectural considerations of AWS Redshift
• Strategies to optimize AWS Redshift for Cost & Performance
• Success Story : Reducing Redshift run cost by 40%
• How we can help you
What we would like to talk about today
Agilisium – Overview
U.S (60+) : Los Angeles(HQ), Chicago, Texas with global presence in
India (250+), Canada, Costa Rica, Netherlands and UK (30+)
We are a Big Data and Analytics company with clear focus
on helping organizations take the
“Data-to-Insights-Leap”
Our Data & Analytics Customers
Our Redshift Experience
400-level AWS ExpertsProven Expertise
Top 3 AWS Redshift Competency
Partner in the U.S with razor focus
on AWS Data & Analytics solutions
Demonstrated Capability
15+ PB migrated to AWS through
$ 50 MN worth of successful Big
Data Analytics projects
55+ AWS Certified Experts. Our SAs
are regular attendees of AOD
training by Redshift Product team
MEET THE SPEAKERS
Jay Palaniappan
CTO & Head of Innovation Labs
Smitha Basavaraju
Big Data Architect
Arun Chinnadurai
Associate Director – BD
shukvina@amazon.com
AWS Redshift – Key Design
Considerations
AWS Redshift – Well Architected
Reliability
• High Availability
• Disaster Recovery & Backup
Operational Excellence
• Automation of operations – CI/CD
• Centralized monitoring and logs
• Learn from operational events and failures
Cost Optimization
• Understand Consumption
• Right sizing & Pricing strategies
Performance Efficiency
• Measure Performance of workloads
• Optimize
• Scale on Demand
Security
• Protect data in transit and at rest
• Strong identity foundation
• Enable monitoring and auditing
Optimization Strategies
Optimizing Strategy
Impact
Effort/ Complexity
2
Pause &
Resume
1
6
4
5
3
Reserved
Instances
Elastic Resize
Concurrency
Scaling
Moving to RA3
Right Sizing
7
Design &
Architecture
1. Reserved Instances
Reserved Instance
Immediate Low Up to 70%
Cost Savings
Reserved Instances :
Duration- 1Yr / 3Yr
Payment Option: No Upfront
Partial Upfront
All Upfront
2. Pause & Resume
Pause & Resume
Immediate Low Up to 50%
Cost Savings
Pause nonproduction instances
Pay only for storage
Applicable only for on-demand instances
3. Elastic Resize
Elastic Resize
Immediate Low Surge in Data/
Performance
Scales redshift clusters up and down clusters in minutes
Automate cluster resize on predictable loads
Optimize cost and plan for capacity
Schedule cluster resize using management console or API
4. Concurrency Scaling
Concurrency Scaling
Immediate Low Scalable
capacity
Automatically adds transient clusters
Serves spike in concurrent requests
For 24hrs of cluster in use, 1 hr. of concurrency scaling is free
Ability to set usage limit
5. Moving to RA3 Instance
RA3
Immediate Zero 2x performance
uplift | 2x
storage
Scale data warehouse based on workload and scale on peak
demand
Pay separately compute and storage independently
2X performance and 2X Storage capacity in comparison to
DS2.XLarge
6. Right Sizing
• Instance Types
• Dense Compute (DC2)
• Dense Storage (DS2)
• RA3
• Sizing
• Size based on workload: CPU, disk, I/O
• Scale up by adding nodes to check
linear performance
• Move to Higher instance groups
7. Table Design Considerations
Sort Key Column EncodingDistribution Key
• ANALYZE COMPRESSION
• Compress all columns except
for first sort key column
• AZ64 is new encoding
• Improves performance 2X-4X by
reducing I/O
• Use PG_TABLE_DEF
• Zone maps stores min and
max values of block
• Order columns by low to high
cardinality
• No of Sort Columns < 4
• Interleaved Sort key– BE
CAUTIOUS
• More columns in interleaved
sort key = Longer Vacuum
• Use STL_TABLE_INFO
• Distributions keys should
have high cardinality to
avoid data skew and “hot”
nodes
• Use Date Columns only if
cardinality is high
• DISTSTYLE AUTO is a great
go-to for all tables < ~5
million rows.
Moving Towards AUTO Management
Table Stats
WLM
• Ensure that AUTO ANALYZE , AUTO SORT &
AUTO VACUUM is enabled
• INTERLEAVED SORT KEYS - Run
VACUUM REINDEX command scheduled
• Use STL_TABLE_INFO for stats
• Use Auto WLM with SQA Enabled
Manual WLM
• Number of queues < 4
• Use QMR to monitor performance from bad queries
• Max concurrency level for all user <=15
• Leave ~5% of memory unallocated
Success Story
AWS WAF-based Redshift assessment for M&E Giant
Technologies:
S3, Redshift, Redshift Spectrum
Source System:
25 TB
Team:
Cloud Solutions Architect, Sr. Big
Data Architect
Fast FactsSolution
• Comprehensive assessment of the candidate Redshift workload across 5
pillars of the AWS WAF, using Agilisium’s Redshift Inspector
• Several observations across all 5 pillars were made based on Findings &
Recommendations report (Redshift Inspector) and workshop
Recommendations
• Security : Database Encryption, Redshift in private cluster, Port
Obfuscation, S3 VPC endpoint
• Performance : Time series data model, Concurrency Scaling, Limited use
of Interleaved sort keys, right-size column width
• Cost Optimization : Data placement, RA3, Deletion of redundant backup
& Reserved Instances
• Reliability : Cross-region backup, Avoid Temp & Staging backup
• Operational Excellence : Audit Logging, Cloud Watch Alerts and Auto
Update
Client requested a holistic assessment of their 25TB Redshift workload to identify avenues for improvement across all dimensions
Objective
Value delivered
30% faster
query
performance
More secured and resilient
Redshift workload
40% Cost
reduction
How we can help?
How we can help?
• AWS WAF-based Redshift
Assessment
✓Findings &
Recommendations Report
✓Remediation Plan (Pre-
cursor for next phase –
Optimize)
• Performance Optimization
(Compelling pricing options)
• Cost Optimization (Outcome-
based pricing)
• Extend customer’s knowledge
on new features and best
practices
• Custom trend report with top
10 metrics for ongoing
maintenance
Diagnose – 3 Days Optimize – 2+ Weeks Maintain – Quarterly
Diagnose – AWS WAF-based Redshift Assessment – 3 Days
Customer Contribution
Identification of business-critical
Redshift workload
Availability of Business & Tech SMEs
tied to Redshift workload for
workshop
Availability of Client DBA to run
Redshift diagnostic queries
Read-only access to your Redshift
cluster for additional investigation,
if any
Automated fact-
based assessment
Rich Corpus of Best practices
Agilisium’s Redshift Inspector
Holistic 60-point check of your Redshift
workload across 5 pillars of AWS WAF
Toolkit is based on 100+ Redshift best practices identified
from migrating 15+ PB to AWS in the last 7+ years
AWS WAF-based
Assessment – Deliverables
Findings & Recommendations Report –
Get accurate observations by criticality
(Critical, Needs Improvement, Well-
Architected)
Actionable Remediation Plan – Plan to
implement top observations from the
Findings & Recommendations report.
Clients can choose to implement the plan
internally or involve Agilisium
Automated Redshift WAF-based Assessment Toolkit
Agilisium’s Redshift Inspector – Key facets covered
Cost Optimization
• Right-fit cluster size
• On-demand to restore snapshots
• Underutilized/unused clusters
• Choice of Reserved Instances (stand vs convertible)
• Hot/cold/warm data strategy
• Intra-region Data Transfer
• Snapshot lifecycle management
Performance Efficiency
• Compression/Encoding of large datasets to improve network throughput
• Avoid Data Skew through right Distribution & Sort Keys
• Up-to-date stats via ANALYZE & VACUUM
• VACUUM strategies (Pre-sort & load)
• Data loads Optimize strategies (COPY commands)
• Track query performance (Integrity constraints as hints)
• Auto WLM vs Custom WLM
• Time-series data model for larger datasets
Security
• SSO & IAM Federations
• Ingress policy – Port 5832 open for internal IPs only
• All traffic routed via private subnets/VPCs
• Encrypt data-at-rest – KMS/HSM
• Encrypt data-in-motion – SSL/TLS
Reliability
• Multi-region cluster setup
• SLA-based manual backup – Restore data lost due to accidental
deletion
• Cross-region backups for HA
• Continuous monitoring of key metrics for HA (Disk utilization,
ReadIOPS, WriteOPS, CPU utilization etc.)
• Redshift user activity logged for RCA
Operational Excellence
• Deferred maintenance
• Redshift Advisor recommendations
• RA3 – Intelligent Data offload
Findings & Recommendations – Sample Report
Findings & Recommendations – Sample Report (contd..)
Findings & Recommendations – Sample Report (contd..)
Contact us at
sales@agilisium.com
Questions?
Thank You

Contenu connexe

Tendances

Analytics in a Day Virtual Workshop
Analytics in a Day Virtual WorkshopAnalytics in a Day Virtual Workshop
Analytics in a Day Virtual Workshop
CCG
 
Empowering Real Time Patient Care Through Spark Streaming
Empowering Real Time Patient Care Through Spark StreamingEmpowering Real Time Patient Care Through Spark Streaming
Empowering Real Time Patient Care Through Spark Streaming
Databricks
 

Tendances (20)

Bad Data is Polluting Big Data
Bad Data is Polluting Big DataBad Data is Polluting Big Data
Bad Data is Polluting Big Data
 
Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...
Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...
Data Mesh in Practice: How Europe’s Leading Online Platform for Fashion Goes ...
 
Benefits of the Azure Cloud
Benefits of the Azure CloudBenefits of the Azure Cloud
Benefits of the Azure Cloud
 
Analyzing Billions of Data Rows with Alteryx, Amazon Redshift, and Tableau
Analyzing Billions of Data Rows with Alteryx, Amazon Redshift, and TableauAnalyzing Billions of Data Rows with Alteryx, Amazon Redshift, and Tableau
Analyzing Billions of Data Rows with Alteryx, Amazon Redshift, and Tableau
 
Altis Webinar: Use Cases For The Modern Data Platform
Altis Webinar: Use Cases For The Modern Data PlatformAltis Webinar: Use Cases For The Modern Data Platform
Altis Webinar: Use Cases For The Modern Data Platform
 
Big Data Analytics: Reference Architectures and Case Studies by Serhiy Haziye...
Big Data Analytics: Reference Architectures and Case Studies by Serhiy Haziye...Big Data Analytics: Reference Architectures and Case Studies by Serhiy Haziye...
Big Data Analytics: Reference Architectures and Case Studies by Serhiy Haziye...
 
Analytics in a Day Virtual Workshop
Analytics in a Day Virtual WorkshopAnalytics in a Day Virtual Workshop
Analytics in a Day Virtual Workshop
 
Empowering Real Time Patient Care Through Spark Streaming
Empowering Real Time Patient Care Through Spark StreamingEmpowering Real Time Patient Care Through Spark Streaming
Empowering Real Time Patient Care Through Spark Streaming
 
How to build a successful Data Lake
How to build a successful Data LakeHow to build a successful Data Lake
How to build a successful Data Lake
 
Creating Agility Through Data Governance and Self-service Integration with S...
Creating Agility Through Data Governance and Self-service Integration with S...Creating Agility Through Data Governance and Self-service Integration with S...
Creating Agility Through Data Governance and Self-service Integration with S...
 
End to End Supply Chain Control Tower
End to End Supply Chain Control TowerEnd to End Supply Chain Control Tower
End to End Supply Chain Control Tower
 
Modern Applications for Practical Business Transformation | Inovar Consulting
Modern Applications for Practical Business Transformation | Inovar ConsultingModern Applications for Practical Business Transformation | Inovar Consulting
Modern Applications for Practical Business Transformation | Inovar Consulting
 
Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...
Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...
Webinar: DataStax and Microsoft Azure: Empowering the Right-Now Enterprise wi...
 
InfoTrellis Corporate
InfoTrellis CorporateInfoTrellis Corporate
InfoTrellis Corporate
 
Designing a Distributed Cloud Database for Dummies
Designing a Distributed Cloud Database for DummiesDesigning a Distributed Cloud Database for Dummies
Designing a Distributed Cloud Database for Dummies
 
What’s New in Syncsort’s Trillium Software System (TSS) 15.7
What’s New in Syncsort’s Trillium Software System (TSS) 15.7What’s New in Syncsort’s Trillium Software System (TSS) 15.7
What’s New in Syncsort’s Trillium Software System (TSS) 15.7
 
Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...
Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...
Citizens Bank: Data Lake Implementation – Selecting BigInsights ViON Spark/Ha...
 
Talend MDM
Talend MDMTalend MDM
Talend MDM
 
Mapping Manager
Mapping ManagerMapping Manager
Mapping Manager
 
Data Services and the Modern Data Ecosystem (ASEAN)
Data Services and the Modern Data Ecosystem (ASEAN)Data Services and the Modern Data Ecosystem (ASEAN)
Data Services and the Modern Data Ecosystem (ASEAN)
 

Similaire à Get the most out of your AWS Redshift investment while keeping cost down

AWS Summit Tel Aviv - Enterprise Track - Cost Optimization & TCO
AWS Summit Tel Aviv - Enterprise Track - Cost Optimization & TCOAWS Summit Tel Aviv - Enterprise Track - Cost Optimization & TCO
AWS Summit Tel Aviv - Enterprise Track - Cost Optimization & TCO
Amazon Web Services
 

Similaire à Get the most out of your AWS Redshift investment while keeping cost down (20)

AWS re:Invent 2016: Best Practices for Data Warehousing with Amazon Redshift ...
AWS re:Invent 2016: Best Practices for Data Warehousing with Amazon Redshift ...AWS re:Invent 2016: Best Practices for Data Warehousing with Amazon Redshift ...
AWS re:Invent 2016: Best Practices for Data Warehousing with Amazon Redshift ...
 
FSI201 FINRA’s Managed Data Lake – Next Gen Analytics in the Cloud
FSI201 FINRA’s Managed Data Lake – Next Gen Analytics in the CloudFSI201 FINRA’s Managed Data Lake – Next Gen Analytics in the Cloud
FSI201 FINRA’s Managed Data Lake – Next Gen Analytics in the Cloud
 
Building and Deploying Large Scale SSRS using Lessons Learned from Customer D...
Building and Deploying Large Scale SSRS using Lessons Learned from Customer D...Building and Deploying Large Scale SSRS using Lessons Learned from Customer D...
Building and Deploying Large Scale SSRS using Lessons Learned from Customer D...
 
AWS Summit Tel Aviv - Enterprise Track - Cost Optimization & TCO
AWS Summit Tel Aviv - Enterprise Track - Cost Optimization & TCOAWS Summit Tel Aviv - Enterprise Track - Cost Optimization & TCO
AWS Summit Tel Aviv - Enterprise Track - Cost Optimization & TCO
 
AWS Summit Berlin 2013 - Optimizing your AWS applications and usage to reduce...
AWS Summit Berlin 2013 - Optimizing your AWS applications and usage to reduce...AWS Summit Berlin 2013 - Optimizing your AWS applications and usage to reduce...
AWS Summit Berlin 2013 - Optimizing your AWS applications and usage to reduce...
 
Best Practices for Supercharging Cloud Analytics on Amazon Redshift
Best Practices for Supercharging Cloud Analytics on Amazon RedshiftBest Practices for Supercharging Cloud Analytics on Amazon Redshift
Best Practices for Supercharging Cloud Analytics on Amazon Redshift
 
Why Scale Matters and How the Cloud Really is Different
Why Scale Matters and How the Cloud Really is Different Why Scale Matters and How the Cloud Really is Different
Why Scale Matters and How the Cloud Really is Different
 
Optimizing your cloud
Optimizing your cloudOptimizing your cloud
Optimizing your cloud
 
Cost Optimization on AWS
Cost Optimization on AWSCost Optimization on AWS
Cost Optimization on AWS
 
Cost Optimization on AWS
Cost Optimization on AWSCost Optimization on AWS
Cost Optimization on AWS
 
AWS Summit 2013 | Auckland - Building Web Scale Applications with AWS
AWS Summit 2013 | Auckland - Building Web Scale Applications with AWSAWS Summit 2013 | Auckland - Building Web Scale Applications with AWS
AWS Summit 2013 | Auckland - Building Web Scale Applications with AWS
 
Database and Analytics on the AWS Cloud
Database and Analytics on the AWS CloudDatabase and Analytics on the AWS Cloud
Database and Analytics on the AWS Cloud
 
Amazon Redshift with Full 360 Inc.
Amazon Redshift with Full 360 Inc.Amazon Redshift with Full 360 Inc.
Amazon Redshift with Full 360 Inc.
 
Running Lean Architectures: How to Optimize for Cost Efficiency
Running Lean Architectures: How to Optimize for Cost Efficiency Running Lean Architectures: How to Optimize for Cost Efficiency
Running Lean Architectures: How to Optimize for Cost Efficiency
 
Managing Performance Globally with MySQL
Managing Performance Globally with MySQLManaging Performance Globally with MySQL
Managing Performance Globally with MySQL
 
How to Set Up a Cloud Cost Optimization Process for your Enterprise
How to Set Up a Cloud Cost Optimization Process for your EnterpriseHow to Set Up a Cloud Cost Optimization Process for your Enterprise
How to Set Up a Cloud Cost Optimization Process for your Enterprise
 
Data warehousing in the era of Big Data: Deep Dive into Amazon Redshift
Data warehousing in the era of Big Data: Deep Dive into Amazon RedshiftData warehousing in the era of Big Data: Deep Dive into Amazon Redshift
Data warehousing in the era of Big Data: Deep Dive into Amazon Redshift
 
Benchmark Showdown: Which Relational Database is the Fastest on AWS?
Benchmark Showdown: Which Relational Database is the Fastest on AWS?Benchmark Showdown: Which Relational Database is the Fastest on AWS?
Benchmark Showdown: Which Relational Database is the Fastest on AWS?
 
AWS re:Invent 2016: Billions of Rows Transformed in Record Time Using Matilli...
AWS re:Invent 2016: Billions of Rows Transformed in Record Time Using Matilli...AWS re:Invent 2016: Billions of Rows Transformed in Record Time Using Matilli...
AWS re:Invent 2016: Billions of Rows Transformed in Record Time Using Matilli...
 
MariaDB AX: Solución analítica con ColumnStore
MariaDB AX: Solución analítica con ColumnStoreMariaDB AX: Solución analítica con ColumnStore
MariaDB AX: Solución analítica con ColumnStore
 

Plus de Agilisium Consulting

Plus de Agilisium Consulting (8)

BI & Analytics
BI & Analytics BI & Analytics
BI & Analytics
 
Big data services slideshare - agilisium 2.0 - v1.0
Big data services   slideshare - agilisium 2.0 - v1.0Big data services   slideshare - agilisium 2.0 - v1.0
Big data services slideshare - agilisium 2.0 - v1.0
 
Big data governance slideshare - v0.5
Big data governance   slideshare - v0.5Big data governance   slideshare - v0.5
Big data governance slideshare - v0.5
 
Big data engineering slideshare - v0.4
Big data engineering   slideshare - v0.4Big data engineering   slideshare - v0.4
Big data engineering slideshare - v0.4
 
Big data consulting slideshare - v0.4
Big data consulting   slideshare - v0.4Big data consulting   slideshare - v0.4
Big data consulting slideshare - v0.4
 
Why Data Lake should be the foundation of Enterprise Data Architecture
Why Data Lake should be the foundation of Enterprise Data ArchitectureWhy Data Lake should be the foundation of Enterprise Data Architecture
Why Data Lake should be the foundation of Enterprise Data Architecture
 
Exploiting Data Lakes: Architecture, Capabilities & Future
Exploiting Data Lakes: Architecture, Capabilities & FutureExploiting Data Lakes: Architecture, Capabilities & Future
Exploiting Data Lakes: Architecture, Capabilities & Future
 
Extending Analytic Reach
Extending Analytic ReachExtending Analytic Reach
Extending Analytic Reach
 

Dernier

In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
ahmedjiabur940
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Klinik kandungan
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Bertram Ludäscher
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
nirzagarg
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
gajnagarg
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
gajnagarg
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Riyadh +966572737505 get cytotec
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
HyderabadDolls
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
nirzagarg
 

Dernier (20)

In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi ArabiaIn Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
In Riyadh ((+919101817206)) Cytotec kit @ Abortion Pills Saudi Arabia
 
7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt7. Epi of Chronic respiratory diseases.ppt
7. Epi of Chronic respiratory diseases.ppt
 
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Surabaya ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...
Gulbai Tekra * Cheap Call Girls In Ahmedabad Phone No 8005736733 Elite Escort...
 
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
Nirala Nagar / Cheap Call Girls In Lucknow Phone No 9548273370 Elite Escort S...
 
Kings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about themKings of Saudi Arabia, information about them
Kings of Saudi Arabia, information about them
 
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
Jual obat aborsi Bandung ( 085657271886 ) Cytote pil telat bulan penggugur ka...
 
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...Reconciling Conflicting Data Curation Actions:  Transparency Through Argument...
Reconciling Conflicting Data Curation Actions: Transparency Through Argument...
 
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
5CL-ADBA,5cladba, Chinese supplier, safety is guaranteed
 
Discover Why Less is More in B2B Research
Discover Why Less is More in B2B ResearchDiscover Why Less is More in B2B Research
Discover Why Less is More in B2B Research
 
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In Begusarai [ 7014168258 ] Call Me For Genuine Models...
 
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...Fun all Day Call Girls in Jaipur   9332606886  High Profile Call Girls You Ca...
Fun all Day Call Girls in Jaipur 9332606886 High Profile Call Girls You Ca...
 
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
Top profile Call Girls In bhavnagar [ 7014168258 ] Call Me For Genuine Models...
 
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
Digital Advertising Lecture for Advanced Digital & Social Media Strategy at U...
 
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
Top profile Call Girls In Vadodara [ 7014168258 ] Call Me For Genuine Models ...
 
Abortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get CytotecAbortion pills in Jeddah | +966572737505 | Get Cytotec
Abortion pills in Jeddah | +966572737505 | Get Cytotec
 
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
Sonagachi * best call girls in Kolkata | ₹,9500 Pay Cash 8005736733 Free Home...
 
Statistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbersStatistics notes ,it includes mean to index numbers
Statistics notes ,it includes mean to index numbers
 
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
Top profile Call Girls In Satna [ 7014168258 ] Call Me For Genuine Models We ...
 
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
Charbagh + Female Escorts Service in Lucknow | Starting ₹,5K To @25k with A/C...
 

Get the most out of your AWS Redshift investment while keeping cost down

  • 1. How to get the most out of your AWS Redshift Investment, while keeping cost down WEBINAR SERIES : AWS OPTIMIZATION Agilisium Innovation Labs
  • 2. • Tens of thousands of customers and growing • 3x faster than other CDWs • 200+ new features in last 18 months AWS Redshift : A Shift towards the Future
  • 3. • Keep up with the rapid pace of innovation • Lack of time to experiment • Extend knowledge on best practices But what seems to be the challenge ? All above have impeded organizations ability to extract maximum value from their existing Redshift investments
  • 4. In the next 30-35 mins… • Key design/architectural considerations of AWS Redshift • Strategies to optimize AWS Redshift for Cost & Performance • Success Story : Reducing Redshift run cost by 40% • How we can help you What we would like to talk about today
  • 5. Agilisium – Overview U.S (60+) : Los Angeles(HQ), Chicago, Texas with global presence in India (250+), Canada, Costa Rica, Netherlands and UK (30+) We are a Big Data and Analytics company with clear focus on helping organizations take the “Data-to-Insights-Leap”
  • 6. Our Data & Analytics Customers
  • 7. Our Redshift Experience 400-level AWS ExpertsProven Expertise Top 3 AWS Redshift Competency Partner in the U.S with razor focus on AWS Data & Analytics solutions Demonstrated Capability 15+ PB migrated to AWS through $ 50 MN worth of successful Big Data Analytics projects 55+ AWS Certified Experts. Our SAs are regular attendees of AOD training by Redshift Product team
  • 8. MEET THE SPEAKERS Jay Palaniappan CTO & Head of Innovation Labs Smitha Basavaraju Big Data Architect Arun Chinnadurai Associate Director – BD shukvina@amazon.com
  • 9. AWS Redshift – Key Design Considerations
  • 10. AWS Redshift – Well Architected Reliability • High Availability • Disaster Recovery & Backup Operational Excellence • Automation of operations – CI/CD • Centralized monitoring and logs • Learn from operational events and failures Cost Optimization • Understand Consumption • Right sizing & Pricing strategies Performance Efficiency • Measure Performance of workloads • Optimize • Scale on Demand Security • Protect data in transit and at rest • Strong identity foundation • Enable monitoring and auditing
  • 12. Optimizing Strategy Impact Effort/ Complexity 2 Pause & Resume 1 6 4 5 3 Reserved Instances Elastic Resize Concurrency Scaling Moving to RA3 Right Sizing 7 Design & Architecture
  • 13. 1. Reserved Instances Reserved Instance Immediate Low Up to 70% Cost Savings Reserved Instances : Duration- 1Yr / 3Yr Payment Option: No Upfront Partial Upfront All Upfront
  • 14. 2. Pause & Resume Pause & Resume Immediate Low Up to 50% Cost Savings Pause nonproduction instances Pay only for storage Applicable only for on-demand instances
  • 15. 3. Elastic Resize Elastic Resize Immediate Low Surge in Data/ Performance Scales redshift clusters up and down clusters in minutes Automate cluster resize on predictable loads Optimize cost and plan for capacity Schedule cluster resize using management console or API
  • 16. 4. Concurrency Scaling Concurrency Scaling Immediate Low Scalable capacity Automatically adds transient clusters Serves spike in concurrent requests For 24hrs of cluster in use, 1 hr. of concurrency scaling is free Ability to set usage limit
  • 17. 5. Moving to RA3 Instance RA3 Immediate Zero 2x performance uplift | 2x storage Scale data warehouse based on workload and scale on peak demand Pay separately compute and storage independently 2X performance and 2X Storage capacity in comparison to DS2.XLarge
  • 18. 6. Right Sizing • Instance Types • Dense Compute (DC2) • Dense Storage (DS2) • RA3 • Sizing • Size based on workload: CPU, disk, I/O • Scale up by adding nodes to check linear performance • Move to Higher instance groups
  • 19. 7. Table Design Considerations Sort Key Column EncodingDistribution Key • ANALYZE COMPRESSION • Compress all columns except for first sort key column • AZ64 is new encoding • Improves performance 2X-4X by reducing I/O • Use PG_TABLE_DEF • Zone maps stores min and max values of block • Order columns by low to high cardinality • No of Sort Columns < 4 • Interleaved Sort key– BE CAUTIOUS • More columns in interleaved sort key = Longer Vacuum • Use STL_TABLE_INFO • Distributions keys should have high cardinality to avoid data skew and “hot” nodes • Use Date Columns only if cardinality is high • DISTSTYLE AUTO is a great go-to for all tables < ~5 million rows.
  • 20. Moving Towards AUTO Management Table Stats WLM • Ensure that AUTO ANALYZE , AUTO SORT & AUTO VACUUM is enabled • INTERLEAVED SORT KEYS - Run VACUUM REINDEX command scheduled • Use STL_TABLE_INFO for stats • Use Auto WLM with SQA Enabled Manual WLM • Number of queues < 4 • Use QMR to monitor performance from bad queries • Max concurrency level for all user <=15 • Leave ~5% of memory unallocated
  • 22. AWS WAF-based Redshift assessment for M&E Giant Technologies: S3, Redshift, Redshift Spectrum Source System: 25 TB Team: Cloud Solutions Architect, Sr. Big Data Architect Fast FactsSolution • Comprehensive assessment of the candidate Redshift workload across 5 pillars of the AWS WAF, using Agilisium’s Redshift Inspector • Several observations across all 5 pillars were made based on Findings & Recommendations report (Redshift Inspector) and workshop Recommendations • Security : Database Encryption, Redshift in private cluster, Port Obfuscation, S3 VPC endpoint • Performance : Time series data model, Concurrency Scaling, Limited use of Interleaved sort keys, right-size column width • Cost Optimization : Data placement, RA3, Deletion of redundant backup & Reserved Instances • Reliability : Cross-region backup, Avoid Temp & Staging backup • Operational Excellence : Audit Logging, Cloud Watch Alerts and Auto Update Client requested a holistic assessment of their 25TB Redshift workload to identify avenues for improvement across all dimensions Objective Value delivered 30% faster query performance More secured and resilient Redshift workload 40% Cost reduction
  • 23. How we can help?
  • 24. How we can help? • AWS WAF-based Redshift Assessment ✓Findings & Recommendations Report ✓Remediation Plan (Pre- cursor for next phase – Optimize) • Performance Optimization (Compelling pricing options) • Cost Optimization (Outcome- based pricing) • Extend customer’s knowledge on new features and best practices • Custom trend report with top 10 metrics for ongoing maintenance Diagnose – 3 Days Optimize – 2+ Weeks Maintain – Quarterly
  • 25. Diagnose – AWS WAF-based Redshift Assessment – 3 Days Customer Contribution Identification of business-critical Redshift workload Availability of Business & Tech SMEs tied to Redshift workload for workshop Availability of Client DBA to run Redshift diagnostic queries Read-only access to your Redshift cluster for additional investigation, if any Automated fact- based assessment Rich Corpus of Best practices Agilisium’s Redshift Inspector Holistic 60-point check of your Redshift workload across 5 pillars of AWS WAF Toolkit is based on 100+ Redshift best practices identified from migrating 15+ PB to AWS in the last 7+ years AWS WAF-based Assessment – Deliverables Findings & Recommendations Report – Get accurate observations by criticality (Critical, Needs Improvement, Well- Architected) Actionable Remediation Plan – Plan to implement top observations from the Findings & Recommendations report. Clients can choose to implement the plan internally or involve Agilisium Automated Redshift WAF-based Assessment Toolkit
  • 26. Agilisium’s Redshift Inspector – Key facets covered Cost Optimization • Right-fit cluster size • On-demand to restore snapshots • Underutilized/unused clusters • Choice of Reserved Instances (stand vs convertible) • Hot/cold/warm data strategy • Intra-region Data Transfer • Snapshot lifecycle management Performance Efficiency • Compression/Encoding of large datasets to improve network throughput • Avoid Data Skew through right Distribution & Sort Keys • Up-to-date stats via ANALYZE & VACUUM • VACUUM strategies (Pre-sort & load) • Data loads Optimize strategies (COPY commands) • Track query performance (Integrity constraints as hints) • Auto WLM vs Custom WLM • Time-series data model for larger datasets Security • SSO & IAM Federations • Ingress policy – Port 5832 open for internal IPs only • All traffic routed via private subnets/VPCs • Encrypt data-at-rest – KMS/HSM • Encrypt data-in-motion – SSL/TLS Reliability • Multi-region cluster setup • SLA-based manual backup – Restore data lost due to accidental deletion • Cross-region backups for HA • Continuous monitoring of key metrics for HA (Disk utilization, ReadIOPS, WriteOPS, CPU utilization etc.) • Redshift user activity logged for RCA Operational Excellence • Deferred maintenance • Redshift Advisor recommendations • RA3 – Intelligent Data offload
  • 27. Findings & Recommendations – Sample Report
  • 28. Findings & Recommendations – Sample Report (contd..)
  • 29. Findings & Recommendations – Sample Report (contd..)