SlideShare a Scribd company logo
1 of 82
Masterclass 
Amazon Redshift 
Ian Massingham – Technical Evangelist 
@IanMmmm
Masterclass 
A technical deep dive beyond the basics 
Help educate you on how to get the best from AWS technologies 
Show you how things work and how to get things done 
Broaden your knowledge in ~45 mins
Amazon Redshift 
A fast, fully managed, petabyte-scale data warehouse 
Makes it simple & cost-effective to analyse all your data using existing tools 
Start small & scale to a petabyte or more for less than $1,000/terabyte/year
Optimised for 
Data Warehousing 
Scalable 
No Up-Front Costs 
Amazon Redshift 
Compatible 
Secure 
Simple
Agenda 
Why Run Your Data Warehouse on AWS? 
Getting Started 
Table Design 
Data Loading 
Working with Data 
Backup and Restoration 
Upgrading & Scaling
Why Run Your Data 
Warehouse on AWS?
Why Managed Databases? 
documentation, 
licensing & training 
backup & recovery, 
data load & unload 
performance 
tuning 
5% 
25% 
20% 
40% 
5% 5% 
scripting & 
coding 
security 
planning 
install, upgrade, 
patch and migrate 
differentiated effort increases the 
uniqueness of an application
Our customers asked us for data warehousing 
done the AWS way 
Easy to provision and scale up massively 
No upfront costs, pay as you go 
Really fast performance at a really low price 
Open and flexible with support for popular BI tools
Amazon Redshift 
parallelizes and distributes everything 
Query 
Common BI Tools 
JDBC/ODBC 
Load 
Backup 
Compute 
Compute 
Restore 
Node 
Node 
Resize 
Compute 
Node 
Leader 
Node 
10GigE Mesh
Redshift lets you start small and grow big 
Extra Large Node (dw1.xl & dw2.xl) 
3 spindles, 15GiB RAM 
2 virtual cores, 10GigE 
Single Node (160GB SSD or 2TB 
Magnetic) 
Cluster 2-32 Nodes (320GB SSD – 64TB 
Magnetic) 
8 Extra Large Node (dw1.8xl & dw2.8xl) 
24 spindles, 120GiB RAM, 1.2TB SSD or 16TB 
Magnetic, 16 virtual cores, 10GigE 
Cluster 2-100 Nodes (2.4TB SSD – 1.6PB Magnetic) 
8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL XL 
8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL XL XL XL XL XL XL XL XL 
8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL XL XL XL XL XL XL XL XL 
8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL XL XL XL XL XL XL XL XL 
XL XL XL XL XL XL XL XL 
8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL
Redshift simplifies operations 
• Built-in security in transit, at rest, when backed up 
• Backup to S3 is continuous, incremental, and automatic 
• Streaming restores let you resume querying faster 
• Disk failures are transparent; nodes recover automatically 
Amazon 
S3 
Clients 
(Optional) SSL 
Continuous, Automatic 
Backup 
Streaming Restore 
Amazon 
Redshift 
Amazon Redshift
Amazon Redshift dramatically reduces IO 
Column storage 
Data compression 
Zone maps 
Direct-attached storage 
Large data block sizes 
Id Age State 
123 20 CA 
345 25 WA 
678 40 FL 
Row storage Column storage
Current production environment 
32 nodes, 128 CPUs, 4.2TB RAM, 1.6 PB disk 
Tested 2B row data set, 6 representative queries on 
2-node Amazon Redshift cluster 
queries ran > 10x faster
TEST: 
2 BILLION RECORDS 
6 REPRESENTATIVE REQUESTS
AMAZON REDSHIFT 2x HS1.8XL 
Vs. 
32 NODES, 4.2TB RAM, 1.6PB
12x TO 150x FASTER
• Needed a way to increase speed, 
performance and flexibility of data 
analysis at a low cost 
• Using AWS enabled FT to run queries 
98% faster than previously—helping FT 
make business decisions quickly 
• Easier to track and analyze trends 
• Reduced infrastructure costs by 80% 
over traditional data center model
Getting 
Started
Getting Started 
Create a Data Warehouse in 
Minutes with Amazon Redshift
Creating a 
Redshift Data 
Warehouse via 
the AWS Console
Creating a 
Redshift Data 
Warehouse via 
the AWS Console
Creating a 
Redshift Data 
Warehouse via 
the AWS Console
Creating a 
Redshift Data 
Warehouse via 
the AWS Console
Creating a 
Redshift Data 
Warehouse via 
the AWS Console
Creating a 
Redshift Data 
Warehouse via 
the AWS Console
Creating a 
Redshift Data 
Warehouse via 
the AWS Console
Creating a 
Redshift Data 
Warehouse via 
the AWS Console
Creating a 
Redshift Data 
Warehouse via 
the AWS Console
Creating a 
Redshift Data 
Warehouse via 
the AWS Console
Creating a 
Redshift Data 
Warehouse via 
the AWS Console
Creating a 
Redshift Data 
Warehouse via 
the AWS Console
Creating a 
Redshift Data 
Warehouse via 
the AWS Console
Creating a 
Redshift Data 
Warehouse via 
the AWS Console
More Getting Started Resources 
2 Month Free Trial 
Comprehensive Getting Started Guide 
Table Design & Data Loading Tutorials 
Find out more about getting started with Amazon Redshift: aws.amazon.com/redshift/getting-started/
Table Design
Table Design 
▶ How to choose optimal table design ? 
▶ Choosing the right data type 
▶ Distributing and sorting data
The VARCHAR data type supports multi-byte characters up to a maximum of three bytes. 
Four-byte or longer characters are not supported. 
For example, VARCHAR(12) can support four Chinese characters (three bytes / character): 
4 characters x 3 bytes per character = 12 bytes 
Data Types
Your Schema 
▶ 3NF 
Airport 
Search Type 
Destination Enquiry Airport Group Map 
Airport Group 
Enquiry Airport 
Enquiry Airport 
Holiday Type Group
Distributing Data 
▶ Redshift is a distributed system: 
▶ A cluster contains a leader node & compute nodes 
▶ A compute node contains slices (one per core) 
▶ A slice contains data 
▶ Slices are chosen based on two types of distribution: 
▶ Round Robin (automated) 
▶ Based on a distribution key (hash of a defined column) 
▶ Queries run on all slices in parallel: optimal query throughput can be 
achieved when data is evenly spread across slices
Unoptimised Distribution 
ORDERS ITEMS Order 1: Dave Smith, Total £195 
Item 1.1: Order 1, Kindle Fire HD 7”, £159 
Item 1.2: Order 1, Kindle Fire Case, £36 
Node 1 
Slice 1 | Slice 2 
Node 2 
Slice 1 | Slice 2 
Node 3 
Slice 1 | Slice 2 
Order 1 Item 2.1 Order 2 Item 1.1 Order 3 Item 1.2 
Item 3.1 Item 2.2 
Default (No Distribution Key, Round Robin Order)
Optimised Distribution 
ORDERS ITEMS 
Order 1: Dave Smith, Total £195 
Item 1.1: Order 1, Kindle Fire HD 7”, £159 
Item 1.2: Order 1, Kindle Fire Case, £36 
Node 1 
Slice 1 | Slice 2 
Node 2 
Slice 1 | Slice 2 
Node 3 
Slice 1 | Slice 2 
Order 1 Order 2 Order 3 
Item 1.1 Item 2.1 
Item 1.2 Item 2.2 
Item 3.1 
Customised (ORDERS.ORDER_ID DISTKEY, ITEMS.ORDER_ID DISTKEY)
Choosing a Distribution Key 
▶ Frequently Joined 
▶ By most commonly run queries 
▶ By queries which consume the most CPU 
▶ High Cardinality 
▶ Large number of discrete values 
▶ Low Skew 
▶ Uniform Distribution 
▶ No Hotspots 
▶ Query STV_BLOCKLIST for Skew Factor 
http://docs.aws.amazon.com/redshift/latest/dg/c_analyzing-table-design.html
Sorting Data 
▶ In the slices (on disk), the data is sorted by a sort key (if none 
uses the insertion order) 
▶ Choose a sort key that is frequently used in your queries 
▶ As a query predicate (date, identifier, …) 
▶ As a join parameter (it can also be the hash key) 
▶ The sort key allows Redshift to avoid reading entire blocks 
based on predicates 
▶ E.g.: a table containing a timestamp sort key, and where only 
recent data is accessed, will skip blocks containing “old” data
Schema Design 
▶ Optimizing a database for querying 
▶ Analyse using Automatic Compression to ensure optimal IO 
▶ Co-locate frequently joined tables: use distribution key wisely 
(avoids data transfers between nodes) 
▶ For joined tables, use sort keys on the joined columns, 
allowing fast merge joins 
▶ Compression allows you to de-normalize without penalizing 
storage, simplifying queries and limiting joins
Possible Schema Optimisations 
▶ Denormalise commonly used join attributes onto large tables 
Search Type 
Destination Enquiry Map 
DISTKEY, SORTKEY = EnquiryID 
Airport 
Airport Group 
Airport Group 
Enquiry Airport 
Enquiry Airport 
Holiday Type Group
Possible Schema Optimisations 
▶ Cache Small Tables with DISTYLE ALL 
Search Type 
Destination Enquiry Map 
DISTKEY, SORTKEY = EnquiryID 
Airport 
Airport Group 
Airport Group 
Enquiry Airport 
Enquiry Airport 
Holiday Type Group
More on Table Design 
Table Design Documentation 
http://docs.aws.amazon.com/redshift/latest/dg/t_Creating_tables.html 
Table Design Tutorial 
http://docs.aws.amazon.com/redshift/latest/dg/tutorial-tuning-tables.html
Data Loading
Amazon Redshift Loading Data Overview 
Data Volume Amazon Elastic 
MapReduce 
Amazon S3 DynamoDB 
Amazon RDS 
Amazon 
Redshift 
Amazon 
Glacier 
Source DBs 
logs / files 
AWS Direct 
Connect 
VPN 
Connection 
S3 Multipart 
Upload 
Corporate Data center AWS Cloud 
AWS Import/ 
Export
Sharing Your Data 
▶ Data Dumped from DB 
▶ CSV Files Encrypted 
▶ Review Encryption Code 
▶ Uploaded to S3 
▶ Bucket <bucket> 
▶ Data shared with AWS 
{ 
"Id": 
"Policy1234567890", 
"Statement": 
[ 
{ 
"Action": 
[ 
"s3:Get*", 
"s3:List*" 
], 
"Effect": 
"Allow", 
"Principal": 
{ 
"AWS": 
["arn:aws:iam::887210671223:root"] 
}, 
"Resource": 
[ 
"arn:aws:s3:::<bucket-­‐name>", 
"arn:aws:s3:::<bucket-­‐name>/*" 
], 
"Sid": 
"Stmt0987654321" 
} 
] 
}
Slice 0 
Slice 1 
Slice 0 
Slice 1 
Splitting Data Files 
Client.txt.1 
Client.txt.2 
Client.txt.3 
Client.txt.4 
Node 0 
Node 1 
2 XL Compute Nodes 
Copy customer from ‘s3://mydata/client.txt’ 
Credentials ‘aws_access_key_id=<your-access-key>; aws_secret_access_key=<your_secret_key>’ 
Delimiter ‘|’; 
mydata
Loading Encrypted Data Files 
Client.txt 
Encrypt the file using 
the envelope key 
Generate a one time 
use envelope 
symmetric key 
Encrypt the envelope 
key using a master 
public key or 
symmetric key 
Amazon 
Redshift 
Copy customer from ‘s3://mydata/client.txt’ 
Credentials ‘aws_access_key_id=<your-access-key>; 
aws_secret_access_key=<your_secret_key>; 
Master_symmetric_key=<master_key>’ Delimiter ‘|’ encrypted;
Typical Load Issues 
▶ Mismatch between data types in table and values in input data fields 
▶ Mismatch between number of columns in table and number of fields in input data 
▶ Mismatched quotes 
▶ Redshift supports both single and double quotes; however, these quotes must be balanced appropriately 
▶ Incorrect format for date/time data in input files 
▶ Use DATEFORMAT and TIMEFORMAT to control 
▶ Out-of-range values in input files (for numeric columns) 
▶ Number of distinct values for a column exceeds the limitation for its compression 
encoding
Direct SQL 
▶ Redshift supports standard DML commands 
▶ INSERT, UPDATE, DELETE 
▶ Redshift does not support single-command merge (upsert) statement 
▶ Load data into a staging table 
▶ Joining the staging table with the target table 
▶ UPDATE data where row exists 
▶ INSERT where no row exists 
▶ All Direct SQL Commands go via Leader Node
Data Loading Best Practices 
▶ Use a COPY Command to load data 
▶ Use a single COPY command 
▶ Split your data into multiple files 
▶ Compress your data files with GZIP 
▶ Use multi-row inserts if COPY is not possible 
▶ Bulk insert operations (INSERT INTO…SELECT and 
CREATE TABLE AS) provide high performance data 
insertion
Data Loading Best Practices 
▶ Load your data in sort key order to avoid needing to vacuum 
▶ Organize your data as a sequence of time-series tables, where each table is 
identical but contains data for different time ranges 
▶ Use staging tables to perform an upsert 
▶ Run the VACUUM command whenever you add, delete, or modify a large 
number of rows, unless you load your data in sort key order 
▶ Increase the memory available to a COPY or VACUUM by increasing 
wlm_query_slot_count 
▶ Run the ANALYZE command whenever you’ve made a non-trivial number of 
changes to your data to ensure your table statistics are current
Working with data in 
Amazon Redshift
RDBMS 
Redshift 
OLTP 
ERP 
Reporting 
& BI
RDBMS 
Redshift 
OLTP 
ERP 
Reporting 
& BI
JDBC/ODBC 
Amazon Redshift
Backup and 
Restoration
BACKUP AND 
RESTORATION Backups to Amazon S3 are 
Co1m6 copresu te 
Node 
Amazon S3 
128GB RAM 
Co1m6 copresu te 
Node 
16TB disk 
128GB RAM 
16TB disk 
128GB RAM 
Co1m6 copresu te 
Node 
16TB disk 
automatic, continuous and 
incremental 
Configurable system snapshot 
retention period 
User snapshots on-demand 
Streaming restores enable you 
to resume querying faster
CLUSTER DATA REPLICATION 
+ 
AUTOMATED BACKUPS ON 
AMAZON S3 
+ 
NODE MONITORING
Upgrading and 
Scaling
16 cores 
Compute Node 
L16e caodrese r 
Node 
16 cores 
Compute Node 
16 cores 
Compute Node 
Resize while remaining online 
Provision a new cluster in the 
background 
16 cores 
Compute Node 
16 cores 
Compute Node 
16 cores 
Compute Node 
16 cores 
Compute Node 
L16e caodrese r 
Node 
Copy data in parallel from node to 
node 
Only charged for source cluster 
SQL Clients/BI Tools 
128GB RAM 
48TB disk 
128GB RAM 
48TB disk 
128GB RAM 
48TB disk 
128GB RAM 
48TB disk 
128GB RAM 
48TB disk 
128GB RAM 
48TB disk 
128GB RAM 
48TB disk 
128GB RAM 
48TB disk 
128GB RAM 
48TB disk
128GB RAM 
16 cores 
Compute Node 
48TB disk 
128GB RAM 
16 cores 
Compute Node 
48TB disk 
128GB RAM 
16 cores 
Compute Node 
48TB disk 
128GB RAM 
16 cores 
Compute Node 
48TB disk 
128GB RAM 
L16e caodrese r 
Node 
48TB disk 
Automatic SQL endpoint switchover via DNS 
Decommission the source cluster 
SQL Clients/BI Tools
Resizing a 
Redshift Data 
Warehouse via 
the AWS Console
Resizing a 
Redshift Data 
Warehouse via 
the AWS Console
Resizing a 
Redshift Data 
Warehouse via 
the AWS Console
Some other things…
Airbnb 
Nokia 
Find out more about these and other AWS case studies here: aws.amazon.com/solutions/case-studies/
Resources to learn more 
Financial Times Case Study 
aws.amazon.com/solutions/case-studies/financial-times 
Getting Started with Amazon Redshift 
aws.amazon.com/redshift/getting-started/ 
Uses & Best Practices for Amazon Redshift 
AWS Summit 2014 San Francisco (video) 
http://youtu.be/reQtXquDpzo 
Your local AWS Solutions Architect J
Summary
Optimised for 
Data Warehousing 
Scalable 
No Up-Front Costs 
Amazon Redshift 
Compatible 
Secure 
Simple
Find out more: 
aws.amazon.com/redshift
AWS Training & Certification 
Certification 
Demonstrate your skills, 
knowledge, and expertise 
with the AWS platform 
aws.amazon.com/certification 
Self-Paced Labs 
Try products, gain new 
skills, and get hands-on 
practice working with 
AWS technologies 
aws.amazon.com/training/ 
self-paced-labs 
Training 
Skill up and gain 
confidence to design, 
develop, deploy and 
manage your applications 
on AWS 
aws.amazon.com/training
Ian Massingham – Technical Evangelist 
@IanMmmm 
@AWS_UKI for local AWS events & news 
@AWScloud for Global AWS News and Announcements 
©Amazon.com, Inc. and its affiliates. All rights reserved.
We typically see customers start by trying our 
services 
Get started now at : aws.amazon.com/getting-started
Design your application for the AWS Cloud 
More details on the AWS Architecture Center at : aws.amazon.com/architecture

More Related Content

What's hot

Introduction to NoSQL Databases
Introduction to NoSQL DatabasesIntroduction to NoSQL Databases
Introduction to NoSQL DatabasesDerek Stainer
 
Application Load Balancer and the integration with AutoScaling and ECS - Pop-...
Application Load Balancer and the integration with AutoScaling and ECS - Pop-...Application Load Balancer and the integration with AutoScaling and ECS - Pop-...
Application Load Balancer and the integration with AutoScaling and ECS - Pop-...Amazon Web Services
 
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...Amazon Web Services
 
A Thorough Comparison of Delta Lake, Iceberg and Hudi
A Thorough Comparison of Delta Lake, Iceberg and HudiA Thorough Comparison of Delta Lake, Iceberg and Hudi
A Thorough Comparison of Delta Lake, Iceberg and HudiDatabricks
 
database migration simple, cross-engine and cross-platform migrations with ...
database migration   simple, cross-engine and cross-platform migrations with ...database migration   simple, cross-engine and cross-platform migrations with ...
database migration simple, cross-engine and cross-platform migrations with ...Amazon Web Services
 
Deep Dive on Amazon RDS (Relational Database Service)
Deep Dive on Amazon RDS (Relational Database Service)Deep Dive on Amazon RDS (Relational Database Service)
Deep Dive on Amazon RDS (Relational Database Service)Amazon Web Services
 
Amazon EMR Deep Dive & Best Practices
Amazon EMR Deep Dive & Best PracticesAmazon EMR Deep Dive & Best Practices
Amazon EMR Deep Dive & Best PracticesAmazon Web Services
 
Azure Data Factory ETL Patterns in the Cloud
Azure Data Factory ETL Patterns in the CloudAzure Data Factory ETL Patterns in the Cloud
Azure Data Factory ETL Patterns in the CloudMark Kromer
 
Amazon Relational Database Service (Amazon RDS)
Amazon Relational Database Service (Amazon RDS)Amazon Relational Database Service (Amazon RDS)
Amazon Relational Database Service (Amazon RDS)Amazon Web Services
 
KINX와 함께 하는 AWS Direct Connect 도입 - 남시우 매니저, KINX :: AWS Summit Seoul 2019
KINX와 함께 하는 AWS Direct Connect 도입 - 남시우 매니저, KINX :: AWS Summit Seoul 2019KINX와 함께 하는 AWS Direct Connect 도입 - 남시우 매니저, KINX :: AWS Summit Seoul 2019
KINX와 함께 하는 AWS Direct Connect 도입 - 남시우 매니저, KINX :: AWS Summit Seoul 2019Amazon Web Services Korea
 
Amazon RDS: Deep Dive - SRV310 - Chicago AWS Summit
Amazon RDS: Deep Dive - SRV310 - Chicago AWS SummitAmazon RDS: Deep Dive - SRV310 - Chicago AWS Summit
Amazon RDS: Deep Dive - SRV310 - Chicago AWS SummitAmazon Web Services
 

What's hot (20)

Introduction to Amazon Aurora
Introduction to Amazon AuroraIntroduction to Amazon Aurora
Introduction to Amazon Aurora
 
Introduction to NoSQL Databases
Introduction to NoSQL DatabasesIntroduction to NoSQL Databases
Introduction to NoSQL Databases
 
Introduction to Amazon EC2
Introduction to Amazon EC2Introduction to Amazon EC2
Introduction to Amazon EC2
 
Amazon Redshift Masterclass
Amazon Redshift MasterclassAmazon Redshift Masterclass
Amazon Redshift Masterclass
 
Application Load Balancer and the integration with AutoScaling and ECS - Pop-...
Application Load Balancer and the integration with AutoScaling and ECS - Pop-...Application Load Balancer and the integration with AutoScaling and ECS - Pop-...
Application Load Balancer and the integration with AutoScaling and ECS - Pop-...
 
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...
Big Data Analytics Architectural Patterns and Best Practices (ANT201-R1) - AW...
 
BDA311 Introduction to AWS Glue
BDA311 Introduction to AWS GlueBDA311 Introduction to AWS Glue
BDA311 Introduction to AWS Glue
 
Introduction to Amazon DynamoDB
Introduction to Amazon DynamoDBIntroduction to Amazon DynamoDB
Introduction to Amazon DynamoDB
 
Introduction to AWS Glue
Introduction to AWS GlueIntroduction to AWS Glue
Introduction to AWS Glue
 
A Thorough Comparison of Delta Lake, Iceberg and Hudi
A Thorough Comparison of Delta Lake, Iceberg and HudiA Thorough Comparison of Delta Lake, Iceberg and Hudi
A Thorough Comparison of Delta Lake, Iceberg and Hudi
 
Amazon Redshift
Amazon Redshift Amazon Redshift
Amazon Redshift
 
database migration simple, cross-engine and cross-platform migrations with ...
database migration   simple, cross-engine and cross-platform migrations with ...database migration   simple, cross-engine and cross-platform migrations with ...
database migration simple, cross-engine and cross-platform migrations with ...
 
AWS RDS
AWS RDSAWS RDS
AWS RDS
 
Deep Dive on Amazon RDS (Relational Database Service)
Deep Dive on Amazon RDS (Relational Database Service)Deep Dive on Amazon RDS (Relational Database Service)
Deep Dive on Amazon RDS (Relational Database Service)
 
Amazon EMR Deep Dive & Best Practices
Amazon EMR Deep Dive & Best PracticesAmazon EMR Deep Dive & Best Practices
Amazon EMR Deep Dive & Best Practices
 
Introduction to Amazon EC2
Introduction to Amazon EC2Introduction to Amazon EC2
Introduction to Amazon EC2
 
Azure Data Factory ETL Patterns in the Cloud
Azure Data Factory ETL Patterns in the CloudAzure Data Factory ETL Patterns in the Cloud
Azure Data Factory ETL Patterns in the Cloud
 
Amazon Relational Database Service (Amazon RDS)
Amazon Relational Database Service (Amazon RDS)Amazon Relational Database Service (Amazon RDS)
Amazon Relational Database Service (Amazon RDS)
 
KINX와 함께 하는 AWS Direct Connect 도입 - 남시우 매니저, KINX :: AWS Summit Seoul 2019
KINX와 함께 하는 AWS Direct Connect 도입 - 남시우 매니저, KINX :: AWS Summit Seoul 2019KINX와 함께 하는 AWS Direct Connect 도입 - 남시우 매니저, KINX :: AWS Summit Seoul 2019
KINX와 함께 하는 AWS Direct Connect 도입 - 남시우 매니저, KINX :: AWS Summit Seoul 2019
 
Amazon RDS: Deep Dive - SRV310 - Chicago AWS Summit
Amazon RDS: Deep Dive - SRV310 - Chicago AWS SummitAmazon RDS: Deep Dive - SRV310 - Chicago AWS Summit
Amazon RDS: Deep Dive - SRV310 - Chicago AWS Summit
 

Viewers also liked

Container Days Conference Plesk 2016 - How AWS, Docker and Microservices infl...
Container Days Conference Plesk 2016 - How AWS, Docker and Microservices infl...Container Days Conference Plesk 2016 - How AWS, Docker and Microservices infl...
Container Days Conference Plesk 2016 - How AWS, Docker and Microservices infl...Jan Löffler
 
AWS Webcast - Webinar Series for State and Local Government #2: Discover the ...
AWS Webcast - Webinar Series for State and Local Government #2: Discover the ...AWS Webcast - Webinar Series for State and Local Government #2: Discover the ...
AWS Webcast - Webinar Series for State and Local Government #2: Discover the ...Amazon Web Services
 
AWS Paris Summit 2014 - Closing Keynote Werner Vogels - Beyond the fridge
AWS Paris Summit 2014 - Closing Keynote Werner Vogels - Beyond the fridgeAWS Paris Summit 2014 - Closing Keynote Werner Vogels - Beyond the fridge
AWS Paris Summit 2014 - Closing Keynote Werner Vogels - Beyond the fridgeAmazon Web Services
 
AWS Webcast - Emergency Preparedness and Interagency Collaboration Webinar
AWS Webcast - Emergency Preparedness and Interagency Collaboration Webinar AWS Webcast - Emergency Preparedness and Interagency Collaboration Webinar
AWS Webcast - Emergency Preparedness and Interagency Collaboration Webinar Amazon Web Services
 
(BDT202) HPC Now Means 'High Personal Computing' | AWS re:Invent 2014
(BDT202) HPC Now Means 'High Personal Computing' | AWS re:Invent 2014(BDT202) HPC Now Means 'High Personal Computing' | AWS re:Invent 2014
(BDT202) HPC Now Means 'High Personal Computing' | AWS re:Invent 2014Amazon Web Services
 
Journey Through the Cloud - Digital Media
Journey Through the Cloud - Digital MediaJourney Through the Cloud - Digital Media
Journey Through the Cloud - Digital MediaAmazon Web Services
 
(BIZ306) Migrating Trimble Sketchup 3D Warehouse to AWS | AWS re:Invent 2014
(BIZ306) Migrating Trimble Sketchup 3D Warehouse to AWS | AWS re:Invent 2014(BIZ306) Migrating Trimble Sketchup 3D Warehouse to AWS | AWS re:Invent 2014
(BIZ306) Migrating Trimble Sketchup 3D Warehouse to AWS | AWS re:Invent 2014Amazon Web Services
 
Fostering a Culture of Innovation with Cloud
Fostering a Culture of Innovation with CloudFostering a Culture of Innovation with Cloud
Fostering a Culture of Innovation with CloudAmazon Web Services
 
(SPOT207) State of the Union: AWS Database Services | AWS re:Invent 2014
(SPOT207) State of the Union: AWS Database Services | AWS re:Invent 2014(SPOT207) State of the Union: AWS Database Services | AWS re:Invent 2014
(SPOT207) State of the Union: AWS Database Services | AWS re:Invent 2014Amazon Web Services
 
Updating Security Operations for the Cloud - AWS Symposium 2014 - Washington ...
Updating Security Operations for the Cloud - AWS Symposium 2014 - Washington ...Updating Security Operations for the Cloud - AWS Symposium 2014 - Washington ...
Updating Security Operations for the Cloud - AWS Symposium 2014 - Washington ...Amazon Web Services
 
AWS Public Sector Symposium | Effective Security Response in the Cloud - Sess...
AWS Public Sector Symposium | Effective Security Response in the Cloud - Sess...AWS Public Sector Symposium | Effective Security Response in the Cloud - Sess...
AWS Public Sector Symposium | Effective Security Response in the Cloud - Sess...Amazon Web Services
 
AWS Government, Education, and Nonprofits Symposium London, United Kingdom L...
 AWS Government, Education, and Nonprofits Symposium London, United Kingdom L... AWS Government, Education, and Nonprofits Symposium London, United Kingdom L...
AWS Government, Education, and Nonprofits Symposium London, United Kingdom L...Amazon Web Services
 
Deployment with Elastic Beanstalk at Edinburgh Startup Event
Deployment with Elastic Beanstalk at Edinburgh Startup EventDeployment with Elastic Beanstalk at Edinburgh Startup Event
Deployment with Elastic Beanstalk at Edinburgh Startup EventAmazon Web Services
 
(PFC307) Auto Scaling: A Machine Learning Approach | AWS re:Invent 2014
(PFC307) Auto Scaling: A Machine Learning Approach | AWS re:Invent 2014(PFC307) Auto Scaling: A Machine Learning Approach | AWS re:Invent 2014
(PFC307) Auto Scaling: A Machine Learning Approach | AWS re:Invent 2014Amazon Web Services
 
(AFF201) What the Top 50 Games Do with In-App Purchasing That the Rest of Us ...
(AFF201) What the Top 50 Games Do with In-App Purchasing That the Rest of Us ...(AFF201) What the Top 50 Games Do with In-App Purchasing That the Rest of Us ...
(AFF201) What the Top 50 Games Do with In-App Purchasing That the Rest of Us ...Amazon Web Services
 
T2 – Continuous integration on aws
T2 – Continuous integration on awsT2 – Continuous integration on aws
T2 – Continuous integration on awsAmazon Web Services
 
What is Cloud Computing with AWS?
What is Cloud Computing with AWS?What is Cloud Computing with AWS?
What is Cloud Computing with AWS?Amazon Web Services
 
(WEB305) Migrating Your Website to AWS | AWS re:Invent 2014
(WEB305) Migrating Your Website to AWS | AWS re:Invent 2014(WEB305) Migrating Your Website to AWS | AWS re:Invent 2014
(WEB305) Migrating Your Website to AWS | AWS re:Invent 2014Amazon Web Services
 

Viewers also liked (20)

Container Days Conference Plesk 2016 - How AWS, Docker and Microservices infl...
Container Days Conference Plesk 2016 - How AWS, Docker and Microservices infl...Container Days Conference Plesk 2016 - How AWS, Docker and Microservices infl...
Container Days Conference Plesk 2016 - How AWS, Docker and Microservices infl...
 
AWS Webcast - Webinar Series for State and Local Government #2: Discover the ...
AWS Webcast - Webinar Series for State and Local Government #2: Discover the ...AWS Webcast - Webinar Series for State and Local Government #2: Discover the ...
AWS Webcast - Webinar Series for State and Local Government #2: Discover the ...
 
AWS Paris Summit 2014 - Closing Keynote Werner Vogels - Beyond the fridge
AWS Paris Summit 2014 - Closing Keynote Werner Vogels - Beyond the fridgeAWS Paris Summit 2014 - Closing Keynote Werner Vogels - Beyond the fridge
AWS Paris Summit 2014 - Closing Keynote Werner Vogels - Beyond the fridge
 
Workshop part2 – Big Data
Workshop part2 – Big DataWorkshop part2 – Big Data
Workshop part2 – Big Data
 
AWS Webcast - Emergency Preparedness and Interagency Collaboration Webinar
AWS Webcast - Emergency Preparedness and Interagency Collaboration Webinar AWS Webcast - Emergency Preparedness and Interagency Collaboration Webinar
AWS Webcast - Emergency Preparedness and Interagency Collaboration Webinar
 
(BDT202) HPC Now Means 'High Personal Computing' | AWS re:Invent 2014
(BDT202) HPC Now Means 'High Personal Computing' | AWS re:Invent 2014(BDT202) HPC Now Means 'High Personal Computing' | AWS re:Invent 2014
(BDT202) HPC Now Means 'High Personal Computing' | AWS re:Invent 2014
 
Journey Through the Cloud - Digital Media
Journey Through the Cloud - Digital MediaJourney Through the Cloud - Digital Media
Journey Through the Cloud - Digital Media
 
(BIZ306) Migrating Trimble Sketchup 3D Warehouse to AWS | AWS re:Invent 2014
(BIZ306) Migrating Trimble Sketchup 3D Warehouse to AWS | AWS re:Invent 2014(BIZ306) Migrating Trimble Sketchup 3D Warehouse to AWS | AWS re:Invent 2014
(BIZ306) Migrating Trimble Sketchup 3D Warehouse to AWS | AWS re:Invent 2014
 
Fostering a Culture of Innovation with Cloud
Fostering a Culture of Innovation with CloudFostering a Culture of Innovation with Cloud
Fostering a Culture of Innovation with Cloud
 
(SPOT207) State of the Union: AWS Database Services | AWS re:Invent 2014
(SPOT207) State of the Union: AWS Database Services | AWS re:Invent 2014(SPOT207) State of the Union: AWS Database Services | AWS re:Invent 2014
(SPOT207) State of the Union: AWS Database Services | AWS re:Invent 2014
 
Updating Security Operations for the Cloud - AWS Symposium 2014 - Washington ...
Updating Security Operations for the Cloud - AWS Symposium 2014 - Washington ...Updating Security Operations for the Cloud - AWS Symposium 2014 - Washington ...
Updating Security Operations for the Cloud - AWS Symposium 2014 - Washington ...
 
AWS Service Drill Downs
AWS Service Drill DownsAWS Service Drill Downs
AWS Service Drill Downs
 
AWS Public Sector Symposium | Effective Security Response in the Cloud - Sess...
AWS Public Sector Symposium | Effective Security Response in the Cloud - Sess...AWS Public Sector Symposium | Effective Security Response in the Cloud - Sess...
AWS Public Sector Symposium | Effective Security Response in the Cloud - Sess...
 
AWS Government, Education, and Nonprofits Symposium London, United Kingdom L...
 AWS Government, Education, and Nonprofits Symposium London, United Kingdom L... AWS Government, Education, and Nonprofits Symposium London, United Kingdom L...
AWS Government, Education, and Nonprofits Symposium London, United Kingdom L...
 
Deployment with Elastic Beanstalk at Edinburgh Startup Event
Deployment with Elastic Beanstalk at Edinburgh Startup EventDeployment with Elastic Beanstalk at Edinburgh Startup Event
Deployment with Elastic Beanstalk at Edinburgh Startup Event
 
(PFC307) Auto Scaling: A Machine Learning Approach | AWS re:Invent 2014
(PFC307) Auto Scaling: A Machine Learning Approach | AWS re:Invent 2014(PFC307) Auto Scaling: A Machine Learning Approach | AWS re:Invent 2014
(PFC307) Auto Scaling: A Machine Learning Approach | AWS re:Invent 2014
 
(AFF201) What the Top 50 Games Do with In-App Purchasing That the Rest of Us ...
(AFF201) What the Top 50 Games Do with In-App Purchasing That the Rest of Us ...(AFF201) What the Top 50 Games Do with In-App Purchasing That the Rest of Us ...
(AFF201) What the Top 50 Games Do with In-App Purchasing That the Rest of Us ...
 
T2 – Continuous integration on aws
T2 – Continuous integration on awsT2 – Continuous integration on aws
T2 – Continuous integration on aws
 
What is Cloud Computing with AWS?
What is Cloud Computing with AWS?What is Cloud Computing with AWS?
What is Cloud Computing with AWS?
 
(WEB305) Migrating Your Website to AWS | AWS re:Invent 2014
(WEB305) Migrating Your Website to AWS | AWS re:Invent 2014(WEB305) Migrating Your Website to AWS | AWS re:Invent 2014
(WEB305) Migrating Your Website to AWS | AWS re:Invent 2014
 

Similar to Masterclass - Redshift

Leveraging Amazon Redshift for Your Data Warehouse
Leveraging Amazon Redshift for Your Data WarehouseLeveraging Amazon Redshift for Your Data Warehouse
Leveraging Amazon Redshift for Your Data WarehouseAmazon Web Services
 
Building Your Data Warehouse with Amazon Redshift
Building Your Data Warehouse with Amazon RedshiftBuilding Your Data Warehouse with Amazon Redshift
Building Your Data Warehouse with Amazon RedshiftAmazon Web Services
 
Deploying your Data Warehouse on AWS
Deploying your Data Warehouse on AWSDeploying your Data Warehouse on AWS
Deploying your Data Warehouse on AWSAmazon Web Services
 
Best Practices for Migrating Your Data Warehouse to Amazon Redshift
Best Practices for Migrating Your Data Warehouse to Amazon RedshiftBest Practices for Migrating Your Data Warehouse to Amazon Redshift
Best Practices for Migrating Your Data Warehouse to Amazon RedshiftAmazon Web Services
 
Amazon RedShift - Ianni Vamvadelis
Amazon RedShift - Ianni VamvadelisAmazon RedShift - Ianni Vamvadelis
Amazon RedShift - Ianni Vamvadelishuguk
 
Best Practices for Migrating your Data Warehouse to Amazon Redshift
Best Practices for Migrating your Data Warehouse to Amazon RedshiftBest Practices for Migrating your Data Warehouse to Amazon Redshift
Best Practices for Migrating your Data Warehouse to Amazon RedshiftAmazon Web Services
 
Getting started with Amazon Redshift
Getting started with Amazon RedshiftGetting started with Amazon Redshift
Getting started with Amazon RedshiftAmazon Web Services
 
Best Practices for Migrating your Data Warehouse to Amazon Redshift
Best Practices for Migrating your Data Warehouse to Amazon Redshift Best Practices for Migrating your Data Warehouse to Amazon Redshift
Best Practices for Migrating your Data Warehouse to Amazon Redshift Amazon Web Services
 
Best Practices for Migrating your Data Warehouse to Amazon Redshift
Best Practices for Migrating your Data Warehouse to Amazon RedshiftBest Practices for Migrating your Data Warehouse to Amazon Redshift
Best Practices for Migrating your Data Warehouse to Amazon RedshiftAmazon Web Services
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon RedshiftAmazon Web Services
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon RedshiftAmazon Web Services
 
23 October 2013 - AWS 201 - A Walk through the AWS Cloud: Introduction to Ama...
23 October 2013 - AWS 201 - A Walk through the AWS Cloud: Introduction to Ama...23 October 2013 - AWS 201 - A Walk through the AWS Cloud: Introduction to Ama...
23 October 2013 - AWS 201 - A Walk through the AWS Cloud: Introduction to Ama...Amazon Web Services
 
Amazon Athena (April 2017)
Amazon Athena (April 2017)Amazon Athena (April 2017)
Amazon Athena (April 2017)Julien SIMON
 
AWS June Webinar Series - Getting Started: Amazon Redshift
AWS June Webinar Series - Getting Started: Amazon RedshiftAWS June Webinar Series - Getting Started: Amazon Redshift
AWS June Webinar Series - Getting Started: Amazon RedshiftAmazon Web Services
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon RedshiftAmazon Web Services
 
Building Your Data Warehouse with Amazon Redshift
Building Your Data Warehouse with Amazon RedshiftBuilding Your Data Warehouse with Amazon Redshift
Building Your Data Warehouse with Amazon RedshiftAmazon Web Services
 
Getting Started with Amazon Redshift
 Getting Started with Amazon Redshift Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftAmazon Web Services
 

Similar to Masterclass - Redshift (20)

Leveraging Amazon Redshift for Your Data Warehouse
Leveraging Amazon Redshift for Your Data WarehouseLeveraging Amazon Redshift for Your Data Warehouse
Leveraging Amazon Redshift for Your Data Warehouse
 
Processing and Analytics
Processing and AnalyticsProcessing and Analytics
Processing and Analytics
 
Building Your Data Warehouse with Amazon Redshift
Building Your Data Warehouse with Amazon RedshiftBuilding Your Data Warehouse with Amazon Redshift
Building Your Data Warehouse with Amazon Redshift
 
Deploying your Data Warehouse on AWS
Deploying your Data Warehouse on AWSDeploying your Data Warehouse on AWS
Deploying your Data Warehouse on AWS
 
Best Practices for Migrating Your Data Warehouse to Amazon Redshift
Best Practices for Migrating Your Data Warehouse to Amazon RedshiftBest Practices for Migrating Your Data Warehouse to Amazon Redshift
Best Practices for Migrating Your Data Warehouse to Amazon Redshift
 
Amazon RedShift - Ianni Vamvadelis
Amazon RedShift - Ianni VamvadelisAmazon RedShift - Ianni Vamvadelis
Amazon RedShift - Ianni Vamvadelis
 
Best Practices for Migrating your Data Warehouse to Amazon Redshift
Best Practices for Migrating your Data Warehouse to Amazon RedshiftBest Practices for Migrating your Data Warehouse to Amazon Redshift
Best Practices for Migrating your Data Warehouse to Amazon Redshift
 
Getting started with Amazon Redshift
Getting started with Amazon RedshiftGetting started with Amazon Redshift
Getting started with Amazon Redshift
 
Best Practices for Migrating your Data Warehouse to Amazon Redshift
Best Practices for Migrating your Data Warehouse to Amazon Redshift Best Practices for Migrating your Data Warehouse to Amazon Redshift
Best Practices for Migrating your Data Warehouse to Amazon Redshift
 
Best Practices for Migrating your Data Warehouse to Amazon Redshift
Best Practices for Migrating your Data Warehouse to Amazon RedshiftBest Practices for Migrating your Data Warehouse to Amazon Redshift
Best Practices for Migrating your Data Warehouse to Amazon Redshift
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon Redshift
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon Redshift
 
23 October 2013 - AWS 201 - A Walk through the AWS Cloud: Introduction to Ama...
23 October 2013 - AWS 201 - A Walk through the AWS Cloud: Introduction to Ama...23 October 2013 - AWS 201 - A Walk through the AWS Cloud: Introduction to Ama...
23 October 2013 - AWS 201 - A Walk through the AWS Cloud: Introduction to Ama...
 
Amazon Athena (April 2017)
Amazon Athena (April 2017)Amazon Athena (April 2017)
Amazon Athena (April 2017)
 
AWS June Webinar Series - Getting Started: Amazon Redshift
AWS June Webinar Series - Getting Started: Amazon RedshiftAWS June Webinar Series - Getting Started: Amazon Redshift
AWS June Webinar Series - Getting Started: Amazon Redshift
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon Redshift
 
Building Your Data Warehouse with Amazon Redshift
Building Your Data Warehouse with Amazon RedshiftBuilding Your Data Warehouse with Amazon Redshift
Building Your Data Warehouse with Amazon Redshift
 
Deep Dive on Amazon DynamoDB
Deep Dive on Amazon DynamoDBDeep Dive on Amazon DynamoDB
Deep Dive on Amazon DynamoDB
 
Getting Started with Amazon Redshift
 Getting Started with Amazon Redshift Getting Started with Amazon Redshift
Getting Started with Amazon Redshift
 
Redshift overview
Redshift overviewRedshift overview
Redshift overview
 

More from Amazon Web Services

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Amazon Web Services
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Amazon Web Services
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateAmazon Web Services
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSAmazon Web Services
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Amazon Web Services
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Amazon Web Services
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...Amazon Web Services
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsAmazon Web Services
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareAmazon Web Services
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSAmazon Web Services
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAmazon Web Services
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareAmazon Web Services
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWSAmazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckAmazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without serversAmazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...Amazon Web Services
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceAmazon Web Services
 

More from Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Recently uploaded

Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentPim van der Noll
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsSergiu Bodiu
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesThousandEyes
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterMydbops
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxLoriGlavin3
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...Wes McKinney
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Farhan Tariq
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxLoriGlavin3
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 

Recently uploaded (20)

Emixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native developmentEmixa Mendix Meetup 11 April 2024 about Mendix Native development
Emixa Mendix Meetup 11 April 2024 about Mendix Native development
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
DevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platformsDevEX - reference for building teams, processes, and platforms
DevEX - reference for building teams, processes, and platforms
 
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyesAssure Ecommerce and Retail Operations Uptime with ThousandEyes
Assure Ecommerce and Retail Operations Uptime with ThousandEyes
 
Scale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL RouterScale your database traffic with Read & Write split using MySQL Router
Scale your database traffic with Read & Write split using MySQL Router
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptxMerck Moving Beyond Passwords: FIDO Paris Seminar.pptx
Merck Moving Beyond Passwords: FIDO Paris Seminar.pptx
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
The Future Roadmap for the Composable Data Stack - Wes McKinney - Data Counci...
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...Genislab builds better products and faster go-to-market with Lean project man...
Genislab builds better products and faster go-to-market with Lean project man...
 
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptxUse of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
Use of FIDO in the Payments and Identity Landscape: FIDO Paris Seminar.pptx
 
Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 

Masterclass - Redshift

  • 1. Masterclass Amazon Redshift Ian Massingham – Technical Evangelist @IanMmmm
  • 2. Masterclass A technical deep dive beyond the basics Help educate you on how to get the best from AWS technologies Show you how things work and how to get things done Broaden your knowledge in ~45 mins
  • 3. Amazon Redshift A fast, fully managed, petabyte-scale data warehouse Makes it simple & cost-effective to analyse all your data using existing tools Start small & scale to a petabyte or more for less than $1,000/terabyte/year
  • 4. Optimised for Data Warehousing Scalable No Up-Front Costs Amazon Redshift Compatible Secure Simple
  • 5. Agenda Why Run Your Data Warehouse on AWS? Getting Started Table Design Data Loading Working with Data Backup and Restoration Upgrading & Scaling
  • 6. Why Run Your Data Warehouse on AWS?
  • 7. Why Managed Databases? documentation, licensing & training backup & recovery, data load & unload performance tuning 5% 25% 20% 40% 5% 5% scripting & coding security planning install, upgrade, patch and migrate differentiated effort increases the uniqueness of an application
  • 8. Our customers asked us for data warehousing done the AWS way Easy to provision and scale up massively No upfront costs, pay as you go Really fast performance at a really low price Open and flexible with support for popular BI tools
  • 9. Amazon Redshift parallelizes and distributes everything Query Common BI Tools JDBC/ODBC Load Backup Compute Compute Restore Node Node Resize Compute Node Leader Node 10GigE Mesh
  • 10. Redshift lets you start small and grow big Extra Large Node (dw1.xl & dw2.xl) 3 spindles, 15GiB RAM 2 virtual cores, 10GigE Single Node (160GB SSD or 2TB Magnetic) Cluster 2-32 Nodes (320GB SSD – 64TB Magnetic) 8 Extra Large Node (dw1.8xl & dw2.8xl) 24 spindles, 120GiB RAM, 1.2TB SSD or 16TB Magnetic, 16 virtual cores, 10GigE Cluster 2-100 Nodes (2.4TB SSD – 1.6PB Magnetic) 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL XL XL XL XL XL XL XL XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL XL XL XL XL XL XL XL XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL XL XL XL XL XL XL XL XL XL XL XL XL XL XL XL XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL 8XL
  • 11. Redshift simplifies operations • Built-in security in transit, at rest, when backed up • Backup to S3 is continuous, incremental, and automatic • Streaming restores let you resume querying faster • Disk failures are transparent; nodes recover automatically Amazon S3 Clients (Optional) SSL Continuous, Automatic Backup Streaming Restore Amazon Redshift Amazon Redshift
  • 12. Amazon Redshift dramatically reduces IO Column storage Data compression Zone maps Direct-attached storage Large data block sizes Id Age State 123 20 CA 345 25 WA 678 40 FL Row storage Column storage
  • 13. Current production environment 32 nodes, 128 CPUs, 4.2TB RAM, 1.6 PB disk Tested 2B row data set, 6 representative queries on 2-node Amazon Redshift cluster queries ran > 10x faster
  • 14. TEST: 2 BILLION RECORDS 6 REPRESENTATIVE REQUESTS
  • 15. AMAZON REDSHIFT 2x HS1.8XL Vs. 32 NODES, 4.2TB RAM, 1.6PB
  • 16. 12x TO 150x FASTER
  • 17. • Needed a way to increase speed, performance and flexibility of data analysis at a low cost • Using AWS enabled FT to run queries 98% faster than previously—helping FT make business decisions quickly • Easier to track and analyze trends • Reduced infrastructure costs by 80% over traditional data center model
  • 19. Getting Started Create a Data Warehouse in Minutes with Amazon Redshift
  • 20. Creating a Redshift Data Warehouse via the AWS Console
  • 21. Creating a Redshift Data Warehouse via the AWS Console
  • 22. Creating a Redshift Data Warehouse via the AWS Console
  • 23. Creating a Redshift Data Warehouse via the AWS Console
  • 24. Creating a Redshift Data Warehouse via the AWS Console
  • 25. Creating a Redshift Data Warehouse via the AWS Console
  • 26. Creating a Redshift Data Warehouse via the AWS Console
  • 27. Creating a Redshift Data Warehouse via the AWS Console
  • 28. Creating a Redshift Data Warehouse via the AWS Console
  • 29. Creating a Redshift Data Warehouse via the AWS Console
  • 30. Creating a Redshift Data Warehouse via the AWS Console
  • 31. Creating a Redshift Data Warehouse via the AWS Console
  • 32. Creating a Redshift Data Warehouse via the AWS Console
  • 33. Creating a Redshift Data Warehouse via the AWS Console
  • 34. More Getting Started Resources 2 Month Free Trial Comprehensive Getting Started Guide Table Design & Data Loading Tutorials Find out more about getting started with Amazon Redshift: aws.amazon.com/redshift/getting-started/
  • 36. Table Design ▶ How to choose optimal table design ? ▶ Choosing the right data type ▶ Distributing and sorting data
  • 37. The VARCHAR data type supports multi-byte characters up to a maximum of three bytes. Four-byte or longer characters are not supported. For example, VARCHAR(12) can support four Chinese characters (three bytes / character): 4 characters x 3 bytes per character = 12 bytes Data Types
  • 38. Your Schema ▶ 3NF Airport Search Type Destination Enquiry Airport Group Map Airport Group Enquiry Airport Enquiry Airport Holiday Type Group
  • 39. Distributing Data ▶ Redshift is a distributed system: ▶ A cluster contains a leader node & compute nodes ▶ A compute node contains slices (one per core) ▶ A slice contains data ▶ Slices are chosen based on two types of distribution: ▶ Round Robin (automated) ▶ Based on a distribution key (hash of a defined column) ▶ Queries run on all slices in parallel: optimal query throughput can be achieved when data is evenly spread across slices
  • 40. Unoptimised Distribution ORDERS ITEMS Order 1: Dave Smith, Total £195 Item 1.1: Order 1, Kindle Fire HD 7”, £159 Item 1.2: Order 1, Kindle Fire Case, £36 Node 1 Slice 1 | Slice 2 Node 2 Slice 1 | Slice 2 Node 3 Slice 1 | Slice 2 Order 1 Item 2.1 Order 2 Item 1.1 Order 3 Item 1.2 Item 3.1 Item 2.2 Default (No Distribution Key, Round Robin Order)
  • 41. Optimised Distribution ORDERS ITEMS Order 1: Dave Smith, Total £195 Item 1.1: Order 1, Kindle Fire HD 7”, £159 Item 1.2: Order 1, Kindle Fire Case, £36 Node 1 Slice 1 | Slice 2 Node 2 Slice 1 | Slice 2 Node 3 Slice 1 | Slice 2 Order 1 Order 2 Order 3 Item 1.1 Item 2.1 Item 1.2 Item 2.2 Item 3.1 Customised (ORDERS.ORDER_ID DISTKEY, ITEMS.ORDER_ID DISTKEY)
  • 42. Choosing a Distribution Key ▶ Frequently Joined ▶ By most commonly run queries ▶ By queries which consume the most CPU ▶ High Cardinality ▶ Large number of discrete values ▶ Low Skew ▶ Uniform Distribution ▶ No Hotspots ▶ Query STV_BLOCKLIST for Skew Factor http://docs.aws.amazon.com/redshift/latest/dg/c_analyzing-table-design.html
  • 43. Sorting Data ▶ In the slices (on disk), the data is sorted by a sort key (if none uses the insertion order) ▶ Choose a sort key that is frequently used in your queries ▶ As a query predicate (date, identifier, …) ▶ As a join parameter (it can also be the hash key) ▶ The sort key allows Redshift to avoid reading entire blocks based on predicates ▶ E.g.: a table containing a timestamp sort key, and where only recent data is accessed, will skip blocks containing “old” data
  • 44. Schema Design ▶ Optimizing a database for querying ▶ Analyse using Automatic Compression to ensure optimal IO ▶ Co-locate frequently joined tables: use distribution key wisely (avoids data transfers between nodes) ▶ For joined tables, use sort keys on the joined columns, allowing fast merge joins ▶ Compression allows you to de-normalize without penalizing storage, simplifying queries and limiting joins
  • 45. Possible Schema Optimisations ▶ Denormalise commonly used join attributes onto large tables Search Type Destination Enquiry Map DISTKEY, SORTKEY = EnquiryID Airport Airport Group Airport Group Enquiry Airport Enquiry Airport Holiday Type Group
  • 46. Possible Schema Optimisations ▶ Cache Small Tables with DISTYLE ALL Search Type Destination Enquiry Map DISTKEY, SORTKEY = EnquiryID Airport Airport Group Airport Group Enquiry Airport Enquiry Airport Holiday Type Group
  • 47. More on Table Design Table Design Documentation http://docs.aws.amazon.com/redshift/latest/dg/t_Creating_tables.html Table Design Tutorial http://docs.aws.amazon.com/redshift/latest/dg/tutorial-tuning-tables.html
  • 49. Amazon Redshift Loading Data Overview Data Volume Amazon Elastic MapReduce Amazon S3 DynamoDB Amazon RDS Amazon Redshift Amazon Glacier Source DBs logs / files AWS Direct Connect VPN Connection S3 Multipart Upload Corporate Data center AWS Cloud AWS Import/ Export
  • 50. Sharing Your Data ▶ Data Dumped from DB ▶ CSV Files Encrypted ▶ Review Encryption Code ▶ Uploaded to S3 ▶ Bucket <bucket> ▶ Data shared with AWS { "Id": "Policy1234567890", "Statement": [ { "Action": [ "s3:Get*", "s3:List*" ], "Effect": "Allow", "Principal": { "AWS": ["arn:aws:iam::887210671223:root"] }, "Resource": [ "arn:aws:s3:::<bucket-­‐name>", "arn:aws:s3:::<bucket-­‐name>/*" ], "Sid": "Stmt0987654321" } ] }
  • 51. Slice 0 Slice 1 Slice 0 Slice 1 Splitting Data Files Client.txt.1 Client.txt.2 Client.txt.3 Client.txt.4 Node 0 Node 1 2 XL Compute Nodes Copy customer from ‘s3://mydata/client.txt’ Credentials ‘aws_access_key_id=<your-access-key>; aws_secret_access_key=<your_secret_key>’ Delimiter ‘|’; mydata
  • 52. Loading Encrypted Data Files Client.txt Encrypt the file using the envelope key Generate a one time use envelope symmetric key Encrypt the envelope key using a master public key or symmetric key Amazon Redshift Copy customer from ‘s3://mydata/client.txt’ Credentials ‘aws_access_key_id=<your-access-key>; aws_secret_access_key=<your_secret_key>; Master_symmetric_key=<master_key>’ Delimiter ‘|’ encrypted;
  • 53. Typical Load Issues ▶ Mismatch between data types in table and values in input data fields ▶ Mismatch between number of columns in table and number of fields in input data ▶ Mismatched quotes ▶ Redshift supports both single and double quotes; however, these quotes must be balanced appropriately ▶ Incorrect format for date/time data in input files ▶ Use DATEFORMAT and TIMEFORMAT to control ▶ Out-of-range values in input files (for numeric columns) ▶ Number of distinct values for a column exceeds the limitation for its compression encoding
  • 54. Direct SQL ▶ Redshift supports standard DML commands ▶ INSERT, UPDATE, DELETE ▶ Redshift does not support single-command merge (upsert) statement ▶ Load data into a staging table ▶ Joining the staging table with the target table ▶ UPDATE data where row exists ▶ INSERT where no row exists ▶ All Direct SQL Commands go via Leader Node
  • 55. Data Loading Best Practices ▶ Use a COPY Command to load data ▶ Use a single COPY command ▶ Split your data into multiple files ▶ Compress your data files with GZIP ▶ Use multi-row inserts if COPY is not possible ▶ Bulk insert operations (INSERT INTO…SELECT and CREATE TABLE AS) provide high performance data insertion
  • 56. Data Loading Best Practices ▶ Load your data in sort key order to avoid needing to vacuum ▶ Organize your data as a sequence of time-series tables, where each table is identical but contains data for different time ranges ▶ Use staging tables to perform an upsert ▶ Run the VACUUM command whenever you add, delete, or modify a large number of rows, unless you load your data in sort key order ▶ Increase the memory available to a COPY or VACUUM by increasing wlm_query_slot_count ▶ Run the ANALYZE command whenever you’ve made a non-trivial number of changes to your data to ensure your table statistics are current
  • 57. Working with data in Amazon Redshift
  • 58. RDBMS Redshift OLTP ERP Reporting & BI
  • 59. RDBMS Redshift OLTP ERP Reporting & BI
  • 61.
  • 62.
  • 63.
  • 65. BACKUP AND RESTORATION Backups to Amazon S3 are Co1m6 copresu te Node Amazon S3 128GB RAM Co1m6 copresu te Node 16TB disk 128GB RAM 16TB disk 128GB RAM Co1m6 copresu te Node 16TB disk automatic, continuous and incremental Configurable system snapshot retention period User snapshots on-demand Streaming restores enable you to resume querying faster
  • 66. CLUSTER DATA REPLICATION + AUTOMATED BACKUPS ON AMAZON S3 + NODE MONITORING
  • 68. 16 cores Compute Node L16e caodrese r Node 16 cores Compute Node 16 cores Compute Node Resize while remaining online Provision a new cluster in the background 16 cores Compute Node 16 cores Compute Node 16 cores Compute Node 16 cores Compute Node L16e caodrese r Node Copy data in parallel from node to node Only charged for source cluster SQL Clients/BI Tools 128GB RAM 48TB disk 128GB RAM 48TB disk 128GB RAM 48TB disk 128GB RAM 48TB disk 128GB RAM 48TB disk 128GB RAM 48TB disk 128GB RAM 48TB disk 128GB RAM 48TB disk 128GB RAM 48TB disk
  • 69. 128GB RAM 16 cores Compute Node 48TB disk 128GB RAM 16 cores Compute Node 48TB disk 128GB RAM 16 cores Compute Node 48TB disk 128GB RAM 16 cores Compute Node 48TB disk 128GB RAM L16e caodrese r Node 48TB disk Automatic SQL endpoint switchover via DNS Decommission the source cluster SQL Clients/BI Tools
  • 70. Resizing a Redshift Data Warehouse via the AWS Console
  • 71. Resizing a Redshift Data Warehouse via the AWS Console
  • 72. Resizing a Redshift Data Warehouse via the AWS Console
  • 74. Airbnb Nokia Find out more about these and other AWS case studies here: aws.amazon.com/solutions/case-studies/
  • 75. Resources to learn more Financial Times Case Study aws.amazon.com/solutions/case-studies/financial-times Getting Started with Amazon Redshift aws.amazon.com/redshift/getting-started/ Uses & Best Practices for Amazon Redshift AWS Summit 2014 San Francisco (video) http://youtu.be/reQtXquDpzo Your local AWS Solutions Architect J
  • 77. Optimised for Data Warehousing Scalable No Up-Front Costs Amazon Redshift Compatible Secure Simple
  • 78. Find out more: aws.amazon.com/redshift
  • 79. AWS Training & Certification Certification Demonstrate your skills, knowledge, and expertise with the AWS platform aws.amazon.com/certification Self-Paced Labs Try products, gain new skills, and get hands-on practice working with AWS technologies aws.amazon.com/training/ self-paced-labs Training Skill up and gain confidence to design, develop, deploy and manage your applications on AWS aws.amazon.com/training
  • 80. Ian Massingham – Technical Evangelist @IanMmmm @AWS_UKI for local AWS events & news @AWScloud for Global AWS News and Announcements ©Amazon.com, Inc. and its affiliates. All rights reserved.
  • 81. We typically see customers start by trying our services Get started now at : aws.amazon.com/getting-started
  • 82. Design your application for the AWS Cloud More details on the AWS Architecture Center at : aws.amazon.com/architecture