SlideShare une entreprise Scribd logo
1  sur  29
Michael Labib, Specialist Solutions Architect
August 2016
Managing IoT and Time Series
Data with Amazon ElastiCache
for Redis
2
Learning Objectives
 Understand Time Series Data & Challenges
 Learn about Amazon ElastiCache for Time
Series Data
 Learn how to build Time Series Solutions
with Amazon ElastiCache
 Explore a Sensor Data Demonstration
Time Series Data
Time Series Data – What is it?
 Device / Sensor Data
 Website Clickstream events
 Logging and metrics
 Social Media and sentiment
analysis
 Can be analyzed upon
collection or batches
4
Time Series Data – Challenges
 What type of database should you use?
 Relational or NoSQL
 How should you model the data to support the queries and analysis
needed?
 What types of aggregations will you need?
 What type of information would do you need to gather?
 How will the applications access the data?
 How do you build the solution in a cost effective manner?
 How long should you retain the data?
 How do you manage scalability given the amount of data and sensors grow?
 And so on…
5
Database Families
Relational
+ Mature Technology
+ SQL is widely adopted
+ Durable and Consistent
- Inflexible data model
- Hard to scale
- Performance limitations
- Updates and other changes can be slow
- Limited to speed of storage technology
1 = Row
2 = Column
6
Database Families
NoSQL
+ ’Schema-less’ = flexible data model
+ Quickly adapt to new requirements
+ Easier to scale
+ Performance
+ Generally higher throughput and
+ lower latency than Relational DB
- Can be less durable and consistent
- Query tools less mature
7
Database Families
NoSQL: In-Memory Key-Value
+ Very fast performance
+ Sub millisecond reads and writes
+ High throughput
+ Schema-Less – Flexible data model
+ Redis - Advanced Data Structures
- Still emerging technology
- Query and other tools less mature
- Less connectors than other database
options
- Less durable
8
Request Rate
High Low
Latency
Low High
Structure
Low
High
9
Data Volume
Low High
Amazon
RDS
Amazon S3
Amazon
Glacier
Amazon
CloudSearch and
Amazon
Elasticsearch
Amazon
DynamoDB
Amazon
ElastiCache
HDFS
Ingesting Time Series Data with Amazon
ElastiCache
AWS
Lambda
Amazon
Kinesis
Streams
AWS
IoT
Data
Processor
Streaming
Real-time
Data
Device
Data
Amazon
ElastiCache for Redis
Datastore
1
Amazon
EC2
10
Amazon ElastiCache Overview
In-Memory Key-Value Store
High-performance
Redis and Memcached
Fully managed; Zero admin
Highly Available and Reliable
Hardened by Amazon
12
Redis – The In-Memory Leader
Powerful
~200 commands + Lua scripting
In-memory data structure server
Utility data structures
strings, lists, hashes, sets, sorted
sets, bitmaps & HyperLogLogs
Simple
Atomic operations
supports transactions
has ACID properties
Ridiculously fast!
<1ms latency for most commands
Highly Available
replication
Persistent
snapshots or append-only log
Open Source
13
Fully managed service = Automated Operations
Redis datastore hosted on Amazon EC2 Amazon ElastiCache for Redis
14
ElastiCache - Customer Value
Extreme
Performance
Sub-millisecond
access latencies
Engineered for
Cloud Scale
Open Source
Compatible
Compatible with
Redis and
Memcached
Existing code
will work when
you update node
end points
Fully
Managed
Automates tasks
such as failed
node
replacement,
software
patching,
upgrades and
backups
CloudWatch
enables you to
monitor cache
performance
metrics
Secure and
Hardened
Supports
Amazon VPC
and IAM for
secure and fine
grained access.
Monitors your
nodes and
applies security
patches when
necessary
Highly
Available
and Scalable
Multi-AZ with
automatic
failover to a
read replica, no
human
intervention
required.
Easily scale
your Redis
(vertically) and
Memcached
(horizontally)
environments
Cost
Effective
Pay as low as
US$0.017 per
hour. Get
started with 750
free hours per
month of a
micro node for a
year.
No cross
availability zone
data transfer
costs.
15
Building Time Series Solutions
with Amazon ElastiCache
Streaming Data
Amazon
ElastiCache
Amazon
EC2
Data
Sources
AWS
Lambda
(Dashboard)
1
Amazon
Kinesis
Streams
Amazon
DynamoDB
Hot Data
Longer
Retention
17
Streaming Data Analytics
Data
Sources
1
Amazon
Kinesis
Streams
Amazon
EMR
(Spark Streaming)
Amazon
ElastiCache
Amazon
S3
Amazon
EC2
(Dashboard)
Amazon
Redshift
Spark Redis Connector
Data Lake
18
Capturing Moving Data
 Equity prices, clickstreams, sensor streams …
 Amazon Kinesis and AWS Lambda are serverless
 Amazon Kinesis + Amazon ElastiCache for Redis ensure
order among incoming data
 Kinesis provides an ordered Stream
 Redis provides a Sorted Set
 Fault Tolerant
 Data Retention – Data accessible in Kinesis 24 hours after its
been added to the stream and can be extended up to 7 days.
 ElastiCache provides HA with automatic failover to a read replica
 ElastiCache is fast and flexible
 Can query and retrieve hot data quickly
19
ElastiCache for Streaming Data Analytics
 Access all Redis Data structures directly from
Spark
 Significantly accelerate Spark performance over
disk-based solutions
 Simplify analytics by leveraging Redis Data
Structures reducing processing overhead and
complexity
 Improve application performance by accessing
aggregations and hot data from Redis
20
Event Driven Data
Capture time ordered changes from a
DynamoDB table using DynamoDB Streams
and propagate to ElastiCache
AWS
Lambda
Amazon
DynamoDB
Streams
Amazon
SES
Amazon
S3
Amazon
Cognito
Amazon
SNS
Amazon
CloudWatch
Amazon
Kinesis
Streams
Amazon
API
Gateway
Event Sources
21
Hot Data
Amazon
ElastiCache
AWS
LambdaLonger
Retention
Amazon
DynamoDB
Responding to Events
 Can cover a huge variety of different data sources
 Security events, application events, messaging events, user actions…
 For IoT, often created when a sensor crosses a threshold
 Lambda captures the incoming event data and routes it
 Hot data to ElastiCache
 Cold data to slower, disk-based databases
 Lambda function to keep databases synchronized
 Cost effective
 Use the speed and flexibility of ElastiCache for hot data queries
‒ No incremental cost per query – price is driven by amount of
memory available
 Use the DynamoDB or RDS for longer data retention
22
Device Data
AWS
IoT
AWS
IoT Device
Amazon
EC2
(Dashboard)
AWS
Lambda
Hot Data
Amazon
ElastiCache
Amazon
DynamoDB
Longer
Retention
Historical |
Data Lake
Amazon
S3
Amazon
Glacier
Cold Data
Amazon
Kinesis
Firehose
23
Sensors and Data Bursts
 “We are all glorified motion sensors. Some things only become
visible to us when they undergo change” – Vera Nazarian, author
 Sensors enable detailed understanding of conditions
 Quickly perceive and react to changing conditions
 Device data can be streaming (time-based delivery) or event
driven (threshold-based delivery)
 Or both! A temperature sensor might deliver regular readings
and also send an alert when it becomes too hot or too cold
 Sensor data solutions must scale to meet loads
 Sensor clusters can generate large data bursts
 AWS IoT auto scales
 ElastiCache for Redis allows flexible sizing to meet changing
workloads
24
Data Modeling for Sensors
 Sensor data is usually of high variety
 Different sensors, and different versions of the same
sensor, collect different data in different formats
 A single device can have many sensors
 The flexible schema of ElastiCache for Redis allows non-
disruptive evolution of data models
 Quickly test and use new or modified data models with
no need to migrate schemas, rebalance shards or other
onerous administrative tasks
25
Sensor Demo
Sensor Demo
AWS
IoT
AWS
IoT Device
Amazon
EC2
(Dashboard)
AWS
Lambda
Amazon
ElastiCache
http://www.seeedstudio.com/wiki/Intel%C2%AE_Edison_and_Grove_IoT_Starter_Kit_Powered_by_AWS
Intel Edison –
Grove Temperature Sensor
27
Sensor Demo - Data Model
SensorData
timestamp
deviceId
temperature
humidity
climate
timestamp
Pub/Sub
deviceId
Hash
If (temperature> 95 F)
publish notification
deviceIP
requestId
Strings (INCR/DECR)
climate:warm
climate:cool
climate:hot
climate:cold
Time-Series events
Sort by timestamp (score)
Aggregations
For dashboards, analytics, etc.
Event details
Data structures updated within a
transaction block using Redis MULTI
Which events occurred at a particular time range?
What were the last N events?
Which devices match a particular pattern?
What is the temperature for a specific device?
What is all the data for a specific device?
What are the totals for each
climate type?
get climate:warm
get climate:cool
get climate:cold
get climate:hot
zrevrangebyscore SensorData +inf -inf WITHSCORES LIMIT 0 5
zrangebyscore SensorData (1410000000000
14400000000000
zscan SensorData 0 MATCH 2* COUNT 100
hget 2221 temperature
hgetall 2221
Sorted Set
28
Summary
 Time series data can present high
volume, velocity and variety challenges
 ElastiCache for Redis is scalable,
performant and cost effective service for
sensor data and other time series data
sets
29

Contenu connexe

Tendances

Getting Started with Managed Database Services on AWS
Getting Started with Managed Database Services on AWSGetting Started with Managed Database Services on AWS
Getting Started with Managed Database Services on AWS
Amazon Web Services
 
Databases in the Cloud - DevDay Austin 2017 Day 2
Databases in the Cloud - DevDay Austin 2017 Day 2Databases in the Cloud - DevDay Austin 2017 Day 2
Databases in the Cloud - DevDay Austin 2017 Day 2
Amazon Web Services
 

Tendances (20)

Managing Data with Amazon ElastiCache for Redis - August 2016 Monthly Webinar...
Managing Data with Amazon ElastiCache for Redis - August 2016 Monthly Webinar...Managing Data with Amazon ElastiCache for Redis - August 2016 Monthly Webinar...
Managing Data with Amazon ElastiCache for Redis - August 2016 Monthly Webinar...
 
AWS re:Invent 2016: Big Data Architectural Patterns and Best Practices on AWS...
AWS re:Invent 2016: Big Data Architectural Patterns and Best Practices on AWS...AWS re:Invent 2016: Big Data Architectural Patterns and Best Practices on AWS...
AWS re:Invent 2016: Big Data Architectural Patterns and Best Practices on AWS...
 
Big Data Architectural Patterns and Best Practices on AWS
Big Data Architectural Patterns and Best Practices on AWSBig Data Architectural Patterns and Best Practices on AWS
Big Data Architectural Patterns and Best Practices on AWS
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon Redshift
 
Making (Almost) Any Database Faster and Cheaper with Caching
Making (Almost) Any Database Faster and Cheaper with CachingMaking (Almost) Any Database Faster and Cheaper with Caching
Making (Almost) Any Database Faster and Cheaper with Caching
 
Introduction to Amazon Aurora
Introduction to Amazon AuroraIntroduction to Amazon Aurora
Introduction to Amazon Aurora
 
Getting Started with Managed Database Services on AWS
Getting Started with Managed Database Services on AWSGetting Started with Managed Database Services on AWS
Getting Started with Managed Database Services on AWS
 
AWS re:Invent 2016: Event Handling at Scale: Designing an Auditable Ingestion...
AWS re:Invent 2016: Event Handling at Scale: Designing an Auditable Ingestion...AWS re:Invent 2016: Event Handling at Scale: Designing an Auditable Ingestion...
AWS re:Invent 2016: Event Handling at Scale: Designing an Auditable Ingestion...
 
Building A Modern Data Analytics Architecture on AWS
Building A Modern Data Analytics Architecture on AWSBuilding A Modern Data Analytics Architecture on AWS
Building A Modern Data Analytics Architecture on AWS
 
Deep Dive on MySQL Databases on AWS - AWS Online Tech Talks
Deep Dive on MySQL Databases on AWS - AWS Online Tech TalksDeep Dive on MySQL Databases on AWS - AWS Online Tech Talks
Deep Dive on MySQL Databases on AWS - AWS Online Tech Talks
 
How to Migrate your Startup to AWS
How to Migrate your Startup to AWSHow to Migrate your Startup to AWS
How to Migrate your Startup to AWS
 
What’s New in Amazon Aurora
What’s New in Amazon AuroraWhat’s New in Amazon Aurora
What’s New in Amazon Aurora
 
AWS re:Invent 2016: [REPEAT] How EA Leveraged Amazon Redshift and AWS Partner...
AWS re:Invent 2016: [REPEAT] How EA Leveraged Amazon Redshift and AWS Partner...AWS re:Invent 2016: [REPEAT] How EA Leveraged Amazon Redshift and AWS Partner...
AWS re:Invent 2016: [REPEAT] How EA Leveraged Amazon Redshift and AWS Partner...
 
AWS re:Invent 2016: JustGiving: Serverless Data Pipelines, Event-Driven ETL, ...
AWS re:Invent 2016: JustGiving: Serverless Data Pipelines, Event-Driven ETL, ...AWS re:Invent 2016: JustGiving: Serverless Data Pipelines, Event-Driven ETL, ...
AWS re:Invent 2016: JustGiving: Serverless Data Pipelines, Event-Driven ETL, ...
 
AWS re:Invent 2016: FINRA in the Cloud: the Big Data Enterprise (ENT313)
AWS re:Invent 2016: FINRA in the Cloud: the Big Data Enterprise (ENT313)AWS re:Invent 2016: FINRA in the Cloud: the Big Data Enterprise (ENT313)
AWS re:Invent 2016: FINRA in the Cloud: the Big Data Enterprise (ENT313)
 
Building Big Data Applications with Serverless Architectures - June 2017 AWS...
Building Big Data Applications with Serverless Architectures -  June 2017 AWS...Building Big Data Applications with Serverless Architectures -  June 2017 AWS...
Building Big Data Applications with Serverless Architectures - June 2017 AWS...
 
What’s New in Amazon RDS for Open-Source and Commercial Databases
What’s New in Amazon RDS for Open-Source and Commercial DatabasesWhat’s New in Amazon RDS for Open-Source and Commercial Databases
What’s New in Amazon RDS for Open-Source and Commercial Databases
 
Databases in the Cloud - DevDay Austin 2017 Day 2
Databases in the Cloud - DevDay Austin 2017 Day 2Databases in the Cloud - DevDay Austin 2017 Day 2
Databases in the Cloud - DevDay Austin 2017 Day 2
 
Introduction to Amazon Relational Database Service
Introduction to Amazon Relational Database ServiceIntroduction to Amazon Relational Database Service
Introduction to Amazon Relational Database Service
 
Migrating Your Databases to AWS Deep Dive on Amazon RDS and AWS
Migrating Your Databases to AWS Deep Dive on Amazon RDS and AWSMigrating Your Databases to AWS Deep Dive on Amazon RDS and AWS
Migrating Your Databases to AWS Deep Dive on Amazon RDS and AWS
 

En vedette

AWS re:Invent 2016: Operations Automation and Infrastructure Management with ...
AWS re:Invent 2016: Operations Automation and Infrastructure Management with ...AWS re:Invent 2016: Operations Automation and Infrastructure Management with ...
AWS re:Invent 2016: Operations Automation and Infrastructure Management with ...
Amazon Web Services
 

En vedette (20)

AWS re:Invent 2016: Operations Automation and Infrastructure Management with ...
AWS re:Invent 2016: Operations Automation and Infrastructure Management with ...AWS re:Invent 2016: Operations Automation and Infrastructure Management with ...
AWS re:Invent 2016: Operations Automation and Infrastructure Management with ...
 
Introduction to DevSecOps on AWS
Introduction to DevSecOps on AWSIntroduction to DevSecOps on AWS
Introduction to DevSecOps on AWS
 
Introduction to Cloud Computing with Amazon Web Services
Introduction to Cloud Computing with Amazon Web ServicesIntroduction to Cloud Computing with Amazon Web Services
Introduction to Cloud Computing with Amazon Web Services
 
Overview of Blue Medora - New Relic Plugin for MongoDB
Overview of Blue Medora - New Relic Plugin for MongoDBOverview of Blue Medora - New Relic Plugin for MongoDB
Overview of Blue Medora - New Relic Plugin for MongoDB
 
Introduction to IAM + Best Practices
Introduction to IAM + Best PracticesIntroduction to IAM + Best Practices
Introduction to IAM + Best Practices
 
Creating Velocity in Data Centre Migrations to AWS
Creating Velocity in Data Centre Migrations to AWSCreating Velocity in Data Centre Migrations to AWS
Creating Velocity in Data Centre Migrations to AWS
 
Workshop: Building Your First Big Data Application on AWS
Workshop: Building Your First Big Data Application on AWSWorkshop: Building Your First Big Data Application on AWS
Workshop: Building Your First Big Data Application on AWS
 
Webinar: Schema Patterns and Your Storage Engine
Webinar: Schema Patterns and Your Storage EngineWebinar: Schema Patterns and Your Storage Engine
Webinar: Schema Patterns and Your Storage Engine
 
XFD導入実践記
XFD導入実践記XFD導入実践記
XFD導入実践記
 
Creating Your Virtual Data Center: VPC Fundamentals and Connectivity Options
 Creating Your Virtual Data Center: VPC Fundamentals and Connectivity Options Creating Your Virtual Data Center: VPC Fundamentals and Connectivity Options
Creating Your Virtual Data Center: VPC Fundamentals and Connectivity Options
 
Deep Dive on Amazon S3
Deep Dive on Amazon S3Deep Dive on Amazon S3
Deep Dive on Amazon S3
 
Sony DAD NMS & Our Migration to the AWS Cloud
Sony DAD NMS & Our Migration to the AWS CloudSony DAD NMS & Our Migration to the AWS Cloud
Sony DAD NMS & Our Migration to the AWS Cloud
 
Getting Started with the Hybrid Cloud: Enterprise Backup and Recovery
 Getting Started with the Hybrid Cloud: Enterprise Backup and Recovery Getting Started with the Hybrid Cloud: Enterprise Backup and Recovery
Getting Started with the Hybrid Cloud: Enterprise Backup and Recovery
 
Getting started with amazon aurora - Toronto
Getting started with amazon aurora - TorontoGetting started with amazon aurora - Toronto
Getting started with amazon aurora - Toronto
 
Hack-Proof Your Cloud: Responding to 2016 Threats
Hack-Proof Your Cloud: Responding to 2016 ThreatsHack-Proof Your Cloud: Responding to 2016 Threats
Hack-Proof Your Cloud: Responding to 2016 Threats
 
Deep Dive: Developing, Deploying & Operating Mobile Apps with AWS
Deep Dive: Developing, Deploying & Operating Mobile Apps with AWS Deep Dive: Developing, Deploying & Operating Mobile Apps with AWS
Deep Dive: Developing, Deploying & Operating Mobile Apps with AWS
 
AWS Summit Auckland - Building a Server-less Data Lake on AWS
AWS Summit Auckland - Building a Server-less Data Lake on AWSAWS Summit Auckland - Building a Server-less Data Lake on AWS
AWS Summit Auckland - Building a Server-less Data Lake on AWS
 
Getting Started with the Hybrid Cloud: Enterprise Backup and Recovery
Getting Started with the Hybrid Cloud: Enterprise Backup and RecoveryGetting Started with the Hybrid Cloud: Enterprise Backup and Recovery
Getting Started with the Hybrid Cloud: Enterprise Backup and Recovery
 
AWS Summit Auckland- Developing Applications for IoT
AWS Summit Auckland-  Developing Applications for IoTAWS Summit Auckland-  Developing Applications for IoT
AWS Summit Auckland- Developing Applications for IoT
 
S'étendre à l'international
S'étendre à l'internationalS'étendre à l'international
S'étendre à l'international
 

Similaire à Managing Data with Voume Velocity, and Variety with Amazon ElastiCache for Redis

Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2
Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2
Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2
Amazon Web Services
 

Similaire à Managing Data with Voume Velocity, and Variety with Amazon ElastiCache for Redis (20)

Using Data Lakes
Using Data LakesUsing Data Lakes
Using Data Lakes
 
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
 
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
Understanding AWS Managed Database and Analytics Services | AWS Public Sector...
 
Deploying your Data Warehouse on AWS
Deploying your Data Warehouse on AWSDeploying your Data Warehouse on AWS
Deploying your Data Warehouse on AWS
 
AWS Big Data Landscape
AWS Big Data LandscapeAWS Big Data Landscape
AWS Big Data Landscape
 
Fast Track to Your Data Lake on AWS
Fast Track to Your Data Lake on AWSFast Track to Your Data Lake on AWS
Fast Track to Your Data Lake on AWS
 
AWS Big Data Solution Days
AWS Big Data Solution DaysAWS Big Data Solution Days
AWS Big Data Solution Days
 
Architecting Data in the AWS Ecosystem
Architecting Data in the AWS EcosystemArchitecting Data in the AWS Ecosystem
Architecting Data in the AWS Ecosystem
 
AWS re:Invent 2016: How to Build a Big Data Analytics Data Lake (LFS303)
AWS re:Invent 2016: How to Build a Big Data Analytics Data Lake (LFS303)AWS re:Invent 2016: How to Build a Big Data Analytics Data Lake (LFS303)
AWS re:Invent 2016: How to Build a Big Data Analytics Data Lake (LFS303)
 
Finding Meaning in the Noise: Understanding Big Data with AWS Analytics
Finding Meaning in the Noise: Understanding Big Data with AWS AnalyticsFinding Meaning in the Noise: Understanding Big Data with AWS Analytics
Finding Meaning in the Noise: Understanding Big Data with AWS Analytics
 
Semplificare l'analisi dei dati con architetture "Serverless": architetture e...
Semplificare l'analisi dei dati con architetture "Serverless": architetture e...Semplificare l'analisi dei dati con architetture "Serverless": architetture e...
Semplificare l'analisi dei dati con architetture "Serverless": architetture e...
 
Using Data Lakes
Using Data Lakes Using Data Lakes
Using Data Lakes
 
Using Data Lakes: Data Analytics Week SF
Using Data Lakes: Data Analytics Week SFUsing Data Lakes: Data Analytics Week SF
Using Data Lakes: Data Analytics Week SF
 
Owning Your Own (Data) Lake House
Owning Your Own (Data) Lake HouseOwning Your Own (Data) Lake House
Owning Your Own (Data) Lake House
 
Getting Started with Amazon Redshift
Getting Started with Amazon RedshiftGetting Started with Amazon Redshift
Getting Started with Amazon Redshift
 
Big Data Session 1.pptx
Big Data Session 1.pptxBig Data Session 1.pptx
Big Data Session 1.pptx
 
Big Data Architectural Patterns and Best Practices on AWS
Big Data Architectural Patterns and Best Practices on AWSBig Data Architectural Patterns and Best Practices on AWS
Big Data Architectural Patterns and Best Practices on AWS
 
Best Practices Using Big Data on AWS | AWS Public Sector Summit 2017
Best Practices Using Big Data on AWS | AWS Public Sector Summit 2017Best Practices Using Big Data on AWS | AWS Public Sector Summit 2017
Best Practices Using Big Data on AWS | AWS Public Sector Summit 2017
 
Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2
Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2
Building Data Warehouses and Data Lakes in the Cloud - DevDay Austin 2017 Day 2
 
Database and Analytics on the AWS Cloud - AWS Innovate Toronto
Database and Analytics on the AWS Cloud - AWS Innovate TorontoDatabase and Analytics on the AWS Cloud - AWS Innovate Toronto
Database and Analytics on the AWS Cloud - AWS Innovate Toronto
 

Plus de Amazon Web Services

Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
Amazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
Amazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
Amazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
Amazon Web Services
 

Plus de Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Dernier

Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
Joaquim Jorge
 

Dernier (20)

Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
 
Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024Partners Life - Insurer Innovation Award 2024
Partners Life - Insurer Innovation Award 2024
 
Boost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdfBoost Fertility New Invention Ups Success Rates.pdf
Boost Fertility New Invention Ups Success Rates.pdf
 
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps ScriptAutomating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
 
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot TakeoffStrategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
 
[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf[2024]Digital Global Overview Report 2024 Meltwater.pdf
[2024]Digital Global Overview Report 2024 Meltwater.pdf
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 
Artificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and MythsArtificial Intelligence: Facts and Myths
Artificial Intelligence: Facts and Myths
 
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...Driving Behavioral Change for Information Management through Data-Driven Gree...
Driving Behavioral Change for Information Management through Data-Driven Gree...
 
08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men08448380779 Call Girls In Friends Colony Women Seeking Men
08448380779 Call Girls In Friends Colony Women Seeking Men
 
Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024Finology Group – Insurtech Innovation Award 2024
Finology Group – Insurtech Innovation Award 2024
 
GenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day PresentationGenCyber Cyber Security Day Presentation
GenCyber Cyber Security Day Presentation
 

Managing Data with Voume Velocity, and Variety with Amazon ElastiCache for Redis

  • 1. Michael Labib, Specialist Solutions Architect August 2016 Managing IoT and Time Series Data with Amazon ElastiCache for Redis
  • 2. 2 Learning Objectives  Understand Time Series Data & Challenges  Learn about Amazon ElastiCache for Time Series Data  Learn how to build Time Series Solutions with Amazon ElastiCache  Explore a Sensor Data Demonstration
  • 4. Time Series Data – What is it?  Device / Sensor Data  Website Clickstream events  Logging and metrics  Social Media and sentiment analysis  Can be analyzed upon collection or batches 4
  • 5. Time Series Data – Challenges  What type of database should you use?  Relational or NoSQL  How should you model the data to support the queries and analysis needed?  What types of aggregations will you need?  What type of information would do you need to gather?  How will the applications access the data?  How do you build the solution in a cost effective manner?  How long should you retain the data?  How do you manage scalability given the amount of data and sensors grow?  And so on… 5
  • 6. Database Families Relational + Mature Technology + SQL is widely adopted + Durable and Consistent - Inflexible data model - Hard to scale - Performance limitations - Updates and other changes can be slow - Limited to speed of storage technology 1 = Row 2 = Column 6
  • 7. Database Families NoSQL + ’Schema-less’ = flexible data model + Quickly adapt to new requirements + Easier to scale + Performance + Generally higher throughput and + lower latency than Relational DB - Can be less durable and consistent - Query tools less mature 7
  • 8. Database Families NoSQL: In-Memory Key-Value + Very fast performance + Sub millisecond reads and writes + High throughput + Schema-Less – Flexible data model + Redis - Advanced Data Structures - Still emerging technology - Query and other tools less mature - Less connectors than other database options - Less durable 8
  • 9. Request Rate High Low Latency Low High Structure Low High 9 Data Volume Low High Amazon RDS Amazon S3 Amazon Glacier Amazon CloudSearch and Amazon Elasticsearch Amazon DynamoDB Amazon ElastiCache HDFS
  • 10. Ingesting Time Series Data with Amazon ElastiCache AWS Lambda Amazon Kinesis Streams AWS IoT Data Processor Streaming Real-time Data Device Data Amazon ElastiCache for Redis Datastore 1 Amazon EC2 10
  • 12. In-Memory Key-Value Store High-performance Redis and Memcached Fully managed; Zero admin Highly Available and Reliable Hardened by Amazon 12
  • 13. Redis – The In-Memory Leader Powerful ~200 commands + Lua scripting In-memory data structure server Utility data structures strings, lists, hashes, sets, sorted sets, bitmaps & HyperLogLogs Simple Atomic operations supports transactions has ACID properties Ridiculously fast! <1ms latency for most commands Highly Available replication Persistent snapshots or append-only log Open Source 13
  • 14. Fully managed service = Automated Operations Redis datastore hosted on Amazon EC2 Amazon ElastiCache for Redis 14
  • 15. ElastiCache - Customer Value Extreme Performance Sub-millisecond access latencies Engineered for Cloud Scale Open Source Compatible Compatible with Redis and Memcached Existing code will work when you update node end points Fully Managed Automates tasks such as failed node replacement, software patching, upgrades and backups CloudWatch enables you to monitor cache performance metrics Secure and Hardened Supports Amazon VPC and IAM for secure and fine grained access. Monitors your nodes and applies security patches when necessary Highly Available and Scalable Multi-AZ with automatic failover to a read replica, no human intervention required. Easily scale your Redis (vertically) and Memcached (horizontally) environments Cost Effective Pay as low as US$0.017 per hour. Get started with 750 free hours per month of a micro node for a year. No cross availability zone data transfer costs. 15
  • 16. Building Time Series Solutions with Amazon ElastiCache
  • 18. Streaming Data Analytics Data Sources 1 Amazon Kinesis Streams Amazon EMR (Spark Streaming) Amazon ElastiCache Amazon S3 Amazon EC2 (Dashboard) Amazon Redshift Spark Redis Connector Data Lake 18
  • 19. Capturing Moving Data  Equity prices, clickstreams, sensor streams …  Amazon Kinesis and AWS Lambda are serverless  Amazon Kinesis + Amazon ElastiCache for Redis ensure order among incoming data  Kinesis provides an ordered Stream  Redis provides a Sorted Set  Fault Tolerant  Data Retention – Data accessible in Kinesis 24 hours after its been added to the stream and can be extended up to 7 days.  ElastiCache provides HA with automatic failover to a read replica  ElastiCache is fast and flexible  Can query and retrieve hot data quickly 19
  • 20. ElastiCache for Streaming Data Analytics  Access all Redis Data structures directly from Spark  Significantly accelerate Spark performance over disk-based solutions  Simplify analytics by leveraging Redis Data Structures reducing processing overhead and complexity  Improve application performance by accessing aggregations and hot data from Redis 20
  • 21. Event Driven Data Capture time ordered changes from a DynamoDB table using DynamoDB Streams and propagate to ElastiCache AWS Lambda Amazon DynamoDB Streams Amazon SES Amazon S3 Amazon Cognito Amazon SNS Amazon CloudWatch Amazon Kinesis Streams Amazon API Gateway Event Sources 21 Hot Data Amazon ElastiCache AWS LambdaLonger Retention Amazon DynamoDB
  • 22. Responding to Events  Can cover a huge variety of different data sources  Security events, application events, messaging events, user actions…  For IoT, often created when a sensor crosses a threshold  Lambda captures the incoming event data and routes it  Hot data to ElastiCache  Cold data to slower, disk-based databases  Lambda function to keep databases synchronized  Cost effective  Use the speed and flexibility of ElastiCache for hot data queries ‒ No incremental cost per query – price is driven by amount of memory available  Use the DynamoDB or RDS for longer data retention 22
  • 23. Device Data AWS IoT AWS IoT Device Amazon EC2 (Dashboard) AWS Lambda Hot Data Amazon ElastiCache Amazon DynamoDB Longer Retention Historical | Data Lake Amazon S3 Amazon Glacier Cold Data Amazon Kinesis Firehose 23
  • 24. Sensors and Data Bursts  “We are all glorified motion sensors. Some things only become visible to us when they undergo change” – Vera Nazarian, author  Sensors enable detailed understanding of conditions  Quickly perceive and react to changing conditions  Device data can be streaming (time-based delivery) or event driven (threshold-based delivery)  Or both! A temperature sensor might deliver regular readings and also send an alert when it becomes too hot or too cold  Sensor data solutions must scale to meet loads  Sensor clusters can generate large data bursts  AWS IoT auto scales  ElastiCache for Redis allows flexible sizing to meet changing workloads 24
  • 25. Data Modeling for Sensors  Sensor data is usually of high variety  Different sensors, and different versions of the same sensor, collect different data in different formats  A single device can have many sensors  The flexible schema of ElastiCache for Redis allows non- disruptive evolution of data models  Quickly test and use new or modified data models with no need to migrate schemas, rebalance shards or other onerous administrative tasks 25
  • 28. Sensor Demo - Data Model SensorData timestamp deviceId temperature humidity climate timestamp Pub/Sub deviceId Hash If (temperature> 95 F) publish notification deviceIP requestId Strings (INCR/DECR) climate:warm climate:cool climate:hot climate:cold Time-Series events Sort by timestamp (score) Aggregations For dashboards, analytics, etc. Event details Data structures updated within a transaction block using Redis MULTI Which events occurred at a particular time range? What were the last N events? Which devices match a particular pattern? What is the temperature for a specific device? What is all the data for a specific device? What are the totals for each climate type? get climate:warm get climate:cool get climate:cold get climate:hot zrevrangebyscore SensorData +inf -inf WITHSCORES LIMIT 0 5 zrangebyscore SensorData (1410000000000 14400000000000 zscan SensorData 0 MATCH 2* COUNT 100 hget 2221 temperature hgetall 2221 Sorted Set 28
  • 29. Summary  Time series data can present high volume, velocity and variety challenges  ElastiCache for Redis is scalable, performant and cost effective service for sensor data and other time series data sets 29