SlideShare une entreprise Scribd logo
1  sur  28
Télécharger pour lire hors ligne
Scaling your Application for Growth using
Automation
November 14,2013
Ken Leung- Euclid Analytics
Greg Narain- Chute

© 2013 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon.com, Inc.
What is Euclid?
Online Analytics for the Offline World
E-Commerce

Physical Stores
How Euclid Works
We use Wi-Fi technology to turn in-store behavior into actionable insights
XX:XX:XX:XX:XX:XX

Wi-Fi AP detects smartphone
MAC addresses

Shopper carrying smartphone
walks by or into store

Euclid analyzes data
for trends and insights

Insights on customer acquisition,
engagement and retention
Market Leader in Real World Analytics
•
•
•

•
•

First to develop proprietary Wi-Fi based analytics
–
–

Most advanced data analytics capabilities and experience in retail environments
Backed by tier 1 investors: Series A led by NEA, Series B led by Benchmark Capital

World-class executive team
–
–

Co-founder of Google Analytics, Founding team of ShopperTrak

Executive experience from Google, SAP, Ariba and Tibco

Experience with the world’s leading retailers
–

Specialty retail, QSR, department store, big box, automotive, malls and more

Largest data scale and rapidly accelerating adoption
–
–
–

Recording >5B events per day
Dataset with >100M unique devices (shoppers)
Gartner Cool Vendor 2012; Idea Innovation Award Winner: Business Technology 2012

Market leadership recognized by:
Euclid is a
Data Company
As of October, 2013, the
Euclid Network:
• Covers over 600
shopping centers, malls,
and street locations
• Processes 50 TB of raw
data
• Collects over 30 GB of
raw data daily

Acquire
Data

•Reliable
•Durable
•Scalable

Process
Data

•Efficient
•Flexible
•Scalable
•Versatile

Deliver
Data

•Richness
•Sophistication
•Value
Euclid’s Challenges
Common Challenges
• Scaling
• Performance
• Cost effectiveness
• Removing the technical
barriers for innovation
• “Failing fast”

Unique Challenges
• Recomputing the entire
history of Euclid data!
– Need fast results
– Need a lot of computational
power, sometimes greater
than 100x of regular daily
compute needs
Euclid’s Use of AWS
Euclid started with AWS from Day One
- Amazon EC2, Amazon RDS, Amazon EMR,
Amazon S3
- AWS Elastic Beanstalk
- Amazon Redshift
Heroku from Amazon Partner Network (APN)
Architecture
Data Acquisition
Elastic Beanstalk
- Multi-AZ, multi-region
- Load balancing, auto scaling
- Monitoring, notification
- Deployment Management
- Amazon EBS-backed volume for failover data recovery
- Log rotation to Amazon S3 (99.999999999% durability)
All built-in.
Data Acquisition - code
<%@ page import="java.io.*,java.util.*,com.euclid.spongebob..server.*" %><%
Properties sensorCredentials = (Properties)this.getServletContext().getAttribute("sensor_credentials");
String sensor_id = request.getParameter("sensor_id");
String credential = request.getParameter("credential");
String body = request.getParameter("body");
if (sensor_id == null || !sensorCredentials.containsKey(sensor_id) ||
!sensorCredentials.getProperty(sensor_id).equals(credential)) {
response.sendError(HttpServletResponse.SC_UNAUTHORIZED);
return;
}
java.util.logging.Logger logger = java.util.logging.Logger.getLogger("spongebob");
logger.log(java.util.logging.Level.INFO, body);
response.setStatus(HttpServletResponse.SC_OK);
%>
Data Acquisition - Principles
• Log to Amazon EBS Volume – high I/O
performance
• As “dumb” as possible: reliable
• Fork data from disk to
– Amazon S3 for batch processing
– Kafka messaging service for real time processing
Data Acquisition – System Monitor
• Low latency
• Low CPU utilization
Data Processing - Pipeline
Raw Data

Map
Reduce
(EMR)

Product
dashboard, insights

R/D
Analytics
Pipeline – Dual Purposes
Two worlds, one platform
• Big Data Engineering – noSQL
– Pig Latin with Amazon EMR (Java, Python UDFs)
– Work flows (Jenkins), shell scripting

• Analytics, Analysts, Business – SQL
– Excel
– Tableau
– Maybe some Python, etc.
Pipeline - Architecture
Amazon S3

SQL DB: MySQL, Redshift

Raw Data
Meta
Data

Aggr.
Level 1

3rd Party
Data

Some Raw Data
Analytics

Aggr.
Level 1

Direct
DB Load

Meta
Data
3rd Party
Data

Models
Algorithms

Aggr.
Level n

MapReduce
MySQL
Product
dashboard, insights

SQL

Aggr.
Level n

R&D

Models
Algorithms
SQL: MySQL, Amazon Redshift, both by AWS
• Started with MySQL, Amazon Redshift Preview Jan
2013
• MySQL 1TB limit vs Amazon Redshift PB scale
• Performance, night and day
– E.g., count distinct of 100m rows: 5h in MySQL, 2m in Amazon Redshift

• Amazon Redshift: killer data warehouse
– Low cost
– No DBA!
– Easy integration
Pipeline - Monitoring
• System monitoring provided by AWS
• Workflow monitoring with Jenkins
– Failure notification
– Dependency management

• Data quality (including acquisition) monitoring
– Also utilize Jenkins
– Scripts that check data at various stages
– Each script as a job in the Jenkins workflow
Pipeline - Workflow
Part of the Jenkins Dependency Graph
AWS Benefits
• “Apps not Ops” – Euclid does not have/need an
Ops team
• Scale up and down on demand
• Pay as we go
• Agile (innovations, time-to-market)
Chute
1. Data
2. Automation
3. Uptime
4. Monitoring
Data
● Real time analytics is hard
● Hadoop!
○ Sqoop imports SQL data to HDFS
○ Clojure
○ Scalding (github.com/twitter/scalding)
● Elasticsearch, Logstash
○ parse logs to track activity for customers
Hadoop cluster
or
EMR

Sharded Postgres

SQOOP

Server
HDFS

S3
N number of
EC2 instances
● varnish
● logstash

plugin front ends
Kibana

ELB

Redis cluster

ElasticSearch

Events Server
● nginx
● logstash

API
Automation through DevOps
● Chute has 100 servers
○ Configured many manually
○ 82? of 100 now managed by Chef
● Whirr
● Sqoop and Cron to automate data import
● route53 with Chef for urls
Uptime
● Architect applications to scale horizontally
○ AWS launches servers on demand
○ spot and reserve pricing
● Keep services running with Chef
○ Chef makes it easy to wrap programs as
a service on AWS
Monitoring
● newrelic
○ server resource monitoring
○ application monitoring
● logstash + kibana
○ elasticsearch backend
○ redis (cluster)
○ can monitor server logs
Please give us your feedback on this
presentation

CPN209
As a thank you, we will select prize
winners daily for completed surveys!

Contenu connexe

Tendances

Masterclass Webinar - Amazon Elastic MapReduce (EMR)
Masterclass Webinar - Amazon Elastic MapReduce (EMR)Masterclass Webinar - Amazon Elastic MapReduce (EMR)
Masterclass Webinar - Amazon Elastic MapReduce (EMR)
Amazon Web Services
 

Tendances (20)

Deep Dive: Amazon Elastic MapReduce
Deep Dive: Amazon Elastic MapReduceDeep Dive: Amazon Elastic MapReduce
Deep Dive: Amazon Elastic MapReduce
 
Deep Dive: Amazon Elastic MapReduce
Deep Dive: Amazon Elastic MapReduceDeep Dive: Amazon Elastic MapReduce
Deep Dive: Amazon Elastic MapReduce
 
Best Practices for Managing Hadoop Framework Based Workloads (on Amazon EMR) ...
Best Practices for Managing Hadoop Framework Based Workloads (on Amazon EMR) ...Best Practices for Managing Hadoop Framework Based Workloads (on Amazon EMR) ...
Best Practices for Managing Hadoop Framework Based Workloads (on Amazon EMR) ...
 
AWS Summit London 2014 | From One to Many - Evolving VPC Design (400)
AWS Summit London 2014 | From One to Many - Evolving VPC Design (400)AWS Summit London 2014 | From One to Many - Evolving VPC Design (400)
AWS Summit London 2014 | From One to Many - Evolving VPC Design (400)
 
AWS Summit London 2014 | Customer Stories | Just Eat
AWS Summit London 2014 | Customer Stories | Just EatAWS Summit London 2014 | Customer Stories | Just Eat
AWS Summit London 2014 | Customer Stories | Just Eat
 
Big data with amazon EMR - Pop-up Loft Tel Aviv
Big data with amazon EMR - Pop-up Loft Tel AvivBig data with amazon EMR - Pop-up Loft Tel Aviv
Big data with amazon EMR - Pop-up Loft Tel Aviv
 
AWS Webcast - Amazon Elastic Map Reduce Deep Dive and Best Practices
AWS Webcast - Amazon Elastic Map Reduce Deep Dive and Best PracticesAWS Webcast - Amazon Elastic Map Reduce Deep Dive and Best Practices
AWS Webcast - Amazon Elastic Map Reduce Deep Dive and Best Practices
 
Tune your Big Data Platform to Work at Scale: Taking Hadoop to the Next Level...
Tune your Big Data Platform to Work at Scale: Taking Hadoop to the Next Level...Tune your Big Data Platform to Work at Scale: Taking Hadoop to the Next Level...
Tune your Big Data Platform to Work at Scale: Taking Hadoop to the Next Level...
 
Getting Started with Amazon EMR
Getting Started with Amazon EMRGetting Started with Amazon EMR
Getting Started with Amazon EMR
 
AWS Summit London 2014 | Deployment Done Right (300)
AWS Summit London 2014 | Deployment Done Right (300)AWS Summit London 2014 | Deployment Done Right (300)
AWS Summit London 2014 | Deployment Done Right (300)
 
(BDT305) Amazon EMR Deep Dive and Best Practices
(BDT305) Amazon EMR Deep Dive and Best Practices(BDT305) Amazon EMR Deep Dive and Best Practices
(BDT305) Amazon EMR Deep Dive and Best Practices
 
Amazon Elastic Map Reduce: the concepts
Amazon Elastic Map Reduce: the conceptsAmazon Elastic Map Reduce: the concepts
Amazon Elastic Map Reduce: the concepts
 
Masterclass Webinar - Amazon Elastic MapReduce (EMR)
Masterclass Webinar - Amazon Elastic MapReduce (EMR)Masterclass Webinar - Amazon Elastic MapReduce (EMR)
Masterclass Webinar - Amazon Elastic MapReduce (EMR)
 
Apache Hadoop and Spark on AWS: Getting started with Amazon EMR - Pop-up Loft...
Apache Hadoop and Spark on AWS: Getting started with Amazon EMR - Pop-up Loft...Apache Hadoop and Spark on AWS: Getting started with Amazon EMR - Pop-up Loft...
Apache Hadoop and Spark on AWS: Getting started with Amazon EMR - Pop-up Loft...
 
Cost Optimisation on AWS
Cost Optimisation on AWSCost Optimisation on AWS
Cost Optimisation on AWS
 
AWS re:Invent 2016: Deep Dive: Amazon EMR Best Practices & Design Patterns (B...
AWS re:Invent 2016: Deep Dive: Amazon EMR Best Practices & Design Patterns (B...AWS re:Invent 2016: Deep Dive: Amazon EMR Best Practices & Design Patterns (B...
AWS re:Invent 2016: Deep Dive: Amazon EMR Best Practices & Design Patterns (B...
 
(SDD401) Amazon Elastic MapReduce Deep Dive and Best Practices | AWS re:Inven...
(SDD401) Amazon Elastic MapReduce Deep Dive and Best Practices | AWS re:Inven...(SDD401) Amazon Elastic MapReduce Deep Dive and Best Practices | AWS re:Inven...
(SDD401) Amazon Elastic MapReduce Deep Dive and Best Practices | AWS re:Inven...
 
AWS EMR (Elastic Map Reduce) explained
AWS EMR (Elastic Map Reduce) explainedAWS EMR (Elastic Map Reduce) explained
AWS EMR (Elastic Map Reduce) explained
 
AWS18_StartupDayToronto_KeepingYourInfraCostsLow
AWS18_StartupDayToronto_KeepingYourInfraCostsLowAWS18_StartupDayToronto_KeepingYourInfraCostsLow
AWS18_StartupDayToronto_KeepingYourInfraCostsLow
 
AWS re:Invent 2016: How to Scale and Operate Elasticsearch on AWS (DEV307)
AWS re:Invent 2016: How to Scale and Operate Elasticsearch on AWS (DEV307)AWS re:Invent 2016: How to Scale and Operate Elasticsearch on AWS (DEV307)
AWS re:Invent 2016: How to Scale and Operate Elasticsearch on AWS (DEV307)
 

En vedette

AWS Summit Tel Aviv - Enterprise Track - Backup and Disaster Recovery
AWS Summit Tel Aviv - Enterprise Track - Backup and Disaster RecoveryAWS Summit Tel Aviv - Enterprise Track - Backup and Disaster Recovery
AWS Summit Tel Aviv - Enterprise Track - Backup and Disaster Recovery
Amazon Web Services
 

En vedette (11)

AWS Summit Tel Aviv - Enterprise Track - Backup and Disaster Recovery
AWS Summit Tel Aviv - Enterprise Track - Backup and Disaster RecoveryAWS Summit Tel Aviv - Enterprise Track - Backup and Disaster Recovery
AWS Summit Tel Aviv - Enterprise Track - Backup and Disaster Recovery
 
Updating Security Operations for the Cloud
Updating Security Operations for the CloudUpdating Security Operations for the Cloud
Updating Security Operations for the Cloud
 
Edmunds.com: Migrating, Deploying & Managing On-Premises Web Property (DMG205...
Edmunds.com: Migrating, Deploying & Managing On-Premises Web Property (DMG205...Edmunds.com: Migrating, Deploying & Managing On-Premises Web Property (DMG205...
Edmunds.com: Migrating, Deploying & Managing On-Premises Web Property (DMG205...
 
Telco: Voice-Command Personal Agent Service with AWS Cloud (MBL202) | AWS re:...
Telco: Voice-Command Personal Agent Service with AWS Cloud (MBL202) | AWS re:...Telco: Voice-Command Personal Agent Service with AWS Cloud (MBL202) | AWS re:...
Telco: Voice-Command Personal Agent Service with AWS Cloud (MBL202) | AWS re:...
 
Navigating PCI Compliance in the Cloud (SEC206) | AWS re:Invent 2013
Navigating PCI Compliance in the Cloud (SEC206) | AWS re:Invent 2013Navigating PCI Compliance in the Cloud (SEC206) | AWS re:Invent 2013
Navigating PCI Compliance in the Cloud (SEC206) | AWS re:Invent 2013
 
A Venture Capitalist’s View on the Start-up Ecosystem and the Cloud (SPOT202)...
A Venture Capitalist’s View on the Start-up Ecosystem and the Cloud (SPOT202)...A Venture Capitalist’s View on the Start-up Ecosystem and the Cloud (SPOT202)...
A Venture Capitalist’s View on the Start-up Ecosystem and the Cloud (SPOT202)...
 
A Smarter World: The Mesh of Interconnected Devices and Artificial Intelligen...
A Smarter World: The Mesh of Interconnected Devices and Artificial Intelligen...A Smarter World: The Mesh of Interconnected Devices and Artificial Intelligen...
A Smarter World: The Mesh of Interconnected Devices and Artificial Intelligen...
 
Understanding AWS Security
Understanding AWS SecurityUnderstanding AWS Security
Understanding AWS Security
 
(BDT403) Best Practices for Building Real-time Streaming Applications with Am...
(BDT403) Best Practices for Building Real-time Streaming Applications with Am...(BDT403) Best Practices for Building Real-time Streaming Applications with Am...
(BDT403) Best Practices for Building Real-time Streaming Applications with Am...
 
Build Your Mobile App Faster with AWS Mobile Services
Build Your Mobile App Faster with AWS Mobile ServicesBuild Your Mobile App Faster with AWS Mobile Services
Build Your Mobile App Faster with AWS Mobile Services
 
Build Your Mobile App Faster with AWS Mobile Services (Cognito, Lambda, SNS, ...
Build Your Mobile App Faster with AWS Mobile Services (Cognito, Lambda, SNS, ...Build Your Mobile App Faster with AWS Mobile Services (Cognito, Lambda, SNS, ...
Build Your Mobile App Faster with AWS Mobile Services (Cognito, Lambda, SNS, ...
 

Similaire à Scaling your Application for Growth using Automation (CPN209) | AWS re:Invent 2013

Similaire à Scaling your Application for Growth using Automation (CPN209) | AWS re:Invent 2013 (20)

Serverless SQL
Serverless SQLServerless SQL
Serverless SQL
 
Modernizing Cloud and Hyperconverged Infrastructure monitoring
Modernizing Cloud and Hyperconverged Infrastructure monitoringModernizing Cloud and Hyperconverged Infrastructure monitoring
Modernizing Cloud and Hyperconverged Infrastructure monitoring
 
Journey Towards Scaling Your Application to 10 million users
Journey Towards Scaling Your Application to 10 million usersJourney Towards Scaling Your Application to 10 million users
Journey Towards Scaling Your Application to 10 million users
 
Performance architecture for cloud connect
Performance architecture for cloud connectPerformance architecture for cloud connect
Performance architecture for cloud connect
 
Agile infrastructure
Agile infrastructureAgile infrastructure
Agile infrastructure
 
Aws re invent 2018 recap
Aws re invent 2018 recapAws re invent 2018 recap
Aws re invent 2018 recap
 
AWS 101 December 2014
AWS 101 December 2014AWS 101 December 2014
AWS 101 December 2014
 
Vancouver keynote - AWS Innovate - Sam Elmalak
Vancouver keynote - AWS Innovate - Sam ElmalakVancouver keynote - AWS Innovate - Sam Elmalak
Vancouver keynote - AWS Innovate - Sam Elmalak
 
DevOps in the Cloud with Microsoft Azure
DevOps in the Cloud with Microsoft AzureDevOps in the Cloud with Microsoft Azure
DevOps in the Cloud with Microsoft Azure
 
Improving Availability & Lowering Costs with Auto Scaling & Amazon EC2 (CPN20...
Improving Availability & Lowering Costs with Auto Scaling & Amazon EC2 (CPN20...Improving Availability & Lowering Costs with Auto Scaling & Amazon EC2 (CPN20...
Improving Availability & Lowering Costs with Auto Scaling & Amazon EC2 (CPN20...
 
AWS Canberra WWPS Summit 2013 - Cloud Computing with AWS: Introduction to AWS
AWS Canberra WWPS Summit 2013 - Cloud Computing with AWS: Introduction to AWSAWS Canberra WWPS Summit 2013 - Cloud Computing with AWS: Introduction to AWS
AWS Canberra WWPS Summit 2013 - Cloud Computing with AWS: Introduction to AWS
 
Azure Monitoring Overview
Azure Monitoring OverviewAzure Monitoring Overview
Azure Monitoring Overview
 
Aws 101 garage+
Aws 101 garage+Aws 101 garage+
Aws 101 garage+
 
CrateDB Machine Data Platform Webinar
CrateDB Machine Data Platform Webinar CrateDB Machine Data Platform Webinar
CrateDB Machine Data Platform Webinar
 
Neev cloud services with AWS
Neev cloud services with AWSNeev cloud services with AWS
Neev cloud services with AWS
 
Building a Big Data & Analytics Platform using AWS
Building a Big Data & Analytics Platform using AWS Building a Big Data & Analytics Platform using AWS
Building a Big Data & Analytics Platform using AWS
 
Cloud computing OpenStack_discussion_2014-05
Cloud computing OpenStack_discussion_2014-05Cloud computing OpenStack_discussion_2014-05
Cloud computing OpenStack_discussion_2014-05
 
VNG/IRD - Cloud computing & Openstack discussion 3/5/2014
VNG/IRD - Cloud computing & Openstack discussion 3/5/2014VNG/IRD - Cloud computing & Openstack discussion 3/5/2014
VNG/IRD - Cloud computing & Openstack discussion 3/5/2014
 
AWS 101, London - September 2014
AWS 101, London - September 2014AWS 101, London - September 2014
AWS 101, London - September 2014
 
Webinar: SQL for Machine Data?
Webinar: SQL for Machine Data?Webinar: SQL for Machine Data?
Webinar: SQL for Machine Data?
 

Plus de Amazon Web Services

Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
Amazon Web Services
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
Amazon Web Services
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
Amazon Web Services
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
Amazon Web Services
 

Plus de Amazon Web Services (20)

Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
Come costruire servizi di Forecasting sfruttando algoritmi di ML e deep learn...
 
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
Big Data per le Startup: come creare applicazioni Big Data in modalità Server...
 
Esegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS FargateEsegui pod serverless con Amazon EKS e AWS Fargate
Esegui pod serverless con Amazon EKS e AWS Fargate
 
Costruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWSCostruire Applicazioni Moderne con AWS
Costruire Applicazioni Moderne con AWS
 
Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot Come spendere fino al 90% in meno con i container e le istanze spot
Come spendere fino al 90% in meno con i container e le istanze spot
 
Open banking as a service
Open banking as a serviceOpen banking as a service
Open banking as a service
 
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
Rendi unica l’offerta della tua startup sul mercato con i servizi Machine Lea...
 
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...OpsWorks Configuration Management: automatizza la gestione e i deployment del...
OpsWorks Configuration Management: automatizza la gestione e i deployment del...
 
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows WorkloadsMicrosoft Active Directory su AWS per supportare i tuoi Windows Workloads
Microsoft Active Directory su AWS per supportare i tuoi Windows Workloads
 
Computer Vision con AWS
Computer Vision con AWSComputer Vision con AWS
Computer Vision con AWS
 
Database Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatareDatabase Oracle e VMware Cloud on AWS i miti da sfatare
Database Oracle e VMware Cloud on AWS i miti da sfatare
 
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJSCrea la tua prima serverless ledger-based app con QLDB e NodeJS
Crea la tua prima serverless ledger-based app con QLDB e NodeJS
 
API moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e webAPI moderne real-time per applicazioni mobili e web
API moderne real-time per applicazioni mobili e web
 
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatareDatabase Oracle e VMware Cloud™ on AWS: i miti da sfatare
Database Oracle e VMware Cloud™ on AWS: i miti da sfatare
 
Tools for building your MVP on AWS
Tools for building your MVP on AWSTools for building your MVP on AWS
Tools for building your MVP on AWS
 
How to Build a Winning Pitch Deck
How to Build a Winning Pitch DeckHow to Build a Winning Pitch Deck
How to Build a Winning Pitch Deck
 
Building a web application without servers
Building a web application without serversBuilding a web application without servers
Building a web application without servers
 
Fundraising Essentials
Fundraising EssentialsFundraising Essentials
Fundraising Essentials
 
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
AWS_HK_StartupDay_Building Interactive websites while automating for efficien...
 
Introduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container ServiceIntroduzione a Amazon Elastic Container Service
Introduzione a Amazon Elastic Container Service
 

Dernier

Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
 

Dernier (20)

Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
Vector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptxVector Search -An Introduction in Oracle Database 23ai.pptx
Vector Search -An Introduction in Oracle Database 23ai.pptx
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
MS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectorsMS Copilot expands with MS Graph connectors
MS Copilot expands with MS Graph connectors
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
Apidays New York 2024 - Accelerating FinTech Innovation by Vasa Krishnan, Fin...
 
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 

Scaling your Application for Growth using Automation (CPN209) | AWS re:Invent 2013

  • 1. Scaling your Application for Growth using Automation November 14,2013 Ken Leung- Euclid Analytics Greg Narain- Chute © 2013 Amazon.com, Inc. and its affiliates. All rights reserved. May not be copied, modified, or distributed in whole or in part without the express consent of Amazon.com, Inc.
  • 3. Online Analytics for the Offline World E-Commerce Physical Stores
  • 4. How Euclid Works We use Wi-Fi technology to turn in-store behavior into actionable insights XX:XX:XX:XX:XX:XX Wi-Fi AP detects smartphone MAC addresses Shopper carrying smartphone walks by or into store Euclid analyzes data for trends and insights Insights on customer acquisition, engagement and retention
  • 5. Market Leader in Real World Analytics • • • • • First to develop proprietary Wi-Fi based analytics – – Most advanced data analytics capabilities and experience in retail environments Backed by tier 1 investors: Series A led by NEA, Series B led by Benchmark Capital World-class executive team – – Co-founder of Google Analytics, Founding team of ShopperTrak Executive experience from Google, SAP, Ariba and Tibco Experience with the world’s leading retailers – Specialty retail, QSR, department store, big box, automotive, malls and more Largest data scale and rapidly accelerating adoption – – – Recording >5B events per day Dataset with >100M unique devices (shoppers) Gartner Cool Vendor 2012; Idea Innovation Award Winner: Business Technology 2012 Market leadership recognized by:
  • 6. Euclid is a Data Company As of October, 2013, the Euclid Network: • Covers over 600 shopping centers, malls, and street locations • Processes 50 TB of raw data • Collects over 30 GB of raw data daily Acquire Data •Reliable •Durable •Scalable Process Data •Efficient •Flexible •Scalable •Versatile Deliver Data •Richness •Sophistication •Value
  • 7. Euclid’s Challenges Common Challenges • Scaling • Performance • Cost effectiveness • Removing the technical barriers for innovation • “Failing fast” Unique Challenges • Recomputing the entire history of Euclid data! – Need fast results – Need a lot of computational power, sometimes greater than 100x of regular daily compute needs
  • 8. Euclid’s Use of AWS Euclid started with AWS from Day One - Amazon EC2, Amazon RDS, Amazon EMR, Amazon S3 - AWS Elastic Beanstalk - Amazon Redshift Heroku from Amazon Partner Network (APN)
  • 10. Data Acquisition Elastic Beanstalk - Multi-AZ, multi-region - Load balancing, auto scaling - Monitoring, notification - Deployment Management - Amazon EBS-backed volume for failover data recovery - Log rotation to Amazon S3 (99.999999999% durability) All built-in.
  • 11. Data Acquisition - code <%@ page import="java.io.*,java.util.*,com.euclid.spongebob..server.*" %><% Properties sensorCredentials = (Properties)this.getServletContext().getAttribute("sensor_credentials"); String sensor_id = request.getParameter("sensor_id"); String credential = request.getParameter("credential"); String body = request.getParameter("body"); if (sensor_id == null || !sensorCredentials.containsKey(sensor_id) || !sensorCredentials.getProperty(sensor_id).equals(credential)) { response.sendError(HttpServletResponse.SC_UNAUTHORIZED); return; } java.util.logging.Logger logger = java.util.logging.Logger.getLogger("spongebob"); logger.log(java.util.logging.Level.INFO, body); response.setStatus(HttpServletResponse.SC_OK); %>
  • 12. Data Acquisition - Principles • Log to Amazon EBS Volume – high I/O performance • As “dumb” as possible: reliable • Fork data from disk to – Amazon S3 for batch processing – Kafka messaging service for real time processing
  • 13. Data Acquisition – System Monitor • Low latency • Low CPU utilization
  • 14. Data Processing - Pipeline Raw Data Map Reduce (EMR) Product dashboard, insights R/D Analytics
  • 15. Pipeline – Dual Purposes Two worlds, one platform • Big Data Engineering – noSQL – Pig Latin with Amazon EMR (Java, Python UDFs) – Work flows (Jenkins), shell scripting • Analytics, Analysts, Business – SQL – Excel – Tableau – Maybe some Python, etc.
  • 16. Pipeline - Architecture Amazon S3 SQL DB: MySQL, Redshift Raw Data Meta Data Aggr. Level 1 3rd Party Data Some Raw Data Analytics Aggr. Level 1 Direct DB Load Meta Data 3rd Party Data Models Algorithms Aggr. Level n MapReduce MySQL Product dashboard, insights SQL Aggr. Level n R&D Models Algorithms
  • 17. SQL: MySQL, Amazon Redshift, both by AWS • Started with MySQL, Amazon Redshift Preview Jan 2013 • MySQL 1TB limit vs Amazon Redshift PB scale • Performance, night and day – E.g., count distinct of 100m rows: 5h in MySQL, 2m in Amazon Redshift • Amazon Redshift: killer data warehouse – Low cost – No DBA! – Easy integration
  • 18. Pipeline - Monitoring • System monitoring provided by AWS • Workflow monitoring with Jenkins – Failure notification – Dependency management • Data quality (including acquisition) monitoring – Also utilize Jenkins – Scripts that check data at various stages – Each script as a job in the Jenkins workflow
  • 19. Pipeline - Workflow Part of the Jenkins Dependency Graph
  • 20. AWS Benefits • “Apps not Ops” – Euclid does not have/need an Ops team • Scale up and down on demand • Pay as we go • Agile (innovations, time-to-market)
  • 21. Chute 1. Data 2. Automation 3. Uptime 4. Monitoring
  • 22. Data ● Real time analytics is hard ● Hadoop! ○ Sqoop imports SQL data to HDFS ○ Clojure ○ Scalding (github.com/twitter/scalding) ● Elasticsearch, Logstash ○ parse logs to track activity for customers
  • 24. N number of EC2 instances ● varnish ● logstash plugin front ends Kibana ELB Redis cluster ElasticSearch Events Server ● nginx ● logstash API
  • 25. Automation through DevOps ● Chute has 100 servers ○ Configured many manually ○ 82? of 100 now managed by Chef ● Whirr ● Sqoop and Cron to automate data import ● route53 with Chef for urls
  • 26. Uptime ● Architect applications to scale horizontally ○ AWS launches servers on demand ○ spot and reserve pricing ● Keep services running with Chef ○ Chef makes it easy to wrap programs as a service on AWS
  • 27. Monitoring ● newrelic ○ server resource monitoring ○ application monitoring ● logstash + kibana ○ elasticsearch backend ○ redis (cluster) ○ can monitor server logs
  • 28. Please give us your feedback on this presentation CPN209 As a thank you, we will select prize winners daily for completed surveys!