Using Amazon CloudWatch Events, AWS Lambda and Spark Streaming to Process EC2 Events (June 2016)

•

1 j'aime•446 vues

Julien SIMON

Talk @ AWS Summit Tel Aviv, June 2016

Technologie

Julien Simon, Principal Technical Evangelist
julsimon@amazon.fr
@julsimon
Using Amazon CloudWatch Events,
AWS Lambda and Spark Streaming
to Process EC2 Events

EC2 Lifecycle Hooks – what are they good for?
•  Assign Elastic IP address on launch
•  Register new instances with DNS, message queues,…
•  Pull down log files before instance is terminated
•  Investigate issues with an instance before terminating it
•  Scaling containers on an ECS cluster

What we’re going to build
Kinesis
Firehose
CloudwatchToFirehose
Lambda
S3bucket
firehoseToS3
EC2 CloudWatch
firehose.
put_record()
EMR
Streaming
EC2 events
(autoscaling,
lifecycle)

EC2 event sources in CloudWatch Rules
Auto Scaling events
{
"source": [
"aws.autoscaling"
],
"detail-type": [
"EC2 Instance Launch Successful",
"EC2 Instance Terminate Successful",
"EC2 Instance Launch Unsuccessful",
"EC2 Instance Terminate Unsuccessful",
"EC2 Instance-launch Lifecycle Action",
"EC2 Instance-terminate Lifecycle Action"
]
}
Lifecycle events
{
"source": [
"aws.ec2"
],
"detail-type": [
"EC2 Instance State-change Notification"
]
}

CLI: from CloudWatch Events to Lambda
% aws lambda get-function --function-name CloudwatchToFirehose
--query "Configuration.FunctionArn”
% aws events put-rule --name EC2AutoScaling
--event-pattern file://EC2AutoScaling.json --state ENABLED
% aws events put-targets --rule EC2AutoScaling
--targets Id=1,Arn=FUNCTION_ARN
% aws lambda add-permission --function-name CloudwatchToFirehose
--statement-id 1 --action 'lambda:InvokeFunction'
--principal events.amazonaws.com --source-arn RULE_ARN

$Lambda function def lambda_handler(event, context): print("Received EC2 event: " + json.dumps(event, indent=2)) firehose = boto3.client('firehose') firehose.put_record( DeliveryStreamName="firehoseToS3” Record={"Data":json.dumps(event)} ) Reminder a CloudWatch log group is created automatically for each version of your Lambda function % aws logs describe-log-streams --log-group-name /aws/lambda/CloudwatchToFirehose --query "logStreams[].logStreamName" % aws logs get-log-events --log-stream-name LOG_STREAM_NAME --log-group-name /aws/lambda/CloudwatchToFirehose$

$CLI: from Kinesis Firehose to S3 aws firehose create-delivery-stream --delivery-stream-name firehoseToS3 --s3-destination-configuration RoleARN=FIREHOSETOS3_ROLE_ARN, BucketARN="arn:aws:s3:::jsimon-public", Prefix="firehose", BufferingHints={SizeInMBs=1,IntervalInSeconds=60}, CompressionFormat=”UNCOMPRESSED", EncryptionConfiguration={NoEncryptionConfig="NoEncryption"}$

Spark Streaming
import org.apache.log4j.Logger
import org.apache.log4j.Level
import org.apache.spark.streaming.Seconds
Logger.getLogger("org").setLevel(Level.ERROR)
Logger.getLogger("com").setLevel(Level.ERROR)
Logger.getLogger("akka").setLevel(Level.ERROR)
val hadoopConf=sc.hadoopConfiguration;
hadoopConf.set("fs.s3.impl", "org.apache.hadoop.fs.s3native.NativeS3FileSystem")
hadoopConf.set("fs.s3.awsAccessKeyId”, ACCESS_KEY_ID)
hadoopConf.set("fs.s3.awsSecretAccessKey”, SECRET_ACCESS_KEY)
val ssc = new org.apache.spark.streaming.StreamingContext(sc,Seconds(10))
val lines = ssc.textFileStream("s3n://BUCKET_NAME")
lines.print()
ssc.start()

Recommandé

Serverless architecture with AWS Lambda (June 2016)Julien SIMON

Running Docker clusters on AWS (June 2016)Julien SIMON

AWS Code{Commit,Deploy,Pipeline} (June 2016)Julien SIMON

Moving Viadeo to AWS (2015)Julien SIMON

A 60-mn tour of AWS compute (March 2016)Julien SIMON

Amazon ECS (December 2015)Julien SIMON

AWS CloudFormation (February 2016)Julien SIMON

Hands-on with AWS IoTJulien SIMON

Recommandé

Serverless architecture with AWS Lambda (June 2016)Julien SIMON

Running Docker clusters on AWS (June 2016)Julien SIMON

AWS Code{Commit,Deploy,Pipeline} (June 2016)Julien SIMON

Moving Viadeo to AWS (2015)Julien SIMON

A 60-mn tour of AWS compute (March 2016)Julien SIMON

Amazon ECS (December 2015)Julien SIMON

AWS CloudFormation (February 2016)Julien SIMON

Hands-on with AWS IoTJulien SIMON

A real-life account of moving 100% to a public cloudJulien SIMON

Docker Paris #28Julien SIMON

Building a Serverless PipelineJulien SIMON

Docker Paris #29Julien SIMON

Continuous Delivery to Amazon EC2 Container ServiceAmazon Web Services

CI&CD on AWS - Meetup Roma Oct 2016Paolo latella

Building a Python Serverless Applications with AWS Chalice - AWS Online Tech...Amazon Web Services

An introduction to serverless architectures (February 2017)Julien SIMON

AWS July Webinar Series: Introducing AWS OpsWorks for Windows ServerAmazon Web Services

From Docker Straight to AWSDevOps.com

(DVO305) Turbocharge YContinuous Deployment Pipeline with ContainersAmazon Web Services

從劍宗到氣宗－談AWS ECS與Serverless最佳實踐Pahud Hsieh

DevOps for the Enterprise: Automating DeploymentsAmazon Web Services

DevOps with Amazon Web Services (November 2016)Julien SIMON

DevOps for the Enterprise: Continuous IntegrationAmazon Web Services

Building serverless apps with Node.jsJulien SIMON

All the Ops: DataOps with GitOps for Streaming data on Kafka and KubernetesDevOps.com

Going Serverlessdehms

Deploy, Manage, and Scale your Apps with AWS Elastic BeanstalkAmazon Web Services

Continuous Delivery to Amazon ECS - AWS August Webinar SeriesAmazon Web Services

Running Hadoop on Amazon EC2康志強大人

Cloudera amazon-ec2Randy Zwitch

Contenu connexe

Tendances

A real-life account of moving 100% to a public cloudJulien SIMON

Docker Paris #28Julien SIMON

Building a Serverless PipelineJulien SIMON

Docker Paris #29Julien SIMON

Continuous Delivery to Amazon EC2 Container ServiceAmazon Web Services

CI&CD on AWS - Meetup Roma Oct 2016Paolo latella

Building a Python Serverless Applications with AWS Chalice - AWS Online Tech...Amazon Web Services

An introduction to serverless architectures (February 2017)Julien SIMON

AWS July Webinar Series: Introducing AWS OpsWorks for Windows ServerAmazon Web Services

From Docker Straight to AWSDevOps.com

(DVO305) Turbocharge YContinuous Deployment Pipeline with ContainersAmazon Web Services

從劍宗到氣宗－談AWS ECS與Serverless最佳實踐Pahud Hsieh

DevOps for the Enterprise: Automating DeploymentsAmazon Web Services

DevOps with Amazon Web Services (November 2016)Julien SIMON

DevOps for the Enterprise: Continuous IntegrationAmazon Web Services

Building serverless apps with Node.jsJulien SIMON

All the Ops: DataOps with GitOps for Streaming data on Kafka and KubernetesDevOps.com

Going Serverlessdehms

Deploy, Manage, and Scale your Apps with AWS Elastic BeanstalkAmazon Web Services

Continuous Delivery to Amazon ECS - AWS August Webinar SeriesAmazon Web Services

Tendances (20)

A real-life account of moving 100% to a public cloud

Docker Paris #28

Building a Serverless Pipeline

Docker Paris #29

Continuous Delivery to Amazon EC2 Container Service

CI&CD on AWS - Meetup Roma Oct 2016

Building a Python Serverless Applications with AWS Chalice - AWS Online Tech...

An introduction to serverless architectures (February 2017)

AWS July Webinar Series: Introducing AWS OpsWorks for Windows Server

From Docker Straight to AWS

(DVO305) Turbocharge YContinuous Deployment Pipeline with Containers

從劍宗到氣宗－談AWS ECS與Serverless最佳實踐

DevOps for the Enterprise: Automating Deployments

DevOps with Amazon Web Services (November 2016)

DevOps for the Enterprise: Continuous Integration

Building serverless apps with Node.js

All the Ops: DataOps with GitOps for Streaming data on Kafka and Kubernetes

Going Serverless

Deploy, Manage, and Scale your Apps with AWS Elastic Beanstalk

Continuous Delivery to Amazon ECS - AWS August Webinar Series

En vedette

Running Hadoop on Amazon EC2康志強大人

Cloudera amazon-ec2Randy Zwitch

Hadoop on ec2Mark Kerzner

Set up Hadoop Cluster on Amazon EC2IMC Institute

Hadoop Workshop on EC2 : March 2015IMC Institute

Hadoop Trends & Hadoop on EC2Yifeng Jiang

Hadoop Workshop using Cloudera on Amazon EC2IMC Institute

(BDT208) A Technical Introduction to Amazon Elastic MapReduceAmazon Web Services

AWS re:Invent 2016: Deep Dive: Amazon EMR Best Practices & Design Patterns (B...Amazon Web Services

En vedette (9)

Running Hadoop on Amazon EC2

Cloudera amazon-ec2

Hadoop on ec2

Set up Hadoop Cluster on Amazon EC2

Hadoop Workshop on EC2 : March 2015

Hadoop Trends & Hadoop on EC2

Hadoop Workshop using Cloudera on Amazon EC2

(BDT208) A Technical Introduction to Amazon Elastic MapReduce

AWS re:Invent 2016: Deep Dive: Amazon EMR Best Practices & Design Patterns (B...

Similaire à Using Amazon CloudWatch Events, AWS Lambda and Spark Streaming to Process EC2 Events (June 2016)

Using Amazon CloudWatch Events, AWS Lambda and Spark Streaming to Process E...Julien SIMON

Using Amazon CloudWatch Events, AWS Lambda and Spark Streaming to process Aut...Amazon Web Services

無伺服器架構和Containers on AWS入門 Amazon Web Services

Deploying Data Science with Docker and AWSMatt McDonnell

AWS July Webinar Series - Troubleshooting Operational and Security Issues in ...Amazon Web Services

AWS re:Invent 2016: Running Batch Jobs on Amazon ECS (CON310)Amazon Web Services

AWS re:Invent 2016: IoT Visualizations and Analytics (IOT306)Amazon Web Services

Get Started & Migrate Your Data to AWS (Thai Session)Amazon Web Services

PHP Office HoursAmazon Web Services

Amazon ECS Container Service Deep DiveAmazon Web Services

Getting Started with Serverless and Container ArchitecturesAmazon Web Services

Programming Amazon Web Services for Beginners (1)Markus Klems

Create a serverless architecture for data collection with Python and AWSDavid Santucci

Get Started & Migrate Your Data to AWS (English Session)Amazon Web Services

Continuous Deployment in AWS LambdaShu Ting Tseng

AWS re:Invent 2016: Running Lean Architectures: How to Optimize for Cost Effi...Amazon Web Services

Speed and Reliability at Any Scale: Amazon SQS and Database Services (SVC206)...Amazon Web Services

AWS Monitoring & LoggingJason Poley

Increase Speed and Agility with Amazon Web ServicesAmazon Web Services

Similaire à Using Amazon CloudWatch Events, AWS Lambda and Spark Streaming to Process EC2 Events (June 2016) (20)

Using Amazon CloudWatch Events, AWS Lambda and Spark Streaming to Process E...

Using Amazon CloudWatch Events, AWS Lambda and Spark Streaming to process Aut...

無伺服器架構和Containers on AWS入門

Deploying Data Science with Docker and AWS

AWS July Webinar Series - Troubleshooting Operational and Security Issues in ...

AWS re:Invent 2016: Running Batch Jobs on Amazon ECS (CON310)

AWS re:Invent 2016: IoT Visualizations and Analytics (IOT306)

Get Started & Migrate Your Data to AWS (Thai Session)

PHP Office Hours

Amazon ECS Container Service Deep Dive

Getting Started with Serverless and Container Architectures

Programming Amazon Web Services for Beginners (1)

Create a serverless architecture for data collection with Python and AWS

Get Started & Migrate Your Data to AWS (English Session)

Continuous Deployment in AWS Lambda

AWS re:Invent 2016: Running Lean Architectures: How to Optimize for Cost Effi...

Speed and Reliability at Any Scale: Amazon SQS and Database Services (SVC206)...

AWS Monitoring & Logging

Increase Speed and Agility with Amazon Web Services

Plus de Julien SIMON

An introduction to computer vision with Hugging FaceJulien SIMON

Reinventing Deep Learning  with Hugging Face TransformersJulien SIMON

Building NLP applications with TransformersJulien SIMON

Building Machine Learning Models Automatically (June 2020)Julien SIMON

Starting your AI/ML project right (May 2020)Julien SIMON

Scale Machine Learning from zero to millions of users (April 2020)Julien SIMON

An Introduction to Generative Adversarial Networks (April 2020)Julien SIMON

AIM410R1 Deep learning applications with TensorFlow, featuring Fannie Mae (De...Julien SIMON

AIM361 Optimizing machine learning models with Amazon SageMaker (December 2019)Julien SIMON

AIM410R Deep Learning Applications with TensorFlow, featuring Mobileye (Decem...Julien SIMON

A pragmatic introduction to natural language processing models (October 2019)Julien SIMON

Building smart applications with AWS AI services (October 2019)Julien SIMON

Build, train and deploy ML models with SageMaker (October 2019)Julien SIMON

The Future of AI (September 2019)Julien SIMON

Building Machine Learning Inference Pipelines at Scale (July 2019)Julien SIMON

Train and Deploy Machine Learning Workloads with AWS Container Services (July...Julien SIMON

Optimize your Machine Learning Workloads on AWS (July 2019)Julien SIMON

Deep Learning on Amazon Sagemaker (July 2019)Julien SIMON

Automate your Amazon SageMaker Workflows (July 2019)Julien SIMON

Build, train and deploy ML models with Amazon SageMaker (May 2019)Julien SIMON

Plus de Julien SIMON (20)

An introduction to computer vision with Hugging Face

Reinventing Deep Learning  with Hugging Face Transformers

Building NLP applications with Transformers

Building Machine Learning Models Automatically (June 2020)

Starting your AI/ML project right (May 2020)

Scale Machine Learning from zero to millions of users (April 2020)

An Introduction to Generative Adversarial Networks (April 2020)

AIM410R1 Deep learning applications with TensorFlow, featuring Fannie Mae (De...

AIM361 Optimizing machine learning models with Amazon SageMaker (December 2019)

AIM410R Deep Learning Applications with TensorFlow, featuring Mobileye (Decem...

A pragmatic introduction to natural language processing models (October 2019)

Building smart applications with AWS AI services (October 2019)

Build, train and deploy ML models with SageMaker (October 2019)

The Future of AI (September 2019)

Building Machine Learning Inference Pipelines at Scale (July 2019)

Train and Deploy Machine Learning Workloads with AWS Container Services (July...

Optimize your Machine Learning Workloads on AWS (July 2019)

Deep Learning on Amazon Sagemaker (July 2019)

Automate your Amazon SageMaker Workflows (July 2019)

Build, train and deploy ML models with Amazon SageMaker (May 2019)

Dernier

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays

Advanced Test Driven-Development @ php[tek] 2024Scott Keck-Warren

Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed

Dev Dives: Streamline document processing with UiPath Studio WebUiPathCommunity

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)Wonjun Hwang

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos

My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar

Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation

Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm

Story boards and shot lists for my a level piececharlottematthew16

Install Stable Diffusion in windows machinePadma Pradeep

Vertex AI Gemini Prompt Engineering TipsMiki Katsuragi

Commit 2024 - Secret Management made easyAlfredo García Lavilla

Are Multi-Cloud and Serverless Good or Bad?Mattias Andersson

SAP Build Work Zone - Overview L2-L3.pptxNavinnSomaal

Artificial intelligence in cctv survelliance.pptxhariprasad279825

Unraveling Multimodality with Large Language Models.pdfAlex Barbosa Coqueiro

Nell’iperspazio con Rocket: il Framework Web di Rust!Commit University

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski

WordPress Websites for Engineers: Elevate Your Brandgvaughan

Dernier (20)

"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack

Advanced Test Driven-Development @ php[tek] 2024

Scanning the Internet for External Cloud Exposures via SSL Certs

Dev Dives: Streamline document processing with UiPath Studio Web

Bun (KitWorks Team Study 노별마루 발표 2024.4.22)

Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)

My Hashitalk Indonesia April 2024 Presentation

Connect Wave/ connectwave Pitch Deck Presentation

Streamlining Python Development: A Guide to a Modern Project Setup

Story boards and shot lists for my a level piece

Install Stable Diffusion in windows machine

Vertex AI Gemini Prompt Engineering Tips

Commit 2024 - Secret Management made easy

Are Multi-Cloud and Serverless Good or Bad?

SAP Build Work Zone - Overview L2-L3.pptx

Artificial intelligence in cctv survelliance.pptx

Unraveling Multimodality with Large Language Models.pdf

Nell’iperspazio con Rocket: il Framework Web di Rust!

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...

WordPress Websites for Engineers: Elevate Your Brand

Using Amazon CloudWatch Events, AWS Lambda and Spark Streaming to Process EC2 Events (June 2016)

1. Julien Simon, Principal Technical Evangelist julsimon@amazon.fr @julsimon Using Amazon CloudWatch Events, AWS Lambda and Spark Streaming to Process EC2 Events

2. EC2 Instance Lifecycle

3. EC2 Lifecycle Hooks – what are they good for? •  Assign Elastic IP address on launch •  Register new instances with DNS, message queues,… •  Pull down log files before instance is terminated •  Investigate issues with an instance before terminating it •  Scaling containers on an ECS cluster

4. What we’re going to build Kinesis Firehose CloudwatchToFirehose Lambda S3bucket firehoseToS3 EC2 CloudWatch firehose. put_record() EMR Streaming EC2 events (autoscaling, lifecycle)

5. EC2 event sources in CloudWatch Rules Auto Scaling events { "source": [ "aws.autoscaling" ], "detail-type": [ "EC2 Instance Launch Successful", "EC2 Instance Terminate Successful", "EC2 Instance Launch Unsuccessful", "EC2 Instance Terminate Unsuccessful", "EC2 Instance-launch Lifecycle Action", "EC2 Instance-terminate Lifecycle Action" ] } Lifecycle events { "source": [ "aws.ec2" ], "detail-type": [ "EC2 Instance State-change Notification" ] }

6. CLI: from CloudWatch Events to Lambda % aws lambda get-function --function-name CloudwatchToFirehose --query "Configuration.FunctionArn” % aws events put-rule --name EC2AutoScaling --event-pattern file://EC2AutoScaling.json --state ENABLED % aws events put-targets --rule EC2AutoScaling --targets Id=1,Arn=FUNCTION_ARN % aws lambda add-permission --function-name CloudwatchToFirehose --statement-id 1 --action 'lambda:InvokeFunction' --principal events.amazonaws.com --source-arn RULE_ARN

7. Lambda function def lambda_handler(event, context): print("Received EC2 event: " + json.dumps(event, indent=2)) firehose = boto3.client('firehose') firehose.put_record( DeliveryStreamName="firehoseToS3” Record={"Data":json.dumps(event)} ) Reminder a CloudWatch log group is created automatically for each version of your Lambda function % aws logs describe-log-streams --log-group-name /aws/lambda/CloudwatchToFirehose --query "logStreams[].logStreamName" % aws logs get-log-events --log-stream-name LOG_STREAM_NAME --log-group-name /aws/lambda/CloudwatchToFirehose

8. CLI: from Kinesis Firehose to S3 aws firehose create-delivery-stream --delivery-stream-name firehoseToS3 --s3-destination-configuration RoleARN=FIREHOSETOS3_ROLE_ARN, BucketARN="arn:aws:s3:::jsimon-public", Prefix="firehose", BufferingHints={SizeInMBs=1,IntervalInSeconds=60}, CompressionFormat=”UNCOMPRESSED", EncryptionConfiguration={NoEncryptionConfig="NoEncryption"}

9. Spark Streaming import org.apache.log4j.Logger import org.apache.log4j.Level import org.apache.spark.streaming.Seconds Logger.getLogger("org").setLevel(Level.ERROR) Logger.getLogger("com").setLevel(Level.ERROR) Logger.getLogger("akka").setLevel(Level.ERROR) val hadoopConf=sc.hadoopConfiguration; hadoopConf.set("fs.s3.impl", "org.apache.hadoop.fs.s3native.NativeS3FileSystem") hadoopConf.set("fs.s3.awsAccessKeyId”, ACCESS_KEY_ID) hadoopConf.set("fs.s3.awsSecretAccessKey”, SECRET_ACCESS_KEY) val ssc = new org.apache.spark.streaming.StreamingContext(sc,Seconds(10)) val lines = ssc.textFileStream("s3n://BUCKET_NAME") lines.print() ssc.start()

10.

11. julsimon@amazon.fr @julsimon