Improving Apache Spark for Dynamic Allocation and Spot Instances

•

1 j'aime•295 vues

This presentation will explore the new work in Spark 3.1 adding the concept of graceful decommissioning and how we can use this to improve Spark’s performance in both dynamic allocation and spot/preemptable instances. Together we’ll explore how Spark’s dynamic allocation has evolved over time, and why the different changes have been needed. We’ll also look at the multi-company collaboration that resulted in being able to deliver this feature and I’ll end with encouraging pointers on how to get more involved in Spark’s development.

Données & analyses

Who am I?
• Holden Kara
u

• She / he
r

• Apache Spark PMC
• Contributor to a lot of other projects
• co-author of High Performance
Spark, Learning Spark, and Kubeflow
for Machine Learning
• http://bit.ly/holdenSparkVideos
• https://youtube.com/user/holdenkarau

Let us start at the beginning
• Spark achieves resilience through re-computation which is part of how we go fas
• This poses challenges with removing executors that may contain dat
• We "solved" it for YARN/Mesos back in the da
• I drank waaaay too much coffee and came up with an alternativ
• But no one really liked it because we didn't need it so I closed the Google doc and
forgot about i
t

• Don’t worry, we’ll get to the code soon :)

But then….
• The "cloud" became really popula
r

• Kubernetes became popula
r

• Everything caught on fire :/

Our Protagonist Remembers
• I started drinking a lot of coffee

• We dusted off that old design and wrote
some cod
e

• And then I got hit by a ca
r

• More people wrote more cod
e

• We had a VOT
E

• We wrote waaaaay more cod
e

• Everyone lived happily ever after?
Photo by Lukas from Pexels

How did DA work on YARN?
• Scale up is "easy" (add more
resources
)

• Scale down required a stay resident
program to be on each YARN node to
serve any file
s

• Spark stored it's shuffle data as file
s

• Persist in memory data was still lost
when scaling down an executor
Photo by Markus Spiske from Pexels

Why did the cloud impact this?
• If you wanted a ~50% cost saving of
spot/preemptible instances you might
lose entire machine
s

• Yes Spark can "handle" this, but does
so by recomputing data (expensive
)

• You can't depend on leaving a program
around to serve files when the server is
just gon
e

• So we need to find a way to migrate the
data

Ok sure the cloud, but K8s?
• Kubernetes doesn't like like the idea of
scheduling a stay resident program on
every nod
e

• Also most people don't like the idea of
shared disk here either (accros jobs/
users
)

• So we need to find a way to migrate the
data

SPARK-20624
• Yee-haw
!

• Ok but more seriously how does it work? Great question lets open up the code
• BlockManagerDecomissioner.scala is where most of the magic happens

Collaboration
http://apache-spark-developers-list.1001551.n3.nabble.com/VOTE-
Decommissioning-SPIP-td29701.htm
l

https://github.com/apache/spark/pulls?q=is%3Apr+decommission+is%3Aclosed+

Ok what about the car?
Getting hit by a car sucks a lot
Slowed down dev work while I did rehab to be able
to walk & type again
Shout out to everyone who helped me recover
(from my wife, girlfriend, partners, my friends, to
the hospital staff, nursing home, PT, OT,
Ambulance, my employer for giving me time off,
the Spark community for understanding I needed
time off <3)

It’s early though so please be careful
On a Happy Note: You can try this now
• Enable the followin
g

- spark.decommission.enabled

- spark.storage.decommission.enabled

- spark.storage.decommission.rddBlocks.enabled
- spark.storage.decommission.shuffleBlocks.enabled
• Want to get fancy? Optionally enable:

- spark.shuffle.externalStorage.enabled

- And configure a storage backend ( spark.shuffle.externalStorage.backend)

Future work
• Heuristics to migrate dat
a

• Improve container pre-emption selectio
• Better heuristics around when to scale up and down containers

TM and © 2021 Apple Inc. All rights reserved.

Recommandé

The Rise of ZStandard: Apache Spark/Parquet/ORC/AvroDatabricks

Apache Sparkに手を出してヤケドしないための基本～「Apache Spark入門より」～（デブサミ 2016 講演資料）NTT DATA OSS Professional Services

Apache Arrow Flight OverviewJacques Nadeau

An Insider’s Guide to Maximizing Spark SQL PerformanceTakuya UESHIN

大量のデータ処理や分析に使えるOSS Apache Sparkのご紹介（Open Source Conference 2020 Online/Kyoto ...NTT DATA Technology & Innovation

MoP(MQTT on Pulsar) - a Powerful Tool for Apache Pulsar in IoT - Pulsar Summi...StreamNative

Apache Spark on Kubernetes入門（Open Source Conference 2021 Online Hiroshima 発表資料）NTT DATA Technology & Innovation

HDFS on Kubernetes—Lessons Learned with Kimoon KimDatabricks

Recommandé

The Rise of ZStandard: Apache Spark/Parquet/ORC/AvroDatabricks

Apache Sparkに手を出してヤケドしないための基本～「Apache Spark入門より」～（デブサミ 2016 講演資料）NTT DATA OSS Professional Services

Apache Arrow Flight OverviewJacques Nadeau

An Insider’s Guide to Maximizing Spark SQL PerformanceTakuya UESHIN

大量のデータ処理や分析に使えるOSS Apache Sparkのご紹介（Open Source Conference 2020 Online/Kyoto ...NTT DATA Technology & Innovation

MoP(MQTT on Pulsar) - a Powerful Tool for Apache Pulsar in IoT - Pulsar Summi...StreamNative

Apache Spark on Kubernetes入門（Open Source Conference 2021 Online Hiroshima 発表資料）NTT DATA Technology & Innovation

HDFS on Kubernetes—Lessons Learned with Kimoon KimDatabricks

Apache Spark on K8S Best Practice and Performance in the CloudDatabricks

Looking ahead at PostgreSQL 15Jonathan Katz

Tuning Apache Spark for Large-Scale Workloads Gaoxiang Liu and Sital KediaDatabricks

Apache Spark Data Source V2 with Wenchen Fan and Gengliang WangDatabricks

TechTalk - 서버를 해킹 당했습니다Daesung Park

Apache Bigtopによるオープンなビッグデータ処理基盤の構築（オープンデベロッパーズカンファレンス 2021 Online 発表資料）NTT DATA Technology & Innovation

Running Apache Spark on Kubernetes: Best Practices and PitfallsDatabricks

Apache Spark超入門（Hadoop / Spark Conference Japan 2016 講演資料）NTT DATA OSS Professional Services

Apache Hadoopの未来 3系になって何が変わるのか?NTT DATA OSS Professional Services

Apache Sparkのご紹介（後半：技術トピック）NTT DATA OSS Professional Services

OpenStackをさらに”使う”技術概要と基礎操作irix_jp

Apache Kudu: Technical Deep Dive  Cloudera, Inc.

Apache Bigtop3.2 (仮)（Open Source Conference 2022 Online/Hiroshima 発表資料）NTT DATA Technology & Innovation

HBaseCon 2013: Apache HBase Table SnapshotsCloudera, Inc.

Autoscaling Flink with Reactive ModeFlink Forward

Apache Spark の紹介（前半：Sparkのキホン）NTT DATA OSS Professional Services

Apache flinkpranay kumar

pg_walinspectについて調べてみた！（第37回PostgreSQLアンカンファレンス@オンライン発表資料）NTT DATA Technology & Innovation

双方向レプリケーションの(Bidirectional Replication)の利用方法QlikPresalesJapan

速習！論理レプリケーション～基礎から最新動向まで～（PostgreSQL Conference Japan 2022 発表資料）NTT DATA Technology & Innovation

Leveraging Databricks for Spark PipelinesRose Toomey

Leveraging Databricks for Spark pipelinesRose Toomey

Contenu connexe

Tendances

Apache Spark on K8S Best Practice and Performance in the CloudDatabricks

Looking ahead at PostgreSQL 15Jonathan Katz

Tuning Apache Spark for Large-Scale Workloads Gaoxiang Liu and Sital KediaDatabricks

Apache Spark Data Source V2 with Wenchen Fan and Gengliang WangDatabricks

TechTalk - 서버를 해킹 당했습니다Daesung Park

Apache Bigtopによるオープンなビッグデータ処理基盤の構築（オープンデベロッパーズカンファレンス 2021 Online 発表資料）NTT DATA Technology & Innovation

Running Apache Spark on Kubernetes: Best Practices and PitfallsDatabricks

Apache Spark超入門（Hadoop / Spark Conference Japan 2016 講演資料）NTT DATA OSS Professional Services

Apache Hadoopの未来 3系になって何が変わるのか?NTT DATA OSS Professional Services

Apache Sparkのご紹介（後半：技術トピック）NTT DATA OSS Professional Services

OpenStackをさらに”使う”技術概要と基礎操作irix_jp

Apache Kudu: Technical Deep Dive  Cloudera, Inc.

Apache Bigtop3.2 (仮)（Open Source Conference 2022 Online/Hiroshima 発表資料）NTT DATA Technology & Innovation

HBaseCon 2013: Apache HBase Table SnapshotsCloudera, Inc.

Autoscaling Flink with Reactive ModeFlink Forward

Apache Spark の紹介（前半：Sparkのキホン）NTT DATA OSS Professional Services

Apache flinkpranay kumar

pg_walinspectについて調べてみた！（第37回PostgreSQLアンカンファレンス@オンライン発表資料）NTT DATA Technology & Innovation

双方向レプリケーションの(Bidirectional Replication)の利用方法QlikPresalesJapan

速習！論理レプリケーション～基礎から最新動向まで～（PostgreSQL Conference Japan 2022 発表資料）NTT DATA Technology & Innovation

Tendances (20)

Apache Spark on K8S Best Practice and Performance in the Cloud

Looking ahead at PostgreSQL 15

Tuning Apache Spark for Large-Scale Workloads Gaoxiang Liu and Sital Kedia

Apache Spark Data Source V2 with Wenchen Fan and Gengliang Wang

TechTalk - 서버를 해킹 당했습니다

Apache Bigtopによるオープンなビッグデータ処理基盤の構築（オープンデベロッパーズカンファレンス 2021 Online 発表資料）

Running Apache Spark on Kubernetes: Best Practices and Pitfalls

Apache Spark超入門（Hadoop / Spark Conference Japan 2016 講演資料）

Apache Hadoopの未来 3系になって何が変わるのか?

Apache Sparkのご紹介（後半：技術トピック）

OpenStackをさらに”使う”技術概要と基礎操作

Apache Kudu: Technical Deep Dive  

Apache Bigtop3.2 (仮)（Open Source Conference 2022 Online/Hiroshima 発表資料）

HBaseCon 2013: Apache HBase Table Snapshots

Autoscaling Flink with Reactive Mode

Apache Spark の紹介（前半：Sparkのキホン）

Apache flink

pg_walinspectについて調べてみた！（第37回PostgreSQLアンカンファレンス@オンライン発表資料）

双方向レプリケーションの(Bidirectional Replication)の利用方法

速習！論理レプリケーション～基礎から最新動向まで～（PostgreSQL Conference Japan 2022 発表資料）

Similaire à Improving Apache Spark for Dynamic Allocation and Spot Instances

Leveraging Databricks for Spark PipelinesRose Toomey

Leveraging Databricks for Spark pipelinesRose Toomey

Deploying Apache Spark Jobs on Kubernetes with Helm and Spark OperatorDatabricks

Stackato v4Jonas Brømsø

Sharing (or stealing) the jewels of python with big data & the jvm (1)Holden Karau

Kafka Summit SF 2017 - Streaming Processing in Python – 10 ways to avoid summ...confluent

Stackato v6Jonas Brømsø

Data Science at Scale: Using Apache Spark for Data Science at BitlySarah Guido

Machine learning in real-time - the next frontierSnowplow Analytics

Apache Spark for Everyone - Women Who Code WorkshopAmanda Casari

sparkBen Liu

StackatoJonas Brømsø

Best Practice in Accelerating Data Applications with Spark+AlluxioAlluxio, Inc.

Stackato v3Jonas Brømsø

Apache Spark At Apple with Sam Maclennan and Vishwanath LakkundiDatabricks

Apache Spark - Lightning Fast Cluster Computing - Hyderabad Scalability MeetupHyderabad Scalability Meetup

Dec6 meetup spark presentationRamesh Mudunuri

LanceShivnathHadoopSummit2015Lance Co Ting Keh

12-Step Program for Scaling Web Applications on PostgreSQLKonstantin Gredeskoul

Stackato v5Jonas Brømsø

Similaire à Improving Apache Spark for Dynamic Allocation and Spot Instances (20)

Leveraging Databricks for Spark Pipelines

Leveraging Databricks for Spark pipelines

Deploying Apache Spark Jobs on Kubernetes with Helm and Spark Operator

Stackato v4

Sharing (or stealing) the jewels of python with big data & the jvm (1)

Kafka Summit SF 2017 - Streaming Processing in Python – 10 ways to avoid summ...

Stackato v6

Data Science at Scale: Using Apache Spark for Data Science at Bitly

Machine learning in real-time - the next frontier

Apache Spark for Everyone - Women Who Code Workshop

spark

Stackato

Best Practice in Accelerating Data Applications with Spark+Alluxio

Stackato v3

Apache Spark At Apple with Sam Maclennan and Vishwanath Lakkundi

Apache Spark - Lightning Fast Cluster Computing - Hyderabad Scalability Meetup

Dec6 meetup spark presentation

LanceShivnathHadoopSummit2015

12-Step Program for Scaling Web Applications on PostgreSQL

Stackato v5

Plus de Databricks

DW Migration Webinar-March 2022.pptxDatabricks

Data Lakehouse Symposium | Day 1 | Part 1Databricks

Data Lakehouse Symposium | Day 1 | Part 2Databricks

Data Lakehouse Symposium | Day 2Databricks

Data Lakehouse Symposium | Day 4Databricks

5 Critical Steps to Clean Your Data Swamp When Migrating Off of HadoopDatabricks

Democratizing Data Quality Through a Centralized PlatformDatabricks

Learn to Use Databricks for Data ScienceDatabricks

Why APM Is Not the Same As ML MonitoringDatabricks

The Function, the Context, and the Data—Enabling ML Ops at Stitch FixDatabricks

Stage Level Scheduling Improving Big Data and AI IntegrationDatabricks

Simplify Data Conversion from Spark to TensorFlow and PyTorchDatabricks

Scaling your Data Pipelines with Apache Spark on KubernetesDatabricks

Scaling and Unifying SciKit Learn and Apache Spark PipelinesDatabricks

Sawtooth Windows for Feature AggregationsDatabricks

Redis + Apache Spark = Swiss Army Knife Meets Kitchen SinkDatabricks

Re-imagine Data Monitoring with whylogs and SparkDatabricks

Raven: End-to-end Optimization of ML Prediction QueriesDatabricks

Processing Large Datasets for ADAS Applications using Apache SparkDatabricks

Massive Data Processing in Adobe Using Delta LakeDatabricks

Plus de Databricks (20)

DW Migration Webinar-March 2022.pptx

Data Lakehouse Symposium | Day 1 | Part 1

Data Lakehouse Symposium | Day 1 | Part 2

Data Lakehouse Symposium | Day 2

Data Lakehouse Symposium | Day 4

5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop

Democratizing Data Quality Through a Centralized Platform

Learn to Use Databricks for Data Science

Why APM Is Not the Same As ML Monitoring

The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix

Stage Level Scheduling Improving Big Data and AI Integration

Simplify Data Conversion from Spark to TensorFlow and PyTorch

Scaling your Data Pipelines with Apache Spark on Kubernetes

Scaling and Unifying SciKit Learn and Apache Spark Pipelines

Sawtooth Windows for Feature Aggregations

Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink

Re-imagine Data Monitoring with whylogs and Spark

Raven: End-to-end Optimization of ML Prediction Queries

Processing Large Datasets for ADAS Applications using Apache Spark

Massive Data Processing in Adobe Using Delta Lake

Dernier

Introduction-to-Machine-Learning (1).pptxfirstjob4

Industrialised data - the key to AI success.pdfLars Albertsson

VidaXL dropshipping via API with DroFx.pptxolyaivanovalion

定制英国白金汉大学毕业证（UCB毕业证书）成绩单原版一比一ffjhghh

Sampling (random) method and Non random.pptDr. Soumendra Kumar Patra

VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

Midocean dropshipping via API with DroFxolyaivanovalion

BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptxMohammedJunaid861692

代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改atducpo

Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...Delhi Call girls

VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...Suhani Kapoor

Data-Analysis for Chicago Crime Data 2023ymrp368

Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor

VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor

Brighton SEO | April 2024 | Data StorytellingNeil Barnes

April 2024 - Crypto Market Report's Analysismanisha194592

RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays

꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...Call Girls In Delhi Whatsup 9873940964 Enjoy Unlimited Pleasure

VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service AmravatiSuhani Kapoor

Dernier (20)

Introduction-to-Machine-Learning (1).pptx

Industrialised data - the key to AI success.pdf

VidaXL dropshipping via API with DroFx.pptx

定制英国白金汉大学毕业证（UCB毕业证书）成绩单原版一比一

Sampling (random) method and Non random.ppt

VIP Call Girls Service Charbagh { Lucknow Call Girls Service 9548273370 } Boo...

Midocean dropshipping via API with DroFx

BPAC WITH UFSBI GENERAL PRESENTATION 18_05_2017-1.pptx

代办国外大学文凭《原版美国UCLA文凭证书》加州大学洛杉矶分校毕业证制作成绩单修改

Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...

VIP High Class Call Girls Jamshedpur Anushka 8250192130 Independent Escort Se...

Data-Analysis for Chicago Crime Data 2023

Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai

VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...

Brighton SEO | April 2024 | Data Storytelling

April 2024 - Crypto Market Report's Analysis

RA-11058_IRR-COMPRESS Do 198 series of 1998

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...

VIP Call Girls in Amravati Aarohi 8250192130 Independent Escort Service Amravati

Improving Apache Spark for Dynamic Allocation and Spot Instances

1. Apple logo is a trademark of Apple Inc. Holden Karau | Data / AI Summi t @holdenkara u Improving Spark for Dynamic Allocation & Spot Instances

2. Who am I? • Holden Kara u • She / he r • Apache Spark PMC • Contributor to a lot of other projects • co-author of High Performance Spark, Learning Spark, and Kubeflow for Machine Learning • http://bit.ly/holdenSparkVideos • https://youtube.com/user/holdenkarau

3. Apple logo is a trademark of Apple Inc.

4. Let us start at the beginning • Spark achieves resilience through re-computation which is part of how we go fas • This poses challenges with removing executors that may contain dat • We "solved" it for YARN/Mesos back in the da • I drank waaaay too much coffee and came up with an alternativ • But no one really liked it because we didn't need it so I closed the Google doc and forgot about i t • Don’t worry, we’ll get to the code soon :)

5. But then…. • The "cloud" became really popula r • Kubernetes became popula r • Everything caught on fire :/

6. Our Protagonist Remembers • I started drinking a lot of coffee • We dusted off that old design and wrote some cod e • And then I got hit by a ca r • More people wrote more cod e • We had a VOT E • We wrote waaaaay more cod e • Everyone lived happily ever after? Photo by Lukas from Pexels

7. How did DA work on YARN? • Scale up is "easy" (add more resources ) • Scale down required a stay resident program to be on each YARN node to serve any file s • Spark stored it's shuffle data as file s • Persist in memory data was still lost when scaling down an executor Photo by Markus Spiske from Pexels

8. Why did the cloud impact this? • If you wanted a ~50% cost saving of spot/preemptible instances you might lose entire machine s • Yes Spark can "handle" this, but does so by recomputing data (expensive ) • You can't depend on leaving a program around to serve files when the server is just gon e • So we need to find a way to migrate the data

9. Ok sure the cloud, but K8s? • Kubernetes doesn't like like the idea of scheduling a stay resident program on every nod e • Also most people don't like the idea of shared disk here either (accros jobs/ users ) • So we need to find a way to migrate the data

10. SPARK-20624 • Yee-haw ! • Ok but more seriously how does it work? Great question lets open up the code • BlockManagerDecomissioner.scala is where most of the magic happens

11. Collaboration http://apache-spark-developers-list.1001551.n3.nabble.com/VOTE- Decommissioning-SPIP-td29701.htm l https://github.com/apache/spark/pulls?q=is%3Apr+decommission+is%3Aclosed+

12. Ok what about the car? Getting hit by a car sucks a lot Slowed down dev work while I did rehab to be able to walk & type again Shout out to everyone who helped me recover (from my wife, girlfriend, partners, my friends, to the hospital staff, nursing home, PT, OT, Ambulance, my employer for giving me time off, the Spark community for understanding I needed time off <3)

13. It’s early though so please be careful On a Happy Note: You can try this now • Enable the followin g - spark.decommission.enabled - spark.storage.decommission.enabled - spark.storage.decommission.rddBlocks.enabled - spark.storage.decommission.shuffleBlocks.enabled • Want to get fancy? Optionally enable: - spark.shuffle.externalStorage.enabled - And configure a storage backend ( spark.shuffle.externalStorage.backend)

14. Future work • Heuristics to migrate dat a • Improve container pre-emption selectio • Better heuristics around when to scale up and down containers

15. Please review this talk :)