Intuitive & Scalable Hyperparameter Tuning with Apache Spark + Fugue

•

0 j'aime•356 vues

This presentation introduces Tune and Fugue, frameworks for intuitive and scalable hyperparameter optimization (HPO). Tune supports both non-iterative and iterative HPO problems. For non-iterative problems, Tune supports grid search, random search, and Bayesian optimization. For iterative problems, Tune generalizes algorithms like Hyperband and Asynchronous Successive Halving. Tune allows tuning models both locally and in a distributed manner without code changes. The presentation demonstrates Tune's capabilities through examples tuning Scikit-Learn and Keras models. The goal of Tune and Fugue is to make HPO development easy, testable, and scalable.

Données & analyses

Intuitive & Scalable HPO
With Spark+Fugue
Han Wang

Agenda
Introduction
Non-Iterative HPO
Demo
Iterative HPO Demo

pip install tune
https://github.com/fugue-project/tune
pip install fugue
https://github.com/fugue-project/fugue

Questions
● Is parameter tuning a machine learning problem?
● Are there common ways to tune both classical models and deep
learning models?
● Why is it so hard to do distributed parameter tuning?

Tuning Problems In General
General Parameter Tuning
Hyperparameter Tuning (for Machine Learning)
Some Classical
Models
Deep Learning Models
Some Classical
Models
Non-Iterative Problems Iterative Problems

Distributed Parameter Tuning
● Not everything can be parallelized
● The tuning logic is always complex and tedious
● Popular tuning frameworks are not distributed environment
friendly
● Spark is not suitable for iterative tuning problems

Distributed Parameter Tuning
Tune SQL Validation

Our Goals
● For non-iterative problems:
○ Unify grid and random search, make other plugable
● For iterative problems:
○ Generalize SOTA algos such as Hyperband and ASHA
● For both
○ Tune both locally and distributedly without code change
○ Make tuning development iterable and testable
○ Minimize moving parts
○ Minimize interfaces

Grid Search
a: Grid(0,1)
b: Grid(“a”, “b”)
c: 3.14
a:0, b:”a”, c:3.14
a:0, b:”b”, c:3.14
a:1, b:”a”, c:3.14
a:1, b:”b”, c:3.14
Search Space Candidates
Pros: determinism, even coverage, interpretable
Cons: complexity can increase exponentially

Random Search
a: Rand(0,1)
b: Choice(“a”,“b”)
c: 3.14
a:0.12, b:”a”, c:3.14
a:0.66, b:”a”, c:3.14
a:0.32, b:”b”, c:3.14
a:0.94, b:”a”, c:3.14
Search Space Candidates
Pros: complexity and distribution are controlled, good for continuous variables
Cons: by luck, not deterministic, large number of samples are normally needed

Bayesian Optimization
objective: a^2
a: Rand(-1,1)
-0.66 -> 0.76 -> -0.18
-> 0.75 -> 0.90
-> 0.07 -> 0.00
-> 0.41 -> 0.12 -> 0.66
Search Space Candidates
Pros: less compute to guess the optimal parameters
Cons: sequential operations may require more time

Hybrid Search Space
Distributed Hybrid Search
Model 1 Model 2
Grid Random Bayesian

Live Demo
Space Concept & Scikit-Learn Tuning

Challenges
● Realtime asynchronous communication
● The overhead for checkpointing iterations can be significant
● Single iterative problem can’t be parallelized
● A lot of boilerplate code

Successive Halving (SHA)
Rung 1
Rung 2
Rung 3
Rung 4

Fully Customized Successive Halving
8, [(4,6), (2,2), (6,1)]

Summary
Space Monitoring
Dataset
Distributed
Execution
Abstraction
Non-Iterative
Random, Grid, BO
Iterative
SHA, HB, ASHA, PBT ...
Specialization
Scikit-Learn
Specialization
Keras, TF, PyTorch

Let’s Collaborate!
● Create specialized higher level APIs for major tuning cases so
users can do tuning with minimal code and without learning
distributed systems
● Enable advanced users to create fully customized, platform
agnostic and scale agnostic tuning pipelines with tune’s lower
level APIs

Feedback
Your feedback is important to us.
Don’t forget to rate and review the sessions.

Recommandé

Machine Learning Models in ProductionDataWorks Summit

MLOps Using MLflowDatabricks

LBFGSの実装Kotaro Tanahashi

Managing the Complete Machine Learning Lifecycle with MLflowDatabricks

Drifting Away: Testing ML Models in ProductionDatabricks

PR-315: Taming Transformers for High-Resolution Image SynthesisHyeongmin Lee

Braden Hancock "Programmatically creating and managing training data with Sno...Fwdays

DEEP LEARNING、トレーニング・インファレンスのGPUによる高速化RCCSRENKEI

Recommandé

Machine Learning Models in ProductionDataWorks Summit

MLOps Using MLflowDatabricks

LBFGSの実装Kotaro Tanahashi

Managing the Complete Machine Learning Lifecycle with MLflowDatabricks

Drifting Away: Testing ML Models in ProductionDatabricks

PR-315: Taming Transformers for High-Resolution Image SynthesisHyeongmin Lee

Braden Hancock "Programmatically creating and managing training data with Sno...Fwdays

DEEP LEARNING、トレーニング・インファレンスのGPUによる高速化RCCSRENKEI

ML-Ops how to bring your data science to productionHerman Wu

Ml ops intro sessionAvinash Patil

Text similarity measuresankit_ppt

Alpha fold 2Vishwas N

MLOps.pptxAllenPeter7

RoFormer: Enhanced Transformer with Rotary Position Embeddingtaeseon ryu

Machine Intelligence at Google Scale: TensorFlowDataWorks Summit/Hadoop Summit

みんなが知らない pytorch-pfn-extrasTakuji Tahara

機械学習を民主化する取り組みYoshitaka Ushiku

What is MLOpsHenrik Skogström

Pytorchehsan tr

Building NLP applications with TransformersJulien SIMON

2018年01月27日 TensorBoardによる学習の可視化aitc_jp

MLOps by Sasha RosenbaumSasha Rosenbaum

Concept Drift: Monitoring Model Quality In Streaming ML ApplicationsLightbend

Recommender SystemsT212

ResNetの仕組みKota Nagasato

MLOps in actionPieter de Bruin

Best Practices for Hyperparameter Tuning with MLflowDatabricks

[DL輪読会]機械学習におけるカオス現象についてDeep Learning JP

Scott Clark, Co-Founder and CEO, SigOpt at MLconf SF 2016MLconf

MLConf 2016 SigOpt Talk by Scott ClarkSigOpt

Contenu connexe

Tendances

ML-Ops how to bring your data science to productionHerman Wu

Ml ops intro sessionAvinash Patil

Text similarity measuresankit_ppt

Alpha fold 2Vishwas N

MLOps.pptxAllenPeter7

RoFormer: Enhanced Transformer with Rotary Position Embeddingtaeseon ryu

Machine Intelligence at Google Scale: TensorFlowDataWorks Summit/Hadoop Summit

みんなが知らない pytorch-pfn-extrasTakuji Tahara

機械学習を民主化する取り組みYoshitaka Ushiku

What is MLOpsHenrik Skogström

Pytorchehsan tr

Building NLP applications with TransformersJulien SIMON

2018年01月27日 TensorBoardによる学習の可視化aitc_jp

MLOps by Sasha RosenbaumSasha Rosenbaum

Concept Drift: Monitoring Model Quality In Streaming ML ApplicationsLightbend

Recommender SystemsT212

ResNetの仕組みKota Nagasato

MLOps in actionPieter de Bruin

Best Practices for Hyperparameter Tuning with MLflowDatabricks

[DL輪読会]機械学習におけるカオス現象についてDeep Learning JP

Tendances (20)

ML-Ops how to bring your data science to production

Ml ops intro session

Text similarity measures

Alpha fold 2

MLOps.pptx

RoFormer: Enhanced Transformer with Rotary Position Embedding

Machine Intelligence at Google Scale: TensorFlow

みんなが知らない pytorch-pfn-extras

機械学習を民主化する取り組み

What is MLOps

Pytorch

Building NLP applications with Transformers

2018年01月27日 TensorBoardによる学習の可視化

MLOps by Sasha Rosenbaum

Concept Drift: Monitoring Model Quality In Streaming ML Applications

Recommender Systems

ResNetの仕組み

MLOps in action

Best Practices for Hyperparameter Tuning with MLflow

[DL輪読会]機械学習におけるカオス現象について

Similaire à Intuitive & Scalable Hyperparameter Tuning with Apache Spark + Fugue

Scott Clark, Co-Founder and CEO, SigOpt at MLconf SF 2016MLconf

MLConf 2016 SigOpt Talk by Scott ClarkSigOpt

Cutting edge hyperparameter tuning made simple with ray tuneXiaoweiJiang7

Triantafyllia VoulibasiISSEL

Argumentation in Artificial Intelligence: From Theory to Practice (Practice)Mauro Vallati

Using Bayesian Optimization to Tune Machine Learning ModelsScott Clark

Using Bayesian Optimization to Tune Machine Learning ModelsSigOpt

01-Introduction_to_Optimization-v2021.2-Sept23-2021.pptxTran273185

It Does What You Say, Not What You Mean: Lessons From A Decade of Program RepairClaire Le Goues

Validation and-design-in-a-small-team-environmentObsidian Software

Validation and Design in a Small Team EnvironmentDVClub

Hadoop & Spark Performance tuning using Dr. ElephantAkshay Rai

Ssbse10.pptYann-Gaël Guéhéneuc

ICLR 2020 RecapSri Ambati

Toronto meetup 20190917Bill Liu

Fahroo - Computational Mathematics - Spring Review 2012 The Air Force Office of Scientific Research

ISC Frankfurt 2015: Good, bad and ugly of accelerators and a complementary pathJohn Holden

From Hours to Minutes: The Journey of Optimizing Mask-RCNN and BERT Using MXNetEric Haibin Lin

Scaling Machine Learning to Billions of Parameters - Spark Summit 2016Badri Narayan Bhaskar

Scaling Machine Learning To Billions Of ParametersJen Aman

Similaire à Intuitive & Scalable Hyperparameter Tuning with Apache Spark + Fugue (20)

Scott Clark, Co-Founder and CEO, SigOpt at MLconf SF 2016

MLConf 2016 SigOpt Talk by Scott Clark

Cutting edge hyperparameter tuning made simple with ray tune

Triantafyllia Voulibasi

Argumentation in Artificial Intelligence: From Theory to Practice (Practice)

Using Bayesian Optimization to Tune Machine Learning Models

01-Introduction_to_Optimization-v2021.2-Sept23-2021.pptx

It Does What You Say, Not What You Mean: Lessons From A Decade of Program Repair

Validation and-design-in-a-small-team-environment

Validation and Design in a Small Team Environment

Hadoop & Spark Performance tuning using Dr. Elephant

Ssbse10.ppt

ICLR 2020 Recap

Toronto meetup 20190917

Fahroo - Computational Mathematics - Spring Review 2012

ISC Frankfurt 2015: Good, bad and ugly of accelerators and a complementary path

From Hours to Minutes: The Journey of Optimizing Mask-RCNN and BERT Using MXNet

Scaling Machine Learning to Billions of Parameters - Spark Summit 2016

Scaling Machine Learning To Billions Of Parameters

Plus de Databricks

DW Migration Webinar-March 2022.pptxDatabricks

Data Lakehouse Symposium | Day 1 | Part 1Databricks

Data Lakehouse Symposium | Day 1 | Part 2Databricks

Data Lakehouse Symposium | Day 2Databricks

Data Lakehouse Symposium | Day 4Databricks

5 Critical Steps to Clean Your Data Swamp When Migrating Off of HadoopDatabricks

Democratizing Data Quality Through a Centralized PlatformDatabricks

Learn to Use Databricks for Data ScienceDatabricks

Why APM Is Not the Same As ML MonitoringDatabricks

The Function, the Context, and the Data—Enabling ML Ops at Stitch FixDatabricks

Stage Level Scheduling Improving Big Data and AI IntegrationDatabricks

Simplify Data Conversion from Spark to TensorFlow and PyTorchDatabricks

Scaling your Data Pipelines with Apache Spark on KubernetesDatabricks

Scaling and Unifying SciKit Learn and Apache Spark PipelinesDatabricks

Sawtooth Windows for Feature AggregationsDatabricks

Redis + Apache Spark = Swiss Army Knife Meets Kitchen SinkDatabricks

Re-imagine Data Monitoring with whylogs and SparkDatabricks

Raven: End-to-end Optimization of ML Prediction QueriesDatabricks

Processing Large Datasets for ADAS Applications using Apache SparkDatabricks

Massive Data Processing in Adobe Using Delta LakeDatabricks

Plus de Databricks (20)

DW Migration Webinar-March 2022.pptx

Data Lakehouse Symposium | Day 1 | Part 1

Data Lakehouse Symposium | Day 1 | Part 2

Data Lakehouse Symposium | Day 2

Data Lakehouse Symposium | Day 4

5 Critical Steps to Clean Your Data Swamp When Migrating Off of Hadoop

Democratizing Data Quality Through a Centralized Platform

Learn to Use Databricks for Data Science

Why APM Is Not the Same As ML Monitoring

The Function, the Context, and the Data—Enabling ML Ops at Stitch Fix

Stage Level Scheduling Improving Big Data and AI Integration

Simplify Data Conversion from Spark to TensorFlow and PyTorch

Scaling your Data Pipelines with Apache Spark on Kubernetes

Scaling and Unifying SciKit Learn and Apache Spark Pipelines

Sawtooth Windows for Feature Aggregations

Redis + Apache Spark = Swiss Army Knife Meets Kitchen Sink

Re-imagine Data Monitoring with whylogs and Spark

Raven: End-to-end Optimization of ML Prediction Queries

Processing Large Datasets for ADAS Applications using Apache Spark

Massive Data Processing in Adobe Using Delta Lake

Dernier

April 2024 - Crypto Market Report's Analysismanisha194592

Introduction-to-Machine-Learning (1).pptxfirstjob4

Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...amitlee9823

CebaBaby dropshipping via API with DroFX.pptxolyaivanovalion

Generative AI on Enterprise Cloud with NiFi and MilvusTimothy Spann

Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al BarshaAroojKhan71

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083

VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130Suhani Kapoor

Abortion pills in Doha Qatar (+966572737505 ! Get CytotecAbortion pills in Riyadh +966572737505 get cytotec

Smarteg dropshipping via API with DroFx.pptxolyaivanovalion

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Serviceranjana rawat

ALSO dropshipping via API with DroFx.pptxolyaivanovalion

Midocean dropshipping via API with DroFxolyaivanovalion

Sampling (random) method and Non random.pptDr. Soumendra Kumar Patra

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE9953056974 Low Rate Call Girls In Saket, Delhi NCR

Log Analysis using OSSEC sasoasasasas.pptxJohnnyPlasten

Capstone Project on IBM Data Analytics ProgramMoniSankarHazra

Mature dropshipping via API with DroFx.pptxolyaivanovalion

Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083

Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson

Dernier (20)

April 2024 - Crypto Market Report's Analysis

Introduction-to-Machine-Learning (1).pptx

Junnasandra Call Girls: 🍓 7737669865 🍓 High Profile Model Escorts | Bangalore...

CebaBaby dropshipping via API with DroFX.pptx

Generative AI on Enterprise Cloud with NiFi and Milvus

Al Barsha Escorts $#$ O565212860 $#$ Escort Service In Al Barsha

Delhi Call Girls CP 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call

VIP Call Girls Service Miyapur Hyderabad Call +91-8250192130

Abortion pills in Doha Qatar (+966572737505 ! Get Cytotec

Smarteg dropshipping via API with DroFx.pptx

(PARI) Call Girls Wanowrie ( 7001035870 ) HI-Fi Pune Escorts Service

ALSO dropshipping via API with DroFx.pptx

Midocean dropshipping via API with DroFx

Sampling (random) method and Non random.ppt

CHEAP Call Girls in Saket (-DELHI )🔝 9953056974🔝(=)/CALL GIRLS SERVICE

Log Analysis using OSSEC sasoasasasas.pptx

Capstone Project on IBM Data Analytics Program

Mature dropshipping via API with DroFx.pptx

Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call

Schema on read is obsolete. Welcome metaprogramming..pdf

Intuitive & Scalable Hyperparameter Tuning with Apache Spark + Fugue

1. Intuitive & Scalable HPO With Spark+Fugue Han Wang

2. Agenda Introduction Non-Iterative HPO Demo Iterative HPO Demo

3. pip install tune https://github.com/fugue-project/tune pip install fugue https://github.com/fugue-project/fugue

4. Introduction

5. Questions ● Is parameter tuning a machine learning problem? ● Are there common ways to tune both classical models and deep learning models? ● Why is it so hard to do distributed parameter tuning?

6. Tuning Problems In General General Parameter Tuning Hyperparameter Tuning (for Machine Learning) Some Classical Models Deep Learning Models Some Classical Models Non-Iterative Problems Iterative Problems

7. Distributed Parameter Tuning ● Not everything can be parallelized ● The tuning logic is always complex and tedious ● Popular tuning frameworks are not distributed environment friendly ● Spark is not suitable for iterative tuning problems

8. Distributed Parameter Tuning Tune SQL Validation

9. Our Goals ● For non-iterative problems: ○ Unify grid and random search, make other plugable ● For iterative problems: ○ Generalize SOTA algos such as Hyperband and ASHA ● For both ○ Tune both locally and distributedly without code change ○ Make tuning development iterable and testable ○ Minimize moving parts ○ Minimize interfaces

10. Non-Iterative Problems

11. Grid Search a: Grid(0,1) b: Grid(“a”, “b”) c: 3.14 a:0, b:”a”, c:3.14 a:0, b:”b”, c:3.14 a:1, b:”a”, c:3.14 a:1, b:”b”, c:3.14 Search Space Candidates Pros: determinism, even coverage, interpretable Cons: complexity can increase exponentially

12. Random Search a: Rand(0,1) b: Choice(“a”,“b”) c: 3.14 a:0.12, b:”a”, c:3.14 a:0.66, b:”a”, c:3.14 a:0.32, b:”b”, c:3.14 a:0.94, b:”a”, c:3.14 Search Space Candidates Pros: complexity and distribution are controlled, good for continuous variables Cons: by luck, not deterministic, large number of samples are normally needed

13. Bayesian Optimization objective: a^2 a: Rand(-1,1) -0.66 -> 0.76 -> -0.18 -> 0.75 -> 0.90 -> 0.07 -> 0.00 -> 0.41 -> 0.12 -> 0.66 Search Space Candidates Pros: less compute to guess the optimal parameters Cons: sequential operations may require more time

14. Hybrid Search Space Distributed Hybrid Search Model 1 Model 2 Grid Random Bayesian

15. Live Demo Space Concept & Scikit-Learn Tuning

16. Iterative Problems

17. Challenges ● Realtime asynchronous communication ● The overhead for checkpointing iterations can be significant ● Single iterative problem can’t be parallelized ● A lot of boilerplate code

18. Successive Halving (SHA) Rung 1 Rung 2 Rung 3 Rung 4

19. Fully Customized Successive Halving 8, [(4,6), (2,2), (6,1)]

20. Hyperband

21. Asynchronous Successive Halving (ASHA)

22. Live Demo Keras Model Tuning

23. Summary Space Monitoring Dataset Distributed Execution Abstraction Non-Iterative Random, Grid, BO Iterative SHA, HB, ASHA, PBT ... Specialization Scikit-Learn Specialization Keras, TF, PyTorch

24. Let’s Collaborate! ● Create specialized higher level APIs for major tuning cases so users can do tuning with minimal code and without learning distributed systems ● Enable advanced users to create fully customized, platform agnostic and scale agnostic tuning pipelines with tune’s lower level APIs

25. pip install tune https://github.com/fugue-project/tune pip install fugue https://github.com/fugue-project/fugue

26. Feedback Your feedback is important to us. Don’t forget to rate and review the sessions.