SlideShare une entreprise Scribd logo
1  sur  45
Télécharger pour lire hors ligne
Differential Approximation and Sprinting
for Multi-Priority Big Data Engines
Robert Birke1, Isabelly Rocha2, Juan Perez3, Valerio Schiavoni2, Pascal Felber2, Lydia Y. Chen4
ABB Research, Switzerland1
University of Neuchâtel, Switzerland2
Universidad del Rosario, Colombia3
TU Delft, The Netherlands4
December 13th, 2019
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 2
Big Data Analytics FrameworksBig Data Analytics Applications
Context
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 2
Big Data Analytics FrameworksBig Data Analytics Applications
Different requirements:
• Latency
• Accuracy
Context
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 2
Big Data Analytics FrameworksBig Data Analytics Applications
Different requirements:
• Latency
• Accuracy
Solution:
• Priority scheduling
Context
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 3
Scheduling Queue
Priority 2
Priority 1
Big Data Analytics Framework
Priority Scheduling: Preemptive
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 3
Scheduling Queue
Priority 2
Priority 1
Big Data Analytics Framework
Priority Scheduling: Preemptive
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 3
Scheduling Queue
Priority 2
Priority 1
Big Data Analytics Framework
Priority Scheduling: Preemptive
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 3
Scheduling Queue
Priority 2
Priority 1
Big Data Analytics Framework
Priority Scheduling: Preemptive
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 3
Scheduling Queue
Priority 2
Priority 1
Big Data Analytics Framework
Priority Scheduling: Preemptive
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 3
Scheduling Queue
Priority 2
Priority 1
Big Data Analytics Framework
Priority Scheduling: Preemptive
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 3
Scheduling Queue
Priority 2
Priority 1
Big Data Analytics Framework
Priority Scheduling: Preemptive
evict job!
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 3
Scheduling Queue
Priority 2
Priority 1
Big Data Analytics Framework
Priority Scheduling: Preemptive
evict job!
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 3
Scheduling Queue
Priority 2
Priority 1
Big Data Analytics Framework
Priority Scheduling: Preemptive
evict job!
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 3
Scheduling Queue
Priority 2
Priority 1
Big Data Analytics Framework
Priority Scheduling: Preemptive
evict job!
waste
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 4
machine Dme distribuDon
Success execution Failed execution Priority eviction
20 %
35 %
45 %
Priority Scheduling: Preemptive
• Consequences:
• Repetitive eviction of low priority jobs
• Significant latency degradation
• High amount of resources wasted on
subsequent evictions
Source: Demystifying Casualties of Evictions in Big Data Priority Scheduling.
ACM SIGMETRICS Performance Evaluation Review 42, 4 (2015), 12–21.
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 5
Scheduling Queue
Priority 2
Priority 1
Big Data Analytics Framework
Priority Scheduling: Non-Preemptive
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 5
Scheduling Queue
Priority 2
Priority 1
Big Data Analytics Framework
Priority Scheduling: Non-Preemptive
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 5
Scheduling Queue
accuracy
loss
Priority 2
Priority 1
Big Data Analytics Framework
Priority Scheduling: Non-Preemptive
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 5
Scheduling Queue
accuracy
loss
extra
waiting
Priority 2
Priority 1
Big Data Analytics Framework
Priority Scheduling: Non-Preemptive
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 6
DiAS
Deflator
•
Dispatching
•
Monitoring
Dropper
• Defining ratios
Scheduler
Sprinter
• Frequency scaling
Our Approach: DiAS
• Spark as big data processing engine
• Augmented Spark with the capability
of drop tasks
• Prototype implemented in Golang
• Implemented a workload generator
for evaluation purpose
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 7
Deflator DiAS
Deflator
Dropper
Scheduler
Sprinter
• Decide how much data to process base on
• Arrival time
• Size of inputs
• Current system load
• Latency model
• Given that certain inputs are dropped
• Define how the entire latency distribution will change
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 8
• Distributions:
• Phase-Type job processing times to model concurrent task times of jobs
• States:
• Matrix analytics methods, to track states
• Number of jobs in queue/engine
• Number of task queue/engine
• The job distribution across priorities
• Solver:
• MMAP[K]/PH[K]/1 priority queue from Horvath [EJOR’15]
Deflator DiAS
Deflator
Dropper
Scheduler
Sprinter
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 8
• Distributions:
• Phase-Type job processing times to model concurrent task times of jobs
• States:
• Matrix analytics methods, to track states
• Number of jobs in queue/engine
• Number of task queue/engine
• The job distribution across priorities
• Solver:
• MMAP[K]/PH[K]/1 priority queue from Horvath [EJOR’15]
Deflator DiAS
Deflator
Dropper
Scheduler
Sprinter
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 9
0
10
20
30
40
50
60
70
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8
Meanabsolutepercenterror[%]
Θm
• Trade-off between accuracy and latency
• Low dropping ratios -> low accuracy loss
Deflator DiAS
Deflator
Dropper
Scheduler
Sprinter
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 9
0
10
20
30
40
50
60
70
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8
Meanabsolutepercenterror[%]
Θm
• Trade-off between accuracy and latency
• Low dropping ratios -> low accuracy loss
Deflator DiAS
Deflator
Dropper
Scheduler
Sprinter
Dropping ratio
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 9
0
10
20
30
40
50
60
70
0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8
Meanabsolutepercenterror[%]
Θm
• Trade-off between accuracy and latency
• Low dropping ratios -> low accuracy loss
Deflator DiAS
Deflator
Dropper
Scheduler
Sprinter
Dropping ratio
Dropping ratio: 50%
MAE: 42%
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 10
0 0.2 0.4 0.6 0.8
Drop Ratio
0
100
200
300
400
MeanJobResponseTime
model - high
obs - high
model - low
obs - low
• Validation of response time for 2-priority jobs
Deflator DiAS
Deflator
Dropper
Scheduler
Sprinter
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 10
0 0.2 0.4 0.6 0.8
Drop Ratio
0
100
200
300
400
MeanJobResponseTime
model - high
obs - high
model - low
obs - low
• Validation of response time for 2-priority jobs
Deflator DiAS
Deflator
Dropper
Scheduler
Sprinter
Dropping ratio: 20%
Response time: 152 seconds
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 11
Deflator DiAS
Deflator
Dropper
Scheduler
Sprinter
Original MR job
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 11
Deflator DiAS
Deflator
Dropper
Scheduler
Sprinter
Original MR job Approximate MR job
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 12
Deflator
Dropper
Scheduler
Sprinter
DiAS
• Implemented in Spark by modifying the function findMissingPartitions()
• Modification: return only [n(1 − θk )] partitions out of n
• Follows specifications of
Dropper
Deflator
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 13
Deflator
Dropper
Scheduler
Sprinter
DiAS
Sprinter
• Performs CPU frequency scaling
• Handle sprinting timer for each job
• Budget defined from different constraints
• Thermal
• Power
• Provisioning
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 14
Our Approach: Overview DiAS
Deflator
Dropper
Scheduler
Sprinter
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019
• Objetive
• Low waste
• Latency/accuracy requirements
• Multi-priority
14
Our Approach: Overview DiAS
Deflator
Dropper
Scheduler
Sprinter
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019
• Objetive
• Low waste
• Latency/accuracy requirements
• Multi-priority
14
Our Approach: Overview
• Challenges
• Many parameters
• Tail latency
• Priority
DiAS
Deflator
Dropper
Scheduler
Sprinter
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019
• Objetive
• Low waste
• Latency/accuracy requirements
• Multi-priority
• Goal
• Define: Task dropping ratio and sprinting timeout
• Given: Priority class, tolerance to accuracy degradation, available sprinting budget
14
Our Approach: Overview
• Challenges
• Many parameters
• Tail latency
• Priority
DiAS
Deflator
Dropper
Scheduler
Sprinter
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 15
• Spark processing engine
• Version 2.1
• 10 workers cluster
• 2 CPU cores and 4 GB memory per worker
• Intel Xeon E3-1270 v6 CPU, 64 cores and 128 GB memory
• Key parameters
• Ratio between low- and high priority jobs
• Average size
• Cluster load
• Workload
• Text analysis jobs
• Graph analysis jobs
Evaluation Setup
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 16
• Differential Approximation
• Two- and three priority system
• Sensitivity analisys
• Similar job size for all priorities
• Several high- to low-priority job ratio
• Several system load
• Differential Approximation and Sprinting
• Latency gain
• Energy gain
Evaluation
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019
• Tail and average latency of low priority job decreases
• Average latency of high priority job increases
17
1
10
100
1000
P NP DA
0/10
DA
0/20
-80
-60
-40
-20
0
20
40
60
80
Responsetime[s]
Difference[%]
High
Low
Evaluation: Differential Approximation
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019
• Tail and average latency of low priority job decreases
• Average latency of high priority job increases
17
1
10
100
1000
P NP DA
0/10
DA
0/20
-80
-60
-40
-20
0
20
40
60
80
Responsetime[s]
Difference[%]
High
Low
Evaluation: Differential Approximation
Preemptive
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019
• Tail and average latency of low priority job decreases
• Average latency of high priority job increases
17
1
10
100
1000
P NP DA
0/10
DA
0/20
-80
-60
-40
-20
0
20
40
60
80
Responsetime[s]
Difference[%]
High
Low
Evaluation: Differential Approximation
Preemptive
Non-Preemptive
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019
• Tail and average latency of low priority job decreases
• Average latency of high priority job increases
17
1
10
100
1000
P NP DA
0/10
DA
0/20
-80
-60
-40
-20
0
20
40
60
80
Responsetime[s]
Difference[%]
High
Low
Evaluation: Differential Approximation
Preemptive
Non-Preemptive
Differential Approximation
x/y: drop ratio on high- (x)
and low-priority (y)
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 18
1
10
100
1000
10000
P DiAS
0/10
DiAS
0/20
-80
-60
-40
-20
0
20
40
60
80
Responsetime[s]
Difference[%]
High
Low
• Tail and average latency of low priority job decreases even more (up to 90%)
• Average latency of high priority also decreases
Evaluation: Differential Approximation and Sprinting
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019
• Reduction of more than 20% on energy consumption
19
1
10
100
1000
10000
P DiAS
0/10
DiAS
0/20
-80
-60
-40
-20
0
20
40
60
80
Energy[kJ]
Difference[%]
Unimited
Limited
Evaluation: Differential Approximation and Sprinting
Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 20
• Main goal: reduce resource waste
• Strategy:
• Drop job eviction
• Deflate low-priority jobs
• Sprint high-priority jobs
• Additional gains:
• Up to 90% latency reduction
• More than 20% energy savings
• Implemented in Golang on top of Spark
Summary and Takeaways

Contenu connexe

Tendances

ACDKOCHI19 - Next Generation Data Analytics Platform on AWS
ACDKOCHI19 - Next Generation Data Analytics Platform on AWSACDKOCHI19 - Next Generation Data Analytics Platform on AWS
ACDKOCHI19 - Next Generation Data Analytics Platform on AWSAWS User Group Kochi
 
SAP HANA SPS10- Hadoop Integration
SAP HANA SPS10- Hadoop IntegrationSAP HANA SPS10- Hadoop Integration
SAP HANA SPS10- Hadoop IntegrationSAP Technology
 
CIO Guide to Using SAP HANA Platform For Big Data
CIO Guide to Using SAP HANA Platform For Big DataCIO Guide to Using SAP HANA Platform For Big Data
CIO Guide to Using SAP HANA Platform For Big DataSnehanshu Shah
 
Keynote – From MapReduce to Spark: An Ecosystem Evolves by Doug Cutting, Chie...
Keynote – From MapReduce to Spark: An Ecosystem Evolves by Doug Cutting, Chie...Keynote – From MapReduce to Spark: An Ecosystem Evolves by Doug Cutting, Chie...
Keynote – From MapReduce to Spark: An Ecosystem Evolves by Doug Cutting, Chie...Cloudera, Inc.
 
SAP HANA Vora SITMTY 20160707
SAP HANA Vora SITMTY 20160707SAP HANA Vora SITMTY 20160707
SAP HANA Vora SITMTY 20160707Henrique Pinto
 
Big Data, Big Thinking: Simplified Architecture Webinar Fact Sheet
Big Data, Big Thinking: Simplified Architecture Webinar Fact SheetBig Data, Big Thinking: Simplified Architecture Webinar Fact Sheet
Big Data, Big Thinking: Simplified Architecture Webinar Fact SheetSAP Technology
 
Advanced analytics with sap hana and r
Advanced analytics with sap hana and rAdvanced analytics with sap hana and r
Advanced analytics with sap hana and rSAP Technology
 
Building Information Platform - Integration of Hadoop with SAP HANA and HANA ...
Building Information Platform - Integration of Hadoop with SAP HANA and HANA ...Building Information Platform - Integration of Hadoop with SAP HANA and HANA ...
Building Information Platform - Integration of Hadoop with SAP HANA and HANA ...DataWorks Summit/Hadoop Summit
 
SAP HANA - The Foundation of Real Time, Now on the AWS Cloud Computing Platform
SAP HANA - The Foundation of Real Time, Now on the AWS Cloud Computing PlatformSAP HANA - The Foundation of Real Time, Now on the AWS Cloud Computing Platform
SAP HANA - The Foundation of Real Time, Now on the AWS Cloud Computing PlatformAmazon Web Services
 
Extending the Reach of R to the Enterprise with TERR and Spotfire
Extending the Reach of R to the Enterprise with TERR and SpotfireExtending the Reach of R to the Enterprise with TERR and Spotfire
Extending the Reach of R to the Enterprise with TERR and SpotfireLou Bajuk
 
SAP Helps Reduce Silos Between Business and Spatial Data
SAP Helps Reduce Silos Between Business and Spatial DataSAP Helps Reduce Silos Between Business and Spatial Data
SAP Helps Reduce Silos Between Business and Spatial DataSAP Technology
 

Tendances (11)

ACDKOCHI19 - Next Generation Data Analytics Platform on AWS
ACDKOCHI19 - Next Generation Data Analytics Platform on AWSACDKOCHI19 - Next Generation Data Analytics Platform on AWS
ACDKOCHI19 - Next Generation Data Analytics Platform on AWS
 
SAP HANA SPS10- Hadoop Integration
SAP HANA SPS10- Hadoop IntegrationSAP HANA SPS10- Hadoop Integration
SAP HANA SPS10- Hadoop Integration
 
CIO Guide to Using SAP HANA Platform For Big Data
CIO Guide to Using SAP HANA Platform For Big DataCIO Guide to Using SAP HANA Platform For Big Data
CIO Guide to Using SAP HANA Platform For Big Data
 
Keynote – From MapReduce to Spark: An Ecosystem Evolves by Doug Cutting, Chie...
Keynote – From MapReduce to Spark: An Ecosystem Evolves by Doug Cutting, Chie...Keynote – From MapReduce to Spark: An Ecosystem Evolves by Doug Cutting, Chie...
Keynote – From MapReduce to Spark: An Ecosystem Evolves by Doug Cutting, Chie...
 
SAP HANA Vora SITMTY 20160707
SAP HANA Vora SITMTY 20160707SAP HANA Vora SITMTY 20160707
SAP HANA Vora SITMTY 20160707
 
Big Data, Big Thinking: Simplified Architecture Webinar Fact Sheet
Big Data, Big Thinking: Simplified Architecture Webinar Fact SheetBig Data, Big Thinking: Simplified Architecture Webinar Fact Sheet
Big Data, Big Thinking: Simplified Architecture Webinar Fact Sheet
 
Advanced analytics with sap hana and r
Advanced analytics with sap hana and rAdvanced analytics with sap hana and r
Advanced analytics with sap hana and r
 
Building Information Platform - Integration of Hadoop with SAP HANA and HANA ...
Building Information Platform - Integration of Hadoop with SAP HANA and HANA ...Building Information Platform - Integration of Hadoop with SAP HANA and HANA ...
Building Information Platform - Integration of Hadoop with SAP HANA and HANA ...
 
SAP HANA - The Foundation of Real Time, Now on the AWS Cloud Computing Platform
SAP HANA - The Foundation of Real Time, Now on the AWS Cloud Computing PlatformSAP HANA - The Foundation of Real Time, Now on the AWS Cloud Computing Platform
SAP HANA - The Foundation of Real Time, Now on the AWS Cloud Computing Platform
 
Extending the Reach of R to the Enterprise with TERR and Spotfire
Extending the Reach of R to the Enterprise with TERR and SpotfireExtending the Reach of R to the Enterprise with TERR and Spotfire
Extending the Reach of R to the Enterprise with TERR and Spotfire
 
SAP Helps Reduce Silos Between Business and Spatial Data
SAP Helps Reduce Silos Between Business and Spatial DataSAP Helps Reduce Silos Between Business and Spatial Data
SAP Helps Reduce Silos Between Business and Spatial Data
 

Similaire à Differential Approximation and Sprinting for Multi-Priority Big Data Engines

Innovate 2014 - What's New in Reporting and Analytics
Innovate 2014 - What's New in Reporting and AnalyticsInnovate 2014 - What's New in Reporting and Analytics
Innovate 2014 - What's New in Reporting and AnalyticsDragos Cojocari
 
Best Laid Plans: Saving Time, Money and Trouble with Optimal Forecasting
Best Laid Plans: Saving Time, Money and Trouble with Optimal ForecastingBest Laid Plans: Saving Time, Money and Trouble with Optimal Forecasting
Best Laid Plans: Saving Time, Money and Trouble with Optimal ForecastingEric Kavanagh
 
Pr dc 2015 sql server is cheaper than open source
Pr dc 2015 sql server is cheaper than open sourcePr dc 2015 sql server is cheaper than open source
Pr dc 2015 sql server is cheaper than open sourceTerry Bunio
 
Pivotal Digital Transformation Forum: Journey to Become a Data-Driven Enterprise
Pivotal Digital Transformation Forum: Journey to Become a Data-Driven EnterprisePivotal Digital Transformation Forum: Journey to Become a Data-Driven Enterprise
Pivotal Digital Transformation Forum: Journey to Become a Data-Driven EnterpriseVMware Tanzu
 
Seeing Redshift: How Amazon Changed Data Warehousing Forever
Seeing Redshift: How Amazon Changed Data Warehousing ForeverSeeing Redshift: How Amazon Changed Data Warehousing Forever
Seeing Redshift: How Amazon Changed Data Warehousing ForeverInside Analysis
 
Enablement Session - IBP Transformation Final_C.pptx
Enablement Session - IBP Transformation Final_C.pptxEnablement Session - IBP Transformation Final_C.pptx
Enablement Session - IBP Transformation Final_C.pptxKaustubh M
 
Auto­matic Para­meter Tun­ing for Data­bases and Big Data Sys­tems
Auto­matic Para­meter Tun­ing for Data­bases and Big Data Sys­tems Auto­matic Para­meter Tun­ing for Data­bases and Big Data Sys­tems
Auto­matic Para­meter Tun­ing for Data­bases and Big Data Sys­tems Jiaheng Lu
 
The New Frontier: Optimizing Big Data Exploration
The New Frontier: Optimizing Big Data ExplorationThe New Frontier: Optimizing Big Data Exploration
The New Frontier: Optimizing Big Data ExplorationInside Analysis
 
Data Discovery and BI - Is there Really a Difference?
Data Discovery and BI - Is there Really a Difference?Data Discovery and BI - Is there Really a Difference?
Data Discovery and BI - Is there Really a Difference?Inside Analysis
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database RoundtableEric Kavanagh
 
Zurich Italia - IT Knowledge Base (English)
Zurich Italia - IT Knowledge Base (English)Zurich Italia - IT Knowledge Base (English)
Zurich Italia - IT Knowledge Base (English)Neo4j
 
SM Energy and Akili Discuss How to Accelerate Your New Asset Assimilation
SM Energy and Akili Discuss How to Accelerate Your New Asset AssimilationSM Energy and Akili Discuss How to Accelerate Your New Asset Assimilation
SM Energy and Akili Discuss How to Accelerate Your New Asset Assimilationrnaramore
 
Spark meetup2 final (Taboola)
Spark meetup2 final (Taboola) Spark meetup2 final (Taboola)
Spark meetup2 final (Taboola) tsliwowicz
 
Predicting When Your Applications Will Go Off the Rails! Managing DB2 Appli...
Predicting When Your Applications Will Go Off the Rails!  Managing DB2 Appli...Predicting When Your Applications Will Go Off the Rails!  Managing DB2 Appli...
Predicting When Your Applications Will Go Off the Rails! Managing DB2 Appli...CA Technologies
 
No REST till Production – Building and Deploying 9 Models to Production in 3 ...
No REST till Production – Building and Deploying 9 Models to Production in 3 ...No REST till Production – Building and Deploying 9 Models to Production in 3 ...
No REST till Production – Building and Deploying 9 Models to Production in 3 ...Databricks
 
KTern.AI-SAP-DXaaS-Workshop-PLAN
KTern.AI-SAP-DXaaS-Workshop-PLANKTern.AI-SAP-DXaaS-Workshop-PLAN
KTern.AI-SAP-DXaaS-Workshop-PLANKTern.AI
 
Making Big Data Analytics with Hadoop fast & easy (webinar slides)
Making Big Data Analytics with Hadoop fast & easy (webinar slides)Making Big Data Analytics with Hadoop fast & easy (webinar slides)
Making Big Data Analytics with Hadoop fast & easy (webinar slides)Yellowfin
 
Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...
Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...
Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...DataWorks Summit/Hadoop Summit
 

Similaire à Differential Approximation and Sprinting for Multi-Priority Big Data Engines (20)

Innovate 2014 - What's New in Reporting and Analytics
Innovate 2014 - What's New in Reporting and AnalyticsInnovate 2014 - What's New in Reporting and Analytics
Innovate 2014 - What's New in Reporting and Analytics
 
Best Laid Plans: Saving Time, Money and Trouble with Optimal Forecasting
Best Laid Plans: Saving Time, Money and Trouble with Optimal ForecastingBest Laid Plans: Saving Time, Money and Trouble with Optimal Forecasting
Best Laid Plans: Saving Time, Money and Trouble with Optimal Forecasting
 
Local Waste Service Standards Front End Integration Workshop
Local Waste Service Standards Front End Integration WorkshopLocal Waste Service Standards Front End Integration Workshop
Local Waste Service Standards Front End Integration Workshop
 
Pr dc 2015 sql server is cheaper than open source
Pr dc 2015 sql server is cheaper than open sourcePr dc 2015 sql server is cheaper than open source
Pr dc 2015 sql server is cheaper than open source
 
Pivotal Digital Transformation Forum: Journey to Become a Data-Driven Enterprise
Pivotal Digital Transformation Forum: Journey to Become a Data-Driven EnterprisePivotal Digital Transformation Forum: Journey to Become a Data-Driven Enterprise
Pivotal Digital Transformation Forum: Journey to Become a Data-Driven Enterprise
 
Seeing Redshift: How Amazon Changed Data Warehousing Forever
Seeing Redshift: How Amazon Changed Data Warehousing ForeverSeeing Redshift: How Amazon Changed Data Warehousing Forever
Seeing Redshift: How Amazon Changed Data Warehousing Forever
 
Enablement Session - IBP Transformation Final_C.pptx
Enablement Session - IBP Transformation Final_C.pptxEnablement Session - IBP Transformation Final_C.pptx
Enablement Session - IBP Transformation Final_C.pptx
 
Auto­matic Para­meter Tun­ing for Data­bases and Big Data Sys­tems
Auto­matic Para­meter Tun­ing for Data­bases and Big Data Sys­tems Auto­matic Para­meter Tun­ing for Data­bases and Big Data Sys­tems
Auto­matic Para­meter Tun­ing for Data­bases and Big Data Sys­tems
 
The New Frontier: Optimizing Big Data Exploration
The New Frontier: Optimizing Big Data ExplorationThe New Frontier: Optimizing Big Data Exploration
The New Frontier: Optimizing Big Data Exploration
 
Data Discovery and BI - Is there Really a Difference?
Data Discovery and BI - Is there Really a Difference?Data Discovery and BI - Is there Really a Difference?
Data Discovery and BI - Is there Really a Difference?
 
Horses for Courses: Database Roundtable
Horses for Courses: Database RoundtableHorses for Courses: Database Roundtable
Horses for Courses: Database Roundtable
 
Zurich Italia - IT Knowledge Base (English)
Zurich Italia - IT Knowledge Base (English)Zurich Italia - IT Knowledge Base (English)
Zurich Italia - IT Knowledge Base (English)
 
SM Energy and Akili Discuss How to Accelerate Your New Asset Assimilation
SM Energy and Akili Discuss How to Accelerate Your New Asset AssimilationSM Energy and Akili Discuss How to Accelerate Your New Asset Assimilation
SM Energy and Akili Discuss How to Accelerate Your New Asset Assimilation
 
Spark meetup2 final (Taboola)
Spark meetup2 final (Taboola) Spark meetup2 final (Taboola)
Spark meetup2 final (Taboola)
 
Predicting When Your Applications Will Go Off the Rails! Managing DB2 Appli...
Predicting When Your Applications Will Go Off the Rails!  Managing DB2 Appli...Predicting When Your Applications Will Go Off the Rails!  Managing DB2 Appli...
Predicting When Your Applications Will Go Off the Rails! Managing DB2 Appli...
 
Resume
ResumeResume
Resume
 
No REST till Production – Building and Deploying 9 Models to Production in 3 ...
No REST till Production – Building and Deploying 9 Models to Production in 3 ...No REST till Production – Building and Deploying 9 Models to Production in 3 ...
No REST till Production – Building and Deploying 9 Models to Production in 3 ...
 
KTern.AI-SAP-DXaaS-Workshop-PLAN
KTern.AI-SAP-DXaaS-Workshop-PLANKTern.AI-SAP-DXaaS-Workshop-PLAN
KTern.AI-SAP-DXaaS-Workshop-PLAN
 
Making Big Data Analytics with Hadoop fast & easy (webinar slides)
Making Big Data Analytics with Hadoop fast & easy (webinar slides)Making Big Data Analytics with Hadoop fast & easy (webinar slides)
Making Big Data Analytics with Hadoop fast & easy (webinar slides)
 
Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...
Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...
Modernizing Business Processes with Big Data: Real-World Use Cases for Produc...
 

Plus de LEGATO project

Scrooge Attack: Undervolting ARM Processors for Profit
Scrooge Attack: Undervolting ARM Processors for ProfitScrooge Attack: Undervolting ARM Processors for Profit
Scrooge Attack: Undervolting ARM Processors for ProfitLEGATO project
 
A practical approach for updating an integrity-enforced operating system
A practical approach for updating an integrity-enforced operating systemA practical approach for updating an integrity-enforced operating system
A practical approach for updating an integrity-enforced operating systemLEGATO project
 
TEEMon: A continuous performance monitoring framework for TEEs
TEEMon: A continuous performance monitoring framework for TEEsTEEMon: A continuous performance monitoring framework for TEEs
TEEMon: A continuous performance monitoring framework for TEEsLEGATO project
 
secureTF: A Secure TensorFlow Framework
secureTF: A Secure TensorFlow FrameworksecureTF: A Secure TensorFlow Framework
secureTF: A Secure TensorFlow FrameworkLEGATO project
 
PipeTune: Pipeline Parallelism of Hyper and System Parameters Tuning for Deep...
PipeTune: Pipeline Parallelism of Hyper and System Parameters Tuning for Deep...PipeTune: Pipeline Parallelism of Hyper and System Parameters Tuning for Deep...
PipeTune: Pipeline Parallelism of Hyper and System Parameters Tuning for Deep...LEGATO project
 
LEGaTO: Machine Learning Use Case
LEGaTO: Machine Learning Use CaseLEGaTO: Machine Learning Use Case
LEGaTO: Machine Learning Use CaseLEGATO project
 
Smart Home AI at the edge
Smart Home AI at the edgeSmart Home AI at the edge
Smart Home AI at the edgeLEGATO project
 
LEGaTO: Low-Energy Heterogeneous Computing Use of AI in the project
LEGaTO: Low-Energy Heterogeneous Computing Use of AI in the projectLEGaTO: Low-Energy Heterogeneous Computing Use of AI in the project
LEGaTO: Low-Energy Heterogeneous Computing Use of AI in the projectLEGATO project
 
LEGaTO: Software Stack Programming Models
LEGaTO: Software Stack Programming ModelsLEGaTO: Software Stack Programming Models
LEGaTO: Software Stack Programming ModelsLEGATO project
 
LEGaTO: Software Stack Runtimes
LEGaTO: Software Stack RuntimesLEGaTO: Software Stack Runtimes
LEGaTO: Software Stack RuntimesLEGATO project
 
LEGaTO Heterogeneous Hardware
LEGaTO Heterogeneous HardwareLEGaTO Heterogeneous Hardware
LEGaTO Heterogeneous HardwareLEGATO project
 
LEGaTO: Low-Energy Heterogeneous Computing Workshop
LEGaTO: Low-Energy Heterogeneous Computing WorkshopLEGaTO: Low-Energy Heterogeneous Computing Workshop
LEGaTO: Low-Energy Heterogeneous Computing WorkshopLEGATO project
 
TZ4Fabric: Executing Smart Contracts with ARM TrustZone
TZ4Fabric: Executing Smart Contracts with ARM TrustZoneTZ4Fabric: Executing Smart Contracts with ARM TrustZone
TZ4Fabric: Executing Smart Contracts with ARM TrustZoneLEGATO project
 
Infection Research with Maxeler Dataflow Computing
Infection Research with Maxeler Dataflow ComputingInfection Research with Maxeler Dataflow Computing
Infection Research with Maxeler Dataflow ComputingLEGATO project
 
Smart Home - AI at the edge
Smart Home - AI at the edgeSmart Home - AI at the edge
Smart Home - AI at the edgeLEGATO project
 
FPGA Undervolting and Checkpointing for Energy-Efficiency and Error-Resiliency
FPGA Undervolting and Checkpointing for Energy-Efficiency and Error-ResiliencyFPGA Undervolting and Checkpointing for Energy-Efficiency and Error-Resiliency
FPGA Undervolting and Checkpointing for Energy-Efficiency and Error-ResiliencyLEGATO project
 
Device Data Directory and Asynchronous execution: A path to heterogeneous com...
Device Data Directory and Asynchronous execution: A path to heterogeneous com...Device Data Directory and Asynchronous execution: A path to heterogeneous com...
Device Data Directory and Asynchronous execution: A path to heterogeneous com...LEGATO project
 
Scheduling Task-parallel Applications in Dynamically Asymmetric Environments
Scheduling Task-parallel Applications in Dynamically Asymmetric EnvironmentsScheduling Task-parallel Applications in Dynamically Asymmetric Environments
Scheduling Task-parallel Applications in Dynamically Asymmetric EnvironmentsLEGATO project
 

Plus de LEGATO project (20)

Scrooge Attack: Undervolting ARM Processors for Profit
Scrooge Attack: Undervolting ARM Processors for ProfitScrooge Attack: Undervolting ARM Processors for Profit
Scrooge Attack: Undervolting ARM Processors for Profit
 
A practical approach for updating an integrity-enforced operating system
A practical approach for updating an integrity-enforced operating systemA practical approach for updating an integrity-enforced operating system
A practical approach for updating an integrity-enforced operating system
 
TEEMon: A continuous performance monitoring framework for TEEs
TEEMon: A continuous performance monitoring framework for TEEsTEEMon: A continuous performance monitoring framework for TEEs
TEEMon: A continuous performance monitoring framework for TEEs
 
secureTF: A Secure TensorFlow Framework
secureTF: A Secure TensorFlow FrameworksecureTF: A Secure TensorFlow Framework
secureTF: A Secure TensorFlow Framework
 
PipeTune: Pipeline Parallelism of Hyper and System Parameters Tuning for Deep...
PipeTune: Pipeline Parallelism of Hyper and System Parameters Tuning for Deep...PipeTune: Pipeline Parallelism of Hyper and System Parameters Tuning for Deep...
PipeTune: Pipeline Parallelism of Hyper and System Parameters Tuning for Deep...
 
LEGaTO: Machine Learning Use Case
LEGaTO: Machine Learning Use CaseLEGaTO: Machine Learning Use Case
LEGaTO: Machine Learning Use Case
 
Smart Home AI at the edge
Smart Home AI at the edgeSmart Home AI at the edge
Smart Home AI at the edge
 
LEGaTO: Low-Energy Heterogeneous Computing Use of AI in the project
LEGaTO: Low-Energy Heterogeneous Computing Use of AI in the projectLEGaTO: Low-Energy Heterogeneous Computing Use of AI in the project
LEGaTO: Low-Energy Heterogeneous Computing Use of AI in the project
 
LEGaTO Integration
LEGaTO IntegrationLEGaTO Integration
LEGaTO Integration
 
LEGaTO: Use cases
LEGaTO: Use casesLEGaTO: Use cases
LEGaTO: Use cases
 
LEGaTO: Software Stack Programming Models
LEGaTO: Software Stack Programming ModelsLEGaTO: Software Stack Programming Models
LEGaTO: Software Stack Programming Models
 
LEGaTO: Software Stack Runtimes
LEGaTO: Software Stack RuntimesLEGaTO: Software Stack Runtimes
LEGaTO: Software Stack Runtimes
 
LEGaTO Heterogeneous Hardware
LEGaTO Heterogeneous HardwareLEGaTO Heterogeneous Hardware
LEGaTO Heterogeneous Hardware
 
LEGaTO: Low-Energy Heterogeneous Computing Workshop
LEGaTO: Low-Energy Heterogeneous Computing WorkshopLEGaTO: Low-Energy Heterogeneous Computing Workshop
LEGaTO: Low-Energy Heterogeneous Computing Workshop
 
TZ4Fabric: Executing Smart Contracts with ARM TrustZone
TZ4Fabric: Executing Smart Contracts with ARM TrustZoneTZ4Fabric: Executing Smart Contracts with ARM TrustZone
TZ4Fabric: Executing Smart Contracts with ARM TrustZone
 
Infection Research with Maxeler Dataflow Computing
Infection Research with Maxeler Dataflow ComputingInfection Research with Maxeler Dataflow Computing
Infection Research with Maxeler Dataflow Computing
 
Smart Home - AI at the edge
Smart Home - AI at the edgeSmart Home - AI at the edge
Smart Home - AI at the edge
 
FPGA Undervolting and Checkpointing for Energy-Efficiency and Error-Resiliency
FPGA Undervolting and Checkpointing for Energy-Efficiency and Error-ResiliencyFPGA Undervolting and Checkpointing for Energy-Efficiency and Error-Resiliency
FPGA Undervolting and Checkpointing for Energy-Efficiency and Error-Resiliency
 
Device Data Directory and Asynchronous execution: A path to heterogeneous com...
Device Data Directory and Asynchronous execution: A path to heterogeneous com...Device Data Directory and Asynchronous execution: A path to heterogeneous com...
Device Data Directory and Asynchronous execution: A path to heterogeneous com...
 
Scheduling Task-parallel Applications in Dynamically Asymmetric Environments
Scheduling Task-parallel Applications in Dynamically Asymmetric EnvironmentsScheduling Task-parallel Applications in Dynamically Asymmetric Environments
Scheduling Task-parallel Applications in Dynamically Asymmetric Environments
 

Dernier

FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryAlex Henderson
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learninglevieagacer
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...Scintica Instrumentation
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)Areesha Ahmad
 
Exploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdfExploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdfrohankumarsinghrore1
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flyPRADYUMMAURYA1
 
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...Monika Rani
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Serviceshivanisharma5244
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptxryanrooker
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY1301aanya
 
Chemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdfChemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdfSumit Kumar yadav
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxSuji236384
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusNazaninKarimi6
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspectsmuralinath2
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)Areesha Ahmad
 
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxClimate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxDiariAli
 
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)AkefAfaneh2
 
An introduction on sequence tagged site mapping
An introduction on sequence tagged site mappingAn introduction on sequence tagged site mapping
An introduction on sequence tagged site mappingadibshanto115
 

Dernier (20)

FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Module for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learningModule for Grade 9 for Asynchronous/Distance learning
Module for Grade 9 for Asynchronous/Distance learning
 
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
(May 9, 2024) Enhanced Ultrafast Vector Flow Imaging (VFI) Using Multi-Angle ...
 
GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)GBSN - Microbiology (Unit 1)
GBSN - Microbiology (Unit 1)
 
Exploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdfExploring Criminology and Criminal Behaviour.pdf
Exploring Criminology and Criminal Behaviour.pdf
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit flypumpkin fruit fly, water melon fruit fly, cucumber fruit fly
pumpkin fruit fly, water melon fruit fly, cucumber fruit fly
 
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS  ESCORT SERVICE In Bhiwan...
Bhiwandi Bhiwandi ❤CALL GIRL 7870993772 ❤CALL GIRLS ESCORT SERVICE In Bhiwan...
 
Site Acceptance Test .
Site Acceptance Test                    .Site Acceptance Test                    .
Site Acceptance Test .
 
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort ServiceCall Girls Ahmedabad +917728919243 call me Independent Escort Service
Call Girls Ahmedabad +917728919243 call me Independent Escort Service
 
300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx300003-World Science Day For Peace And Development.pptx
300003-World Science Day For Peace And Development.pptx
 
biology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGYbiology HL practice questions IB BIOLOGY
biology HL practice questions IB BIOLOGY
 
Chemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdfChemistry 5th semester paper 1st Notes.pdf
Chemistry 5th semester paper 1st Notes.pdf
 
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptxPSYCHOSOCIAL NEEDS. in nursing II sem pptx
PSYCHOSOCIAL NEEDS. in nursing II sem pptx
 
development of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virusdevelopment of diagnostic enzyme assay to detect leuser virus
development of diagnostic enzyme assay to detect leuser virus
 
Dr. E. Muralinath_ Blood indices_clinical aspects
Dr. E. Muralinath_ Blood indices_clinical  aspectsDr. E. Muralinath_ Blood indices_clinical  aspects
Dr. E. Muralinath_ Blood indices_clinical aspects
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptxClimate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
Climate Change Impacts on Terrestrial and Aquatic Ecosystems.pptx
 
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)COMPUTING ANTI-DERIVATIVES(Integration by SUBSTITUTION)
COMPUTING ANTI-DERIVATIVES (Integration by SUBSTITUTION)
 
An introduction on sequence tagged site mapping
An introduction on sequence tagged site mappingAn introduction on sequence tagged site mapping
An introduction on sequence tagged site mapping
 

Differential Approximation and Sprinting for Multi-Priority Big Data Engines

  • 1. Differential Approximation and Sprinting for Multi-Priority Big Data Engines Robert Birke1, Isabelly Rocha2, Juan Perez3, Valerio Schiavoni2, Pascal Felber2, Lydia Y. Chen4 ABB Research, Switzerland1 University of Neuchâtel, Switzerland2 Universidad del Rosario, Colombia3 TU Delft, The Netherlands4 December 13th, 2019
  • 2. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 2 Big Data Analytics FrameworksBig Data Analytics Applications Context
  • 3. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 2 Big Data Analytics FrameworksBig Data Analytics Applications Different requirements: • Latency • Accuracy Context
  • 4. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 2 Big Data Analytics FrameworksBig Data Analytics Applications Different requirements: • Latency • Accuracy Solution: • Priority scheduling Context
  • 5. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 3 Scheduling Queue Priority 2 Priority 1 Big Data Analytics Framework Priority Scheduling: Preemptive
  • 6. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 3 Scheduling Queue Priority 2 Priority 1 Big Data Analytics Framework Priority Scheduling: Preemptive
  • 7. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 3 Scheduling Queue Priority 2 Priority 1 Big Data Analytics Framework Priority Scheduling: Preemptive
  • 8. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 3 Scheduling Queue Priority 2 Priority 1 Big Data Analytics Framework Priority Scheduling: Preemptive
  • 9. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 3 Scheduling Queue Priority 2 Priority 1 Big Data Analytics Framework Priority Scheduling: Preemptive
  • 10. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 3 Scheduling Queue Priority 2 Priority 1 Big Data Analytics Framework Priority Scheduling: Preemptive
  • 11. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 3 Scheduling Queue Priority 2 Priority 1 Big Data Analytics Framework Priority Scheduling: Preemptive evict job!
  • 12. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 3 Scheduling Queue Priority 2 Priority 1 Big Data Analytics Framework Priority Scheduling: Preemptive evict job!
  • 13. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 3 Scheduling Queue Priority 2 Priority 1 Big Data Analytics Framework Priority Scheduling: Preemptive evict job!
  • 14. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 3 Scheduling Queue Priority 2 Priority 1 Big Data Analytics Framework Priority Scheduling: Preemptive evict job! waste
  • 15. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 4 machine Dme distribuDon Success execution Failed execution Priority eviction 20 % 35 % 45 % Priority Scheduling: Preemptive • Consequences: • Repetitive eviction of low priority jobs • Significant latency degradation • High amount of resources wasted on subsequent evictions Source: Demystifying Casualties of Evictions in Big Data Priority Scheduling. ACM SIGMETRICS Performance Evaluation Review 42, 4 (2015), 12–21.
  • 16. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 5 Scheduling Queue Priority 2 Priority 1 Big Data Analytics Framework Priority Scheduling: Non-Preemptive
  • 17. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 5 Scheduling Queue Priority 2 Priority 1 Big Data Analytics Framework Priority Scheduling: Non-Preemptive
  • 18. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 5 Scheduling Queue accuracy loss Priority 2 Priority 1 Big Data Analytics Framework Priority Scheduling: Non-Preemptive
  • 19. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 5 Scheduling Queue accuracy loss extra waiting Priority 2 Priority 1 Big Data Analytics Framework Priority Scheduling: Non-Preemptive
  • 20. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 6 DiAS Deflator • Dispatching • Monitoring Dropper • Defining ratios Scheduler Sprinter • Frequency scaling Our Approach: DiAS • Spark as big data processing engine • Augmented Spark with the capability of drop tasks • Prototype implemented in Golang • Implemented a workload generator for evaluation purpose
  • 21. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 7 Deflator DiAS Deflator Dropper Scheduler Sprinter • Decide how much data to process base on • Arrival time • Size of inputs • Current system load • Latency model • Given that certain inputs are dropped • Define how the entire latency distribution will change
  • 22. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 8 • Distributions: • Phase-Type job processing times to model concurrent task times of jobs • States: • Matrix analytics methods, to track states • Number of jobs in queue/engine • Number of task queue/engine • The job distribution across priorities • Solver: • MMAP[K]/PH[K]/1 priority queue from Horvath [EJOR’15] Deflator DiAS Deflator Dropper Scheduler Sprinter
  • 23. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 8 • Distributions: • Phase-Type job processing times to model concurrent task times of jobs • States: • Matrix analytics methods, to track states • Number of jobs in queue/engine • Number of task queue/engine • The job distribution across priorities • Solver: • MMAP[K]/PH[K]/1 priority queue from Horvath [EJOR’15] Deflator DiAS Deflator Dropper Scheduler Sprinter
  • 24. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 9 0 10 20 30 40 50 60 70 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 Meanabsolutepercenterror[%] Θm • Trade-off between accuracy and latency • Low dropping ratios -> low accuracy loss Deflator DiAS Deflator Dropper Scheduler Sprinter
  • 25. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 9 0 10 20 30 40 50 60 70 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 Meanabsolutepercenterror[%] Θm • Trade-off between accuracy and latency • Low dropping ratios -> low accuracy loss Deflator DiAS Deflator Dropper Scheduler Sprinter Dropping ratio
  • 26. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 9 0 10 20 30 40 50 60 70 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 Meanabsolutepercenterror[%] Θm • Trade-off between accuracy and latency • Low dropping ratios -> low accuracy loss Deflator DiAS Deflator Dropper Scheduler Sprinter Dropping ratio Dropping ratio: 50% MAE: 42%
  • 27. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 10 0 0.2 0.4 0.6 0.8 Drop Ratio 0 100 200 300 400 MeanJobResponseTime model - high obs - high model - low obs - low • Validation of response time for 2-priority jobs Deflator DiAS Deflator Dropper Scheduler Sprinter
  • 28. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 10 0 0.2 0.4 0.6 0.8 Drop Ratio 0 100 200 300 400 MeanJobResponseTime model - high obs - high model - low obs - low • Validation of response time for 2-priority jobs Deflator DiAS Deflator Dropper Scheduler Sprinter Dropping ratio: 20% Response time: 152 seconds
  • 29. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 11 Deflator DiAS Deflator Dropper Scheduler Sprinter Original MR job
  • 30. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 11 Deflator DiAS Deflator Dropper Scheduler Sprinter Original MR job Approximate MR job
  • 31. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 12 Deflator Dropper Scheduler Sprinter DiAS • Implemented in Spark by modifying the function findMissingPartitions() • Modification: return only [n(1 − θk )] partitions out of n • Follows specifications of Dropper Deflator
  • 32. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 13 Deflator Dropper Scheduler Sprinter DiAS Sprinter • Performs CPU frequency scaling • Handle sprinting timer for each job • Budget defined from different constraints • Thermal • Power • Provisioning
  • 33. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 14 Our Approach: Overview DiAS Deflator Dropper Scheduler Sprinter
  • 34. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 • Objetive • Low waste • Latency/accuracy requirements • Multi-priority 14 Our Approach: Overview DiAS Deflator Dropper Scheduler Sprinter
  • 35. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 • Objetive • Low waste • Latency/accuracy requirements • Multi-priority 14 Our Approach: Overview • Challenges • Many parameters • Tail latency • Priority DiAS Deflator Dropper Scheduler Sprinter
  • 36. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 • Objetive • Low waste • Latency/accuracy requirements • Multi-priority • Goal • Define: Task dropping ratio and sprinting timeout • Given: Priority class, tolerance to accuracy degradation, available sprinting budget 14 Our Approach: Overview • Challenges • Many parameters • Tail latency • Priority DiAS Deflator Dropper Scheduler Sprinter
  • 37. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 15 • Spark processing engine • Version 2.1 • 10 workers cluster • 2 CPU cores and 4 GB memory per worker • Intel Xeon E3-1270 v6 CPU, 64 cores and 128 GB memory • Key parameters • Ratio between low- and high priority jobs • Average size • Cluster load • Workload • Text analysis jobs • Graph analysis jobs Evaluation Setup
  • 38. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 16 • Differential Approximation • Two- and three priority system • Sensitivity analisys • Similar job size for all priorities • Several high- to low-priority job ratio • Several system load • Differential Approximation and Sprinting • Latency gain • Energy gain Evaluation
  • 39. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 • Tail and average latency of low priority job decreases • Average latency of high priority job increases 17 1 10 100 1000 P NP DA 0/10 DA 0/20 -80 -60 -40 -20 0 20 40 60 80 Responsetime[s] Difference[%] High Low Evaluation: Differential Approximation
  • 40. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 • Tail and average latency of low priority job decreases • Average latency of high priority job increases 17 1 10 100 1000 P NP DA 0/10 DA 0/20 -80 -60 -40 -20 0 20 40 60 80 Responsetime[s] Difference[%] High Low Evaluation: Differential Approximation Preemptive
  • 41. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 • Tail and average latency of low priority job decreases • Average latency of high priority job increases 17 1 10 100 1000 P NP DA 0/10 DA 0/20 -80 -60 -40 -20 0 20 40 60 80 Responsetime[s] Difference[%] High Low Evaluation: Differential Approximation Preemptive Non-Preemptive
  • 42. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 • Tail and average latency of low priority job decreases • Average latency of high priority job increases 17 1 10 100 1000 P NP DA 0/10 DA 0/20 -80 -60 -40 -20 0 20 40 60 80 Responsetime[s] Difference[%] High Low Evaluation: Differential Approximation Preemptive Non-Preemptive Differential Approximation x/y: drop ratio on high- (x) and low-priority (y)
  • 43. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 18 1 10 100 1000 10000 P DiAS 0/10 DiAS 0/20 -80 -60 -40 -20 0 20 40 60 80 Responsetime[s] Difference[%] High Low • Tail and average latency of low priority job decreases even more (up to 90%) • Average latency of high priority also decreases Evaluation: Differential Approximation and Sprinting
  • 44. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 • Reduction of more than 20% on energy consumption 19 1 10 100 1000 10000 P DiAS 0/10 DiAS 0/20 -80 -60 -40 -20 0 20 40 60 80 Energy[kJ] Difference[%] Unimited Limited Evaluation: Differential Approximation and Sprinting
  • 45. Isabelly Rocha Differential Approximation and Sprinting for Multi-Priority Big Data Engines | Middleware 2019 20 • Main goal: reduce resource waste • Strategy: • Drop job eviction • Deflate low-priority jobs • Sprint high-priority jobs • Additional gains: • Up to 90% latency reduction • More than 20% energy savings • Implemented in Golang on top of Spark Summary and Takeaways