SlideShare une entreprise Scribd logo
1  sur  51
Energy Efficiency in Large Scale Systems Gaurav Dhiman, Raid Ayoub Prof. Tajana ŠimunićRosing Dept. of Computer Science
Large scale systems: Clusters Power consumption is a critical  	design parameter: Operational costs ,[object Object]
CoolingBy 2010, US electricity bill for powering and cooling data centers ~$7B[1] Electricity input to data centers in the US exceeds electricity consumption of Italy! [1]: Meisner et al, ASPLOS 2008 2
Energy Savings with DVFS Reduction in CPU power Extra system power
Effectiveness of DVFS For energy savings ER > EE Factors in modern systems affecting this equation: Performance delay (tdelay) Idle CPU power consumption (PE) Power consumption of other devices (PE)
Performance Delay Lower tdelay=> higher energy savings Depends on memory/CPU intensiveness Experiments with SPEC CPU2000 mcf: highly memory intensive Expect low tdelay sixtrack: highly cache/CPU intensive Expect high tdelay Two state of the art processors AMD quad core Opteron On die memory controller (2.6GHz), DDR3 Intel quad core Xeon Off chip memory controller (1.3GHz), DDR2
Performance Delay mcf much closer to best case on Xeon mcf much closer to worst case on AMD Due to on die memory controller and fast DDR3 memory Due to slower memory controller and memory
Idle CPU power consumption Low power idle CPU states common now C1 state used be default Zero dynamic power consumption Support for deeper C-states appearing C6 on Nehalem Zero dynamic+leakage power ,[object Object]
Lower DVFS benefits,[object Object]
Lower DVFS benefits for memory intensive benchmarks,[object Object]
Methodology Run SPEC CPU2000 benchmarks at all v-f settings Estimate savings baselined against system with PM-(1:3) policies ,[object Object]
DVFS beneficial if:
%EsavingsPM-i > 0,[object Object]
On die memory controller,[object Object]
High perf delay ,[object Object]
Avg 7% savings
Avg 200% delay ,[object Object]
 Lower system idle power consumption,[object Object]
Server Power Breakdown
Energy Proportional Computing “The Case for  Energy-Proportional  Computing,” Luiz André Barroso, Urs Hölzle, IEEE Computer December 2007  Doing nothing well …NOT! Energy Efficiency = Utilization/Power Figure 2. Server power usage and energy efficiency at varying utilization levels, from idle to  peak performance. Even an energy-efficient server still consumes about half its full power when doing virtually no work. 17
Energy Proportional Computing “The Case for  Energy-Proportional  Computing,” Luiz André Barroso, Urs Hölzle, IEEE Computer December 2007  It is surprisingly hardto achieve high levelsof utilization of typical servers (and your homePC or laptop is even worse) Figure 1. Average CPU utilization of more than 5,000 servers during a six-month period. Servers  are rarely completely idle and seldom operate near their maximum utilization, instead operating  most of the time at between 10 and 50 percent of their maximum 18
Energy Proportional Computing “The Case for  Energy-Proportional  Computing,” Luiz André Barroso, Urs Hölzle, IEEE Computer December 2007  Doing nothing  VERY well Design for  wide dynamic  power range and  active low power modes Energy Efficiency = Utilization/Power Figure 4. Power usage and energy efficiency in a more energy-proportional server. This  server has a power efficiency of more than 80 percent of its peak value for utilizations of  30 percent and above, with efficiency remaining above 50 percent for utilization levels as  low as 10 percent. 19
Why not consolidate servers? Security Isolation Must use the same OS Solution: Use virtualization!
Virtualization Benefits: ,[object Object]
Different OS in each VM
Better resource utilization21
Virtualization Benefits: ,[object Object]
Dynamic load management
Energy savings through VM consolidation!22
How to Save Energy? VM consolidation is a common practice: Increases resource utilization Turn idle machines into sleep mode Active machines? Active power management: e.g. DVFS less effective in newer line of server processors  Leakage, faster memories, low voltage range Make the workload run faster Similar average power across machines Exploit workload characteristics to share resources efficiently 23
Motivation: Workload Characterization VM1 VM2 PM1 mcf 60% PM2 eon 24
Motivation: Workload Characterization 50W Workload characteristics determine: Power/performance profile Power distribution Co-schedule/consolidate heterogeneous VMs 25
Motivation: Workload Characterization Co-schedule/consolidate heterogeneous VMs 26
What about DVFS? 80% 40% 9% Poor performance  Energy inefficient Only good if homogeneously high MPC workload 27
vGreen A system for VM scheduling across a cluster of physical machines Dynamic VM characterization: ,[object Object]
Instruction throughput
CPU utilizationCo-schedule VMs with heterogeneous characteristics for better:  Performance Energy efficiency Balanced thermal profile 28
Scheduling with VMs VM1 VM2 VM1 Dom0 VM2 Xen Scheduler ,[object Object]
Management
I/O
VM Creation:
Specify CPU, memory, I/O config
CPU of VM referred to as VCPU:
Fundamental unit of executionVCPU2 VCPU1 VCPU2 VCPU1 ,[object Object]
Xen schedules VCPUs across PCPUs29
vGreen Architecture Main Components: ,[object Object]
vgxen: Characterizes the running VMs
vgdom: Exports information to vgservvgserv vgpolicy Updates Updates Commands ,[object Object]
Collects and analyzes the characterization information
Issues scheduling commands based on balancing policyvgdom vgdom VM1 Dom0 VM2 VM1 Dom0 VM2 Xen vgxen Xen vgxen vgnode1 vgnode2 30
vgnode (client physical machine) vgdom vgxen: characterizes the VMs ,[object Object]

Contenu connexe

Tendances

DigSILENT PF - 00 stability fundamentals
DigSILENT PF - 00 stability fundamentalsDigSILENT PF - 00 stability fundamentals
DigSILENT PF - 00 stability fundamentalsHimmelstern
 
Data Center Cooling, Critical Facility and Infrastructure Optimization
Data Center Cooling, Critical Facility and Infrastructure OptimizationData Center Cooling, Critical Facility and Infrastructure Optimization
Data Center Cooling, Critical Facility and Infrastructure OptimizationGreg Stover
 
Data center power infrastructure
Data center power infrastructureData center power infrastructure
Data center power infrastructureLivin Jose
 
Eaton Enterprise Data Centers Sep 2010
Eaton Enterprise Data Centers Sep 2010Eaton Enterprise Data Centers Sep 2010
Eaton Enterprise Data Centers Sep 2010uhlmanken
 
White Paper: Monitoring EMC Greenplum DCA with Nagios - EMC Greenplum Data Co...
White Paper: Monitoring EMC Greenplum DCA with Nagios - EMC Greenplum Data Co...White Paper: Monitoring EMC Greenplum DCA with Nagios - EMC Greenplum Data Co...
White Paper: Monitoring EMC Greenplum DCA with Nagios - EMC Greenplum Data Co...EMC
 
Calculating total power
Calculating total powerCalculating total power
Calculating total powerjeet69
 
D1.2 analysis and selection of low power techniques, services and patterns
D1.2 analysis and selection of low power techniques, services and patternsD1.2 analysis and selection of low power techniques, services and patterns
D1.2 analysis and selection of low power techniques, services and patternsBabak Sorkhpour
 
Frequency regulation of deregulated power system
Frequency regulation of deregulated power systemFrequency regulation of deregulated power system
Frequency regulation of deregulated power systemeSAT Publishing House
 
CA Nimsoft ecoMeter
CA Nimsoft ecoMeterCA Nimsoft ecoMeter
CA Nimsoft ecoMeterCA Nimsoft
 

Tendances (12)

DigSILENT PF - 00 stability fundamentals
DigSILENT PF - 00 stability fundamentalsDigSILENT PF - 00 stability fundamentals
DigSILENT PF - 00 stability fundamentals
 
E3 s binghamton
E3 s binghamtonE3 s binghamton
E3 s binghamton
 
Data Center Cooling, Critical Facility and Infrastructure Optimization
Data Center Cooling, Critical Facility and Infrastructure OptimizationData Center Cooling, Critical Facility and Infrastructure Optimization
Data Center Cooling, Critical Facility and Infrastructure Optimization
 
Data center power infrastructure
Data center power infrastructureData center power infrastructure
Data center power infrastructure
 
Eaton Enterprise Data Centers Sep 2010
Eaton Enterprise Data Centers Sep 2010Eaton Enterprise Data Centers Sep 2010
Eaton Enterprise Data Centers Sep 2010
 
hp power capping and hp dynamic
hp power capping and hp dynamic hp power capping and hp dynamic
hp power capping and hp dynamic
 
White Paper: Monitoring EMC Greenplum DCA with Nagios - EMC Greenplum Data Co...
White Paper: Monitoring EMC Greenplum DCA with Nagios - EMC Greenplum Data Co...White Paper: Monitoring EMC Greenplum DCA with Nagios - EMC Greenplum Data Co...
White Paper: Monitoring EMC Greenplum DCA with Nagios - EMC Greenplum Data Co...
 
Calculating total power
Calculating total powerCalculating total power
Calculating total power
 
D1.2 analysis and selection of low power techniques, services and patterns
D1.2 analysis and selection of low power techniques, services and patternsD1.2 analysis and selection of low power techniques, services and patterns
D1.2 analysis and selection of low power techniques, services and patterns
 
Frequency regulation of deregulated power system
Frequency regulation of deregulated power systemFrequency regulation of deregulated power system
Frequency regulation of deregulated power system
 
CA Nimsoft ecoMeter
CA Nimsoft ecoMeterCA Nimsoft ecoMeter
CA Nimsoft ecoMeter
 
Stmm white paper
Stmm white paperStmm white paper
Stmm white paper
 

En vedette

Deployability
DeployabilityDeployability
DeployabilityLen Bass
 
Architecting for the cloud elasticity security
Architecting for the cloud elasticity securityArchitecting for the cloud elasticity security
Architecting for the cloud elasticity securityLen Bass
 
Acm Tech Talk - Decomposition Paradigms for Large Scale Systems
Acm Tech Talk - Decomposition Paradigms for Large Scale SystemsAcm Tech Talk - Decomposition Paradigms for Large Scale Systems
Acm Tech Talk - Decomposition Paradigms for Large Scale SystemsVinayak Hegde
 
NEW ALGORITHMS FOR SECURE OUTSOURCING OF LARGE-SCALE SYSTEMS OF LINEAR EQUAT...
 NEW ALGORITHMS FOR SECURE OUTSOURCING OF LARGE-SCALE SYSTEMS OF LINEAR EQUAT... NEW ALGORITHMS FOR SECURE OUTSOURCING OF LARGE-SCALE SYSTEMS OF LINEAR EQUAT...
NEW ALGORITHMS FOR SECURE OUTSOURCING OF LARGE-SCALE SYSTEMS OF LINEAR EQUAT...Nexgen Technology
 
Designing Ultra Large Scale Systems List
Designing Ultra Large Scale Systems ListDesigning Ultra Large Scale Systems List
Designing Ultra Large Scale Systems ListCrafitti Consulting
 
Architectural Tactics for Large Scale Systems
Architectural Tactics for Large Scale SystemsArchitectural Tactics for Large Scale Systems
Architectural Tactics for Large Scale SystemsLen Bass
 
Principles of microservices velocity
Principles of microservices   velocityPrinciples of microservices   velocity
Principles of microservices velocitySam Newman
 

En vedette (7)

Deployability
DeployabilityDeployability
Deployability
 
Architecting for the cloud elasticity security
Architecting for the cloud elasticity securityArchitecting for the cloud elasticity security
Architecting for the cloud elasticity security
 
Acm Tech Talk - Decomposition Paradigms for Large Scale Systems
Acm Tech Talk - Decomposition Paradigms for Large Scale SystemsAcm Tech Talk - Decomposition Paradigms for Large Scale Systems
Acm Tech Talk - Decomposition Paradigms for Large Scale Systems
 
NEW ALGORITHMS FOR SECURE OUTSOURCING OF LARGE-SCALE SYSTEMS OF LINEAR EQUAT...
 NEW ALGORITHMS FOR SECURE OUTSOURCING OF LARGE-SCALE SYSTEMS OF LINEAR EQUAT... NEW ALGORITHMS FOR SECURE OUTSOURCING OF LARGE-SCALE SYSTEMS OF LINEAR EQUAT...
NEW ALGORITHMS FOR SECURE OUTSOURCING OF LARGE-SCALE SYSTEMS OF LINEAR EQUAT...
 
Designing Ultra Large Scale Systems List
Designing Ultra Large Scale Systems ListDesigning Ultra Large Scale Systems List
Designing Ultra Large Scale Systems List
 
Architectural Tactics for Large Scale Systems
Architectural Tactics for Large Scale SystemsArchitectural Tactics for Large Scale Systems
Architectural Tactics for Large Scale Systems
 
Principles of microservices velocity
Principles of microservices   velocityPrinciples of microservices   velocity
Principles of microservices velocity
 

Similaire à Energy Efficiency in Large Scale Systems

CNR @ VMUG.IT 20150304
CNR @ VMUG.IT 20150304CNR @ VMUG.IT 20150304
CNR @ VMUG.IT 20150304VMUG IT
 
Windows server power_efficiency___robben_and_worthington__final
Windows server power_efficiency___robben_and_worthington__finalWindows server power_efficiency___robben_and_worthington__final
Windows server power_efficiency___robben_and_worthington__finalBruce Worthington
 
Parallel and Distributed Computing Chapter 9
Parallel and Distributed Computing Chapter 9Parallel and Distributed Computing Chapter 9
Parallel and Distributed Computing Chapter 9AbdullahMunir32
 
Runtime Methods to Improve Energy Efficiency in HPC Applications
Runtime Methods to Improve Energy Efficiency in HPC ApplicationsRuntime Methods to Improve Energy Efficiency in HPC Applications
Runtime Methods to Improve Energy Efficiency in HPC ApplicationsFacultad de Informática UCM
 
A Study on Task Scheduling in Could Data Centers for Energy Efficacy
A Study on Task Scheduling in Could Data Centers for Energy Efficacy A Study on Task Scheduling in Could Data Centers for Energy Efficacy
A Study on Task Scheduling in Could Data Centers for Energy Efficacy Ehsan Sharifi
 
DataCenter:: Infrastructure Presentation
DataCenter:: Infrastructure PresentationDataCenter:: Infrastructure Presentation
DataCenter:: Infrastructure PresentationMuhammad Asad Rashid
 
E03403027030
E03403027030E03403027030
E03403027030theijes
 
MRI Energy-Efficient Cloud Computing
MRI Energy-Efficient Cloud ComputingMRI Energy-Efficient Cloud Computing
MRI Energy-Efficient Cloud ComputingRoger Rafanell Mas
 
KVM Tuning @ eBay
KVM Tuning @ eBayKVM Tuning @ eBay
KVM Tuning @ eBayXu Jiang
 
Mobile computing edited
Mobile computing editedMobile computing edited
Mobile computing editedm_hughes
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuninginside-BigData.com
 
Run-time power management in cloud and containerized environments
Run-time power management in cloud and containerized environmentsRun-time power management in cloud and containerized environments
Run-time power management in cloud and containerized environmentsNECST Lab @ Politecnico di Milano
 
Probabilistic consolidation of virtual machines in self organizing cloud data...
Probabilistic consolidation of virtual machines in self organizing cloud data...Probabilistic consolidation of virtual machines in self organizing cloud data...
Probabilistic consolidation of virtual machines in self organizing cloud data...Papitha Velumani
 
Windows Server 2008 R2 Hyper V
Windows Server 2008 R2 Hyper VWindows Server 2008 R2 Hyper V
Windows Server 2008 R2 Hyper VAmit Gatenyo
 

Similaire à Energy Efficiency in Large Scale Systems (20)

CNR @ VMUG.IT 20150304
CNR @ VMUG.IT 20150304CNR @ VMUG.IT 20150304
CNR @ VMUG.IT 20150304
 
Windows server power_efficiency___robben_and_worthington__final
Windows server power_efficiency___robben_and_worthington__finalWindows server power_efficiency___robben_and_worthington__final
Windows server power_efficiency___robben_and_worthington__final
 
Parallel and Distributed Computing Chapter 9
Parallel and Distributed Computing Chapter 9Parallel and Distributed Computing Chapter 9
Parallel and Distributed Computing Chapter 9
 
Hitec Brochure QPS English_LR
Hitec Brochure QPS English_LRHitec Brochure QPS English_LR
Hitec Brochure QPS English_LR
 
Emerson Energy Logic
Emerson Energy LogicEmerson Energy Logic
Emerson Energy Logic
 
Runtime Methods to Improve Energy Efficiency in HPC Applications
Runtime Methods to Improve Energy Efficiency in HPC ApplicationsRuntime Methods to Improve Energy Efficiency in HPC Applications
Runtime Methods to Improve Energy Efficiency in HPC Applications
 
A Study on Task Scheduling in Could Data Centers for Energy Efficacy
A Study on Task Scheduling in Could Data Centers for Energy Efficacy A Study on Task Scheduling in Could Data Centers for Energy Efficacy
A Study on Task Scheduling in Could Data Centers for Energy Efficacy
 
Virtualización para la Eficiencia
Virtualización para la EficienciaVirtualización para la Eficiencia
Virtualización para la Eficiencia
 
DataCenter:: Infrastructure Presentation
DataCenter:: Infrastructure PresentationDataCenter:: Infrastructure Presentation
DataCenter:: Infrastructure Presentation
 
E03403027030
E03403027030E03403027030
E03403027030
 
MRI Energy-Efficient Cloud Computing
MRI Energy-Efficient Cloud ComputingMRI Energy-Efficient Cloud Computing
MRI Energy-Efficient Cloud Computing
 
Green It
Green ItGreen It
Green It
 
KVM Tuning @ eBay
KVM Tuning @ eBayKVM Tuning @ eBay
KVM Tuning @ eBay
 
Mobile computing edited
Mobile computing editedMobile computing edited
Mobile computing edited
 
Energy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic TuningEnergy Efficient Computing using Dynamic Tuning
Energy Efficient Computing using Dynamic Tuning
 
GUI overhead
GUI overheadGUI overhead
GUI overhead
 
Run-time power management in cloud and containerized environments
Run-time power management in cloud and containerized environmentsRun-time power management in cloud and containerized environments
Run-time power management in cloud and containerized environments
 
Probabilistic consolidation of virtual machines in self organizing cloud data...
Probabilistic consolidation of virtual machines in self organizing cloud data...Probabilistic consolidation of virtual machines in self organizing cloud data...
Probabilistic consolidation of virtual machines in self organizing cloud data...
 
Build Energy Saving into Your Datacenter
Build Energy Saving into Your DatacenterBuild Energy Saving into Your Datacenter
Build Energy Saving into Your Datacenter
 
Windows Server 2008 R2 Hyper V
Windows Server 2008 R2 Hyper VWindows Server 2008 R2 Hyper V
Windows Server 2008 R2 Hyper V
 

Plus de Jerry Sheehan

Educause-Presidential/CIO
Educause-Presidential/CIOEducause-Presidential/CIO
Educause-Presidential/CIOJerry Sheehan
 
Montana State, Research Networking and the Outcomes from the First National R...
Montana State, Research Networking and the Outcomes from the First National R...Montana State, Research Networking and the Outcomes from the First National R...
Montana State, Research Networking and the Outcomes from the First National R...Jerry Sheehan
 
Technology, Complexity, and Change: The Creative Frictions of Today
Technology, Complexity, and Change:  The Creative Frictions of TodayTechnology, Complexity, and Change:  The Creative Frictions of Today
Technology, Complexity, and Change: The Creative Frictions of TodayJerry Sheehan
 
Supporting the National Research Platform with a Lean Cyberinfrastructure (CI...
Supporting the National Research Platform with a Lean Cyberinfrastructure (CI...Supporting the National Research Platform with a Lean Cyberinfrastructure (CI...
Supporting the National Research Platform with a Lean Cyberinfrastructure (CI...Jerry Sheehan
 
Best Practices, Cyberinfrastructure from Scratch
Best Practices, Cyberinfrastructure from ScratchBest Practices, Cyberinfrastructure from Scratch
Best Practices, Cyberinfrastructure from ScratchJerry Sheehan
 
DC Modular Datacenter for Improved Energy Efficiency
DC Modular Datacenter for Improved Energy EfficiencyDC Modular Datacenter for Improved Energy Efficiency
DC Modular Datacenter for Improved Energy EfficiencyJerry Sheehan
 
GreenLight Data Collection Architecture
GreenLight Data Collection ArchitectureGreenLight Data Collection Architecture
GreenLight Data Collection ArchitectureJerry Sheehan
 
GreenLight CENIC Award
GreenLight CENIC AwardGreenLight CENIC Award
GreenLight CENIC AwardJerry Sheehan
 
Cine grid exchange@cenic2010-5
Cine grid exchange@cenic2010-5Cine grid exchange@cenic2010-5
Cine grid exchange@cenic2010-5Jerry Sheehan
 
GreenLight: GLIF 2009
GreenLight:  GLIF 2009GreenLight:  GLIF 2009
GreenLight: GLIF 2009Jerry Sheehan
 
GreenLight Overview for Minority Serving Institute Workshop
GreenLight Overview for Minority Serving Institute WorkshopGreenLight Overview for Minority Serving Institute Workshop
GreenLight Overview for Minority Serving Institute WorkshopJerry Sheehan
 
GreenLight Project Overview
GreenLight Project OverviewGreenLight Project Overview
GreenLight Project OverviewJerry Sheehan
 

Plus de Jerry Sheehan (14)

Cisco educause
Cisco educauseCisco educause
Cisco educause
 
Educause-Presidential/CIO
Educause-Presidential/CIOEducause-Presidential/CIO
Educause-Presidential/CIO
 
Montana State, Research Networking and the Outcomes from the First National R...
Montana State, Research Networking and the Outcomes from the First National R...Montana State, Research Networking and the Outcomes from the First National R...
Montana State, Research Networking and the Outcomes from the First National R...
 
Technology, Complexity, and Change: The Creative Frictions of Today
Technology, Complexity, and Change:  The Creative Frictions of TodayTechnology, Complexity, and Change:  The Creative Frictions of Today
Technology, Complexity, and Change: The Creative Frictions of Today
 
Supporting the National Research Platform with a Lean Cyberinfrastructure (CI...
Supporting the National Research Platform with a Lean Cyberinfrastructure (CI...Supporting the National Research Platform with a Lean Cyberinfrastructure (CI...
Supporting the National Research Platform with a Lean Cyberinfrastructure (CI...
 
Best Practices, Cyberinfrastructure from Scratch
Best Practices, Cyberinfrastructure from ScratchBest Practices, Cyberinfrastructure from Scratch
Best Practices, Cyberinfrastructure from Scratch
 
Project GreenLight
Project GreenLightProject GreenLight
Project GreenLight
 
DC Modular Datacenter for Improved Energy Efficiency
DC Modular Datacenter for Improved Energy EfficiencyDC Modular Datacenter for Improved Energy Efficiency
DC Modular Datacenter for Improved Energy Efficiency
 
GreenLight Data Collection Architecture
GreenLight Data Collection ArchitectureGreenLight Data Collection Architecture
GreenLight Data Collection Architecture
 
GreenLight CENIC Award
GreenLight CENIC AwardGreenLight CENIC Award
GreenLight CENIC Award
 
Cine grid exchange@cenic2010-5
Cine grid exchange@cenic2010-5Cine grid exchange@cenic2010-5
Cine grid exchange@cenic2010-5
 
GreenLight: GLIF 2009
GreenLight:  GLIF 2009GreenLight:  GLIF 2009
GreenLight: GLIF 2009
 
GreenLight Overview for Minority Serving Institute Workshop
GreenLight Overview for Minority Serving Institute WorkshopGreenLight Overview for Minority Serving Institute Workshop
GreenLight Overview for Minority Serving Institute Workshop
 
GreenLight Project Overview
GreenLight Project OverviewGreenLight Project Overview
GreenLight Project Overview
 

Dernier

Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsRavi Sanghani
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesThousandEyes
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructureitnewsafrica
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfNeo4j
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Kaya Weers
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Nikki Chapple
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxfnnc6jmgwh
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024Lonnie McRorey
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesKari Kakkonen
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity PlanDatabarracks
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Mark Goldstein
 

Dernier (20)

Potential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and InsightsPotential of AI (Generative AI) in Business: Learnings and Insights
Potential of AI (Generative AI) in Business: Learnings and Insights
 
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyesHow to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
How to Effectively Monitor SD-WAN and SASE Environments with ThousandEyes
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical InfrastructureVarsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
Varsha Sewlal- Cyber Attacks on Critical Critical Infrastructure
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Connecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdfConnecting the Dots for Information Discovery.pdf
Connecting the Dots for Information Discovery.pdf
 
Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)Design pattern talk by Kaya Weers - 2024 (v2)
Design pattern talk by Kaya Weers - 2024 (v2)
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
Microsoft 365 Copilot: How to boost your productivity with AI – Part one: Ado...
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptxGenerative AI - Gitex v1Generative AI - Gitex v1.pptx
Generative AI - Gitex v1Generative AI - Gitex v1.pptx
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024TeamStation AI System Report LATAM IT Salaries 2024
TeamStation AI System Report LATAM IT Salaries 2024
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
Testing tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examplesTesting tools and AI - ideas what to try with some tool examples
Testing tools and AI - ideas what to try with some tool examples
 
How to write a Business Continuity Plan
How to write a Business Continuity PlanHow to write a Business Continuity Plan
How to write a Business Continuity Plan
 
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
Arizona Broadband Policy Past, Present, and Future Presentation 3/25/24
 

Energy Efficiency in Large Scale Systems

  • 1. Energy Efficiency in Large Scale Systems Gaurav Dhiman, Raid Ayoub Prof. Tajana ŠimunićRosing Dept. of Computer Science
  • 2.
  • 3. CoolingBy 2010, US electricity bill for powering and cooling data centers ~$7B[1] Electricity input to data centers in the US exceeds electricity consumption of Italy! [1]: Meisner et al, ASPLOS 2008 2
  • 4. Energy Savings with DVFS Reduction in CPU power Extra system power
  • 5. Effectiveness of DVFS For energy savings ER > EE Factors in modern systems affecting this equation: Performance delay (tdelay) Idle CPU power consumption (PE) Power consumption of other devices (PE)
  • 6. Performance Delay Lower tdelay=> higher energy savings Depends on memory/CPU intensiveness Experiments with SPEC CPU2000 mcf: highly memory intensive Expect low tdelay sixtrack: highly cache/CPU intensive Expect high tdelay Two state of the art processors AMD quad core Opteron On die memory controller (2.6GHz), DDR3 Intel quad core Xeon Off chip memory controller (1.3GHz), DDR2
  • 7. Performance Delay mcf much closer to best case on Xeon mcf much closer to worst case on AMD Due to on die memory controller and fast DDR3 memory Due to slower memory controller and memory
  • 8.
  • 9.
  • 10.
  • 11.
  • 13.
  • 14.
  • 15.
  • 17.
  • 18.
  • 20. Energy Proportional Computing “The Case for Energy-Proportional Computing,” Luiz André Barroso, Urs Hölzle, IEEE Computer December 2007 Doing nothing well …NOT! Energy Efficiency = Utilization/Power Figure 2. Server power usage and energy efficiency at varying utilization levels, from idle to peak performance. Even an energy-efficient server still consumes about half its full power when doing virtually no work. 17
  • 21. Energy Proportional Computing “The Case for Energy-Proportional Computing,” Luiz André Barroso, Urs Hölzle, IEEE Computer December 2007 It is surprisingly hardto achieve high levelsof utilization of typical servers (and your homePC or laptop is even worse) Figure 1. Average CPU utilization of more than 5,000 servers during a six-month period. Servers are rarely completely idle and seldom operate near their maximum utilization, instead operating most of the time at between 10 and 50 percent of their maximum 18
  • 22. Energy Proportional Computing “The Case for Energy-Proportional Computing,” Luiz André Barroso, Urs Hölzle, IEEE Computer December 2007 Doing nothing VERY well Design for wide dynamic power range and active low power modes Energy Efficiency = Utilization/Power Figure 4. Power usage and energy efficiency in a more energy-proportional server. This server has a power efficiency of more than 80 percent of its peak value for utilizations of 30 percent and above, with efficiency remaining above 50 percent for utilization levels as low as 10 percent. 19
  • 23. Why not consolidate servers? Security Isolation Must use the same OS Solution: Use virtualization!
  • 24.
  • 25. Different OS in each VM
  • 27.
  • 29. Energy savings through VM consolidation!22
  • 30. How to Save Energy? VM consolidation is a common practice: Increases resource utilization Turn idle machines into sleep mode Active machines? Active power management: e.g. DVFS less effective in newer line of server processors Leakage, faster memories, low voltage range Make the workload run faster Similar average power across machines Exploit workload characteristics to share resources efficiently 23
  • 31. Motivation: Workload Characterization VM1 VM2 PM1 mcf 60% PM2 eon 24
  • 32. Motivation: Workload Characterization 50W Workload characteristics determine: Power/performance profile Power distribution Co-schedule/consolidate heterogeneous VMs 25
  • 33. Motivation: Workload Characterization Co-schedule/consolidate heterogeneous VMs 26
  • 34. What about DVFS? 80% 40% 9% Poor performance Energy inefficient Only good if homogeneously high MPC workload 27
  • 35.
  • 37. CPU utilizationCo-schedule VMs with heterogeneous characteristics for better: Performance Energy efficiency Balanced thermal profile 28
  • 38.
  • 40. I/O
  • 42. Specify CPU, memory, I/O config
  • 43. CPU of VM referred to as VCPU:
  • 44.
  • 45. Xen schedules VCPUs across PCPUs29
  • 46.
  • 48.
  • 49. Collects and analyzes the characterization information
  • 50. Issues scheduling commands based on balancing policyvgdom vgdom VM1 Dom0 VM2 VM1 Dom0 VM2 Xen vgxen Xen vgxen vgnode1 vgnode2 30
  • 51.
  • 53.
  • 55. Exports to vgservwMPC wIPC util wMPC wIPC util wMPC wIPC util wMPC wIPC util 31 VCPU1 VCPU2 VCPU1 VCPU2
  • 56. Hierarchical Workload Characterization nMPC nIPC nutil Node Level Metrics (maintained by vgpolicy) VGNODE VM Level Metrics (maintained by vgpolicy and vgxen) vMPC vIPC vutil vMPC vIPC vutil VM1 VM2 VCPU Level Metrics (maintained by vgxen) wMPC wIPC util wMPC wIPC util wMPC wIPC util wMPC wIPC util 32 VCPU1 VCPU2 VCPU1 VCPU2
  • 57.
  • 58. MPC: performance and energy efficiency
  • 59.
  • 60.
  • 61. Migrate if it does not reverse imbalanceVM1 VM2 VM2 VM1 nMPC > nMPCth vgnodenMPCmin 34
  • 62. Implementation Xen 3.3.1 as the hypervisor vgxen implemented as part of the stock Xen credit scheduler vgdom implemented as a driver and application in Domain0 Communicates with vgxen through a shared page No modifications required to the guest OS! Used a testbed of Dual Intel Quad core Xeon based machines as vgnodes Linux based desktop used as vgserv vgdom VM1 Dom0 VM2 Xen vgxen 35
  • 63.
  • 64. Compare against ‘E+’: Eucalyptus + state of the art dynamic VM scheduling algorithms
  • 65. Perform VM consolidation based on CPU utilization36
  • 66. Weighted Speedup vs E+ Average 40% Weighted Speedup 20% speedup on average 37
  • 67. Energy Savings vs E+ Average 35% Energy Savings 38
  • 68. Balanced Thermal Profile Average power variance reduction of 30W 39
  • 69.
  • 70. How to minimize the cooling costs within a single machine?
  • 71. How to further reduce the cooling costs by creating a better temperature distribution across the physical machines1U server CPU CPU Fan subsystem
  • 72.
  • 73.
  • 74.
  • 75.
  • 77. Provides better stabilityReactive approach  Lowers cooling savings  Cannot minimize the noise level Impacts fan stability Challenge: Design of efficient proactive dynamic cooling aware workload management technique
  • 78.
  • 79. Migrate some of the active threads from the sockets with high fan speed to sockets with lower speed
  • 80. Swap some of the hot threads from sockets with high fan speed with colder threads from sockets with lower speed.VPW VPA VPA VPC VPw VPY VPC VPY VPX VPB VPD VPZ VPZ VPB VPX VPD High speed Low speed Moderate speed Moderate speed
  • 81.
  • 82. If Fan speedM≥Fan speedN, we can swap the hot thread from socket N with colder threads from socket MVPA VPW VPW VPY VPA VPC VPC VPY PW ≤ PC+PD VPB VPD VPD VPX VPB VPX Moderate speed Moderate speed Moderate speed Low speed 46 46
  • 83.
  • 84. VM management at the physical machine level
  • 85. VP management at the CPU socket levelVM migration Thread migration VP1 VP2 VP3 VP3 VP5 VP4 VP6 VP4 VP1 VP3 VP2 VP2 VP4 VP6 VP5 VP4 Low speed Moderate speed High speed Moderate speed Server i Server j 47
  • 86.
  • 87. VMs at the machine level
  • 88. VPs at the socket level
  • 90. Period ~ minutes @ the VM level
  • 91. Period ~ seconds @ the VP level Mark if savings exist Traverse VMs/VPs Schedule Evaluate Consolidation Savings Mark if savings exist 48
  • 92.
  • 93.
  • 94. Dynamic load balancing minimizes the differences in task queues across various levels49 K. Skadron, et al. Temperature-aware microarchitecture, ISCA 2003.
  • 95.

Notes de l'éditeur

  1. This figure shows a typical fan controller that is based on a classical close-loop approach. The fan controller decides the required fan speed. The output of the controller is fed to the actuator to actually adjust the fan speed. The feedback is collected using thermal sensors (each CPU core has a dedicated thermal sensor) where the fan speed is in proportional to the highest temperature <click> The cooling optimizations techniques up until now focused mainly on the fan controller without including workload management which we show later that including workload management can results in a big cooling savings
  2. Current load balancing do not consider cooling costs <click> Read the example to the audience (stop when you reach the equation) [The figure is the visual representation of the example]. In this figure we show a case of dual sockets (each socket has 4 cores where each runs 1 (thr=workload thread or job)<click> Thermal imbalance leads to cooling inefficiencies due to “cubic relation between fan speed and power” <click> This indicate that better workload assignment can improve the thermal distribution and lower cooling cost. The question is HOW and WHEN to schedule the workload
  3. We utilize the freedom in migrating the workload around to perform cooling aware workload scheduling to minimize the cooling costs<click> The good news is that the migration overhead of the threads between sockets is minor since the temperature change is quite slow (order of sec) compared to the migration time (order of micro sec)<click> In this example we show a case of thermal imbalance between two sockets (one fan run at high speed while the other at low speed)<click> The challenge is which threads to migrate to get a better thermal and cooling balance. Then read the second bullet in the yellow box
  4. The question that we need to answer is “when we should trigger the workload rescheduling”One way is to employ a reactive approach that acts when the system is in cooling inefficiency condition. The problem with this approach is that mitigating the inefficiencies require time (temperature changes slowly) which impacts the cooling savings, noise and may generate instability in the fan system <click> The alternative way is to use proactive researching that predict then avoid cooling inefficiencies at earlier point in time and reschedule accordingly. Read quickly the benefits in the green box<click> Read the challenge sentence
  5. In this slide and the following one we illustrate the fundamental ways to deliver cooling savings: This slide explains “spreading the hot threads” concept to obtain cooling savings through creating a better temperature distribution across the CPU sockets. This technique needs to be applied when there is an imbalance in the heat sink temperature across the CPU sockets. To implement job spreading we can employ either job migration or swapping (read the two bullets briefly). <click> The example in the bottom clearly shows how spreading works. In the left side we have a case of big imbalance. To solve the imbalance we swap the hot threads (C,D) with the colder ones (W,X). The two fans now run at a moderate speed (savings is expected due to the cubic relation between fan power and speed)
  6. Here we illustrate the second way to obtain cooling savings. The motivation is to concentrate more hot threads into fewer sockets while keeping their fan speed in almost the same. We apply this method when the average temperature across sockets is in similar range (it should be noted that consolidation is not opposite to the spreading but it can be applies on top of it)Consolidation can be implemented in two ways:Squeezing more hot jobs to the fan that is running more that what it should be (fan speeds is discrete, e.g 8 or 16 speeds)<click> The other way is to trade a (hot thread) from the socket that have lower fan with (colder threads but have similar total power) from the socket with higher fan speed to maintain temperature balance. This help lowering the fan speed of the socket that receives the cold threads while keeping the higher fan speed almost the same. The example below illustrate this case