SlideShare une entreprise Scribd logo
1  sur  4
Analysis of Parallel Algorithms for Energy
Conservation in Scalable Multicore Architecture
Problem statement
An important concern for computing, as for society, is conserving energy. Moreover, energy
conservation is critical in mobile devices for practical reasons. The relation between parallel
applications and their energy requirements on scalable multicore processors are examined. It
believe this sort of analysis can provide programmers with intuitions about the energy required by
the parallel algorithms they are using–thus guiding the choice of algorithm, architecture and the
number of cores used for a particular application. Researchers have studied performance scalability
of parallel algorithms for some time [10]. However, it is important to note that performance
scalability of a parallel algorithm is not the same as its energy scalability. This difference between
performance scalability and energy scalability is due to two important factors. There is a nonlinear
relationship between power and frequency at which the cores operate in multicore processors. In
fact, the power consumed by a core is (typically) proportional to the cube of its frequency.
Executing parallel algorithms typically involves communication (or shared memory accesses) as well
as computation. The power and performance characteristics of communication and computation
may be different. For example, in many algorithms, communication time may be masked by
overlapping communication and computation (e.g., [1]). However, the power required for
communication would be unaffected whether communication overlaps with the computation or not.
Objective
(a) To analyse energy characteristics of parallel algorithms executed on scalable multicore
processors.
(b) To study the sensitivity of the analysis to changes in parameters such as the ratio of power
required for computation versus power required for communication.
(c) To determine how many cores to use in order to minimize energy consumption.
Methodology
The methodology is to evaluate energy scalability under iso-performance of parallel applications.
Partitioning Step
Find the critical path of the parallel algorithm. The critical path is the longest path through the task
dependency graph (where edges represents task serialization) of the parallel algorithm. Note that
the critical path length gives a lower bound on execution time of the parallel algorithm. In the
parallel algorithm, the critical path is easy to find. It is the execution path of the core that has the
sum of all numbers at the end. Half of the cores send the sum they compute to the other half so that
no core receives a sum from more than one core. The receiving core then add the number the local
sum they have computed. We perform the same step recursively until there is only one core left. At
the end of computation, one core will store the sum of all N numbers.
Communication and computation steps
Partition the critical path into communication and computation steps. We can see that there are log
(M) communication steps and ((N=M) � 1 + log (M)) computation steps. Consider a simple parallel
algorithm to add N numbers using M cores. Initially all N numbers are equally distributed among the
M cores; at the end of the computation, one of the cores stores their sum. Without loss of
generality, assume that the number of cores available is some power of two. The algorithm runs in
log (M) steps.
Adding N numbers using 4 actors; Left most
line represents the critical path; embarrassingly
parallel application but represents a broad
class of tree algorithms
Agglomeration step
Scale computation steps of the critical path so that the parallel performance matches the
performance requirement. Scaling the computation time of the critical path to the difference of the
required performance and the communication time of the critical path, and obtain the new reduced
frequency at which all M cores should run. Reduced frequency obtained at which all M cores should
run to complete in time T.
Where β represents number of cycles required per addition. In order to achieve energy savings, we
require 0 < X0 < F. This restriction provides a lower bound on the input size as a function of M and Kc.
Evaluate the message complexity (total number of messages processed) of the parallel algorithm.
The example algorithms discuss before show that the message complexity of some parallel
algorithms may depend only on the number of cores, while for others it depends on both the input
size and the number of cores used. It is trivial to see that number of message transfers for this
parallel algorithm when running on M cores is (M-1). In the algorithm, the message complexity is
only dependent on M and not on the input size N.
Evaluate the total idle time at all the cores assuming the frequency obtained, running at new
frequency X’. Scaling the parallel algorithm critical path may lead to an increase in idle time in other
paths at other cores. Total idle time is:
Where the first term represents the total idle time spent by idle cores while other cores are busy
computing and second term represents the total idle time spent by idle cores while other cores are
involved in message communication.
Step 6 Frame an expression for energy consumption of the parallel algorithm using the energy
model. The energy expression is the sum of the energy consumed by Computation, Ecomp,
Communication, Ecomm and Idling (static power), Eidle
Ecomp is lower if the cores run at a lower frequency, while Eidle may increase as the busy cores take
longer to finish. Ecomm may increase as more cores are used since the computation is more
distributed. Observe that (N-1) is the total number of computation steps. The energy consumed for
computation, communication and idling while the algorithm is running on M cores at reduced
frequency X’ is:
Mapping step
Analyse the equation to obtain the number of cores required for minimum energy consumption as a
function of input size. The energy expression is dependent on many variables such as N (Input Size),
M (Number of cores), (Number of instruction per addition), Kc (no of cycles executed at maximum
frequency for single message communication time), Em (energy consumed for single message
communication between cores), Ps (static power) and the maximum frequency of a core. In most
architectures, the number of cycles involved per addition is just one, so we assume = 1. Set idle
energy consumed per cycle as (Ps=F) = 1, where the cycle is at the maximum frequency F. Express all
energy values with respect to this normalized energy value.
Figure 1 plot for any input size N, initially energy decreases with increasing M and later on increases
with increasing M. Energy for computation decreases with an increase in number of cores running at
reduced frequencies, and energy for communication increases with increasing cores. We can see
that increasing the input size leads to an increase in the optimal number of cores required for
minimum energy consumption. We now consider the sensitivity of this analysis with respect to the
ratio k. Figure 2 plots the optimal number of cores required for minimum energy consumption by
varying input size and k.
Figure 2 shows that for a fixed input size, the optimal number of cores required for minimum energy
consumption decreases with increasing k. Moreover, it is evident that with increasing input size, the
trend remains the same (approximating a negative exponential curve with negative coefficient).

Contenu connexe

Tendances

Enriched Firefly Algorithm for Solving Reactive Power Problem
Enriched Firefly Algorithm for Solving Reactive Power ProblemEnriched Firefly Algorithm for Solving Reactive Power Problem
Enriched Firefly Algorithm for Solving Reactive Power Problemijeei-iaes
 
Lut optimization for memory based computation
Lut optimization for memory based computationLut optimization for memory based computation
Lut optimization for memory based computationJohn Williams
 
Energy Efficient Geographical Forwarding Algorithm For Wireless Ad Hoc And Se...
Energy Efficient Geographical Forwarding Algorithm For Wireless Ad Hoc And Se...Energy Efficient Geographical Forwarding Algorithm For Wireless Ad Hoc And Se...
Energy Efficient Geographical Forwarding Algorithm For Wireless Ad Hoc And Se...zhendong
 
Presentation: Wind Speed Prediction using Radial Basis Function Neural Network
Presentation: Wind Speed Prediction using Radial Basis Function Neural NetworkPresentation: Wind Speed Prediction using Radial Basis Function Neural Network
Presentation: Wind Speed Prediction using Radial Basis Function Neural NetworkArzam Muzaffar Kotriwala
 
Kalman Filter Algorithm for Mitigation of Power System Harmonics
Kalman Filter Algorithm for Mitigation of Power System Harmonics  Kalman Filter Algorithm for Mitigation of Power System Harmonics
Kalman Filter Algorithm for Mitigation of Power System Harmonics IJECEIAES
 
A Threshold Enhancement Technique for Chaotic On-Off Keying Scheme
A Threshold Enhancement Technique for Chaotic On-Off Keying SchemeA Threshold Enhancement Technique for Chaotic On-Off Keying Scheme
A Threshold Enhancement Technique for Chaotic On-Off Keying SchemeCSCJournals
 
Economic/Emission Load Dispatch Using Artificial Bee Colony Algorithm
Economic/Emission Load Dispatch Using Artificial Bee Colony AlgorithmEconomic/Emission Load Dispatch Using Artificial Bee Colony Algorithm
Economic/Emission Load Dispatch Using Artificial Bee Colony AlgorithmIDES Editor
 
Design & Implementation of LUT Based Multiplier Using APCOMS Technique
Design & Implementation of LUT Based Multiplier Using APCOMS TechniqueDesign & Implementation of LUT Based Multiplier Using APCOMS Technique
Design & Implementation of LUT Based Multiplier Using APCOMS Techniqueijsrd.com
 
A study of dipole antennas using mat lab
A study of dipole antennas using mat labA study of dipole antennas using mat lab
A study of dipole antennas using mat labLe Professionist
 
A reduced complexity and an efficient channel
A reduced complexity and an efficient channelA reduced complexity and an efficient channel
A reduced complexity and an efficient channeleSAT Publishing House
 
PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDMOD...
PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDMOD...PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDMOD...
PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDMOD...ijgca
 
Reduction of Active Power Loss byUsing Adaptive Cat Swarm Optimization
Reduction of Active Power Loss byUsing Adaptive Cat Swarm OptimizationReduction of Active Power Loss byUsing Adaptive Cat Swarm Optimization
Reduction of Active Power Loss byUsing Adaptive Cat Swarm Optimizationijeei-iaes
 
Combination of Immune Genetic Particle Swarm Optimization algorithm with BP a...
Combination of Immune Genetic Particle Swarm Optimization algorithm with BP a...Combination of Immune Genetic Particle Swarm Optimization algorithm with BP a...
Combination of Immune Genetic Particle Swarm Optimization algorithm with BP a...paperpublications3
 
Performance of CBR Traffic on Node Overutilization in MANETs
Performance of CBR Traffic on Node Overutilization in MANETsPerformance of CBR Traffic on Node Overutilization in MANETs
Performance of CBR Traffic on Node Overutilization in MANETsRSIS International
 
Adaptive Channel Equalization for Nonlinear Channels using Signed Regressor F...
Adaptive Channel Equalization for Nonlinear Channels using Signed Regressor F...Adaptive Channel Equalization for Nonlinear Channels using Signed Regressor F...
Adaptive Channel Equalization for Nonlinear Channels using Signed Regressor F...IDES Editor
 
Performance Analysis of Ultra Wideband Receivers for High Data Rate Wireless ...
Performance Analysis of Ultra Wideband Receivers for High Data Rate Wireless ...Performance Analysis of Ultra Wideband Receivers for High Data Rate Wireless ...
Performance Analysis of Ultra Wideband Receivers for High Data Rate Wireless ...graphhoc
 

Tendances (19)

Enriched Firefly Algorithm for Solving Reactive Power Problem
Enriched Firefly Algorithm for Solving Reactive Power ProblemEnriched Firefly Algorithm for Solving Reactive Power Problem
Enriched Firefly Algorithm for Solving Reactive Power Problem
 
Lut optimization for memory based computation
Lut optimization for memory based computationLut optimization for memory based computation
Lut optimization for memory based computation
 
Energy Efficient Geographical Forwarding Algorithm For Wireless Ad Hoc And Se...
Energy Efficient Geographical Forwarding Algorithm For Wireless Ad Hoc And Se...Energy Efficient Geographical Forwarding Algorithm For Wireless Ad Hoc And Se...
Energy Efficient Geographical Forwarding Algorithm For Wireless Ad Hoc And Se...
 
Presentation: Wind Speed Prediction using Radial Basis Function Neural Network
Presentation: Wind Speed Prediction using Radial Basis Function Neural NetworkPresentation: Wind Speed Prediction using Radial Basis Function Neural Network
Presentation: Wind Speed Prediction using Radial Basis Function Neural Network
 
Kalman Filter Algorithm for Mitigation of Power System Harmonics
Kalman Filter Algorithm for Mitigation of Power System Harmonics  Kalman Filter Algorithm for Mitigation of Power System Harmonics
Kalman Filter Algorithm for Mitigation of Power System Harmonics
 
A Threshold Enhancement Technique for Chaotic On-Off Keying Scheme
A Threshold Enhancement Technique for Chaotic On-Off Keying SchemeA Threshold Enhancement Technique for Chaotic On-Off Keying Scheme
A Threshold Enhancement Technique for Chaotic On-Off Keying Scheme
 
Economic/Emission Load Dispatch Using Artificial Bee Colony Algorithm
Economic/Emission Load Dispatch Using Artificial Bee Colony AlgorithmEconomic/Emission Load Dispatch Using Artificial Bee Colony Algorithm
Economic/Emission Load Dispatch Using Artificial Bee Colony Algorithm
 
Design & Implementation of LUT Based Multiplier Using APCOMS Technique
Design & Implementation of LUT Based Multiplier Using APCOMS TechniqueDesign & Implementation of LUT Based Multiplier Using APCOMS Technique
Design & Implementation of LUT Based Multiplier Using APCOMS Technique
 
A study of dipole antennas using mat lab
A study of dipole antennas using mat labA study of dipole antennas using mat lab
A study of dipole antennas using mat lab
 
E0812730
E0812730E0812730
E0812730
 
A reduced complexity and an efficient channel
A reduced complexity and an efficient channelA reduced complexity and an efficient channel
A reduced complexity and an efficient channel
 
Economic load dispatch problem solving using "Cuckoo Search"
Economic load dispatch problem solving using "Cuckoo Search"Economic load dispatch problem solving using "Cuckoo Search"
Economic load dispatch problem solving using "Cuckoo Search"
 
PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDMOD...
PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDMOD...PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDMOD...
PERFORMANCE FACTORS OF CLOUD COMPUTING DATA CENTERS USING [(M/G/1) : (∞/GDMOD...
 
Reduction of Active Power Loss byUsing Adaptive Cat Swarm Optimization
Reduction of Active Power Loss byUsing Adaptive Cat Swarm OptimizationReduction of Active Power Loss byUsing Adaptive Cat Swarm Optimization
Reduction of Active Power Loss byUsing Adaptive Cat Swarm Optimization
 
Wireless Sensor Network Security Analytics
Wireless Sensor Network Security AnalyticsWireless Sensor Network Security Analytics
Wireless Sensor Network Security Analytics
 
Combination of Immune Genetic Particle Swarm Optimization algorithm with BP a...
Combination of Immune Genetic Particle Swarm Optimization algorithm with BP a...Combination of Immune Genetic Particle Swarm Optimization algorithm with BP a...
Combination of Immune Genetic Particle Swarm Optimization algorithm with BP a...
 
Performance of CBR Traffic on Node Overutilization in MANETs
Performance of CBR Traffic on Node Overutilization in MANETsPerformance of CBR Traffic on Node Overutilization in MANETs
Performance of CBR Traffic on Node Overutilization in MANETs
 
Adaptive Channel Equalization for Nonlinear Channels using Signed Regressor F...
Adaptive Channel Equalization for Nonlinear Channels using Signed Regressor F...Adaptive Channel Equalization for Nonlinear Channels using Signed Regressor F...
Adaptive Channel Equalization for Nonlinear Channels using Signed Regressor F...
 
Performance Analysis of Ultra Wideband Receivers for High Data Rate Wireless ...
Performance Analysis of Ultra Wideband Receivers for High Data Rate Wireless ...Performance Analysis of Ultra Wideband Receivers for High Data Rate Wireless ...
Performance Analysis of Ultra Wideband Receivers for High Data Rate Wireless ...
 

Similaire à Analysis of parallel algorithms for energy consumption

OPTIMIZED TASK ALLOCATION IN SENSOR NETWORKS
OPTIMIZED TASK ALLOCATION IN SENSOR NETWORKSOPTIMIZED TASK ALLOCATION IN SENSOR NETWORKS
OPTIMIZED TASK ALLOCATION IN SENSOR NETWORKSZac Darcy
 
Edd clustering algorithm for
Edd clustering algorithm forEdd clustering algorithm for
Edd clustering algorithm forcsandit
 
EDD CLUSTERING ALGORITHM FOR WIRELESS SENSOR NETWORKS
EDD CLUSTERING ALGORITHM FOR WIRELESS SENSOR NETWORKSEDD CLUSTERING ALGORITHM FOR WIRELESS SENSOR NETWORKS
EDD CLUSTERING ALGORITHM FOR WIRELESS SENSOR NETWORKScscpconf
 
Energy consumption of vlsi decoders
Energy consumption of vlsi decodersEnergy consumption of vlsi decoders
Energy consumption of vlsi decodersjpstudcorner
 
Parallel Algorithms: Sort & Merge, Image Processing, Fault Tolerance
Parallel Algorithms: Sort & Merge, Image Processing, Fault ToleranceParallel Algorithms: Sort & Merge, Image Processing, Fault Tolerance
Parallel Algorithms: Sort & Merge, Image Processing, Fault ToleranceUniversity of Technology - Iraq
 
A CLUSTER BASED STABLE ROUTING PROTOCOL USING BINARY PARTICLE SWARM OPTIMIZAT...
A CLUSTER BASED STABLE ROUTING PROTOCOL USING BINARY PARTICLE SWARM OPTIMIZAT...A CLUSTER BASED STABLE ROUTING PROTOCOL USING BINARY PARTICLE SWARM OPTIMIZAT...
A CLUSTER BASED STABLE ROUTING PROTOCOL USING BINARY PARTICLE SWARM OPTIMIZAT...ijmnct
 
Design Optimization of Energy and Delay Efficient Wireless Sensor Network wit...
Design Optimization of Energy and Delay Efficient Wireless Sensor Network wit...Design Optimization of Energy and Delay Efficient Wireless Sensor Network wit...
Design Optimization of Energy and Delay Efficient Wireless Sensor Network wit...IOSR Journals
 
Cluster head election using imperialist competitive algorithm (chei) for wire...
Cluster head election using imperialist competitive algorithm (chei) for wire...Cluster head election using imperialist competitive algorithm (chei) for wire...
Cluster head election using imperialist competitive algorithm (chei) for wire...ijmnct
 
E NERGY D EGREE D ISTANCE C LUSTERING A LGORITHM FOR W Sns
E NERGY D EGREE D ISTANCE C LUSTERING A LGORITHM FOR W SnsE NERGY D EGREE D ISTANCE C LUSTERING A LGORITHM FOR W Sns
E NERGY D EGREE D ISTANCE C LUSTERING A LGORITHM FOR W Snsijwmn
 
Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...
Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...
Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...IJCNCJournal
 
Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...
Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...
Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...IJCNCJournal
 
Energy-Efficient Hybrid K-Means Algorithm for Clustered Wireless Sensor Netw...
Energy-Efficient Hybrid K-Means Algorithm for Clustered  Wireless Sensor Netw...Energy-Efficient Hybrid K-Means Algorithm for Clustered  Wireless Sensor Netw...
Energy-Efficient Hybrid K-Means Algorithm for Clustered Wireless Sensor Netw...IJECEIAES
 
Efficient use of Energy in WSN using H-LEACH
Efficient use of Energy in WSN using H-LEACHEfficient use of Energy in WSN using H-LEACH
Efficient use of Energy in WSN using H-LEACHdbpublications
 
Comprehensive Performance Evaluation on Multiplication of Matrices using MPI
Comprehensive Performance Evaluation on Multiplication of Matrices using MPIComprehensive Performance Evaluation on Multiplication of Matrices using MPI
Comprehensive Performance Evaluation on Multiplication of Matrices using MPIijtsrd
 
PERFORMANCE AND COMPLEXITY ANALYSIS OF A REDUCED ITERATIONS LLL ALGORITHM
PERFORMANCE AND COMPLEXITY ANALYSIS OF A REDUCED ITERATIONS LLL ALGORITHMPERFORMANCE AND COMPLEXITY ANALYSIS OF A REDUCED ITERATIONS LLL ALGORITHM
PERFORMANCE AND COMPLEXITY ANALYSIS OF A REDUCED ITERATIONS LLL ALGORITHMIJCNCJournal
 
Approximated computing for low power neural networks
Approximated computing for low power neural networksApproximated computing for low power neural networks
Approximated computing for low power neural networksTELKOMNIKA JOURNAL
 
Performance evaluation of various routing techniques in wireless multimedia
Performance evaluation of various routing techniques in wireless multimediaPerformance evaluation of various routing techniques in wireless multimedia
Performance evaluation of various routing techniques in wireless multimediaIAEME Publication
 
Optimum Network Reconfiguration using Grey Wolf Optimizer
Optimum Network Reconfiguration using Grey Wolf OptimizerOptimum Network Reconfiguration using Grey Wolf Optimizer
Optimum Network Reconfiguration using Grey Wolf OptimizerTELKOMNIKA JOURNAL
 

Similaire à Analysis of parallel algorithms for energy consumption (20)

OPTIMIZED TASK ALLOCATION IN SENSOR NETWORKS
OPTIMIZED TASK ALLOCATION IN SENSOR NETWORKSOPTIMIZED TASK ALLOCATION IN SENSOR NETWORKS
OPTIMIZED TASK ALLOCATION IN SENSOR NETWORKS
 
Edd clustering algorithm for
Edd clustering algorithm forEdd clustering algorithm for
Edd clustering algorithm for
 
EDD CLUSTERING ALGORITHM FOR WIRELESS SENSOR NETWORKS
EDD CLUSTERING ALGORITHM FOR WIRELESS SENSOR NETWORKSEDD CLUSTERING ALGORITHM FOR WIRELESS SENSOR NETWORKS
EDD CLUSTERING ALGORITHM FOR WIRELESS SENSOR NETWORKS
 
Energy consumption of vlsi decoders
Energy consumption of vlsi decodersEnergy consumption of vlsi decoders
Energy consumption of vlsi decoders
 
Parallel Algorithms: Sort & Merge, Image Processing, Fault Tolerance
Parallel Algorithms: Sort & Merge, Image Processing, Fault ToleranceParallel Algorithms: Sort & Merge, Image Processing, Fault Tolerance
Parallel Algorithms: Sort & Merge, Image Processing, Fault Tolerance
 
A CLUSTER BASED STABLE ROUTING PROTOCOL USING BINARY PARTICLE SWARM OPTIMIZAT...
A CLUSTER BASED STABLE ROUTING PROTOCOL USING BINARY PARTICLE SWARM OPTIMIZAT...A CLUSTER BASED STABLE ROUTING PROTOCOL USING BINARY PARTICLE SWARM OPTIMIZAT...
A CLUSTER BASED STABLE ROUTING PROTOCOL USING BINARY PARTICLE SWARM OPTIMIZAT...
 
Design Optimization of Energy and Delay Efficient Wireless Sensor Network wit...
Design Optimization of Energy and Delay Efficient Wireless Sensor Network wit...Design Optimization of Energy and Delay Efficient Wireless Sensor Network wit...
Design Optimization of Energy and Delay Efficient Wireless Sensor Network wit...
 
Ijecet 06 09_003
Ijecet 06 09_003Ijecet 06 09_003
Ijecet 06 09_003
 
Cluster head election using imperialist competitive algorithm (chei) for wire...
Cluster head election using imperialist competitive algorithm (chei) for wire...Cluster head election using imperialist competitive algorithm (chei) for wire...
Cluster head election using imperialist competitive algorithm (chei) for wire...
 
E NERGY D EGREE D ISTANCE C LUSTERING A LGORITHM FOR W Sns
E NERGY D EGREE D ISTANCE C LUSTERING A LGORITHM FOR W SnsE NERGY D EGREE D ISTANCE C LUSTERING A LGORITHM FOR W Sns
E NERGY D EGREE D ISTANCE C LUSTERING A LGORITHM FOR W Sns
 
Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...
Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...
Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...
 
Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...
Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...
Investigation of Ant Colony Optimization Algorithm for Efficient Energy Utili...
 
Energy-Efficient Hybrid K-Means Algorithm for Clustered Wireless Sensor Netw...
Energy-Efficient Hybrid K-Means Algorithm for Clustered  Wireless Sensor Netw...Energy-Efficient Hybrid K-Means Algorithm for Clustered  Wireless Sensor Netw...
Energy-Efficient Hybrid K-Means Algorithm for Clustered Wireless Sensor Netw...
 
Efficient use of Energy in WSN using H-LEACH
Efficient use of Energy in WSN using H-LEACHEfficient use of Energy in WSN using H-LEACH
Efficient use of Energy in WSN using H-LEACH
 
Comprehensive Performance Evaluation on Multiplication of Matrices using MPI
Comprehensive Performance Evaluation on Multiplication of Matrices using MPIComprehensive Performance Evaluation on Multiplication of Matrices using MPI
Comprehensive Performance Evaluation on Multiplication of Matrices using MPI
 
PERFORMANCE AND COMPLEXITY ANALYSIS OF A REDUCED ITERATIONS LLL ALGORITHM
PERFORMANCE AND COMPLEXITY ANALYSIS OF A REDUCED ITERATIONS LLL ALGORITHMPERFORMANCE AND COMPLEXITY ANALYSIS OF A REDUCED ITERATIONS LLL ALGORITHM
PERFORMANCE AND COMPLEXITY ANALYSIS OF A REDUCED ITERATIONS LLL ALGORITHM
 
Fo2510211029
Fo2510211029Fo2510211029
Fo2510211029
 
Approximated computing for low power neural networks
Approximated computing for low power neural networksApproximated computing for low power neural networks
Approximated computing for low power neural networks
 
Performance evaluation of various routing techniques in wireless multimedia
Performance evaluation of various routing techniques in wireless multimediaPerformance evaluation of various routing techniques in wireless multimedia
Performance evaluation of various routing techniques in wireless multimedia
 
Optimum Network Reconfiguration using Grey Wolf Optimizer
Optimum Network Reconfiguration using Grey Wolf OptimizerOptimum Network Reconfiguration using Grey Wolf Optimizer
Optimum Network Reconfiguration using Grey Wolf Optimizer
 

Plus de are you

Business plan (air umbrella)
Business plan (air umbrella)Business plan (air umbrella)
Business plan (air umbrella)are you
 
A* algorithm
A* algorithmA* algorithm
A* algorithmare you
 
Dijkstra algorithm
Dijkstra algorithmDijkstra algorithm
Dijkstra algorithmare you
 
Application of least square method
Application of least square methodApplication of least square method
Application of least square methodare you
 
Comparative study of fish harvesting
Comparative study of fish harvestingComparative study of fish harvesting
Comparative study of fish harvestingare you
 
Business idea
Business ideaBusiness idea
Business ideaare you
 
Effective determinants of corporate nano
Effective determinants of corporate nanoEffective determinants of corporate nano
Effective determinants of corporate nanoare you
 
Govida chocolate
Govida chocolateGovida chocolate
Govida chocolateare you
 
Yourprezi
YourpreziYourprezi
Yourpreziare you
 
Skrip Arabic (الكتابة العربية)
Skrip Arabic (الكتابة العربية)Skrip Arabic (الكتابة العربية)
Skrip Arabic (الكتابة العربية)are you
 
Report ent600
Report  ent600Report  ent600
Report ent600are you
 
Mgt project
Mgt projectMgt project
Mgt projectare you
 
MyBride Presentation
MyBride PresentationMyBride Presentation
MyBride Presentationare you
 
Levi's jeans presentation
Levi's jeans presentationLevi's jeans presentation
Levi's jeans presentationare you
 
Portable Printer
Portable PrinterPortable Printer
Portable Printerare you
 
Air craft control system of parallel processing
Air craft control system of parallel processingAir craft control system of parallel processing
Air craft control system of parallel processingare you
 
Application of Parallel Processing
Application of Parallel ProcessingApplication of Parallel Processing
Application of Parallel Processingare you
 
Mybride gallery
Mybride galleryMybride gallery
Mybride galleryare you
 

Plus de are you (20)

Business plan (air umbrella)
Business plan (air umbrella)Business plan (air umbrella)
Business plan (air umbrella)
 
A* algorithm
A* algorithmA* algorithm
A* algorithm
 
Poster
PosterPoster
Poster
 
Dijkstra algorithm
Dijkstra algorithmDijkstra algorithm
Dijkstra algorithm
 
Application of least square method
Application of least square methodApplication of least square method
Application of least square method
 
Comparative study of fish harvesting
Comparative study of fish harvestingComparative study of fish harvesting
Comparative study of fish harvesting
 
Business idea
Business ideaBusiness idea
Business idea
 
Effective determinants of corporate nano
Effective determinants of corporate nanoEffective determinants of corporate nano
Effective determinants of corporate nano
 
Govida chocolate
Govida chocolateGovida chocolate
Govida chocolate
 
Yourprezi
YourpreziYourprezi
Yourprezi
 
Skrip Arabic (الكتابة العربية)
Skrip Arabic (الكتابة العربية)Skrip Arabic (الكتابة العربية)
Skrip Arabic (الكتابة العربية)
 
Report ent600
Report  ent600Report  ent600
Report ent600
 
Mgt project
Mgt projectMgt project
Mgt project
 
MyBride Presentation
MyBride PresentationMyBride Presentation
MyBride Presentation
 
Prezi
PreziPrezi
Prezi
 
Levi's jeans presentation
Levi's jeans presentationLevi's jeans presentation
Levi's jeans presentation
 
Portable Printer
Portable PrinterPortable Printer
Portable Printer
 
Air craft control system of parallel processing
Air craft control system of parallel processingAir craft control system of parallel processing
Air craft control system of parallel processing
 
Application of Parallel Processing
Application of Parallel ProcessingApplication of Parallel Processing
Application of Parallel Processing
 
Mybride gallery
Mybride galleryMybride gallery
Mybride gallery
 

Dernier

Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactdawncurless
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionSafetyChain Software
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingTechSoup
 
The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxShobhayan Kirtania
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAssociation for Project Management
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...fonyou31
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3JemimahLaneBuaron
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdfSoniaTolstoy
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...EduSkills OECD
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfchloefrazer622
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Sapana Sha
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptxVS Mahajan Coaching Centre
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformChameera Dedduwage
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...Pooja Nehwal
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajanpragatimahajan3
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfJayanti Pande
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docxPoojaSen20
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfciinovamais
 

Dernier (20)

Accessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impactAccessible design: Minimum effort, maximum impact
Accessible design: Minimum effort, maximum impact
 
Mastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory InspectionMastering the Unannounced Regulatory Inspection
Mastering the Unannounced Regulatory Inspection
 
Grant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy ConsultingGrant Readiness 101 TechSoup and Remy Consulting
Grant Readiness 101 TechSoup and Remy Consulting
 
The byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptxThe byproduct of sericulture in different industries.pptx
The byproduct of sericulture in different industries.pptx
 
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
Mattingly "AI & Prompt Design: Structured Data, Assistants, & RAG"
 
APM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across SectorsAPM Welcome, APM North West Network Conference, Synergies Across Sectors
APM Welcome, APM North West Network Conference, Synergies Across Sectors
 
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
Ecosystem Interactions Class Discussion Presentation in Blue Green Lined Styl...
 
Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3Q4-W6-Restating Informational Text Grade 3
Q4-W6-Restating Informational Text Grade 3
 
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdfBASLIQ CURRENT LOOKBOOK  LOOKBOOK(1) (1).pdf
BASLIQ CURRENT LOOKBOOK LOOKBOOK(1) (1).pdf
 
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
Presentation by Andreas Schleicher Tackling the School Absenteeism Crisis 30 ...
 
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptxINDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
INDIA QUIZ 2024 RLAC DELHI UNIVERSITY.pptx
 
Disha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdfDisha NEET Physics Guide for classes 11 and 12.pdf
Disha NEET Physics Guide for classes 11 and 12.pdf
 
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111Call Girls in Dwarka Mor Delhi Contact Us 9654467111
Call Girls in Dwarka Mor Delhi Contact Us 9654467111
 
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions  for the students and aspirants of Chemistry12th.pptxOrganic Name Reactions  for the students and aspirants of Chemistry12th.pptx
Organic Name Reactions for the students and aspirants of Chemistry12th.pptx
 
A Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy ReformA Critique of the Proposed National Education Policy Reform
A Critique of the Proposed National Education Policy Reform
 
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...Russian Call Girls in Andheri Airport Mumbai WhatsApp  9167673311 💞 Full Nigh...
Russian Call Girls in Andheri Airport Mumbai WhatsApp 9167673311 💞 Full Nigh...
 
social pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajansocial pharmacy d-pharm 1st year by Pragati K. Mahajan
social pharmacy d-pharm 1st year by Pragati K. Mahajan
 
Web & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdfWeb & Social Media Analytics Previous Year Question Paper.pdf
Web & Social Media Analytics Previous Year Question Paper.pdf
 
mini mental status format.docx
mini    mental       status     format.docxmini    mental       status     format.docx
mini mental status format.docx
 
Activity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdfActivity 01 - Artificial Culture (1).pdf
Activity 01 - Artificial Culture (1).pdf
 

Analysis of parallel algorithms for energy consumption

  • 1. Analysis of Parallel Algorithms for Energy Conservation in Scalable Multicore Architecture Problem statement An important concern for computing, as for society, is conserving energy. Moreover, energy conservation is critical in mobile devices for practical reasons. The relation between parallel applications and their energy requirements on scalable multicore processors are examined. It believe this sort of analysis can provide programmers with intuitions about the energy required by the parallel algorithms they are using–thus guiding the choice of algorithm, architecture and the number of cores used for a particular application. Researchers have studied performance scalability of parallel algorithms for some time [10]. However, it is important to note that performance scalability of a parallel algorithm is not the same as its energy scalability. This difference between performance scalability and energy scalability is due to two important factors. There is a nonlinear relationship between power and frequency at which the cores operate in multicore processors. In fact, the power consumed by a core is (typically) proportional to the cube of its frequency. Executing parallel algorithms typically involves communication (or shared memory accesses) as well as computation. The power and performance characteristics of communication and computation may be different. For example, in many algorithms, communication time may be masked by overlapping communication and computation (e.g., [1]). However, the power required for communication would be unaffected whether communication overlaps with the computation or not. Objective (a) To analyse energy characteristics of parallel algorithms executed on scalable multicore processors. (b) To study the sensitivity of the analysis to changes in parameters such as the ratio of power required for computation versus power required for communication. (c) To determine how many cores to use in order to minimize energy consumption.
  • 2. Methodology The methodology is to evaluate energy scalability under iso-performance of parallel applications. Partitioning Step Find the critical path of the parallel algorithm. The critical path is the longest path through the task dependency graph (where edges represents task serialization) of the parallel algorithm. Note that the critical path length gives a lower bound on execution time of the parallel algorithm. In the parallel algorithm, the critical path is easy to find. It is the execution path of the core that has the sum of all numbers at the end. Half of the cores send the sum they compute to the other half so that no core receives a sum from more than one core. The receiving core then add the number the local sum they have computed. We perform the same step recursively until there is only one core left. At the end of computation, one core will store the sum of all N numbers. Communication and computation steps Partition the critical path into communication and computation steps. We can see that there are log (M) communication steps and ((N=M) � 1 + log (M)) computation steps. Consider a simple parallel algorithm to add N numbers using M cores. Initially all N numbers are equally distributed among the M cores; at the end of the computation, one of the cores stores their sum. Without loss of generality, assume that the number of cores available is some power of two. The algorithm runs in log (M) steps. Adding N numbers using 4 actors; Left most line represents the critical path; embarrassingly parallel application but represents a broad class of tree algorithms
  • 3. Agglomeration step Scale computation steps of the critical path so that the parallel performance matches the performance requirement. Scaling the computation time of the critical path to the difference of the required performance and the communication time of the critical path, and obtain the new reduced frequency at which all M cores should run. Reduced frequency obtained at which all M cores should run to complete in time T. Where β represents number of cycles required per addition. In order to achieve energy savings, we require 0 < X0 < F. This restriction provides a lower bound on the input size as a function of M and Kc. Evaluate the message complexity (total number of messages processed) of the parallel algorithm. The example algorithms discuss before show that the message complexity of some parallel algorithms may depend only on the number of cores, while for others it depends on both the input size and the number of cores used. It is trivial to see that number of message transfers for this parallel algorithm when running on M cores is (M-1). In the algorithm, the message complexity is only dependent on M and not on the input size N. Evaluate the total idle time at all the cores assuming the frequency obtained, running at new frequency X’. Scaling the parallel algorithm critical path may lead to an increase in idle time in other paths at other cores. Total idle time is: Where the first term represents the total idle time spent by idle cores while other cores are busy computing and second term represents the total idle time spent by idle cores while other cores are involved in message communication. Step 6 Frame an expression for energy consumption of the parallel algorithm using the energy model. The energy expression is the sum of the energy consumed by Computation, Ecomp, Communication, Ecomm and Idling (static power), Eidle Ecomp is lower if the cores run at a lower frequency, while Eidle may increase as the busy cores take longer to finish. Ecomm may increase as more cores are used since the computation is more distributed. Observe that (N-1) is the total number of computation steps. The energy consumed for computation, communication and idling while the algorithm is running on M cores at reduced frequency X’ is:
  • 4. Mapping step Analyse the equation to obtain the number of cores required for minimum energy consumption as a function of input size. The energy expression is dependent on many variables such as N (Input Size), M (Number of cores), (Number of instruction per addition), Kc (no of cycles executed at maximum frequency for single message communication time), Em (energy consumed for single message communication between cores), Ps (static power) and the maximum frequency of a core. In most architectures, the number of cycles involved per addition is just one, so we assume = 1. Set idle energy consumed per cycle as (Ps=F) = 1, where the cycle is at the maximum frequency F. Express all energy values with respect to this normalized energy value. Figure 1 plot for any input size N, initially energy decreases with increasing M and later on increases with increasing M. Energy for computation decreases with an increase in number of cores running at reduced frequencies, and energy for communication increases with increasing cores. We can see that increasing the input size leads to an increase in the optimal number of cores required for minimum energy consumption. We now consider the sensitivity of this analysis with respect to the ratio k. Figure 2 plots the optimal number of cores required for minimum energy consumption by varying input size and k. Figure 2 shows that for a fixed input size, the optimal number of cores required for minimum energy consumption decreases with increasing k. Moreover, it is evident that with increasing input size, the trend remains the same (approximating a negative exponential curve with negative coefficient).