SlideShare une entreprise Scribd logo
1  sur  8
Télécharger pour lire hors ligne
IOSR Journal of Computer Engineering (IOSR-JCE)
e-ISSN: 2278-0661, p- ISSN: 2278-8727Volume 11, Issue 1 (May. - Jun. 2013), PP 31-38
www.iosrjournals.org
www.iosrjournals.org 31 | Page
Adaptive check-pointing and replication strategy to tolerate faults
in computational grid
Subash Thapa1
, Abhijit bhowmick2
1
(computer Science, NIT Kurukshetra, INDIA)
2
(computer Science, NIT Kurukshetra, INDIA)
Abstract: We have considered both adaptive resource replication and adaptive check-pointing. Complete
resource failure has been considered which was ignored in most of the previous works. We have proposed an
efficient solution for scheduling as well as a fault tolerant method based on maintaining a balance between the
overheads of replication and check-pointing. The selection of resource from available resource is done such
that no partiality is done between jobs of equal priority. The fault tolerance method, i.e., resource failure has
been resolved in our proposed solution with the help of replication, three replicas for each task and number of
replicas will be increased by one on each permanent failure to maintain total replicas three and new replica
start its execution from the last saved checkpoint .
Keywords: Grid, fault,tolerance, redundancy,replication,adaptive,check pointing, Grid Computing
I. Introduction
.The main characteristics of grid are:
 Multiple administrative domains: Since, it spawns into multiple administrative domains, the policies and
autonomy of different domains needs to be maintained.
Heterogeneity: Grid contains different types of resources like personal computers to super computers to
specialized devices like telescopes. Grid provides seamless way to success different resources.
Scalability: A grid might grow from few integrated resources to millions. The performance might be degraded
as the size of grid increases.
1.3 Types of Grid Systems
The types of grid are mentioned below:
Computational Grid: A computational Grid is a system that aims at achieving higher aggregate computational
power than any single constituent machine.
Data Grid: A Data Grid is responsible for housing and providing access to data across multiple organizations.
Users are not concerned with where this data is located as long as they have access to the data.
1.4.1 Resource Allocation
Available resources should be used maximally or optimally in order to balance the distribution of tasks over the
network.
If node failure or communication link failure occurs, it is required to re-allocate and manage the tasks and
resources from start.
1.4.3 FAULT TOLERANCE
Fault-tolerance or graceful degradation is the property that enables a system (often computer-based) to
continue operating properly in the event of the failure of (or one or more faults within) some of its components.
II. Previous Work
In previous works, several mechanisms were proposed for grid or distributed computing systems.
However, some of them used only space redundancy (hardware replication), and others used only time
redundancy (check-pointing and rollback).In 2007, Sangho Yi, Derrick Kondo has published a paper for
reducing a number of replicas by using check-pointing and rollback and also analyzed the probability of
successful execution within the given deadline and reliability requirement of each task for selecting replicas. In
that paper major drawback analyzed by us are:
In this paper, author assumes both permanent and recoverable failures occuraccording to the Poisson
process with rate λfp and λfr, respectively for calculation of no replicas but If the failures occur with non-
Adaptive check-pointing and replication strategy to tolerate faults in computational grid
www.iosrjournals.org 32 | Page
exponential distribution, (e.g. Gamma, Weibull, etc.), it is very hard to use the existing probabilistic distribution
for the same.
In this paper, check-point interval is not major issue, does not talk about this and each recoverable
failure on each replica required to resume from its last saved check-point.
. Previous works done by Babar Nazir, Kalim Qureshi and Paul Manuel in 2008 maintained fault index
of grid resources which doesn’t keep track of permanent failure and recoverable failure. But in this paper major
drawback was:
The fault tolerance method used by the author in this paper is based only on check-pointing.
The priority (deadline) of task has not been considered in this paper. As it is a economy based system ,
task of different priority from user is a certainty.
III. Problem formulation:
Grid users submit their jobs to the grid resource broker (GRB) by specifying their QoS requirements,
i.e., deadline in which users want their jobs to be executed and the budget which users have for the completion
of jobs.
Grid resource broker schedules user jobs on the best available resource by optimizing time.
Result of the job is submitted to user upon successful completion of the job.
Such an economy based environment has following two major draw backs:
In the economy based grid environments, there are resources that fulfill the criteria of deadline and
budget constraints (QoS requirements), but they have a tendency toward faults. In such a scenario, the grid
resource broker goes ahead to select the same resource for the mere reason that the grid resource promises to
meet QoS requirements of the grid jobs. This eventually results in compromising the user’s QoS parameters in
order to complete the task.
In this paper, in order to address the first problem, we use a task check-pointing heuristic to enable the
economy based grid to tolerate faults gracefully, as we are able to restore the partially completed task from the
last checkpoint. And we also use replication on at most three best available resources on the basis of a
probability value of permanent resource failure.
In order to address the second problem, we make our check-pointing strategy adaptive by maintaining a
probability value of recoverable resource failure. In this way, we are able to introduce checkpoint mostly when
it is necessary.
IV. System Model and Assumption:
We have considered an economy based computational grid computing environment. In this system we assume
the resources R1, R2, R3……………………………..……… Rn with following characteristics value.
Some common notations and functions used throughout paper are:
Np: Denotes count of permanent failure.
Nr: Denotes count of Recoverable failure.
Ns: Denotes count of successful execution of task.
Pp: Probability value of permanent failure.
Such that Pp = Np / (Ns+Np+Nr)
Pr: Probability value of recoverable failure.
Such that Pr = Nr / (Ns+Np+Nr)
Pt: Probability value of Total failure.
Such that Pt = Pr + Pp
C(i), i = 1,2, . .. ,N, are the percentage of task completed such that
0 <= C(i) <= 100 and C(1) > C(2) > · · · > C(N)
Adaptive check-pointing and replication strategy to tolerate faults in computational grid
www.iosrjournals.org 33 | Page
F(i), i = 0, 1,2, . . . ,N, are integers such that F(0) < F(1) < ……… <F(N) and represent index range of fault
index.
Proposed Idea:
tr denote the work requirement time of a task
T(tr) denote expected total execution time such that in the absence of any failures T(tr ) = tr
The T(tr ) can expected execution time, Ti(ti,ci), where the ti and ci be calculated as the sum of all intervals are
the work requirement time and check-pointing overhead of an ith
interval.
td represents the deadline for a task.
Ps is a user-defined probability of successful execution of a task within the deadline.
P (tr , td ) is probability of successful execution of a task which has tr as the work .
requirement time and td as the deadline
He assume that the permanent and recoverable failures occur according to Poisson
process at rate λf p and λf r , respectively
Theorem 1
If we assume the recoverable failures occur according to the Poisson process with rate λfr and that
failures can occur during checkpointing, the expected execution time of the ith
interval with
check- pointing, Tri (ti , ci ) is given by
Theorem 2
If we assume both permanent and recover able failures occur according to the Poisson process with
rate λfp and λfr , respectively, the probability for successful execution for the time requirement tr
within the deadline td , P (tr , td ) is given by
V. Adaptive fault tolerant job scheduling strategy
Grid faults affect the performance of resource management strategy. As our proposed strategy
considers fault tolerance in an economy based computational grid environment, the aim is to optimize user-
centric metrics in the presence of faults
We propose to maintain and update probability value of permanent or recoverable resource failure of
all available resources of the grid. The probability value of the grid resource will suggest its vulnerability to
faults (i.e. higher the probability value is higher the failure rate).
VI. Components of proposed scheduling strategy:
Grid resource:
Grid Information Server: Grid users register themselves to the Grid Information Server (GIS) of a grid by
specifying the QoS requirements
Grid Resource Broker (GRB): Each grid job (composed of gridlets) is first submitted to its broker. When
GRB receives a grid job from a user, it gets the contact information of available grid resources from the GIS
and then requests the resources to send their current work load condition.
Adaptive check-pointing and replication strategy to tolerate faults in computational grid
www.iosrjournals.org 34 | Page
ALGORITHM A
Select the resource according to QoS from available resource.
Arrange them in increasing probability value of total failure.
Select the three replicas for equal priority job such that no partiality is done at time t as shown in fig.
F: probability value of the selected grid resource
F(i), i = 0, 1, 2, . . . ,N, are integers such that F(0) < F(1) < ··· < F(N)
C(i), i= 1, 2, . . . ,N, are the percentage of task completed such that
<= C(i) <= 100 and C(1) > C(2) > ··· > C(N)
IF F(0) <= F < F(1) THEN
The task is queued to that resource with a checkpoint at C(1) (It indicates that the grid resource has to send the
result of the task completion to the CPManager when C(0)% task is completed.)
GO TO Step 6.
END IF
IF F(1) <= F < F(2) THEN
The task is queued to that resource with a checkpoint at C(2).
GO TO Step 6.
END IF
IF F(2) <= F < F(3) THEN
The task is queued to that resource with a checkpoint at C(3).
GO TO Step 6.
END IF
IF F(N- 1) <= F < F(N) THEN
Adaptive check-pointing and replication strategy to tolerate faults in computational grid
www.iosrjournals.org 35 | Page
The task is queued to that resource with a checkpoint at C(N).
GO TO Step 6.
END IF
IF F(N) <=F THEN
Remove resource from the available resources list and label it as unavailable resource (i.e., no job is assigned) to
that resource until:
It has passed ALIVE time successfully. (ALIVE time is the time interval for which grid resource must be up so
that it can be again included in the grid available resource list.) END IF
Add the job to the unassigned job list and reapply the time optimization algorithm for this task on available
resources.
EXIT
Fault Tolerant Schedule Manager (FTScheduleManager): we assume that a fault occurs when a fault detector is
unable to receive any signal or response from grid resource in pull model / push model. Then a fault is detected
by a fault detector, the fault occurrence information about the grid resource is updated in Fault Tolerant
Schedule Manager that would be both recoverable failure information and permanent failure information. Task
successful execution information is also send by resource broker to Fault Tolerant Schedule Manager.
Thus fault probability information is used, while making a job allocation decision to the grid resource. It uses
following algorithm i.e. Algorithm B:
ALGORITHM B:
IF gridlet receptor receives the task completion result from the CPManager THEN a. IF resource fault index >=
1 THEN
Send a message to FTScheduleManager to increment Ns END IF
b. GOTO Step3 END IF
IF the to FTScheduleManager receives the task failure message from fault detector, THEN a. IF resource is
permanent failure THEN
Send a message to FTScheduleManager to increment Np
Send a message to CPManager, whether there is any checkpoint result of
this task.
iii. IF checkpoint result of the task exists in CPManager’s task checkpointing table, THEN Inform the resource
broker with that check point value to replicate on another resource.
GOTO Step 3. END IF
iv. IF checkpoint result of the task does not exist in CPManager’s task check-pointing table, THEN
Inform the resource broker with that check point value to replicate on another resource.
GOTO Step 3. END IF
END IF
EXIT
Checkpoint Manager (CPManager): It receives the partially executed result of a
task from a grid resource in the intervals specified by the grid resource broker based on the checkpoint. It maintains grid
tasks and their checkpoint table which
Adaptive check-pointing and replication strategy to tolerate faults in computational grid
www.iosrjournals.org 36 | Page
contains information of partially executed tasks by the grid resources. CPManager also receives and responds to
the task completion and task failure message from grid resources. For a particular task, the CPManager discards
the result of the previous checkpoint when a new value of checkpoint result is received with later value. For a
particular task, if CPManager receives the task completion message from resource, it removes its entity from the
task checkpoint result table. In our proposed algorithm it takes maximum of available check-point from
replication.
Grid dispatcher: It dispatches the tasks one by one to the respective grid resource as listed in the queue by GRB.
A small grid task is called a gridlet.
Gridlet receptor: It receives the task (gridlet) execution result from the grid resource where the task is dispatched
by the dispatcher.
WORKING:
Resource entity registers themselves to the Grid Information Service (GIS) entity.
Fault Tolerant Schedule Manager (FTScheduleManager) sends a synchronous event to the GIS entities to send
the list of resource available.
The GIS entity returns a list of registered resources to FTScheduleManager. FTScheduleManager then initializes
the resource fault occurrence history table
that contain the fault index (suggest resource vulnerability toward failures) of the resources
available.
IV. The GIS entity returns a list of registered resources and their contact details to the broker.
The broker entity finds the current work load condition of the resources by sending events to resource entities
requesting them to send their resource configuration and properties.
VI. Resource entities respond to this event by sending their dynamic information that includes resources cost,
capability, availability, load, and other configuration parameters.
VII. The broker entity sends synchronous events to FTScheduleManager to get the probability value of total
resource failure.
VIII. The broker entity, after receiving probability value of total resource failure index list, dispatches gridlet to
replicate on three resource entities if available such that all job of equal priority will get equal efficiency
resource entities at a time t.
Fault Tolerance method:
I. Resource failure has been resolved in our proposed solution with the help of replication, 3 replica for each
task.
Adaptive check-pointing and replication strategy to tolerate faults in computational grid
www.iosrjournals.org 37 | Page
When the resource entity fails to process the gridlet, detected by fault detector then fault detector sends gridlet
failure event to the CPManager entity and to FTScheduleManager.
III. The CPManager entity on the receipt of gridlet failure event from resource entity sends an asynchronous
gridlet failure event to broker entity .
IV. The checkpoint at the check point manager is adaptively updated by taking the maximum of the checkpoints
from the 3 replicated resource periodically.
On permanent failure of resource, maximum of the last saved check points is taken from the check point
manager to replicate task on another resource by resource broker but on recoverable failure restart the process
on same resource with last saved check-point. Thus taking advantage of the different reliability and computation
speed of the replicated resources.
Page 22 of 27
VI. Task is resumed from this check point on a new replicated resource .Thus in this way maximum number of
replicas for eack task is fixed at 3. i.e resource wastage is minimized.
VII. Conclusion
We have proposed an efficient solution for scheduling as well as an adaptive fault tolerant method
based on maintaining a balance between the overheads of replication and check-pointing. The case of complete
resource failure has been considered. Our proposed solution minimizes the resource wastage due to replication
by taking an optimal number of replicas. At any time the check point manager maintains a unique check point
which it keep on updating periodically , by taking the maximum of the received check points from the three
replicas. This utilizes the different computational speed of the replicated resources. Use of probability value
instead of index value further takes into account the case of permanent failure while scheduling and while
deciding the check point intervals.
References
[1] “Adaptive checkpointing strategy to tolerate faults in economy based grid” Babar Nazir ·
Kalim Qureshi · Paul Manuel Published online: 16 October 2008 © Springer Science+Business Media, LLC 2008
[2] “Using Replication and Checkpointing for Reliable Task Management in Computational Grids” Sangho Yi, Derrick Kondo INRIA,
France {sangho.yi, derrick.kondo}@inria.fr
[3] H. Z. E. Tsiakkouri, R. Sakellariou and M. D. Dikaiakos, “Scheduling Workflows with Budget Constraints,” in Proceedings of the
CoreGRID Workshop ”Integrated research in Grid Computing“, S. Gorlatch andM. Danelutto, Eds., Nov. 2009, pp. 347-357.
[4] H. Topcuouglu, S. Hariri, and Min-you Wu, “Performance-Effective and Low-Complexity Task Scheduling for Heterogeneous
Computing,” IEEE Trans. Parallel Distrib. Syst., vol. 13, no. 3, pp. 260-274, 2010.
Adaptive check-pointing and replication strategy to tolerate faults in computational grid
www.iosrjournals.org 38 | Page
[5] M. Wieczorek, A. Hoheisel, and R. Prodan, “Taxonomies of the Multi-criteria Grid Workflow Scheduling Problem,” in Proceedings
of the CoreGRID Workshop on Grid Middleware. Dresden, Germany: Springer-Verlag, June 2009.
[6] J. Yu and R. Buyya , “Scheduling Scientific Workflow Applications with Deadline and Budget Constraints using Genetic
Algorithms, ”Scientific Programming Journal, vol. 14, no. 1, 2009.
[7] J. Yu, R. Buyya, and C. K. Tham, “Cost-based Scheduling of Scientific Workflow Applications on Utility Grids” , in Proceedings
of the 1st IEEE International Conference on e-Science and Grid Computing (e-Science 2005), IEEE. Melbourne, Australia: IEEE
CS Press, Dec. 2009.
[8] J. Yu, M. Kirley, and R. Buyya, “Multi-objective Planning for Workflow Execution on Grids,” in Proceedings of the 8th
IEEE/ACM International Conference on Grid Computing (Grid 2007), IEEE. Los Alamitos, CA, USA: IEEE CS Press, June 2010.

Contenu connexe

Tendances

(5 10) chitra natarajan
(5 10) chitra natarajan(5 10) chitra natarajan
(5 10) chitra natarajanIISRTJournals
 
An Uncertainty-Aware Approach to Optimal Configuration of Stream Processing S...
An Uncertainty-Aware Approach to Optimal Configuration of Stream Processing S...An Uncertainty-Aware Approach to Optimal Configuration of Stream Processing S...
An Uncertainty-Aware Approach to Optimal Configuration of Stream Processing S...Pooyan Jamshidi
 
Genetic Algorithm for task scheduling in Cloud Computing Environment
Genetic Algorithm for task scheduling in Cloud Computing EnvironmentGenetic Algorithm for task scheduling in Cloud Computing Environment
Genetic Algorithm for task scheduling in Cloud Computing EnvironmentSwapnil Shahade
 
Optimized Assignment of Independent Task for Improving Resources Performance ...
Optimized Assignment of Independent Task for Improving Resources Performance ...Optimized Assignment of Independent Task for Improving Resources Performance ...
Optimized Assignment of Independent Task for Improving Resources Performance ...ijgca
 
Load Balancing in Cloud using Modified Genetic Algorithm
Load Balancing in Cloud using Modified Genetic AlgorithmLoad Balancing in Cloud using Modified Genetic Algorithm
Load Balancing in Cloud using Modified Genetic AlgorithmIJCSIS Research Publications
 
A Review on Scheduling in Cloud Computing
A Review on Scheduling in Cloud ComputingA Review on Scheduling in Cloud Computing
A Review on Scheduling in Cloud Computingijujournal
 
Transfer Learning for Improving Model Predictions in Highly Configurable Soft...
Transfer Learning for Improving Model Predictions in Highly Configurable Soft...Transfer Learning for Improving Model Predictions in Highly Configurable Soft...
Transfer Learning for Improving Model Predictions in Highly Configurable Soft...Pooyan Jamshidi
 
A survey of various scheduling algorithm in cloud computing environment
A survey of various scheduling algorithm in cloud computing environmentA survey of various scheduling algorithm in cloud computing environment
A survey of various scheduling algorithm in cloud computing environmenteSAT Publishing House
 
Optimal buffer allocation in
Optimal buffer allocation inOptimal buffer allocation in
Optimal buffer allocation incsandit
 
Resource scheduling algorithm
Resource scheduling algorithmResource scheduling algorithm
Resource scheduling algorithmShilpa Damor
 
HyperLogLog in Practice: Algorithmic Engineering of a State of The Art Cardin...
HyperLogLog in Practice: Algorithmic Engineering of a State of The Art Cardin...HyperLogLog in Practice: Algorithmic Engineering of a State of The Art Cardin...
HyperLogLog in Practice: Algorithmic Engineering of a State of The Art Cardin...Sunny Kr
 
Efficient Dynamic Scheduling Algorithm for Real-Time MultiCore Systems
Efficient Dynamic Scheduling Algorithm for Real-Time MultiCore Systems Efficient Dynamic Scheduling Algorithm for Real-Time MultiCore Systems
Efficient Dynamic Scheduling Algorithm for Real-Time MultiCore Systems iosrjce
 
Task Scheduling in Grid Computing.
Task Scheduling in Grid Computing.Task Scheduling in Grid Computing.
Task Scheduling in Grid Computing.Ramandeep Kaur
 
Comparative Analysis of Various Grid Based Scheduling Algorithms
Comparative Analysis of Various Grid Based Scheduling AlgorithmsComparative Analysis of Various Grid Based Scheduling Algorithms
Comparative Analysis of Various Grid Based Scheduling Algorithmsiosrjce
 
An enhanced adaptive scoring job scheduling algorithm with replication strate...
An enhanced adaptive scoring job scheduling algorithm with replication strate...An enhanced adaptive scoring job scheduling algorithm with replication strate...
An enhanced adaptive scoring job scheduling algorithm with replication strate...eSAT Publishing House
 
Scheduling Using Multi Objective Genetic Algorithm
Scheduling Using Multi Objective Genetic AlgorithmScheduling Using Multi Objective Genetic Algorithm
Scheduling Using Multi Objective Genetic Algorithmiosrjce
 

Tendances (16)

(5 10) chitra natarajan
(5 10) chitra natarajan(5 10) chitra natarajan
(5 10) chitra natarajan
 
An Uncertainty-Aware Approach to Optimal Configuration of Stream Processing S...
An Uncertainty-Aware Approach to Optimal Configuration of Stream Processing S...An Uncertainty-Aware Approach to Optimal Configuration of Stream Processing S...
An Uncertainty-Aware Approach to Optimal Configuration of Stream Processing S...
 
Genetic Algorithm for task scheduling in Cloud Computing Environment
Genetic Algorithm for task scheduling in Cloud Computing EnvironmentGenetic Algorithm for task scheduling in Cloud Computing Environment
Genetic Algorithm for task scheduling in Cloud Computing Environment
 
Optimized Assignment of Independent Task for Improving Resources Performance ...
Optimized Assignment of Independent Task for Improving Resources Performance ...Optimized Assignment of Independent Task for Improving Resources Performance ...
Optimized Assignment of Independent Task for Improving Resources Performance ...
 
Load Balancing in Cloud using Modified Genetic Algorithm
Load Balancing in Cloud using Modified Genetic AlgorithmLoad Balancing in Cloud using Modified Genetic Algorithm
Load Balancing in Cloud using Modified Genetic Algorithm
 
A Review on Scheduling in Cloud Computing
A Review on Scheduling in Cloud ComputingA Review on Scheduling in Cloud Computing
A Review on Scheduling in Cloud Computing
 
Transfer Learning for Improving Model Predictions in Highly Configurable Soft...
Transfer Learning for Improving Model Predictions in Highly Configurable Soft...Transfer Learning for Improving Model Predictions in Highly Configurable Soft...
Transfer Learning for Improving Model Predictions in Highly Configurable Soft...
 
A survey of various scheduling algorithm in cloud computing environment
A survey of various scheduling algorithm in cloud computing environmentA survey of various scheduling algorithm in cloud computing environment
A survey of various scheduling algorithm in cloud computing environment
 
Optimal buffer allocation in
Optimal buffer allocation inOptimal buffer allocation in
Optimal buffer allocation in
 
Resource scheduling algorithm
Resource scheduling algorithmResource scheduling algorithm
Resource scheduling algorithm
 
HyperLogLog in Practice: Algorithmic Engineering of a State of The Art Cardin...
HyperLogLog in Practice: Algorithmic Engineering of a State of The Art Cardin...HyperLogLog in Practice: Algorithmic Engineering of a State of The Art Cardin...
HyperLogLog in Practice: Algorithmic Engineering of a State of The Art Cardin...
 
Efficient Dynamic Scheduling Algorithm for Real-Time MultiCore Systems
Efficient Dynamic Scheduling Algorithm for Real-Time MultiCore Systems Efficient Dynamic Scheduling Algorithm for Real-Time MultiCore Systems
Efficient Dynamic Scheduling Algorithm for Real-Time MultiCore Systems
 
Task Scheduling in Grid Computing.
Task Scheduling in Grid Computing.Task Scheduling in Grid Computing.
Task Scheduling in Grid Computing.
 
Comparative Analysis of Various Grid Based Scheduling Algorithms
Comparative Analysis of Various Grid Based Scheduling AlgorithmsComparative Analysis of Various Grid Based Scheduling Algorithms
Comparative Analysis of Various Grid Based Scheduling Algorithms
 
An enhanced adaptive scoring job scheduling algorithm with replication strate...
An enhanced adaptive scoring job scheduling algorithm with replication strate...An enhanced adaptive scoring job scheduling algorithm with replication strate...
An enhanced adaptive scoring job scheduling algorithm with replication strate...
 
Scheduling Using Multi Objective Genetic Algorithm
Scheduling Using Multi Objective Genetic AlgorithmScheduling Using Multi Objective Genetic Algorithm
Scheduling Using Multi Objective Genetic Algorithm
 

En vedette

E-Healthcare Billing and Record Management Information System using Android w...
E-Healthcare Billing and Record Management Information System using Android w...E-Healthcare Billing and Record Management Information System using Android w...
E-Healthcare Billing and Record Management Information System using Android w...IOSR Journals
 
Improved Fuzzy Control Strategy for Power Quality in Distributed Generation’s...
Improved Fuzzy Control Strategy for Power Quality in Distributed Generation’s...Improved Fuzzy Control Strategy for Power Quality in Distributed Generation’s...
Improved Fuzzy Control Strategy for Power Quality in Distributed Generation’s...IOSR Journals
 
A Music Visual Interface via Emotion Detection Supervisor
A Music Visual Interface via Emotion Detection SupervisorA Music Visual Interface via Emotion Detection Supervisor
A Music Visual Interface via Emotion Detection SupervisorIOSR Journals
 
Some Aspects of Stress Distribution and Effect of Voids Having Different Gase...
Some Aspects of Stress Distribution and Effect of Voids Having Different Gase...Some Aspects of Stress Distribution and Effect of Voids Having Different Gase...
Some Aspects of Stress Distribution and Effect of Voids Having Different Gase...IOSR Journals
 
Impact of Insect Resistant Cotton
Impact of Insect Resistant CottonImpact of Insect Resistant Cotton
Impact of Insect Resistant CottonIOSR Journals
 
A New Single-Stage Multilevel Type Full-Bridge Converter Applied to closed lo...
A New Single-Stage Multilevel Type Full-Bridge Converter Applied to closed lo...A New Single-Stage Multilevel Type Full-Bridge Converter Applied to closed lo...
A New Single-Stage Multilevel Type Full-Bridge Converter Applied to closed lo...IOSR Journals
 
Temperature and Azimuth angle variation effect on the Building Integrated Pho...
Temperature and Azimuth angle variation effect on the Building Integrated Pho...Temperature and Azimuth angle variation effect on the Building Integrated Pho...
Temperature and Azimuth angle variation effect on the Building Integrated Pho...IOSR Journals
 
Efficient Forecasting of Exchange rates with Recurrent FLANN
Efficient Forecasting of Exchange rates with Recurrent FLANNEfficient Forecasting of Exchange rates with Recurrent FLANN
Efficient Forecasting of Exchange rates with Recurrent FLANNIOSR Journals
 
Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...
Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...
Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...IOSR Journals
 
Performance Evaluation of Routing Protocols for MAC Layer Models
Performance Evaluation of Routing Protocols for MAC Layer ModelsPerformance Evaluation of Routing Protocols for MAC Layer Models
Performance Evaluation of Routing Protocols for MAC Layer ModelsIOSR Journals
 
A Hierarchical Feature Set optimization for effective code change based Defec...
A Hierarchical Feature Set optimization for effective code change based Defec...A Hierarchical Feature Set optimization for effective code change based Defec...
A Hierarchical Feature Set optimization for effective code change based Defec...IOSR Journals
 
Information Hiding for “Color to Gray and back” with Hartley, Slant and Kekre...
Information Hiding for “Color to Gray and back” with Hartley, Slant and Kekre...Information Hiding for “Color to Gray and back” with Hartley, Slant and Kekre...
Information Hiding for “Color to Gray and back” with Hartley, Slant and Kekre...IOSR Journals
 
A Novel Method to Improve the Control Performance of Multilevel Inverter by M...
A Novel Method to Improve the Control Performance of Multilevel Inverter by M...A Novel Method to Improve the Control Performance of Multilevel Inverter by M...
A Novel Method to Improve the Control Performance of Multilevel Inverter by M...IOSR Journals
 
Inflammatory Conditions Mimicking Tumours In Calabar: A 30 Year Study (1978-2...
Inflammatory Conditions Mimicking Tumours In Calabar: A 30 Year Study (1978-2...Inflammatory Conditions Mimicking Tumours In Calabar: A 30 Year Study (1978-2...
Inflammatory Conditions Mimicking Tumours In Calabar: A 30 Year Study (1978-2...IOSR Journals
 
New Skills for Testers
New Skills for TestersNew Skills for Testers
New Skills for TestersIOSR Journals
 
Open Source Software Survivability Analysis Using Communication Pattern Valid...
Open Source Software Survivability Analysis Using Communication Pattern Valid...Open Source Software Survivability Analysis Using Communication Pattern Valid...
Open Source Software Survivability Analysis Using Communication Pattern Valid...IOSR Journals
 
An optimized link state routing protocol based on a cross layer design for wi...
An optimized link state routing protocol based on a cross layer design for wi...An optimized link state routing protocol based on a cross layer design for wi...
An optimized link state routing protocol based on a cross layer design for wi...IOSR Journals
 
Public Verifiability in Cloud Computing Using Signcryption Based on Elliptic ...
Public Verifiability in Cloud Computing Using Signcryption Based on Elliptic ...Public Verifiability in Cloud Computing Using Signcryption Based on Elliptic ...
Public Verifiability in Cloud Computing Using Signcryption Based on Elliptic ...IOSR Journals
 

En vedette (20)

C0551417
C0551417C0551417
C0551417
 
J0935461
J0935461J0935461
J0935461
 
E-Healthcare Billing and Record Management Information System using Android w...
E-Healthcare Billing and Record Management Information System using Android w...E-Healthcare Billing and Record Management Information System using Android w...
E-Healthcare Billing and Record Management Information System using Android w...
 
Improved Fuzzy Control Strategy for Power Quality in Distributed Generation’s...
Improved Fuzzy Control Strategy for Power Quality in Distributed Generation’s...Improved Fuzzy Control Strategy for Power Quality in Distributed Generation’s...
Improved Fuzzy Control Strategy for Power Quality in Distributed Generation’s...
 
A Music Visual Interface via Emotion Detection Supervisor
A Music Visual Interface via Emotion Detection SupervisorA Music Visual Interface via Emotion Detection Supervisor
A Music Visual Interface via Emotion Detection Supervisor
 
Some Aspects of Stress Distribution and Effect of Voids Having Different Gase...
Some Aspects of Stress Distribution and Effect of Voids Having Different Gase...Some Aspects of Stress Distribution and Effect of Voids Having Different Gase...
Some Aspects of Stress Distribution and Effect of Voids Having Different Gase...
 
Impact of Insect Resistant Cotton
Impact of Insect Resistant CottonImpact of Insect Resistant Cotton
Impact of Insect Resistant Cotton
 
A New Single-Stage Multilevel Type Full-Bridge Converter Applied to closed lo...
A New Single-Stage Multilevel Type Full-Bridge Converter Applied to closed lo...A New Single-Stage Multilevel Type Full-Bridge Converter Applied to closed lo...
A New Single-Stage Multilevel Type Full-Bridge Converter Applied to closed lo...
 
Temperature and Azimuth angle variation effect on the Building Integrated Pho...
Temperature and Azimuth angle variation effect on the Building Integrated Pho...Temperature and Azimuth angle variation effect on the Building Integrated Pho...
Temperature and Azimuth angle variation effect on the Building Integrated Pho...
 
Efficient Forecasting of Exchange rates with Recurrent FLANN
Efficient Forecasting of Exchange rates with Recurrent FLANNEfficient Forecasting of Exchange rates with Recurrent FLANN
Efficient Forecasting of Exchange rates with Recurrent FLANN
 
Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...
Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...
Analysis of Rayleigh Quotient in Extrapolation Method to Accelerate the Compu...
 
Performance Evaluation of Routing Protocols for MAC Layer Models
Performance Evaluation of Routing Protocols for MAC Layer ModelsPerformance Evaluation of Routing Protocols for MAC Layer Models
Performance Evaluation of Routing Protocols for MAC Layer Models
 
A Hierarchical Feature Set optimization for effective code change based Defec...
A Hierarchical Feature Set optimization for effective code change based Defec...A Hierarchical Feature Set optimization for effective code change based Defec...
A Hierarchical Feature Set optimization for effective code change based Defec...
 
Information Hiding for “Color to Gray and back” with Hartley, Slant and Kekre...
Information Hiding for “Color to Gray and back” with Hartley, Slant and Kekre...Information Hiding for “Color to Gray and back” with Hartley, Slant and Kekre...
Information Hiding for “Color to Gray and back” with Hartley, Slant and Kekre...
 
A Novel Method to Improve the Control Performance of Multilevel Inverter by M...
A Novel Method to Improve the Control Performance of Multilevel Inverter by M...A Novel Method to Improve the Control Performance of Multilevel Inverter by M...
A Novel Method to Improve the Control Performance of Multilevel Inverter by M...
 
Inflammatory Conditions Mimicking Tumours In Calabar: A 30 Year Study (1978-2...
Inflammatory Conditions Mimicking Tumours In Calabar: A 30 Year Study (1978-2...Inflammatory Conditions Mimicking Tumours In Calabar: A 30 Year Study (1978-2...
Inflammatory Conditions Mimicking Tumours In Calabar: A 30 Year Study (1978-2...
 
New Skills for Testers
New Skills for TestersNew Skills for Testers
New Skills for Testers
 
Open Source Software Survivability Analysis Using Communication Pattern Valid...
Open Source Software Survivability Analysis Using Communication Pattern Valid...Open Source Software Survivability Analysis Using Communication Pattern Valid...
Open Source Software Survivability Analysis Using Communication Pattern Valid...
 
An optimized link state routing protocol based on a cross layer design for wi...
An optimized link state routing protocol based on a cross layer design for wi...An optimized link state routing protocol based on a cross layer design for wi...
An optimized link state routing protocol based on a cross layer design for wi...
 
Public Verifiability in Cloud Computing Using Signcryption Based on Elliptic ...
Public Verifiability in Cloud Computing Using Signcryption Based on Elliptic ...Public Verifiability in Cloud Computing Using Signcryption Based on Elliptic ...
Public Verifiability in Cloud Computing Using Signcryption Based on Elliptic ...
 

Similaire à Adaptive checkpointing and replication strategy

Efficient Resource Management Mechanism with Fault Tolerant Model for Computa...
Efficient Resource Management Mechanism with Fault Tolerant Model for Computa...Efficient Resource Management Mechanism with Fault Tolerant Model for Computa...
Efficient Resource Management Mechanism with Fault Tolerant Model for Computa...Editor IJCATR
 
Optimized Assignment of Independent Task for Improving Resources Performance ...
Optimized Assignment of Independent Task for Improving Resources Performance ...Optimized Assignment of Independent Task for Improving Resources Performance ...
Optimized Assignment of Independent Task for Improving Resources Performance ...Ricardo014
 
Optimized Assignment of Independent Task for Improving Resources Performance ...
Optimized Assignment of Independent Task for Improving Resources Performance ...Optimized Assignment of Independent Task for Improving Resources Performance ...
Optimized Assignment of Independent Task for Improving Resources Performance ...ijgca
 
Fault Tollerant scheduling system for computational grid
Fault Tollerant scheduling system for computational gridFault Tollerant scheduling system for computational grid
Fault Tollerant scheduling system for computational gridGhulam Asfia
 
A Survey of Job Scheduling Algorithms Whit Hierarchical Structure to Load Ba...
A Survey of Job Scheduling Algorithms Whit  Hierarchical Structure to Load Ba...A Survey of Job Scheduling Algorithms Whit  Hierarchical Structure to Load Ba...
A Survey of Job Scheduling Algorithms Whit Hierarchical Structure to Load Ba...Editor IJCATR
 
A study on dynamic load balancing in grid environment
A study on dynamic load balancing in grid environmentA study on dynamic load balancing in grid environment
A study on dynamic load balancing in grid environmentIJSRD
 
Job Scheduling on the Grid Environment using Max-Min Firefly Algorithm
Job Scheduling on the Grid Environment using Max-Min  Firefly AlgorithmJob Scheduling on the Grid Environment using Max-Min  Firefly Algorithm
Job Scheduling on the Grid Environment using Max-Min Firefly AlgorithmEditor IJCATR
 
A survey of various scheduling algorithm in cloud computing environment
A survey of various scheduling algorithm in cloud computing environmentA survey of various scheduling algorithm in cloud computing environment
A survey of various scheduling algorithm in cloud computing environmenteSAT Journals
 
Effective and Efficient Job Scheduling in Grid Computing
Effective and Efficient Job Scheduling in Grid ComputingEffective and Efficient Job Scheduling in Grid Computing
Effective and Efficient Job Scheduling in Grid ComputingAditya Kokadwar
 
Bragged Regression Tree Algorithm for Dynamic Distribution and Scheduling of ...
Bragged Regression Tree Algorithm for Dynamic Distribution and Scheduling of ...Bragged Regression Tree Algorithm for Dynamic Distribution and Scheduling of ...
Bragged Regression Tree Algorithm for Dynamic Distribution and Scheduling of ...Editor IJCATR
 
Task Scheduling using Hybrid Algorithm in Cloud Computing Environments
Task Scheduling using Hybrid Algorithm in Cloud Computing EnvironmentsTask Scheduling using Hybrid Algorithm in Cloud Computing Environments
Task Scheduling using Hybrid Algorithm in Cloud Computing Environmentsiosrjce
 
ESTIMATING HANDLING TIME OF SOFTWARE DEFECTS
ESTIMATING HANDLING TIME OF SOFTWARE DEFECTSESTIMATING HANDLING TIME OF SOFTWARE DEFECTS
ESTIMATING HANDLING TIME OF SOFTWARE DEFECTScsandit
 

Similaire à Adaptive checkpointing and replication strategy (20)

D0212326
D0212326D0212326
D0212326
 
Efficient Resource Management Mechanism with Fault Tolerant Model for Computa...
Efficient Resource Management Mechanism with Fault Tolerant Model for Computa...Efficient Resource Management Mechanism with Fault Tolerant Model for Computa...
Efficient Resource Management Mechanism with Fault Tolerant Model for Computa...
 
G216063
G216063G216063
G216063
 
Optimized Assignment of Independent Task for Improving Resources Performance ...
Optimized Assignment of Independent Task for Improving Resources Performance ...Optimized Assignment of Independent Task for Improving Resources Performance ...
Optimized Assignment of Independent Task for Improving Resources Performance ...
 
Optimized Assignment of Independent Task for Improving Resources Performance ...
Optimized Assignment of Independent Task for Improving Resources Performance ...Optimized Assignment of Independent Task for Improving Resources Performance ...
Optimized Assignment of Independent Task for Improving Resources Performance ...
 
Fault Tollerant scheduling system for computational grid
Fault Tollerant scheduling system for computational gridFault Tollerant scheduling system for computational grid
Fault Tollerant scheduling system for computational grid
 
C017241316
C017241316C017241316
C017241316
 
Ie3514301434
Ie3514301434Ie3514301434
Ie3514301434
 
A Survey of Job Scheduling Algorithms Whit Hierarchical Structure to Load Ba...
A Survey of Job Scheduling Algorithms Whit  Hierarchical Structure to Load Ba...A Survey of Job Scheduling Algorithms Whit  Hierarchical Structure to Load Ba...
A Survey of Job Scheduling Algorithms Whit Hierarchical Structure to Load Ba...
 
A study on dynamic load balancing in grid environment
A study on dynamic load balancing in grid environmentA study on dynamic load balancing in grid environment
A study on dynamic load balancing in grid environment
 
Job Scheduling on the Grid Environment using Max-Min Firefly Algorithm
Job Scheduling on the Grid Environment using Max-Min  Firefly AlgorithmJob Scheduling on the Grid Environment using Max-Min  Firefly Algorithm
Job Scheduling on the Grid Environment using Max-Min Firefly Algorithm
 
A survey of various scheduling algorithm in cloud computing environment
A survey of various scheduling algorithm in cloud computing environmentA survey of various scheduling algorithm in cloud computing environment
A survey of various scheduling algorithm in cloud computing environment
 
Effective and Efficient Job Scheduling in Grid Computing
Effective and Efficient Job Scheduling in Grid ComputingEffective and Efficient Job Scheduling in Grid Computing
Effective and Efficient Job Scheduling in Grid Computing
 
K017617786
K017617786K017617786
K017617786
 
9 ijcse-01223
9 ijcse-012239 ijcse-01223
9 ijcse-01223
 
Bragged Regression Tree Algorithm for Dynamic Distribution and Scheduling of ...
Bragged Regression Tree Algorithm for Dynamic Distribution and Scheduling of ...Bragged Regression Tree Algorithm for Dynamic Distribution and Scheduling of ...
Bragged Regression Tree Algorithm for Dynamic Distribution and Scheduling of ...
 
D04573033
D04573033D04573033
D04573033
 
Task Scheduling using Hybrid Algorithm in Cloud Computing Environments
Task Scheduling using Hybrid Algorithm in Cloud Computing EnvironmentsTask Scheduling using Hybrid Algorithm in Cloud Computing Environments
Task Scheduling using Hybrid Algorithm in Cloud Computing Environments
 
N0173696106
N0173696106N0173696106
N0173696106
 
ESTIMATING HANDLING TIME OF SOFTWARE DEFECTS
ESTIMATING HANDLING TIME OF SOFTWARE DEFECTSESTIMATING HANDLING TIME OF SOFTWARE DEFECTS
ESTIMATING HANDLING TIME OF SOFTWARE DEFECTS
 

Plus de IOSR Journals (20)

A011140104
A011140104A011140104
A011140104
 
M0111397100
M0111397100M0111397100
M0111397100
 
L011138596
L011138596L011138596
L011138596
 
K011138084
K011138084K011138084
K011138084
 
J011137479
J011137479J011137479
J011137479
 
I011136673
I011136673I011136673
I011136673
 
G011134454
G011134454G011134454
G011134454
 
H011135565
H011135565H011135565
H011135565
 
F011134043
F011134043F011134043
F011134043
 
E011133639
E011133639E011133639
E011133639
 
D011132635
D011132635D011132635
D011132635
 
C011131925
C011131925C011131925
C011131925
 
B011130918
B011130918B011130918
B011130918
 
A011130108
A011130108A011130108
A011130108
 
I011125160
I011125160I011125160
I011125160
 
H011124050
H011124050H011124050
H011124050
 
G011123539
G011123539G011123539
G011123539
 
F011123134
F011123134F011123134
F011123134
 
E011122530
E011122530E011122530
E011122530
 
D011121524
D011121524D011121524
D011121524
 

Dernier

Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupFlorian Wilhelm
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brandgvaughan
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteDianaGray10
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxLoriGlavin3
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubKalema Edgar
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxLoriGlavin3
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek SchlawackFwdays
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfAddepto
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsRizwan Syed
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024BookNet Canada
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsPixlogix Infotech
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr BaganFwdays
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Mark Simos
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024Stephanie Beckett
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxLoriGlavin3
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfPrecisely
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 

Dernier (20)

Streamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project SetupStreamlining Python Development: A Guide to a Modern Project Setup
Streamlining Python Development: A Guide to a Modern Project Setup
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
WordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your BrandWordPress Websites for Engineers: Elevate Your Brand
WordPress Websites for Engineers: Elevate Your Brand
 
Take control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test SuiteTake control of your SAP testing with UiPath Test Suite
Take control of your SAP testing with UiPath Test Suite
 
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptxDigital Identity is Under Attack: FIDO Paris Seminar.pptx
Digital Identity is Under Attack: FIDO Paris Seminar.pptx
 
Unleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding ClubUnleash Your Potential - Namagunga Girls Coding Club
Unleash Your Potential - Namagunga Girls Coding Club
 
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptxPasskey Providers and Enabling Portability: FIDO Paris Seminar.pptx
Passkey Providers and Enabling Portability: FIDO Paris Seminar.pptx
 
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
"Subclassing and Composition – A Pythonic Tour of Trade-Offs", Hynek Schlawack
 
Gen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdfGen AI in Business - Global Trends Report 2024.pdf
Gen AI in Business - Global Trends Report 2024.pdf
 
Scanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL CertsScanning the Internet for External Cloud Exposures via SSL Certs
Scanning the Internet for External Cloud Exposures via SSL Certs
 
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: BNC CataList - Tech Forum 2024
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
The Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and ConsThe Ultimate Guide to Choosing WordPress Pros and Cons
The Ultimate Guide to Choosing WordPress Pros and Cons
 
"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan"ML in Production",Oleksandr Bagan
"ML in Production",Oleksandr Bagan
 
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
Tampa BSides - Chef's Tour of Microsoft Security Adoption Framework (SAF)
 
What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024What's New in Teams Calling, Meetings and Devices March 2024
What's New in Teams Calling, Meetings and Devices March 2024
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptxA Deep Dive on Passkeys: FIDO Paris Seminar.pptx
A Deep Dive on Passkeys: FIDO Paris Seminar.pptx
 
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdfHyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
Hyperautomation and AI/ML: A Strategy for Digital Transformation Success.pdf
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 

Adaptive checkpointing and replication strategy

  • 1. IOSR Journal of Computer Engineering (IOSR-JCE) e-ISSN: 2278-0661, p- ISSN: 2278-8727Volume 11, Issue 1 (May. - Jun. 2013), PP 31-38 www.iosrjournals.org www.iosrjournals.org 31 | Page Adaptive check-pointing and replication strategy to tolerate faults in computational grid Subash Thapa1 , Abhijit bhowmick2 1 (computer Science, NIT Kurukshetra, INDIA) 2 (computer Science, NIT Kurukshetra, INDIA) Abstract: We have considered both adaptive resource replication and adaptive check-pointing. Complete resource failure has been considered which was ignored in most of the previous works. We have proposed an efficient solution for scheduling as well as a fault tolerant method based on maintaining a balance between the overheads of replication and check-pointing. The selection of resource from available resource is done such that no partiality is done between jobs of equal priority. The fault tolerance method, i.e., resource failure has been resolved in our proposed solution with the help of replication, three replicas for each task and number of replicas will be increased by one on each permanent failure to maintain total replicas three and new replica start its execution from the last saved checkpoint . Keywords: Grid, fault,tolerance, redundancy,replication,adaptive,check pointing, Grid Computing I. Introduction .The main characteristics of grid are:  Multiple administrative domains: Since, it spawns into multiple administrative domains, the policies and autonomy of different domains needs to be maintained. Heterogeneity: Grid contains different types of resources like personal computers to super computers to specialized devices like telescopes. Grid provides seamless way to success different resources. Scalability: A grid might grow from few integrated resources to millions. The performance might be degraded as the size of grid increases. 1.3 Types of Grid Systems The types of grid are mentioned below: Computational Grid: A computational Grid is a system that aims at achieving higher aggregate computational power than any single constituent machine. Data Grid: A Data Grid is responsible for housing and providing access to data across multiple organizations. Users are not concerned with where this data is located as long as they have access to the data. 1.4.1 Resource Allocation Available resources should be used maximally or optimally in order to balance the distribution of tasks over the network. If node failure or communication link failure occurs, it is required to re-allocate and manage the tasks and resources from start. 1.4.3 FAULT TOLERANCE Fault-tolerance or graceful degradation is the property that enables a system (often computer-based) to continue operating properly in the event of the failure of (or one or more faults within) some of its components. II. Previous Work In previous works, several mechanisms were proposed for grid or distributed computing systems. However, some of them used only space redundancy (hardware replication), and others used only time redundancy (check-pointing and rollback).In 2007, Sangho Yi, Derrick Kondo has published a paper for reducing a number of replicas by using check-pointing and rollback and also analyzed the probability of successful execution within the given deadline and reliability requirement of each task for selecting replicas. In that paper major drawback analyzed by us are: In this paper, author assumes both permanent and recoverable failures occuraccording to the Poisson process with rate λfp and λfr, respectively for calculation of no replicas but If the failures occur with non-
  • 2. Adaptive check-pointing and replication strategy to tolerate faults in computational grid www.iosrjournals.org 32 | Page exponential distribution, (e.g. Gamma, Weibull, etc.), it is very hard to use the existing probabilistic distribution for the same. In this paper, check-point interval is not major issue, does not talk about this and each recoverable failure on each replica required to resume from its last saved check-point. . Previous works done by Babar Nazir, Kalim Qureshi and Paul Manuel in 2008 maintained fault index of grid resources which doesn’t keep track of permanent failure and recoverable failure. But in this paper major drawback was: The fault tolerance method used by the author in this paper is based only on check-pointing. The priority (deadline) of task has not been considered in this paper. As it is a economy based system , task of different priority from user is a certainty. III. Problem formulation: Grid users submit their jobs to the grid resource broker (GRB) by specifying their QoS requirements, i.e., deadline in which users want their jobs to be executed and the budget which users have for the completion of jobs. Grid resource broker schedules user jobs on the best available resource by optimizing time. Result of the job is submitted to user upon successful completion of the job. Such an economy based environment has following two major draw backs: In the economy based grid environments, there are resources that fulfill the criteria of deadline and budget constraints (QoS requirements), but they have a tendency toward faults. In such a scenario, the grid resource broker goes ahead to select the same resource for the mere reason that the grid resource promises to meet QoS requirements of the grid jobs. This eventually results in compromising the user’s QoS parameters in order to complete the task. In this paper, in order to address the first problem, we use a task check-pointing heuristic to enable the economy based grid to tolerate faults gracefully, as we are able to restore the partially completed task from the last checkpoint. And we also use replication on at most three best available resources on the basis of a probability value of permanent resource failure. In order to address the second problem, we make our check-pointing strategy adaptive by maintaining a probability value of recoverable resource failure. In this way, we are able to introduce checkpoint mostly when it is necessary. IV. System Model and Assumption: We have considered an economy based computational grid computing environment. In this system we assume the resources R1, R2, R3……………………………..……… Rn with following characteristics value. Some common notations and functions used throughout paper are: Np: Denotes count of permanent failure. Nr: Denotes count of Recoverable failure. Ns: Denotes count of successful execution of task. Pp: Probability value of permanent failure. Such that Pp = Np / (Ns+Np+Nr) Pr: Probability value of recoverable failure. Such that Pr = Nr / (Ns+Np+Nr) Pt: Probability value of Total failure. Such that Pt = Pr + Pp C(i), i = 1,2, . .. ,N, are the percentage of task completed such that 0 <= C(i) <= 100 and C(1) > C(2) > · · · > C(N)
  • 3. Adaptive check-pointing and replication strategy to tolerate faults in computational grid www.iosrjournals.org 33 | Page F(i), i = 0, 1,2, . . . ,N, are integers such that F(0) < F(1) < ……… <F(N) and represent index range of fault index. Proposed Idea: tr denote the work requirement time of a task T(tr) denote expected total execution time such that in the absence of any failures T(tr ) = tr The T(tr ) can expected execution time, Ti(ti,ci), where the ti and ci be calculated as the sum of all intervals are the work requirement time and check-pointing overhead of an ith interval. td represents the deadline for a task. Ps is a user-defined probability of successful execution of a task within the deadline. P (tr , td ) is probability of successful execution of a task which has tr as the work . requirement time and td as the deadline He assume that the permanent and recoverable failures occur according to Poisson process at rate λf p and λf r , respectively Theorem 1 If we assume the recoverable failures occur according to the Poisson process with rate λfr and that failures can occur during checkpointing, the expected execution time of the ith interval with check- pointing, Tri (ti , ci ) is given by Theorem 2 If we assume both permanent and recover able failures occur according to the Poisson process with rate λfp and λfr , respectively, the probability for successful execution for the time requirement tr within the deadline td , P (tr , td ) is given by V. Adaptive fault tolerant job scheduling strategy Grid faults affect the performance of resource management strategy. As our proposed strategy considers fault tolerance in an economy based computational grid environment, the aim is to optimize user- centric metrics in the presence of faults We propose to maintain and update probability value of permanent or recoverable resource failure of all available resources of the grid. The probability value of the grid resource will suggest its vulnerability to faults (i.e. higher the probability value is higher the failure rate). VI. Components of proposed scheduling strategy: Grid resource: Grid Information Server: Grid users register themselves to the Grid Information Server (GIS) of a grid by specifying the QoS requirements Grid Resource Broker (GRB): Each grid job (composed of gridlets) is first submitted to its broker. When GRB receives a grid job from a user, it gets the contact information of available grid resources from the GIS and then requests the resources to send their current work load condition.
  • 4. Adaptive check-pointing and replication strategy to tolerate faults in computational grid www.iosrjournals.org 34 | Page ALGORITHM A Select the resource according to QoS from available resource. Arrange them in increasing probability value of total failure. Select the three replicas for equal priority job such that no partiality is done at time t as shown in fig. F: probability value of the selected grid resource F(i), i = 0, 1, 2, . . . ,N, are integers such that F(0) < F(1) < ··· < F(N) C(i), i= 1, 2, . . . ,N, are the percentage of task completed such that <= C(i) <= 100 and C(1) > C(2) > ··· > C(N) IF F(0) <= F < F(1) THEN The task is queued to that resource with a checkpoint at C(1) (It indicates that the grid resource has to send the result of the task completion to the CPManager when C(0)% task is completed.) GO TO Step 6. END IF IF F(1) <= F < F(2) THEN The task is queued to that resource with a checkpoint at C(2). GO TO Step 6. END IF IF F(2) <= F < F(3) THEN The task is queued to that resource with a checkpoint at C(3). GO TO Step 6. END IF IF F(N- 1) <= F < F(N) THEN
  • 5. Adaptive check-pointing and replication strategy to tolerate faults in computational grid www.iosrjournals.org 35 | Page The task is queued to that resource with a checkpoint at C(N). GO TO Step 6. END IF IF F(N) <=F THEN Remove resource from the available resources list and label it as unavailable resource (i.e., no job is assigned) to that resource until: It has passed ALIVE time successfully. (ALIVE time is the time interval for which grid resource must be up so that it can be again included in the grid available resource list.) END IF Add the job to the unassigned job list and reapply the time optimization algorithm for this task on available resources. EXIT Fault Tolerant Schedule Manager (FTScheduleManager): we assume that a fault occurs when a fault detector is unable to receive any signal or response from grid resource in pull model / push model. Then a fault is detected by a fault detector, the fault occurrence information about the grid resource is updated in Fault Tolerant Schedule Manager that would be both recoverable failure information and permanent failure information. Task successful execution information is also send by resource broker to Fault Tolerant Schedule Manager. Thus fault probability information is used, while making a job allocation decision to the grid resource. It uses following algorithm i.e. Algorithm B: ALGORITHM B: IF gridlet receptor receives the task completion result from the CPManager THEN a. IF resource fault index >= 1 THEN Send a message to FTScheduleManager to increment Ns END IF b. GOTO Step3 END IF IF the to FTScheduleManager receives the task failure message from fault detector, THEN a. IF resource is permanent failure THEN Send a message to FTScheduleManager to increment Np Send a message to CPManager, whether there is any checkpoint result of this task. iii. IF checkpoint result of the task exists in CPManager’s task checkpointing table, THEN Inform the resource broker with that check point value to replicate on another resource. GOTO Step 3. END IF iv. IF checkpoint result of the task does not exist in CPManager’s task check-pointing table, THEN Inform the resource broker with that check point value to replicate on another resource. GOTO Step 3. END IF END IF EXIT Checkpoint Manager (CPManager): It receives the partially executed result of a task from a grid resource in the intervals specified by the grid resource broker based on the checkpoint. It maintains grid tasks and their checkpoint table which
  • 6. Adaptive check-pointing and replication strategy to tolerate faults in computational grid www.iosrjournals.org 36 | Page contains information of partially executed tasks by the grid resources. CPManager also receives and responds to the task completion and task failure message from grid resources. For a particular task, the CPManager discards the result of the previous checkpoint when a new value of checkpoint result is received with later value. For a particular task, if CPManager receives the task completion message from resource, it removes its entity from the task checkpoint result table. In our proposed algorithm it takes maximum of available check-point from replication. Grid dispatcher: It dispatches the tasks one by one to the respective grid resource as listed in the queue by GRB. A small grid task is called a gridlet. Gridlet receptor: It receives the task (gridlet) execution result from the grid resource where the task is dispatched by the dispatcher. WORKING: Resource entity registers themselves to the Grid Information Service (GIS) entity. Fault Tolerant Schedule Manager (FTScheduleManager) sends a synchronous event to the GIS entities to send the list of resource available. The GIS entity returns a list of registered resources to FTScheduleManager. FTScheduleManager then initializes the resource fault occurrence history table that contain the fault index (suggest resource vulnerability toward failures) of the resources available. IV. The GIS entity returns a list of registered resources and their contact details to the broker. The broker entity finds the current work load condition of the resources by sending events to resource entities requesting them to send their resource configuration and properties. VI. Resource entities respond to this event by sending their dynamic information that includes resources cost, capability, availability, load, and other configuration parameters. VII. The broker entity sends synchronous events to FTScheduleManager to get the probability value of total resource failure. VIII. The broker entity, after receiving probability value of total resource failure index list, dispatches gridlet to replicate on three resource entities if available such that all job of equal priority will get equal efficiency resource entities at a time t. Fault Tolerance method: I. Resource failure has been resolved in our proposed solution with the help of replication, 3 replica for each task.
  • 7. Adaptive check-pointing and replication strategy to tolerate faults in computational grid www.iosrjournals.org 37 | Page When the resource entity fails to process the gridlet, detected by fault detector then fault detector sends gridlet failure event to the CPManager entity and to FTScheduleManager. III. The CPManager entity on the receipt of gridlet failure event from resource entity sends an asynchronous gridlet failure event to broker entity . IV. The checkpoint at the check point manager is adaptively updated by taking the maximum of the checkpoints from the 3 replicated resource periodically. On permanent failure of resource, maximum of the last saved check points is taken from the check point manager to replicate task on another resource by resource broker but on recoverable failure restart the process on same resource with last saved check-point. Thus taking advantage of the different reliability and computation speed of the replicated resources. Page 22 of 27 VI. Task is resumed from this check point on a new replicated resource .Thus in this way maximum number of replicas for eack task is fixed at 3. i.e resource wastage is minimized. VII. Conclusion We have proposed an efficient solution for scheduling as well as an adaptive fault tolerant method based on maintaining a balance between the overheads of replication and check-pointing. The case of complete resource failure has been considered. Our proposed solution minimizes the resource wastage due to replication by taking an optimal number of replicas. At any time the check point manager maintains a unique check point which it keep on updating periodically , by taking the maximum of the received check points from the three replicas. This utilizes the different computational speed of the replicated resources. Use of probability value instead of index value further takes into account the case of permanent failure while scheduling and while deciding the check point intervals. References [1] “Adaptive checkpointing strategy to tolerate faults in economy based grid” Babar Nazir · Kalim Qureshi · Paul Manuel Published online: 16 October 2008 © Springer Science+Business Media, LLC 2008 [2] “Using Replication and Checkpointing for Reliable Task Management in Computational Grids” Sangho Yi, Derrick Kondo INRIA, France {sangho.yi, derrick.kondo}@inria.fr [3] H. Z. E. Tsiakkouri, R. Sakellariou and M. D. Dikaiakos, “Scheduling Workflows with Budget Constraints,” in Proceedings of the CoreGRID Workshop ”Integrated research in Grid Computing“, S. Gorlatch andM. Danelutto, Eds., Nov. 2009, pp. 347-357. [4] H. Topcuouglu, S. Hariri, and Min-you Wu, “Performance-Effective and Low-Complexity Task Scheduling for Heterogeneous Computing,” IEEE Trans. Parallel Distrib. Syst., vol. 13, no. 3, pp. 260-274, 2010.
  • 8. Adaptive check-pointing and replication strategy to tolerate faults in computational grid www.iosrjournals.org 38 | Page [5] M. Wieczorek, A. Hoheisel, and R. Prodan, “Taxonomies of the Multi-criteria Grid Workflow Scheduling Problem,” in Proceedings of the CoreGRID Workshop on Grid Middleware. Dresden, Germany: Springer-Verlag, June 2009. [6] J. Yu and R. Buyya , “Scheduling Scientific Workflow Applications with Deadline and Budget Constraints using Genetic Algorithms, ”Scientific Programming Journal, vol. 14, no. 1, 2009. [7] J. Yu, R. Buyya, and C. K. Tham, “Cost-based Scheduling of Scientific Workflow Applications on Utility Grids” , in Proceedings of the 1st IEEE International Conference on e-Science and Grid Computing (e-Science 2005), IEEE. Melbourne, Australia: IEEE CS Press, Dec. 2009. [8] J. Yu, M. Kirley, and R. Buyya, “Multi-objective Planning for Workflow Execution on Grids,” in Proceedings of the 8th IEEE/ACM International Conference on Grid Computing (Grid 2007), IEEE. Los Alamitos, CA, USA: IEEE CS Press, June 2010.